| Definition | Escherichia coli ED1a chromosome, complete genome. |
|---|---|
| Accession | NC_011745 |
| Length | 5,209,548 |
Click here to switch to the map view.
The map label for this gene is yfgF [H]
Identifier: 218690618
GI number: 218690618
Start: 2877143
End: 2879386
Strand: Reverse
Name: yfgF [H]
Synonym: ECED1_2927
Alternate gene names: 218690618
Gene position: 2879386-2877143 (Counterclockwise)
Preceding gene: 218690619
Following gene: 218690613
Centisome position: 55.27
GC content: 46.79
Gene sequence:
>2244_bases ATGAAACTGAATGCAACTTATATAAAAATACGTGATAAATGGTGGGGGCTTCCGCTGTTCCTGCCTTCTTTAATCTTGCC CATTTTCGCCCACATTAATACTTTCGCGCATATTTCTTCCGGTGAAGTTTTTCTCTTTTATCTGCCACTGGCACTGATGA TCAGCATGATGATGTTTTTCAGCTGGGCGGCATTGCCAGGGATCGCCTTAGGGATTTTTGTCCGCAAATATGCAGAGCTG GGTTTTTACGAAACGCTCTCATTAACGGCTAATTTTATTATCATTATCATTCTCTGTTGGGGCGGTTACAGGGTTTTTAC CCCCCGGCGTAACAACGTTTCACATGGTGATAGCCGTTTAATTTCCCAGCGACTATTCTGGCAGATTGTGTTTCCTGCAA CGCTGTTTCTGATACTTTTCCAGTTTGCTGCGTTTGTAGGGTTACTGGCGAGCAGAGAAAATCTGGTCGGCGTCATGCCC TTTAACCTCGGGACCTTAATCAATTATCAGGCCTTGCTGGTGGGTAATCTGATTGGTGTCCCGCTGTGCTACTTCATCAT TCGGGTGGTGCGAAATCCGTTTTATTTACGTAGCTATTATTCGCAATTAAAACAGCAGGTTGATGCCAAAGTTACCAAAA AAGAGTTCGCAATCTGGCTACTGGCATTAGGTGCTTTACTATTGCTGTTATGCATGCCGTTAAATGAAAAAAGCACGATT TTTAGCACCAATTACACCTTGTCATTATTGCTGCCCCTGATGATGTGGGGAGCGATGCGCTATGGTTATAAGCTGATTTC ATTGCTCTGGGCGGTCGTGTTGATGATCAGCATCCACAGCTATCAAAATTACATTCCCATTTATCCTGGCTATACCACGC AGCTCACCATAACCTCCTCCAGTTATCTGGTATTCTCTTTTATTGTCAATTATATGGCTGTACTGGCAACCCGTCAGCGA GCGGTAGTCAGACGCATTCAGCGGCTTGCGTATGTGGACCCGGTGGTTCATCTGCCAAATGTTCGCGCCCTGAATCGCGC GTTACGTGATGCCCCCTGGTCTGCGCTTTGTTATTTACGCATCCCTGGCATGGAAATGCTGGTTAAGAATTATGGCATCA TGCTGCGGATTCAATACAAGCAAAAACTTTCTCACTGGCTGTCACCATTGTTGGAACCGGGTGAAGATGTTTATCAGCTT TCGGGTAACGATCTCGCGCTGCGGCTGAATACAGAATCGCACCAGGAGCGCATTACCGCACTGGATAGCCATCTCAAGCA ATTTCGTTTCTTTTGGGATGGAATGCCGATGCAACCGCAGATTGGCGTCAGTTACTGCTATGTGCGCTCGCCAGTGAATC ATATCTACCTGCTGCTGGGAGAGCTAAATACAGTGGCCGAACTTTCCATCGTGACCAACGCCCCGGAAAATATGCAGCGT CGCGGAGCAATGTATTTGCAACGCGAATTGAAAGATAAAGTCGCGATGATGAATCGGCTACAGCGGGCGCTGGAACACAA CCATTTTTTCCTGATGGCCCAGCCGATTACCGGTATGCGTGGTGATGTCTACCATGAAATTCTTCTGCGCATGAAAGGTG AGAATGATGAACTGATCAGCCCCGATAGCTTCTTACCGGTCGCGCACGAATTTGGTTTATCGTCGAGTATCGACATGTGG GTCATTGAGCATACGCTGCAATTTATGGCTGAAAACAGAGCGAAGATGCCCGCTCACCGTTTTGCTATTAATCTGTCTCC AACCTCGGTATGTCAGGCTCGTTTTCCTGTTGAAGTCAGTCAGCTGCTGGCTAAATATCAGATTGAAGCGTGGCAACTTA TTTTTGAAGTCACCGAAAGTAATGCTCTGACCAATGTTAAGCAGGCGCAAATCACCTTGCAGCATCTTCAGGAATTAGGC TGCCAGATTGCGATTGATGATTTCGGCACCGGCTACGCCAGCTATGCGCGGCTTAAAAATGTGAATGCCGATCTGCTTAA AATTGACGGCAGTTTTATCCGCAATATTGTGTCAAATAGTCTGGATTATCAGATAGTGGCGTCGATTTGCCACCTGGCGC GAATGAAGAAAATGCGGGTAGTGGCAGAGTACGTTGAAAACGAAGAGATCCGCGAGGCGGTGCTCTCTTTGGGGATCGAT TATATGCAGGGTTATCTTATTGGTAAGCCGCAACCGTTAATTGATACGCTGAATGAAATCGAACCCATTCGCGAAAGTGC CTGA
Upstream 100 bases:
>100_bases CATTGCTCTCTATAGTGATGAATAATCATCATTCGAAGTCAGGTGGGATGCCTGTCTGAATACACCTCCTTCAGGATGTG GGGGATTCTGCTGAGCATCT
Downstream 100 bases:
>100_bases ATAATGCGGGCCGACATTTCTCGTCGGCCCGCAAAGCATTAAGCGGCGATTTCTGGTGTACTTTCTTCTTCAATTTTCAA CCGCCAGCCAGCCACGCCTT
Product: hypothetical protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 747; Mature: 747
Protein sequence:
>747_residues MKLNATYIKIRDKWWGLPLFLPSLILPIFAHINTFAHISSGEVFLFYLPLALMISMMMFFSWAALPGIALGIFVRKYAEL GFYETLSLTANFIIIIILCWGGYRVFTPRRNNVSHGDSRLISQRLFWQIVFPATLFLILFQFAAFVGLLASRENLVGVMP FNLGTLINYQALLVGNLIGVPLCYFIIRVVRNPFYLRSYYSQLKQQVDAKVTKKEFAIWLLALGALLLLLCMPLNEKSTI FSTNYTLSLLLPLMMWGAMRYGYKLISLLWAVVLMISIHSYQNYIPIYPGYTTQLTITSSSYLVFSFIVNYMAVLATRQR AVVRRIQRLAYVDPVVHLPNVRALNRALRDAPWSALCYLRIPGMEMLVKNYGIMLRIQYKQKLSHWLSPLLEPGEDVYQL SGNDLALRLNTESHQERITALDSHLKQFRFFWDGMPMQPQIGVSYCYVRSPVNHIYLLLGELNTVAELSIVTNAPENMQR RGAMYLQRELKDKVAMMNRLQRALEHNHFFLMAQPITGMRGDVYHEILLRMKGENDELISPDSFLPVAHEFGLSSSIDMW VIEHTLQFMAENRAKMPAHRFAINLSPTSVCQARFPVEVSQLLAKYQIEAWQLIFEVTESNALTNVKQAQITLQHLQELG CQIAIDDFGTGYASYARLKNVNADLLKIDGSFIRNIVSNSLDYQIVASICHLARMKKMRVVAEYVENEEIREAVLSLGID YMQGYLIGKPQPLIDTLNEIEPIRESA
Sequences:
>Translated_747_residues MKLNATYIKIRDKWWGLPLFLPSLILPIFAHINTFAHISSGEVFLFYLPLALMISMMMFFSWAALPGIALGIFVRKYAEL GFYETLSLTANFIIIIILCWGGYRVFTPRRNNVSHGDSRLISQRLFWQIVFPATLFLILFQFAAFVGLLASRENLVGVMP FNLGTLINYQALLVGNLIGVPLCYFIIRVVRNPFYLRSYYSQLKQQVDAKVTKKEFAIWLLALGALLLLLCMPLNEKSTI FSTNYTLSLLLPLMMWGAMRYGYKLISLLWAVVLMISIHSYQNYIPIYPGYTTQLTITSSSYLVFSFIVNYMAVLATRQR AVVRRIQRLAYVDPVVHLPNVRALNRALRDAPWSALCYLRIPGMEMLVKNYGIMLRIQYKQKLSHWLSPLLEPGEDVYQL SGNDLALRLNTESHQERITALDSHLKQFRFFWDGMPMQPQIGVSYCYVRSPVNHIYLLLGELNTVAELSIVTNAPENMQR RGAMYLQRELKDKVAMMNRLQRALEHNHFFLMAQPITGMRGDVYHEILLRMKGENDELISPDSFLPVAHEFGLSSSIDMW VIEHTLQFMAENRAKMPAHRFAINLSPTSVCQARFPVEVSQLLAKYQIEAWQLIFEVTESNALTNVKQAQITLQHLQELG CQIAIDDFGTGYASYARLKNVNADLLKIDGSFIRNIVSNSLDYQIVASICHLARMKKMRVVAEYVENEEIREAVLSLGID YMQGYLIGKPQPLIDTLNEIEPIRESA >Mature_747_residues MKLNATYIKIRDKWWGLPLFLPSLILPIFAHINTFAHISSGEVFLFYLPLALMISMMMFFSWAALPGIALGIFVRKYAEL GFYETLSLTANFIIIIILCWGGYRVFTPRRNNVSHGDSRLISQRLFWQIVFPATLFLILFQFAAFVGLLASRENLVGVMP FNLGTLINYQALLVGNLIGVPLCYFIIRVVRNPFYLRSYYSQLKQQVDAKVTKKEFAIWLLALGALLLLLCMPLNEKSTI FSTNYTLSLLLPLMMWGAMRYGYKLISLLWAVVLMISIHSYQNYIPIYPGYTTQLTITSSSYLVFSFIVNYMAVLATRQR AVVRRIQRLAYVDPVVHLPNVRALNRALRDAPWSALCYLRIPGMEMLVKNYGIMLRIQYKQKLSHWLSPLLEPGEDVYQL SGNDLALRLNTESHQERITALDSHLKQFRFFWDGMPMQPQIGVSYCYVRSPVNHIYLLLGELNTVAELSIVTNAPENMQR RGAMYLQRELKDKVAMMNRLQRALEHNHFFLMAQPITGMRGDVYHEILLRMKGENDELISPDSFLPVAHEFGLSSSIDMW VIEHTLQFMAENRAKMPAHRFAINLSPTSVCQARFPVEVSQLLAKYQIEAWQLIFEVTESNALTNVKQAQITLQHLQELG CQIAIDDFGTGYASYARLKNVNADLLKIDGSFIRNIVSNSLDYQIVASICHLARMKKMRVVAEYVENEEIREAVLSLGID YMQGYLIGKPQPLIDTLNEIEPIRESA
Specific function: Truncated proteins consisting of the GGDEF/EAL domains (residues 319-747) or of the EAL domain alone (481-747) have c-di- GMP phosphodiesterase activity. They do not have diguanylate cyclase activity. Cyclic-di-GMP is a second messenger which controls cel
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 EAL domain [H]
Homologues:
Organism=Escherichia coli, GI1788849, Length=747, Percent_Identity=99.330655957162, Blast_Score=1531, Evalue=0.0, Organism=Escherichia coli, GI87082096, Length=714, Percent_Identity=30.1120448179272, Blast_Score=317, Evalue=1e-87, Organism=Escherichia coli, GI1790496, Length=236, Percent_Identity=31.7796610169492, Blast_Score=132, Evalue=1e-31, Organism=Escherichia coli, GI87081743, Length=263, Percent_Identity=30.0380228136882, Blast_Score=124, Evalue=2e-29, Organism=Escherichia coli, GI1787541, Length=439, Percent_Identity=26.4236902050114, Blast_Score=120, Evalue=3e-28, Organism=Escherichia coli, GI87081921, Length=462, Percent_Identity=25.5411255411255, Blast_Score=119, Evalue=6e-28, Organism=Escherichia coli, GI226510982, Length=252, Percent_Identity=28.968253968254, Blast_Score=111, Evalue=1e-25, Organism=Escherichia coli, GI1788502, Length=240, Percent_Identity=28.75, Blast_Score=111, Evalue=2e-25, Organism=Escherichia coli, GI1786507, Length=233, Percent_Identity=30.4721030042918, Blast_Score=110, Evalue=2e-25, Organism=Escherichia coli, GI87081980, Length=239, Percent_Identity=31.3807531380753, Blast_Score=103, Evalue=5e-23, Organism=Escherichia coli, GI87081845, Length=258, Percent_Identity=29.0697674418605, Blast_Score=100, Evalue=6e-22, Organism=Escherichia coli, GI1787055, Length=250, Percent_Identity=26.8, Blast_Score=100, Evalue=7e-22, Organism=Escherichia coli, GI1789650, Length=225, Percent_Identity=23.1111111111111, Blast_Score=67, Evalue=4e-12,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000160 - InterPro: IPR001633 - InterPro: IPR007895 [H]
Pfam domain/function: PF00563 EAL; PF05231 MASE1 [H]
EC number: =3.1.4.52 [H]
Molecular weight: Translated: 85666; Mature: 85666
Theoretical pI: Translated: 9.11; Mature: 9.11
Prosite motif: PS50883 EAL
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.1 %Cys (Translated Protein) 4.0 %Met (Translated Protein) 5.1 %Cys+Met (Translated Protein) 1.1 %Cys (Mature Protein) 4.0 %Met (Mature Protein) 5.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKLNATYIKIRDKWWGLPLFLPSLILPIFAHINTFAHISSGEVFLFYLPLALMISMMMFF CCCCEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHEECCCCEEHHHHHHHHHHHHHHHH SWAALPGIALGIFVRKYAELGFYETLSLTANFIIIIILCWGGYRVFTPRRNNVSHGDSRL HHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHEEEEEECCCEEEECCCCCCCCCCHHHH ISQRLFWQIVFPATLFLILFQFAAFVGLLASRENLVGVMPFNLGTLINYQALLVGNLIGV HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHHHHH PLCYFIIRVVRNPFYLRSYYSQLKQQVDAKVTKKEFAIWLLALGALLLLLCMPLNEKSTI HHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCE FSTNYTLSLLLPLMMWGAMRYGYKLISLLWAVVLMISIHSYQNYIPIYPGYTTQLTITSS EECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEECCCCEEEEEEECC SYLVFSFIVNYMAVLATRQRAVVRRIQRLAYVDPVVHLPNVRALNRALRDAPWSALCYLR CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHCCCCHHHHHHHHHCCCCCEEEEEE IPGMEMLVKNYGIMLRIQYKQKLSHWLSPLLEPGEDVYQLSGNDLALRLNTESHQERITA CCCHHHHHHCCCEEEEEEHHHHHHHHHHHHCCCCCHHEEECCCEEEEEECCHHHHHHHHH LDSHLKQFRFFWDGMPMQPQIGVSYCYVRSPVNHIYLLLGELNTVAELSIVTNAPENMQR HHHHHHHHHHHHCCCCCCCCCCCEEEEECCCHHHHEEHHHCCCCHHEEEEEECCCHHHHH RGAMYLQRELKDKVAMMNRLQRALEHNHFFLMAQPITGMRGDVYHEILLRMKGENDELIS HHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHCCCCCCCCC PDSFLPVAHEFGLSSSIDMWVIEHTLQFMAENRAKMPAHRFAINLSPTSVCQARFPVEVS CCCCCCHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHEEEECCHHHHHHCCCCHHHH QLLAKYQIEAWQLIFEVTESNALTNVKQAQITLQHLQELGCQIAIDDFGTGYASYARLKN HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHC VNADLLKIDGSFIRNIVSNSLDYQIVASICHLARMKKMRVVAEYVENEEIREAVLSLGID CCCCEEEECHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCHH YMQGYLIGKPQPLIDTLNEIEPIRESA HHCCEECCCCCHHHHHHHHHHHHHCCC >Mature Secondary Structure MKLNATYIKIRDKWWGLPLFLPSLILPIFAHINTFAHISSGEVFLFYLPLALMISMMMFF CCCCEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHEECCCCEEHHHHHHHHHHHHHHHH SWAALPGIALGIFVRKYAELGFYETLSLTANFIIIIILCWGGYRVFTPRRNNVSHGDSRL HHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHEEEEEECCCEEEECCCCCCCCCCHHHH ISQRLFWQIVFPATLFLILFQFAAFVGLLASRENLVGVMPFNLGTLINYQALLVGNLIGV HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHHHHH PLCYFIIRVVRNPFYLRSYYSQLKQQVDAKVTKKEFAIWLLALGALLLLLCMPLNEKSTI HHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCE FSTNYTLSLLLPLMMWGAMRYGYKLISLLWAVVLMISIHSYQNYIPIYPGYTTQLTITSS EECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEECCCCEEEEEEECC SYLVFSFIVNYMAVLATRQRAVVRRIQRLAYVDPVVHLPNVRALNRALRDAPWSALCYLR CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHCCCCHHHHHHHHHCCCCCEEEEEE IPGMEMLVKNYGIMLRIQYKQKLSHWLSPLLEPGEDVYQLSGNDLALRLNTESHQERITA CCCHHHHHHCCCEEEEEEHHHHHHHHHHHHCCCCCHHEEECCCEEEEEECCHHHHHHHHH LDSHLKQFRFFWDGMPMQPQIGVSYCYVRSPVNHIYLLLGELNTVAELSIVTNAPENMQR HHHHHHHHHHHHCCCCCCCCCCCEEEEECCCHHHHEEHHHCCCCHHEEEEEECCCHHHHH RGAMYLQRELKDKVAMMNRLQRALEHNHFFLMAQPITGMRGDVYHEILLRMKGENDELIS HHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHCCCCCCCCC PDSFLPVAHEFGLSSSIDMWVIEHTLQFMAENRAKMPAHRFAINLSPTSVCQARFPVEVS CCCCCCHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHEEEECCHHHHHHCCCCHHHH QLLAKYQIEAWQLIFEVTESNALTNVKQAQITLQHLQELGCQIAIDDFGTGYASYARLKN HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHC VNADLLKIDGSFIRNIVSNSLDYQIVASICHLARMKKMRVVAEYVENEEIREAVLSLGID CCCCEEEECHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCHH YMQGYLIGKPQPLIDTLNEIEPIRESA HHCCEECCCCCHHHHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 9205837; 9278503 [H]