Definition Escherichia coli ED1a chromosome, complete genome.
Accession NC_011745
Length 5,209,548

Click here to switch to the map view.

The map label for this gene is yfgF [H]

Identifier: 218690618

GI number: 218690618

Start: 2877143

End: 2879386

Strand: Reverse

Name: yfgF [H]

Synonym: ECED1_2927

Alternate gene names: 218690618

Gene position: 2879386-2877143 (Counterclockwise)

Preceding gene: 218690619

Following gene: 218690613

Centisome position: 55.27

GC content: 46.79

Gene sequence:

>2244_bases
ATGAAACTGAATGCAACTTATATAAAAATACGTGATAAATGGTGGGGGCTTCCGCTGTTCCTGCCTTCTTTAATCTTGCC
CATTTTCGCCCACATTAATACTTTCGCGCATATTTCTTCCGGTGAAGTTTTTCTCTTTTATCTGCCACTGGCACTGATGA
TCAGCATGATGATGTTTTTCAGCTGGGCGGCATTGCCAGGGATCGCCTTAGGGATTTTTGTCCGCAAATATGCAGAGCTG
GGTTTTTACGAAACGCTCTCATTAACGGCTAATTTTATTATCATTATCATTCTCTGTTGGGGCGGTTACAGGGTTTTTAC
CCCCCGGCGTAACAACGTTTCACATGGTGATAGCCGTTTAATTTCCCAGCGACTATTCTGGCAGATTGTGTTTCCTGCAA
CGCTGTTTCTGATACTTTTCCAGTTTGCTGCGTTTGTAGGGTTACTGGCGAGCAGAGAAAATCTGGTCGGCGTCATGCCC
TTTAACCTCGGGACCTTAATCAATTATCAGGCCTTGCTGGTGGGTAATCTGATTGGTGTCCCGCTGTGCTACTTCATCAT
TCGGGTGGTGCGAAATCCGTTTTATTTACGTAGCTATTATTCGCAATTAAAACAGCAGGTTGATGCCAAAGTTACCAAAA
AAGAGTTCGCAATCTGGCTACTGGCATTAGGTGCTTTACTATTGCTGTTATGCATGCCGTTAAATGAAAAAAGCACGATT
TTTAGCACCAATTACACCTTGTCATTATTGCTGCCCCTGATGATGTGGGGAGCGATGCGCTATGGTTATAAGCTGATTTC
ATTGCTCTGGGCGGTCGTGTTGATGATCAGCATCCACAGCTATCAAAATTACATTCCCATTTATCCTGGCTATACCACGC
AGCTCACCATAACCTCCTCCAGTTATCTGGTATTCTCTTTTATTGTCAATTATATGGCTGTACTGGCAACCCGTCAGCGA
GCGGTAGTCAGACGCATTCAGCGGCTTGCGTATGTGGACCCGGTGGTTCATCTGCCAAATGTTCGCGCCCTGAATCGCGC
GTTACGTGATGCCCCCTGGTCTGCGCTTTGTTATTTACGCATCCCTGGCATGGAAATGCTGGTTAAGAATTATGGCATCA
TGCTGCGGATTCAATACAAGCAAAAACTTTCTCACTGGCTGTCACCATTGTTGGAACCGGGTGAAGATGTTTATCAGCTT
TCGGGTAACGATCTCGCGCTGCGGCTGAATACAGAATCGCACCAGGAGCGCATTACCGCACTGGATAGCCATCTCAAGCA
ATTTCGTTTCTTTTGGGATGGAATGCCGATGCAACCGCAGATTGGCGTCAGTTACTGCTATGTGCGCTCGCCAGTGAATC
ATATCTACCTGCTGCTGGGAGAGCTAAATACAGTGGCCGAACTTTCCATCGTGACCAACGCCCCGGAAAATATGCAGCGT
CGCGGAGCAATGTATTTGCAACGCGAATTGAAAGATAAAGTCGCGATGATGAATCGGCTACAGCGGGCGCTGGAACACAA
CCATTTTTTCCTGATGGCCCAGCCGATTACCGGTATGCGTGGTGATGTCTACCATGAAATTCTTCTGCGCATGAAAGGTG
AGAATGATGAACTGATCAGCCCCGATAGCTTCTTACCGGTCGCGCACGAATTTGGTTTATCGTCGAGTATCGACATGTGG
GTCATTGAGCATACGCTGCAATTTATGGCTGAAAACAGAGCGAAGATGCCCGCTCACCGTTTTGCTATTAATCTGTCTCC
AACCTCGGTATGTCAGGCTCGTTTTCCTGTTGAAGTCAGTCAGCTGCTGGCTAAATATCAGATTGAAGCGTGGCAACTTA
TTTTTGAAGTCACCGAAAGTAATGCTCTGACCAATGTTAAGCAGGCGCAAATCACCTTGCAGCATCTTCAGGAATTAGGC
TGCCAGATTGCGATTGATGATTTCGGCACCGGCTACGCCAGCTATGCGCGGCTTAAAAATGTGAATGCCGATCTGCTTAA
AATTGACGGCAGTTTTATCCGCAATATTGTGTCAAATAGTCTGGATTATCAGATAGTGGCGTCGATTTGCCACCTGGCGC
GAATGAAGAAAATGCGGGTAGTGGCAGAGTACGTTGAAAACGAAGAGATCCGCGAGGCGGTGCTCTCTTTGGGGATCGAT
TATATGCAGGGTTATCTTATTGGTAAGCCGCAACCGTTAATTGATACGCTGAATGAAATCGAACCCATTCGCGAAAGTGC
CTGA

Upstream 100 bases:

>100_bases
CATTGCTCTCTATAGTGATGAATAATCATCATTCGAAGTCAGGTGGGATGCCTGTCTGAATACACCTCCTTCAGGATGTG
GGGGATTCTGCTGAGCATCT

Downstream 100 bases:

>100_bases
ATAATGCGGGCCGACATTTCTCGTCGGCCCGCAAAGCATTAAGCGGCGATTTCTGGTGTACTTTCTTCTTCAATTTTCAA
CCGCCAGCCAGCCACGCCTT

Product: hypothetical protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 747; Mature: 747

Protein sequence:

>747_residues
MKLNATYIKIRDKWWGLPLFLPSLILPIFAHINTFAHISSGEVFLFYLPLALMISMMMFFSWAALPGIALGIFVRKYAEL
GFYETLSLTANFIIIIILCWGGYRVFTPRRNNVSHGDSRLISQRLFWQIVFPATLFLILFQFAAFVGLLASRENLVGVMP
FNLGTLINYQALLVGNLIGVPLCYFIIRVVRNPFYLRSYYSQLKQQVDAKVTKKEFAIWLLALGALLLLLCMPLNEKSTI
FSTNYTLSLLLPLMMWGAMRYGYKLISLLWAVVLMISIHSYQNYIPIYPGYTTQLTITSSSYLVFSFIVNYMAVLATRQR
AVVRRIQRLAYVDPVVHLPNVRALNRALRDAPWSALCYLRIPGMEMLVKNYGIMLRIQYKQKLSHWLSPLLEPGEDVYQL
SGNDLALRLNTESHQERITALDSHLKQFRFFWDGMPMQPQIGVSYCYVRSPVNHIYLLLGELNTVAELSIVTNAPENMQR
RGAMYLQRELKDKVAMMNRLQRALEHNHFFLMAQPITGMRGDVYHEILLRMKGENDELISPDSFLPVAHEFGLSSSIDMW
VIEHTLQFMAENRAKMPAHRFAINLSPTSVCQARFPVEVSQLLAKYQIEAWQLIFEVTESNALTNVKQAQITLQHLQELG
CQIAIDDFGTGYASYARLKNVNADLLKIDGSFIRNIVSNSLDYQIVASICHLARMKKMRVVAEYVENEEIREAVLSLGID
YMQGYLIGKPQPLIDTLNEIEPIRESA

Sequences:

>Translated_747_residues
MKLNATYIKIRDKWWGLPLFLPSLILPIFAHINTFAHISSGEVFLFYLPLALMISMMMFFSWAALPGIALGIFVRKYAEL
GFYETLSLTANFIIIIILCWGGYRVFTPRRNNVSHGDSRLISQRLFWQIVFPATLFLILFQFAAFVGLLASRENLVGVMP
FNLGTLINYQALLVGNLIGVPLCYFIIRVVRNPFYLRSYYSQLKQQVDAKVTKKEFAIWLLALGALLLLLCMPLNEKSTI
FSTNYTLSLLLPLMMWGAMRYGYKLISLLWAVVLMISIHSYQNYIPIYPGYTTQLTITSSSYLVFSFIVNYMAVLATRQR
AVVRRIQRLAYVDPVVHLPNVRALNRALRDAPWSALCYLRIPGMEMLVKNYGIMLRIQYKQKLSHWLSPLLEPGEDVYQL
SGNDLALRLNTESHQERITALDSHLKQFRFFWDGMPMQPQIGVSYCYVRSPVNHIYLLLGELNTVAELSIVTNAPENMQR
RGAMYLQRELKDKVAMMNRLQRALEHNHFFLMAQPITGMRGDVYHEILLRMKGENDELISPDSFLPVAHEFGLSSSIDMW
VIEHTLQFMAENRAKMPAHRFAINLSPTSVCQARFPVEVSQLLAKYQIEAWQLIFEVTESNALTNVKQAQITLQHLQELG
CQIAIDDFGTGYASYARLKNVNADLLKIDGSFIRNIVSNSLDYQIVASICHLARMKKMRVVAEYVENEEIREAVLSLGID
YMQGYLIGKPQPLIDTLNEIEPIRESA
>Mature_747_residues
MKLNATYIKIRDKWWGLPLFLPSLILPIFAHINTFAHISSGEVFLFYLPLALMISMMMFFSWAALPGIALGIFVRKYAEL
GFYETLSLTANFIIIIILCWGGYRVFTPRRNNVSHGDSRLISQRLFWQIVFPATLFLILFQFAAFVGLLASRENLVGVMP
FNLGTLINYQALLVGNLIGVPLCYFIIRVVRNPFYLRSYYSQLKQQVDAKVTKKEFAIWLLALGALLLLLCMPLNEKSTI
FSTNYTLSLLLPLMMWGAMRYGYKLISLLWAVVLMISIHSYQNYIPIYPGYTTQLTITSSSYLVFSFIVNYMAVLATRQR
AVVRRIQRLAYVDPVVHLPNVRALNRALRDAPWSALCYLRIPGMEMLVKNYGIMLRIQYKQKLSHWLSPLLEPGEDVYQL
SGNDLALRLNTESHQERITALDSHLKQFRFFWDGMPMQPQIGVSYCYVRSPVNHIYLLLGELNTVAELSIVTNAPENMQR
RGAMYLQRELKDKVAMMNRLQRALEHNHFFLMAQPITGMRGDVYHEILLRMKGENDELISPDSFLPVAHEFGLSSSIDMW
VIEHTLQFMAENRAKMPAHRFAINLSPTSVCQARFPVEVSQLLAKYQIEAWQLIFEVTESNALTNVKQAQITLQHLQELG
CQIAIDDFGTGYASYARLKNVNADLLKIDGSFIRNIVSNSLDYQIVASICHLARMKKMRVVAEYVENEEIREAVLSLGID
YMQGYLIGKPQPLIDTLNEIEPIRESA

Specific function: Truncated proteins consisting of the GGDEF/EAL domains (residues 319-747) or of the EAL domain alone (481-747) have c-di- GMP phosphodiesterase activity. They do not have diguanylate cyclase activity. Cyclic-di-GMP is a second messenger which controls cel

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cell inner membrane; Multi-pass membrane protein [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 EAL domain [H]

Homologues:

Organism=Escherichia coli, GI1788849, Length=747, Percent_Identity=99.330655957162, Blast_Score=1531, Evalue=0.0,
Organism=Escherichia coli, GI87082096, Length=714, Percent_Identity=30.1120448179272, Blast_Score=317, Evalue=1e-87,
Organism=Escherichia coli, GI1790496, Length=236, Percent_Identity=31.7796610169492, Blast_Score=132, Evalue=1e-31,
Organism=Escherichia coli, GI87081743, Length=263, Percent_Identity=30.0380228136882, Blast_Score=124, Evalue=2e-29,
Organism=Escherichia coli, GI1787541, Length=439, Percent_Identity=26.4236902050114, Blast_Score=120, Evalue=3e-28,
Organism=Escherichia coli, GI87081921, Length=462, Percent_Identity=25.5411255411255, Blast_Score=119, Evalue=6e-28,
Organism=Escherichia coli, GI226510982, Length=252, Percent_Identity=28.968253968254, Blast_Score=111, Evalue=1e-25,
Organism=Escherichia coli, GI1788502, Length=240, Percent_Identity=28.75, Blast_Score=111, Evalue=2e-25,
Organism=Escherichia coli, GI1786507, Length=233, Percent_Identity=30.4721030042918, Blast_Score=110, Evalue=2e-25,
Organism=Escherichia coli, GI87081980, Length=239, Percent_Identity=31.3807531380753, Blast_Score=103, Evalue=5e-23,
Organism=Escherichia coli, GI87081845, Length=258, Percent_Identity=29.0697674418605, Blast_Score=100, Evalue=6e-22,
Organism=Escherichia coli, GI1787055, Length=250, Percent_Identity=26.8, Blast_Score=100, Evalue=7e-22,
Organism=Escherichia coli, GI1789650, Length=225, Percent_Identity=23.1111111111111, Blast_Score=67, Evalue=4e-12,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000160
- InterPro:   IPR001633
- InterPro:   IPR007895 [H]

Pfam domain/function: PF00563 EAL; PF05231 MASE1 [H]

EC number: =3.1.4.52 [H]

Molecular weight: Translated: 85666; Mature: 85666

Theoretical pI: Translated: 9.11; Mature: 9.11

Prosite motif: PS50883 EAL

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.1 %Cys     (Translated Protein)
4.0 %Met     (Translated Protein)
5.1 %Cys+Met (Translated Protein)
1.1 %Cys     (Mature Protein)
4.0 %Met     (Mature Protein)
5.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKLNATYIKIRDKWWGLPLFLPSLILPIFAHINTFAHISSGEVFLFYLPLALMISMMMFF
CCCCEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHEECCCCEEHHHHHHHHHHHHHHHH
SWAALPGIALGIFVRKYAELGFYETLSLTANFIIIIILCWGGYRVFTPRRNNVSHGDSRL
HHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHEEEEEECCCEEEECCCCCCCCCCHHHH
ISQRLFWQIVFPATLFLILFQFAAFVGLLASRENLVGVMPFNLGTLINYQALLVGNLIGV
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHHHHH
PLCYFIIRVVRNPFYLRSYYSQLKQQVDAKVTKKEFAIWLLALGALLLLLCMPLNEKSTI
HHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCE
FSTNYTLSLLLPLMMWGAMRYGYKLISLLWAVVLMISIHSYQNYIPIYPGYTTQLTITSS
EECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEECCCCEEEEEEECC
SYLVFSFIVNYMAVLATRQRAVVRRIQRLAYVDPVVHLPNVRALNRALRDAPWSALCYLR
CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHCCCCHHHHHHHHHCCCCCEEEEEE
IPGMEMLVKNYGIMLRIQYKQKLSHWLSPLLEPGEDVYQLSGNDLALRLNTESHQERITA
CCCHHHHHHCCCEEEEEEHHHHHHHHHHHHCCCCCHHEEECCCEEEEEECCHHHHHHHHH
LDSHLKQFRFFWDGMPMQPQIGVSYCYVRSPVNHIYLLLGELNTVAELSIVTNAPENMQR
HHHHHHHHHHHHCCCCCCCCCCCEEEEECCCHHHHEEHHHCCCCHHEEEEEECCCHHHHH
RGAMYLQRELKDKVAMMNRLQRALEHNHFFLMAQPITGMRGDVYHEILLRMKGENDELIS
HHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHCCCCCCCCC
PDSFLPVAHEFGLSSSIDMWVIEHTLQFMAENRAKMPAHRFAINLSPTSVCQARFPVEVS
CCCCCCHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHEEEECCHHHHHHCCCCHHHH
QLLAKYQIEAWQLIFEVTESNALTNVKQAQITLQHLQELGCQIAIDDFGTGYASYARLKN
HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHC
VNADLLKIDGSFIRNIVSNSLDYQIVASICHLARMKKMRVVAEYVENEEIREAVLSLGID
CCCCEEEECHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCHH
YMQGYLIGKPQPLIDTLNEIEPIRESA
HHCCEECCCCCHHHHHHHHHHHHHCCC
>Mature Secondary Structure
MKLNATYIKIRDKWWGLPLFLPSLILPIFAHINTFAHISSGEVFLFYLPLALMISMMMFF
CCCCEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHEECCCCEEHHHHHHHHHHHHHHHH
SWAALPGIALGIFVRKYAELGFYETLSLTANFIIIIILCWGGYRVFTPRRNNVSHGDSRL
HHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHEEEEEECCCEEEECCCCCCCCCCHHHH
ISQRLFWQIVFPATLFLILFQFAAFVGLLASRENLVGVMPFNLGTLINYQALLVGNLIGV
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHHHHH
PLCYFIIRVVRNPFYLRSYYSQLKQQVDAKVTKKEFAIWLLALGALLLLLCMPLNEKSTI
HHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCE
FSTNYTLSLLLPLMMWGAMRYGYKLISLLWAVVLMISIHSYQNYIPIYPGYTTQLTITSS
EECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEECCCCEEEEEEECC
SYLVFSFIVNYMAVLATRQRAVVRRIQRLAYVDPVVHLPNVRALNRALRDAPWSALCYLR
CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHCCCCHHHHHHHHHCCCCCEEEEEE
IPGMEMLVKNYGIMLRIQYKQKLSHWLSPLLEPGEDVYQLSGNDLALRLNTESHQERITA
CCCHHHHHHCCCEEEEEEHHHHHHHHHHHHCCCCCHHEEECCCEEEEEECCHHHHHHHHH
LDSHLKQFRFFWDGMPMQPQIGVSYCYVRSPVNHIYLLLGELNTVAELSIVTNAPENMQR
HHHHHHHHHHHHCCCCCCCCCCCEEEEECCCHHHHEEHHHCCCCHHEEEEEECCCHHHHH
RGAMYLQRELKDKVAMMNRLQRALEHNHFFLMAQPITGMRGDVYHEILLRMKGENDELIS
HHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHCCCCCCCCC
PDSFLPVAHEFGLSSSIDMWVIEHTLQFMAENRAKMPAHRFAINLSPTSVCQARFPVEVS
CCCCCCHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHEEEECCHHHHHHCCCCHHHH
QLLAKYQIEAWQLIFEVTESNALTNVKQAQITLQHLQELGCQIAIDDFGTGYASYARLKN
HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHC
VNADLLKIDGSFIRNIVSNSLDYQIVASICHLARMKKMRVVAEYVENEEIREAVLSLGID
CCCCEEEECHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCHH
YMQGYLIGKPQPLIDTLNEIEPIRESA
HHCCEECCCCCHHHHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 9205837; 9278503 [H]