Definition | Escherichia coli HS, complete genome. |
---|---|
Accession | NC_009800 |
Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is yfeA [H]
Identifier: 157161868
GI number: 157161868
Start: 2547045
End: 2549234
Strand: Reverse
Name: yfeA [H]
Synonym: EcHS_A2532
Alternate gene names: 157161868
Gene position: 2549234-2547045 (Counterclockwise)
Preceding gene: 157161871
Following gene: 157161867
Centisome position: 54.9
GC content: 49.04
Gene sequence:
>2190_bases ATGTTTGTGGAGCATAACCTGATAAAAAATATCAAGATATTCACACTAGCGTTTACGCTCACCGTGGTACTTATTCAGCT ATCCCGTTTTATTTCGCCACTTGCCATTATCCATTCCAGTTATATCTTTCTGGCGTGGATGCCACTGTGCGTAATGCTGT CAATCTTGTTTATCTTTGGCTGGCGCGGTGTCGTTCCCGTTTTATGCGGGATGTTTTGCACCAATCTGTGGAACTTTCAT CTCTCTTTTTTACAGACTGCGGTCATGCTCGGTAGCCAGACGTTTGTCGTGTTGTGTGCCTGCGCAATATTACGCTGGCA GCTGGGGACGCGTTGGCGTTATGGATTGACCAGCCGATATGTCTGGCAACGTCTGTTCTGGCTTGGTTTGGTGACGCCGA TCGGCATCAAATGCAGCATGTATCTTGTGGGGAGCTTCTTTGATTTCCCGCTAAAGATATCCACCTTTTTCGGCGATGCG GATGCCATTTTCACGGTCGTTGATTTGCTAAGCCTTTTCACCGCTGTGCTGATTTACAACATGCTTTTCTACTATCTCAC TCGCATGATTGTAAGTCCCCACTTTGCGCAGATATTGTGGCGCAGGGATATCGCTCCGTCGTTGGGCAAAGAGAAACGCG CATTTACCTTAAGCTGGCTGGCAGCGTTAAGCGTGCTGCTACTTCTGTTGTGCACACCTTATGAAAACGACTTTATTGCC GGTTACCTGGTACCCGTTTTCTTCATCATCTTTACCCTCGGGGTCGGTAAGCTTCGCTATCCGTTTTTAAATCTCACCTG GGCTGTTTCAACGCTTTGCCTTCTGAATTACAACCAGAACTTTTTGCAAGGGGTGGAAACCGAATATTCGCTGGCATTTA TTCTTGCGGTGCTGATTTCCTTTAGCGTTTGCCTGCTCTATATGGTGCGCATTTATCATCGCAGTGAATGGCTTAATCGC CGCTGGCATTTGCAGGCGCTGACAGATCCGTTAACGCTCCTACCCAACTTTCGTGCGTTGGAACAAGCGCCGGAGCAAGA GGCGGGCAAGAGTTTTTGCTGCCTGCGCATTGATAATCTTGAGTTTATGAGTCGTCATTACGGCTTAATGATGCGCGTTC ACTGTATCCGCTCAATTTGCCGTACGCTGCTGCCGTTGATGCAGGAAAACGAAAAGTTGTATCAATTGCCGGGTAGTGAA CTGCTGTTAGTGCTGAGCGGGCCGGAAACGGAAGGGCGACTCCAGCATATGGTTAACATCCTGAATAGTCGGCAAATTCA CTGGAACAATACCGGGCTGGATATGGGCTATGGTGCTGCCTGGGGGCGTTTTGATGGAAATCAGGAAACCCTGCAACCCT TGTTGGGGCAGTTAAGCTGGCTGGCGGAGCAATCCTGCGCACATCATCATGTGCTGGCGCTGGATAGCAGAGAGGAGATG GTTTCCGGGCAGACCACTAAACAGGTGCTATTGCTGAATACCATTCGCACGGCGTTAGATCAGGGTGATTTGCTGCTCTA CGCCCAGCCAATTCGCAACAAAGAGGGTGAAGGTTATGATGAGATCCTCGCGCGACTGAAATATGACGGCGGCATTATGA CCCCGGATAAGTTTCTGCCCCTTATTGCTCAGTTTAACCTTAGCGCGCGTTTTGATTTGCAAGTGCTGGAATCCTTGTTG AAGTGGCTGGCAACACACCCTTGCGACAAAAAAGGACCGCGCTTTTCAGTCAATTTAATGCCGCTCACGCTGCTGCAAAA GAATATTGCCGGGCGGATTATTCGTCTGTTTAAGCGTTATCACATCTCCCCGCAGGCGGTCATTCTTGAGATCACCGAGG AGCAGGCGTTTTCTAACGCAGAAAGCAGCATGTACAACATCGAGCAGCTGCATAAGTTTGGTTTCCGGATTGCGATTGAT GACTTTGGCACCGGATATGCCAACTACGGACGGTTAAAGCGTTTGCAGGCTGATATCATCAAAATTGATGGCGTCTTTGT GAAAGATATTGTCACGAACACGCTGGATGCGATGATTGTGAGATCAATTACCGATCTGGCGAAAGCGAAGTCATTGAGTG TGGTCGCGGAGTTTGTCGAGACGCAACAGCAGCAGGCGCTATTGCATAAGCTCGGGGTGCAATATCTGCAAGGGTATTTG ATTGGTCGCCCGCAGCCATTAGCTGATTAA
Upstream 100 bases:
>100_bases TGAAAAATCGGCAACAGGCTGGCCCCCTGTTTGCTTCGCGATGCGAATAAACTTATTATTTGTGTGCCTGAAAACCCCGA TCAGTGAGAGTAGTGTACTC
Downstream 100 bases:
>100_bases CAGGCGTAAAAAAACCGGGGAATTATCCCATAAGCGCTAACTTAAGGGTTGTGGTATTACGCCTGATATGATTTAACGTG CCGATGAATTACTCTCACGA
Product: diguanylate cyclase
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 729; Mature: 729
Protein sequence:
>729_residues MFVEHNLIKNIKIFTLAFTLTVVLIQLSRFISPLAIIHSSYIFLAWMPLCVMLSILFIFGWRGVVPVLCGMFCTNLWNFH LSFLQTAVMLGSQTFVVLCACAILRWQLGTRWRYGLTSRYVWQRLFWLGLVTPIGIKCSMYLVGSFFDFPLKISTFFGDA DAIFTVVDLLSLFTAVLIYNMLFYYLTRMIVSPHFAQILWRRDIAPSLGKEKRAFTLSWLAALSVLLLLLCTPYENDFIA GYLVPVFFIIFTLGVGKLRYPFLNLTWAVSTLCLLNYNQNFLQGVETEYSLAFILAVLISFSVCLLYMVRIYHRSEWLNR RWHLQALTDPLTLLPNFRALEQAPEQEAGKSFCCLRIDNLEFMSRHYGLMMRVHCIRSICRTLLPLMQENEKLYQLPGSE LLLVLSGPETEGRLQHMVNILNSRQIHWNNTGLDMGYGAAWGRFDGNQETLQPLLGQLSWLAEQSCAHHHVLALDSREEM VSGQTTKQVLLLNTIRTALDQGDLLLYAQPIRNKEGEGYDEILARLKYDGGIMTPDKFLPLIAQFNLSARFDLQVLESLL KWLATHPCDKKGPRFSVNLMPLTLLQKNIAGRIIRLFKRYHISPQAVILEITEEQAFSNAESSMYNIEQLHKFGFRIAID DFGTGYANYGRLKRLQADIIKIDGVFVKDIVTNTLDAMIVRSITDLAKAKSLSVVAEFVETQQQQALLHKLGVQYLQGYL IGRPQPLAD
Sequences:
>Translated_729_residues MFVEHNLIKNIKIFTLAFTLTVVLIQLSRFISPLAIIHSSYIFLAWMPLCVMLSILFIFGWRGVVPVLCGMFCTNLWNFH LSFLQTAVMLGSQTFVVLCACAILRWQLGTRWRYGLTSRYVWQRLFWLGLVTPIGIKCSMYLVGSFFDFPLKISTFFGDA DAIFTVVDLLSLFTAVLIYNMLFYYLTRMIVSPHFAQILWRRDIAPSLGKEKRAFTLSWLAALSVLLLLLCTPYENDFIA GYLVPVFFIIFTLGVGKLRYPFLNLTWAVSTLCLLNYNQNFLQGVETEYSLAFILAVLISFSVCLLYMVRIYHRSEWLNR RWHLQALTDPLTLLPNFRALEQAPEQEAGKSFCCLRIDNLEFMSRHYGLMMRVHCIRSICRTLLPLMQENEKLYQLPGSE LLLVLSGPETEGRLQHMVNILNSRQIHWNNTGLDMGYGAAWGRFDGNQETLQPLLGQLSWLAEQSCAHHHVLALDSREEM VSGQTTKQVLLLNTIRTALDQGDLLLYAQPIRNKEGEGYDEILARLKYDGGIMTPDKFLPLIAQFNLSARFDLQVLESLL KWLATHPCDKKGPRFSVNLMPLTLLQKNIAGRIIRLFKRYHISPQAVILEITEEQAFSNAESSMYNIEQLHKFGFRIAID DFGTGYANYGRLKRLQADIIKIDGVFVKDIVTNTLDAMIVRSITDLAKAKSLSVVAEFVETQQQQALLHKLGVQYLQGYL IGRPQPLAD >Mature_729_residues MFVEHNLIKNIKIFTLAFTLTVVLIQLSRFISPLAIIHSSYIFLAWMPLCVMLSILFIFGWRGVVPVLCGMFCTNLWNFH LSFLQTAVMLGSQTFVVLCACAILRWQLGTRWRYGLTSRYVWQRLFWLGLVTPIGIKCSMYLVGSFFDFPLKISTFFGDA DAIFTVVDLLSLFTAVLIYNMLFYYLTRMIVSPHFAQILWRRDIAPSLGKEKRAFTLSWLAALSVLLLLLCTPYENDFIA GYLVPVFFIIFTLGVGKLRYPFLNLTWAVSTLCLLNYNQNFLQGVETEYSLAFILAVLISFSVCLLYMVRIYHRSEWLNR RWHLQALTDPLTLLPNFRALEQAPEQEAGKSFCCLRIDNLEFMSRHYGLMMRVHCIRSICRTLLPLMQENEKLYQLPGSE LLLVLSGPETEGRLQHMVNILNSRQIHWNNTGLDMGYGAAWGRFDGNQETLQPLLGQLSWLAEQSCAHHHVLALDSREEM VSGQTTKQVLLLNTIRTALDQGDLLLYAQPIRNKEGEGYDEILARLKYDGGIMTPDKFLPLIAQFNLSARFDLQVLESLL KWLATHPCDKKGPRFSVNLMPLTLLQKNIAGRIIRLFKRYHISPQAVILEITEEQAFSNAESSMYNIEQLHKFGFRIAID DFGTGYANYGRLKRLQADIIKIDGVFVKDIVTNTLDAMIVRSITDLAKAKSLSVVAEFVETQQQQALLHKLGVQYLQGYL IGRPQPLAD
Specific function: Unknown
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 GGDEF domain [H]
Homologues:
Organism=Escherichia coli, GI87082096, Length=729, Percent_Identity=99.8628257887517, Blast_Score=1495, Evalue=0.0, Organism=Escherichia coli, GI1788849, Length=711, Percent_Identity=29.817158931083, Blast_Score=311, Evalue=8e-86, Organism=Escherichia coli, GI1790496, Length=235, Percent_Identity=31.4893617021277, Blast_Score=119, Evalue=6e-28, Organism=Escherichia coli, GI87081743, Length=244, Percent_Identity=30.7377049180328, Blast_Score=116, Evalue=6e-27, Organism=Escherichia coli, GI1787541, Length=427, Percent_Identity=25.5269320843091, Blast_Score=110, Evalue=3e-25, Organism=Escherichia coli, GI1787055, Length=512, Percent_Identity=24.21875, Blast_Score=109, Evalue=5e-25, Organism=Escherichia coli, GI226510982, Length=211, Percent_Identity=33.175355450237, Blast_Score=104, Evalue=2e-23, Organism=Escherichia coli, GI87081980, Length=242, Percent_Identity=29.7520661157025, Blast_Score=102, Evalue=9e-23, Organism=Escherichia coli, GI1788502, Length=243, Percent_Identity=30.0411522633745, Blast_Score=101, Evalue=1e-22, Organism=Escherichia coli, GI87081845, Length=235, Percent_Identity=29.7872340425532, Blast_Score=100, Evalue=3e-22, Organism=Escherichia coli, GI1786507, Length=210, Percent_Identity=31.9047619047619, Blast_Score=100, Evalue=6e-22, Organism=Escherichia coli, GI87081921, Length=265, Percent_Identity=27.5471698113208, Blast_Score=93, Evalue=7e-20, Organism=Escherichia coli, GI1787410, Length=167, Percent_Identity=25.748502994012, Blast_Score=64, Evalue=2e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001054 - InterPro: IPR000160 - InterPro: IPR001633 - InterPro: IPR007895 [H]
Pfam domain/function: PF00563 EAL; PF05231 MASE1 [H]
EC number: NA
Molecular weight: Translated: 83453; Mature: 83453
Theoretical pI: Translated: 8.45; Mature: 8.45
Prosite motif: PS50883 EAL ; PS50887 GGDEF
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.1 %Cys (Translated Protein) 2.7 %Met (Translated Protein) 4.8 %Cys+Met (Translated Protein) 2.1 %Cys (Mature Protein) 2.7 %Met (Mature Protein) 4.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MFVEHNLIKNIKIFTLAFTLTVVLIQLSRFISPLAIIHSSYIFLAWMPLCVMLSILFIFG CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC WRGVVPVLCGMFCTNLWNFHLSFLQTAVMLGSQTFVVLCACAILRWQLGTRWRYGLTSRY CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCHHHH VWQRLFWLGLVTPIGIKCSMYLVGSFFDFPLKISTFFGDADAIFTVVDLLSLFTAVLIYN HHHHHHHHHHHCCCCCEEHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHH MLFYYLTRMIVSPHFAQILWRRDIAPSLGKEKRAFTLSWLAALSVLLLLLCTPYENDFIA HHHHHHHHHHHCHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCHHH GYLVPVFFIIFTLGVGKLRYPFLNLTWAVSTLCLLNYNQNFLQGVETEYSLAFILAVLIS HHHHHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHCCCHHHHHCCCCHHHHHHHHHHHHH FSVCLLYMVRIYHRSEWLNRRWHLQALTDPLTLLPNFRALEQAPEQEAGKSFCCLRIDNL HHHHHHHHHHHHHHHHHHCCCEEHHHHCCHHHHCCCCHHHHCCCHHHCCCCEEEEEECCH EFMSRHYGLMMRVHCIRSICRTLLPLMQENEKLYQLPGSELLLVLSGPETEGRLQHMVNI HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHEECCCCCEEEEEECCCCHHHHHHHHHH LNSRQIHWNNTGLDMGYGAAWGRFDGNQETLQPLLGQLSWLAEQSCAHHHVLALDSREEM HHCCCEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEEECCHHHH VSGQTTKQVLLLNTIRTALDQGDLLLYAQPIRNKEGEGYDEILARLKYDGGIMTPDKFLP HCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCHHHHHHHHHCCCCCCCHHHHHH LIAQFNLSARFDLQVLESLLKWLATHPCDKKGPRFSVNLMPLTLLQKNIAGRIIRLFKRY HHHHCCCCCCCCHHHHHHHHHHHHCCCCCCCCCEEEEEEHHHHHHHHHHHHHHHHHHHHH HISPQAVILEITEEQAFSNAESSMYNIEQLHKFGFRIAIDDFGTGYANYGRLKRLQADII CCCCCEEEEEEEHHHHHCCHHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHHHHH KIDGVFVKDIVTNTLDAMIVRSITDLAKAKSLSVVAEFVETQQQQALLHKLGVQYLQGYL HCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH IGRPQPLAD CCCCCCCCC >Mature Secondary Structure MFVEHNLIKNIKIFTLAFTLTVVLIQLSRFISPLAIIHSSYIFLAWMPLCVMLSILFIFG CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC WRGVVPVLCGMFCTNLWNFHLSFLQTAVMLGSQTFVVLCACAILRWQLGTRWRYGLTSRY CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCHHHH VWQRLFWLGLVTPIGIKCSMYLVGSFFDFPLKISTFFGDADAIFTVVDLLSLFTAVLIYN HHHHHHHHHHHCCCCCEEHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHH MLFYYLTRMIVSPHFAQILWRRDIAPSLGKEKRAFTLSWLAALSVLLLLLCTPYENDFIA HHHHHHHHHHHCHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCHHH GYLVPVFFIIFTLGVGKLRYPFLNLTWAVSTLCLLNYNQNFLQGVETEYSLAFILAVLIS HHHHHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHCCCHHHHHCCCCHHHHHHHHHHHHH FSVCLLYMVRIYHRSEWLNRRWHLQALTDPLTLLPNFRALEQAPEQEAGKSFCCLRIDNL HHHHHHHHHHHHHHHHHHCCCEEHHHHCCHHHHCCCCHHHHCCCHHHCCCCEEEEEECCH EFMSRHYGLMMRVHCIRSICRTLLPLMQENEKLYQLPGSELLLVLSGPETEGRLQHMVNI HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHEECCCCCEEEEEECCCCHHHHHHHHHH LNSRQIHWNNTGLDMGYGAAWGRFDGNQETLQPLLGQLSWLAEQSCAHHHVLALDSREEM HHCCCEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEEECCHHHH VSGQTTKQVLLLNTIRTALDQGDLLLYAQPIRNKEGEGYDEILARLKYDGGIMTPDKFLP HCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCHHHHHHHHHCCCCCCCHHHHHH LIAQFNLSARFDLQVLESLLKWLATHPCDKKGPRFSVNLMPLTLLQKNIAGRIIRLFKRY HHHHCCCCCCCCHHHHHHHHHHHHCCCCCCCCCEEEEEEHHHHHHHHHHHHHHHHHHHHH HISPQAVILEITEEQAFSNAESSMYNIEQLHKFGFRIAIDDFGTGYANYGRLKRLQADII CCCCCEEEEEEEHHHHHCCHHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHHHHH KIDGVFVKDIVTNTLDAMIVRSITDLAKAKSLSVVAEFVETQQQQALLHKLGVQYLQGYL HCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH IGRPQPLAD CCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9205837; 9278503; 2201776 [H]