Definition | Escherichia coli HS, complete genome. |
---|---|
Accession | NC_009800 |
Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is ybdR [H]
Identifier: 157160105
GI number: 157160105
Start: 678169
End: 679407
Strand: Direct
Name: ybdR [H]
Synonym: EcHS_A0660
Alternate gene names: 157160105
Gene position: 678169-679407 (Clockwise)
Preceding gene: 157160103
Following gene: 157160115
Centisome position: 14.6
GC content: 50.04
Gene sequence:
>1239_bases ATGAAAGCATTGACTTATCACGGCCCACATCACGTTCAGGTAGAAAATGTTCCCGATCCGGGCGTTGAACAGGCAGATGA TATTATTCTGCGTATTACGGCAACGGCGATCTGTGGCTCTGACCTCCATCTTTATCGAGGCAAAATACCTCAGGTTAAAC ATGGCGATATTTTTGGTCATGAATTTATGGGGGAAGTAGTTGAAACCGGAAAGGACGTAAAAAATTTGCAAAAAGGCGAC CGAGTGGTAATTCCGTTCGTCATTGCTTGTGGCGACTGTTTTTTCTGTCGATTGCAACAATATGCCGCCTGCGAAAATAC CAATGCGGGTAAAGGCGCTGCGCTCAATAAAAAACAGATACCAGCTCCAGCGGCATTGTTTGGTTATAGTCACCTGTATG GCGGCGTTCCTGGTGGGCAGGCGGAATATGTCCGCGTCCCTAAAGGGAATGTGGGGCCGTTTAAAGTACCGCCTTTGCTT TCAGATGATAAAGCGCTTTTCCTTTCTGATATTCTGCCAACGGCATGGCAGGCAGCAAAAAATGCGCAGATCCAACAAGG TTCAAGCGTTGCAGTCTATGGTGCTGGTCCTGTGGGATTGTTGACAATCGCCTGTGCACGGTTGCTCGGTGCGGAACAGA TTTTTGTTGTTGATCATCATCCCTACCGCTTGCATTTCGCCGCCGACCGCTACGGCGCGATCCCGATTAATTTTGATGAA GACAGCGATCCGGCACAGTCAATTATTGAACAAACGGCAGGTCACCGGGGCGTGGATGCAGTAATAGACGCCGTCGGTTT TGAAGCGAAAGGCAGCACCACGGAAACGGTGCTGACTAACCTGAAACTGGAGGGCAGCAGCGGTAAAGCGTTGCGTCAGT GTATTGCGGCGGTCAGGCGTGGCGGCATTGTTAGCGTACCGGGCGTCTACGCTGGATTTATTCACGGTTTCCTGTTTGGC GACGCCTTTGATAAAGGGTTGTCGTTTAAAATGGGACAGACCCACGTTCACGCATGGCTGGGAGAATTATTACCGTTAAT TGAGAAAGGATTACTGAAACCAGAAGAAATTGTTACCCACTATATGCCGTTTGAAGAGGCCGGCCGGGGATATGAGATTT TCGAAAAACGTGAAGAGGAGTGCCGTAAGGTGATTCTGGTACCCGGTGCACAAAACGCAGAGGCGGCGCAGAAGGCGGTT TCAGGTCTGGTGAATGCGATGCCGGGGGGAACAATATGA
Upstream 100 bases:
>100_bases ACACCAGGAATTCTCCCAAAACCTGCGGTACCGCCCGTTTTCCCGCTGTGATAGCTACCCTTAAAGACTGACTCTTTTTT GAACTGTCTCTGGAGGTTGC
Downstream 100 bases:
>100_bases TCGTCAGGAGTGGTTTTCGAGGTAAAGGACAGCCATGACGATAATCGCCGCCATAATCAGAAATCCTATCAGGATGTAAA ATGCTTCTGCCATGGTTATT
Product: zinc-binding dehydrogenase family oxidoreductase
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 412; Mature: 412
Protein sequence:
>412_residues MKALTYHGPHHVQVENVPDPGVEQADDIILRITATAICGSDLHLYRGKIPQVKHGDIFGHEFMGEVVETGKDVKNLQKGD RVVIPFVIACGDCFFCRLQQYAACENTNAGKGAALNKKQIPAPAALFGYSHLYGGVPGGQAEYVRVPKGNVGPFKVPPLL SDDKALFLSDILPTAWQAAKNAQIQQGSSVAVYGAGPVGLLTIACARLLGAEQIFVVDHHPYRLHFAADRYGAIPINFDE DSDPAQSIIEQTAGHRGVDAVIDAVGFEAKGSTTETVLTNLKLEGSSGKALRQCIAAVRRGGIVSVPGVYAGFIHGFLFG DAFDKGLSFKMGQTHVHAWLGELLPLIEKGLLKPEEIVTHYMPFEEAGRGYEIFEKREEECRKVILVPGAQNAEAAQKAV SGLVNAMPGGTI
Sequences:
>Translated_412_residues MKALTYHGPHHVQVENVPDPGVEQADDIILRITATAICGSDLHLYRGKIPQVKHGDIFGHEFMGEVVETGKDVKNLQKGD RVVIPFVIACGDCFFCRLQQYAACENTNAGKGAALNKKQIPAPAALFGYSHLYGGVPGGQAEYVRVPKGNVGPFKVPPLL SDDKALFLSDILPTAWQAAKNAQIQQGSSVAVYGAGPVGLLTIACARLLGAEQIFVVDHHPYRLHFAADRYGAIPINFDE DSDPAQSIIEQTAGHRGVDAVIDAVGFEAKGSTTETVLTNLKLEGSSGKALRQCIAAVRRGGIVSVPGVYAGFIHGFLFG DAFDKGLSFKMGQTHVHAWLGELLPLIEKGLLKPEEIVTHYMPFEEAGRGYEIFEKREEECRKVILVPGAQNAEAAQKAV SGLVNAMPGGTI >Mature_412_residues MKALTYHGPHHVQVENVPDPGVEQADDIILRITATAICGSDLHLYRGKIPQVKHGDIFGHEFMGEVVETGKDVKNLQKGD RVVIPFVIACGDCFFCRLQQYAACENTNAGKGAALNKKQIPAPAALFGYSHLYGGVPGGQAEYVRVPKGNVGPFKVPPLL SDDKALFLSDILPTAWQAAKNAQIQQGSSVAVYGAGPVGLLTIACARLLGAEQIFVVDHHPYRLHFAADRYGAIPINFDE DSDPAQSIIEQTAGHRGVDAVIDAVGFEAKGSTTETVLTNLKLEGSSGKALRQCIAAVRRGGIVSVPGVYAGFIHGFLFG DAFDKGLSFKMGQTHVHAWLGELLPLIEKGLLKPEEIVTHYMPFEEAGRGYEIFEKREEECRKVILVPGAQNAEAAQKAV SGLVNAMPGGTI
Specific function: Unknown
COG id: COG1063
COG function: function code ER; Threonine dehydrogenase and related Zn-dependent dehydrogenases
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the zinc-containing alcohol dehydrogenase family. Class-III subfamily [H]
Homologues:
Organism=Homo sapiens, GI71565154, Length=248, Percent_Identity=30.6451612903226, Blast_Score=91, Evalue=2e-18, Organism=Homo sapiens, GI156523966, Length=388, Percent_Identity=23.1958762886598, Blast_Score=75, Evalue=1e-13, Organism=Homo sapiens, GI4501939, Length=376, Percent_Identity=23.1382978723404, Blast_Score=73, Evalue=4e-13, Organism=Homo sapiens, GI71565152, Length=361, Percent_Identity=24.3767313019391, Blast_Score=70, Evalue=4e-12, Organism=Escherichia coli, GI1786825, Length=412, Percent_Identity=99.5145631067961, Blast_Score=839, Evalue=0.0, Organism=Escherichia coli, GI1790045, Length=400, Percent_Identity=28, Blast_Score=110, Evalue=2e-25, Organism=Escherichia coli, GI1787863, Length=394, Percent_Identity=23.6040609137056, Blast_Score=105, Evalue=5e-24, Organism=Escherichia coli, GI226510992, Length=385, Percent_Identity=25.7142857142857, Blast_Score=96, Evalue=5e-21, Organism=Escherichia coli, GI87082125, Length=261, Percent_Identity=27.9693486590038, Blast_Score=89, Evalue=4e-19, Organism=Escherichia coli, GI1790718, Length=216, Percent_Identity=28.2407407407407, Blast_Score=83, Evalue=4e-17, Organism=Escherichia coli, GI1788407, Length=317, Percent_Identity=25.8675078864353, Blast_Score=75, Evalue=6e-15, Organism=Escherichia coli, GI1788075, Length=381, Percent_Identity=25.4593175853018, Blast_Score=73, Evalue=4e-14, Organism=Escherichia coli, GI1786552, Length=246, Percent_Identity=28.0487804878049, Blast_Score=73, Evalue=4e-14, Organism=Escherichia coli, GI87081918, Length=255, Percent_Identity=27.4509803921569, Blast_Score=67, Evalue=3e-12, Organism=Caenorhabditis elegans, GI25146526, Length=276, Percent_Identity=25.3623188405797, Blast_Score=70, Evalue=3e-12, Organism=Caenorhabditis elegans, GI71997431, Length=276, Percent_Identity=25, Blast_Score=67, Evalue=2e-11, Organism=Caenorhabditis elegans, GI17565904, Length=247, Percent_Identity=26.3157894736842, Blast_Score=65, Evalue=5e-11, Organism=Saccharomyces cerevisiae, GI6320033, Length=405, Percent_Identity=26.6666666666667, Blast_Score=101, Evalue=2e-22, Organism=Saccharomyces cerevisiae, GI6319258, Length=301, Percent_Identity=29.2358803986711, Blast_Score=94, Evalue=3e-20, Organism=Saccharomyces cerevisiae, GI6319257, Length=296, Percent_Identity=29.0540540540541, Blast_Score=87, Evalue=7e-18, Organism=Drosophila melanogaster, GI17737895, Length=209, Percent_Identity=27.7511961722488, Blast_Score=75, Evalue=8e-14,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR013149 - InterPro: IPR013154 - InterPro: IPR002085 - InterPro: IPR002328 - InterPro: IPR011032 - InterPro: IPR016040 [H]
Pfam domain/function: PF08240 ADH_N; PF00107 ADH_zinc_N [H]
EC number: NA
Molecular weight: Translated: 44189; Mature: 44189
Theoretical pI: Translated: 6.56; Mature: 6.56
Prosite motif: PS00059 ADH_ZINC
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.9 %Cys (Translated Protein) 1.2 %Met (Translated Protein) 3.2 %Cys+Met (Translated Protein) 1.9 %Cys (Mature Protein) 1.2 %Met (Mature Protein) 3.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKALTYHGPHHVQVENVPDPGVEQADDIILRITATAICGSDLHLYRGKIPQVKHGDIFGH CCCEECCCCCEEEECCCCCCCCCCCCCEEEEEEEHHHCCCCCHHHCCCCCCCCCCCCCCH EFMGEVVETGKDVKNLQKGDRVVIPFVIACGDCFFCRLQQYAACENTNAGKGAALNKKQI HHHHHHHHCCHHHHHHHCCCEEEEEHHHHHCCHHHHHHHHHHHHCCCCCCCCCCCCCCCC PAPAALFGYSHLYGGVPGGQAEYVRVPKGNVGPFKVPPLLSDDKALFLSDILPTAWQAAK CCCHHHHHHHHHHCCCCCCCCCEEECCCCCCCCEECCCCCCCCCHHHHHHHHHHHHHHHC NAQIQQGSSVAVYGAGPVGLLTIACARLLGAEQIFVVDHHPYRLHFAADRYGAIPINFDE CCCCCCCCEEEEEECCHHHHHHHHHHHHHCCCEEEEEECCCEEEEEECCCCCCEEECCCC DSDPAQSIIEQTAGHRGVDAVIDAVGFEAKGSTTETVLTNLKLEGSSGKALRQCIAAVRR CCCHHHHHHHHHCCCCCHHHHHHHHCCCCCCCCHHHEEEEEEEECCCCHHHHHHHHHHHC GGIVSVPGVYAGFIHGFLFGDAFDKGLSFKMGQTHVHAWLGELLPLIEKGLLKPEEIVTH CCEEECCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHH YMPFEEAGRGYEIFEKREEECRKVILVPGAQNAEAAQKAVSGLVNAMPGGTI HCCHHHCCCCHHHHHHHHHHCCEEEECCCCCCHHHHHHHHHHHHHCCCCCCC >Mature Secondary Structure MKALTYHGPHHVQVENVPDPGVEQADDIILRITATAICGSDLHLYRGKIPQVKHGDIFGH CCCEECCCCCEEEECCCCCCCCCCCCCEEEEEEEHHHCCCCCHHHCCCCCCCCCCCCCCH EFMGEVVETGKDVKNLQKGDRVVIPFVIACGDCFFCRLQQYAACENTNAGKGAALNKKQI HHHHHHHHCCHHHHHHHCCCEEEEEHHHHHCCHHHHHHHHHHHHCCCCCCCCCCCCCCCC PAPAALFGYSHLYGGVPGGQAEYVRVPKGNVGPFKVPPLLSDDKALFLSDILPTAWQAAK CCCHHHHHHHHHHCCCCCCCCCEEECCCCCCCCEECCCCCCCCCHHHHHHHHHHHHHHHC NAQIQQGSSVAVYGAGPVGLLTIACARLLGAEQIFVVDHHPYRLHFAADRYGAIPINFDE CCCCCCCCEEEEEECCHHHHHHHHHHHHHCCCEEEEEECCCEEEEEECCCCCCEEECCCC DSDPAQSIIEQTAGHRGVDAVIDAVGFEAKGSTTETVLTNLKLEGSSGKALRQCIAAVRR CCCHHHHHHHHHCCCCCHHHHHHHHCCCCCCCCHHHEEEEEEEECCCCHHHHHHHHHHHC GGIVSVPGVYAGFIHGFLFGDAFDKGLSFKMGQTHVHAWLGELLPLIEKGLLKPEEIVTH CCEEECCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHH YMPFEEAGRGYEIFEKREEECRKVILVPGAQNAEAAQKAVSGLVNAMPGGTI HCCHHHCCCCHHHHHHHHHHCCEEEECCCCCCHHHHHHHHHHHHHCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: Zn [C]
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 8905232; 9278503 [H]