| Definition | Escherichia coli HS, complete genome. |
|---|---|
| Accession | NC_009800 |
| Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is yjgI [H]
Identifier: 157163726
GI number: 157163726
Start: 4509834
End: 4510547
Strand: Reverse
Name: yjgI [H]
Synonym: EcHS_A4505
Alternate gene names: 157163726
Gene position: 4510547-4509834 (Counterclockwise)
Preceding gene: 157163729
Following gene: 157163725
Centisome position: 97.14
GC content: 54.9
Gene sequence:
>714_bases ATGGGCGCTTTTACAGGTAAGACAGTTCTCATCCTCGGTGGTAGCCGTGGAATCGGTGCCGCTATCGTACGTCGTTTCGT CACCGATGGGGCCAATGTACGATTCACCTATGCGGGGTCGAAAGATGCCGCTGAACGCCTGGCACAAGAGACGGGAGCGA CAGCAGTATTCACAGATAGTGCTGACAGAGACGCTGTTATTGATGTCGTTCGTAAGAGCGGCGCATTGGATATCCTCGTG GTAAATGCAGGTATTGGCGTCTTTGGCGATGCCCTGGAATTAAATGCCGACGATATTGATCGCCTTTTCAAAATCAATAT TCATGCTCCTTACCATGCCTCCGTTGAAGCCGCCCGGCAGATGCCCGAAGGCGGGCGCATCTTAATCATCGGCTCCGTGA ATGGCGATCGTATGCCTGTTGCAGGCATGGCTGCTTATGCCGCCAGCAAATCTGCCCTGCAAGGCATGGCGCGCGGGCTG GCCCGTGATTTTGGACCGCGTGGGATCACCATCAACGTCGTCCAGCCAGGGCCAATTGATACCGACGCTAATCCCGCCAA CGGGCCAATGCGCGATATGTTGCATGGTTTTATGGCTATCAAAAGACATGGGCAACCGGAAGAGGTCGCTGGTATGGTCG CATGGTTAGCAGGGCCAGAAGCCAGCTTTGTTACCGGCGCGATGCATACCATTGATGGCGCGTTTGGCGCATAA
Upstream 100 bases:
>100_bases TGGAACGCGAGATTGTTTTTTAGTGACCATTACAAAACTTGTTGACAGAAAGTTAAAACAGTTTTGTAATGCATGTTACA TAATAAATCAAGGAGTCCTT
Downstream 100 bases:
>100_bases CCGACTACGCTCAATTAAGCCCAGCCATTTCCCATGATGTCTGGGTTTTGTTTACTCACGTCGTCCGCTAAAAGCGGCTC CTGGTAAATATAAATCTTCT
Product: oxidoreductase
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 237; Mature: 236
Protein sequence:
>237_residues MGAFTGKTVLILGGSRGIGAAIVRRFVTDGANVRFTYAGSKDAAERLAQETGATAVFTDSADRDAVIDVVRKSGALDILV VNAGIGVFGDALELNADDIDRLFKINIHAPYHASVEAARQMPEGGRILIIGSVNGDRMPVAGMAAYAASKSALQGMARGL ARDFGPRGITINVVQPGPIDTDANPANGPMRDMLHGFMAIKRHGQPEEVAGMVAWLAGPEASFVTGAMHTIDGAFGA
Sequences:
>Translated_237_residues MGAFTGKTVLILGGSRGIGAAIVRRFVTDGANVRFTYAGSKDAAERLAQETGATAVFTDSADRDAVIDVVRKSGALDILV VNAGIGVFGDALELNADDIDRLFKINIHAPYHASVEAARQMPEGGRILIIGSVNGDRMPVAGMAAYAASKSALQGMARGL ARDFGPRGITINVVQPGPIDTDANPANGPMRDMLHGFMAIKRHGQPEEVAGMVAWLAGPEASFVTGAMHTIDGAFGA >Mature_236_residues GAFTGKTVLILGGSRGIGAAIVRRFVTDGANVRFTYAGSKDAAERLAQETGATAVFTDSADRDAVIDVVRKSGALDILVV NAGIGVFGDALELNADDIDRLFKINIHAPYHASVEAARQMPEGGRILIIGSVNGDRMPVAGMAAYAASKSALQGMARGLA RDFGPRGITINVVQPGPIDTDANPANGPMRDMLHGFMAIKRHGQPEEVAGMVAWLAGPEASFVTGAMHTIDGAFGA
Specific function: Unknown
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the short-chain dehydrogenases/reductases (SDR) family [H]
Homologues:
Organism=Homo sapiens, GI32483357, Length=244, Percent_Identity=28.6885245901639, Blast_Score=103, Evalue=1e-22, Organism=Homo sapiens, GI5031737, Length=241, Percent_Identity=26.5560165975104, Blast_Score=96, Evalue=4e-20, Organism=Homo sapiens, GI126723750, Length=240, Percent_Identity=27.5, Blast_Score=77, Evalue=1e-14, Organism=Homo sapiens, GI33667109, Length=194, Percent_Identity=26.2886597938144, Blast_Score=74, Evalue=1e-13, Organism=Homo sapiens, GI126723191, Length=183, Percent_Identity=28.4153005464481, Blast_Score=72, Evalue=5e-13, Organism=Homo sapiens, GI7705925, Length=241, Percent_Identity=25.3112033195021, Blast_Score=71, Evalue=8e-13, Organism=Homo sapiens, GI40254992, Length=236, Percent_Identity=28.3898305084746, Blast_Score=69, Evalue=3e-12, Organism=Homo sapiens, GI66933014, Length=245, Percent_Identity=26.530612244898, Blast_Score=69, Evalue=3e-12, Organism=Homo sapiens, GI19923817, Length=252, Percent_Identity=26.1904761904762, Blast_Score=68, Evalue=7e-12, Organism=Homo sapiens, GI223718074, Length=215, Percent_Identity=29.7674418604651, Blast_Score=65, Evalue=7e-11, Organism=Escherichia coli, GI2367365, Length=237, Percent_Identity=98.3122362869198, Blast_Score=464, Evalue=1e-132, Organism=Escherichia coli, GI87082100, Length=254, Percent_Identity=34.251968503937, Blast_Score=115, Evalue=2e-27, Organism=Escherichia coli, GI1787335, Length=241, Percent_Identity=36.5145228215768, Blast_Score=111, Evalue=4e-26, Organism=Escherichia coli, GI1789378, Length=246, Percent_Identity=31.7073170731707, Blast_Score=110, Evalue=7e-26, Organism=Escherichia coli, GI1788459, Length=249, Percent_Identity=26.9076305220884, Blast_Score=92, Evalue=4e-20, Organism=Escherichia coli, GI2367175, Length=242, Percent_Identity=32.2314049586777, Blast_Score=89, Evalue=2e-19, Organism=Escherichia coli, GI87082160, Length=250, Percent_Identity=27.6, Blast_Score=88, Evalue=4e-19, Organism=Escherichia coli, GI1786812, Length=248, Percent_Identity=29.8387096774194, Blast_Score=82, Evalue=4e-17, Organism=Escherichia coli, GI1787905, Length=241, Percent_Identity=31.9502074688797, Blast_Score=81, Evalue=7e-17, Organism=Escherichia coli, GI1790717, Length=245, Percent_Identity=27.3469387755102, Blast_Score=72, Evalue=4e-14, Organism=Escherichia coli, GI1789208, Length=176, Percent_Identity=29.5454545454545, Blast_Score=68, Evalue=6e-13, Organism=Escherichia coli, GI1786701, Length=189, Percent_Identity=31.2169312169312, Blast_Score=63, Evalue=2e-11, Organism=Caenorhabditis elegans, GI17560676, Length=248, Percent_Identity=28.2258064516129, Blast_Score=97, Evalue=5e-21, Organism=Caenorhabditis elegans, GI17561402, Length=254, Percent_Identity=30.3149606299213, Blast_Score=97, Evalue=5e-21, Organism=Caenorhabditis elegans, GI115534694, Length=245, Percent_Identity=29.3877551020408, Blast_Score=84, Evalue=5e-17, Organism=Caenorhabditis elegans, GI17555706, Length=244, Percent_Identity=29.5081967213115, Blast_Score=80, Evalue=9e-16, Organism=Caenorhabditis elegans, GI71994600, Length=247, Percent_Identity=27.9352226720648, Blast_Score=78, Evalue=3e-15, Organism=Caenorhabditis elegans, GI17560150, Length=262, Percent_Identity=28.2442748091603, Blast_Score=78, Evalue=4e-15, Organism=Caenorhabditis elegans, GI17562908, Length=265, Percent_Identity=29.0566037735849, Blast_Score=77, Evalue=8e-15, Organism=Caenorhabditis elegans, GI17563726, Length=251, Percent_Identity=28.6852589641434, Blast_Score=76, Evalue=1e-14, Organism=Caenorhabditis elegans, GI17562906, Length=261, Percent_Identity=28.3524904214559, Blast_Score=75, Evalue=4e-14, Organism=Caenorhabditis elegans, GI17562990, Length=260, Percent_Identity=28.4615384615385, Blast_Score=74, Evalue=6e-14, Organism=Caenorhabditis elegans, GI25147288, Length=242, Percent_Identity=27.2727272727273, Blast_Score=74, Evalue=6e-14, Organism=Caenorhabditis elegans, GI17562904, Length=250, Percent_Identity=28, Blast_Score=72, Evalue=2e-13, Organism=Caenorhabditis elegans, GI17536651, Length=254, Percent_Identity=29.1338582677165, Blast_Score=72, Evalue=2e-13, Organism=Caenorhabditis elegans, GI71994604, Length=192, Percent_Identity=27.6041666666667, Blast_Score=72, Evalue=3e-13, Organism=Caenorhabditis elegans, GI193204405, Length=264, Percent_Identity=28.4090909090909, Blast_Score=71, Evalue=4e-13, Organism=Caenorhabditis elegans, GI72000259, Length=260, Percent_Identity=26.5384615384615, Blast_Score=70, Evalue=7e-13, Organism=Caenorhabditis elegans, GI17560332, Length=273, Percent_Identity=27.4725274725275, Blast_Score=70, Evalue=8e-13, Organism=Caenorhabditis elegans, GI17562910, Length=270, Percent_Identity=25.5555555555556, Blast_Score=69, Evalue=2e-12, Organism=Caenorhabditis elegans, GI17531453, Length=262, Percent_Identity=26.7175572519084, Blast_Score=69, Evalue=2e-12, Organism=Caenorhabditis elegans, GI17508651, Length=245, Percent_Identity=25.7142857142857, Blast_Score=68, Evalue=4e-12, Organism=Caenorhabditis elegans, GI17544670, Length=195, Percent_Identity=28.7179487179487, Blast_Score=67, Evalue=7e-12, Organism=Caenorhabditis elegans, GI17538486, Length=266, Percent_Identity=28.5714285714286, Blast_Score=64, Evalue=5e-11, Organism=Caenorhabditis elegans, GI17565030, Length=271, Percent_Identity=27.6752767527675, Blast_Score=64, Evalue=1e-10, Organism=Drosophila melanogaster, GI24644339, Length=247, Percent_Identity=33.6032388663968, Blast_Score=100, Evalue=1e-21, Organism=Drosophila melanogaster, GI21355319, Length=242, Percent_Identity=30.1652892561983, Blast_Score=94, Evalue=6e-20, Organism=Drosophila melanogaster, GI23397609, Length=243, Percent_Identity=30.8641975308642, Blast_Score=93, Evalue=1e-19, Organism=Drosophila melanogaster, GI24639444, Length=248, Percent_Identity=30.241935483871, Blast_Score=89, Evalue=2e-18, Organism=Drosophila melanogaster, GI21357041, Length=253, Percent_Identity=30.4347826086957, Blast_Score=89, Evalue=2e-18, Organism=Drosophila melanogaster, GI24643142, Length=244, Percent_Identity=28.6885245901639, Blast_Score=89, Evalue=3e-18, Organism=Drosophila melanogaster, GI28571526, Length=250, Percent_Identity=29.2, Blast_Score=89, Evalue=3e-18, Organism=Drosophila melanogaster, GI24644337, Length=171, Percent_Identity=32.7485380116959, Blast_Score=77, Evalue=1e-14, Organism=Drosophila melanogaster, GI17737361, Length=255, Percent_Identity=28.6274509803922, Blast_Score=65, Evalue=4e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR002198 - InterPro: IPR002347 - InterPro: IPR016040 - InterPro: IPR020904 [H]
Pfam domain/function: PF00106 adh_short [H]
EC number: 1.-.-.- [C]
Molecular weight: Translated: 24589; Mature: 24458
Theoretical pI: Translated: 6.36; Mature: 6.36
Prosite motif: PS00061 ADH_SHORT
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.0 %Cys (Translated Protein) 4.2 %Met (Translated Protein) 4.2 %Cys+Met (Translated Protein) 0.0 %Cys (Mature Protein) 3.8 %Met (Mature Protein) 3.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MGAFTGKTVLILGGSRGIGAAIVRRFVTDGANVRFTYAGSKDAAERLAQETGATAVFTDS CCCCCCCEEEEEECCCCHHHHHHHHHHCCCCCEEEEECCCHHHHHHHHHHHCCEEEEECC ADRDAVIDVVRKSGALDILVVNAGIGVFGDALELNADDIDRLFKINIHAPYHASVEAARQ CCCHHHHHHHHCCCCEEEEEEECCCCCCCCEEECCHHHCCEEEEEEECCCCCCHHHHHHC MPEGGRILIIGSVNGDRMPVAGMAAYAASKSALQGMARGLARDFGPRGITINVVQPGPID CCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCC TDANPANGPMRDMLHGFMAIKRHGQPEEVAGMVAWLAGPEASFVTGAMHTIDGAFGA CCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCC >Mature Secondary Structure GAFTGKTVLILGGSRGIGAAIVRRFVTDGANVRFTYAGSKDAAERLAQETGATAVFTDS CCCCCCEEEEEECCCCHHHHHHHHHHCCCCCEEEEECCCHHHHHHHHHHHCCEEEEECC ADRDAVIDVVRKSGALDILVVNAGIGVFGDALELNADDIDRLFKINIHAPYHASVEAARQ CCCHHHHHHHHCCCCEEEEEEECCCCCCCCEEECCHHHCCEEEEEEECCCCCCHHHHHHC MPEGGRILIIGSVNGDRMPVAGMAAYAASKSALQGMARGLARDFGPRGITINVVQPGPID CCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCC TDANPANGPMRDMLHGFMAIKRHGQPEEVAGMVAWLAGPEASFVTGAMHTIDGAFGA CCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 7610040; 9278503 [H]