The gene/protein map for NC_009800 is currently unavailable.
Definition Escherichia coli HS, complete genome.
Accession NC_009800
Length 4,643,538

Click here to switch to the map view.

The map label for this gene is yjgI [H]

Identifier: 157163726

GI number: 157163726

Start: 4509834

End: 4510547

Strand: Reverse

Name: yjgI [H]

Synonym: EcHS_A4505

Alternate gene names: 157163726

Gene position: 4510547-4509834 (Counterclockwise)

Preceding gene: 157163729

Following gene: 157163725

Centisome position: 97.14

GC content: 54.9

Gene sequence:

>714_bases
ATGGGCGCTTTTACAGGTAAGACAGTTCTCATCCTCGGTGGTAGCCGTGGAATCGGTGCCGCTATCGTACGTCGTTTCGT
CACCGATGGGGCCAATGTACGATTCACCTATGCGGGGTCGAAAGATGCCGCTGAACGCCTGGCACAAGAGACGGGAGCGA
CAGCAGTATTCACAGATAGTGCTGACAGAGACGCTGTTATTGATGTCGTTCGTAAGAGCGGCGCATTGGATATCCTCGTG
GTAAATGCAGGTATTGGCGTCTTTGGCGATGCCCTGGAATTAAATGCCGACGATATTGATCGCCTTTTCAAAATCAATAT
TCATGCTCCTTACCATGCCTCCGTTGAAGCCGCCCGGCAGATGCCCGAAGGCGGGCGCATCTTAATCATCGGCTCCGTGA
ATGGCGATCGTATGCCTGTTGCAGGCATGGCTGCTTATGCCGCCAGCAAATCTGCCCTGCAAGGCATGGCGCGCGGGCTG
GCCCGTGATTTTGGACCGCGTGGGATCACCATCAACGTCGTCCAGCCAGGGCCAATTGATACCGACGCTAATCCCGCCAA
CGGGCCAATGCGCGATATGTTGCATGGTTTTATGGCTATCAAAAGACATGGGCAACCGGAAGAGGTCGCTGGTATGGTCG
CATGGTTAGCAGGGCCAGAAGCCAGCTTTGTTACCGGCGCGATGCATACCATTGATGGCGCGTTTGGCGCATAA

Upstream 100 bases:

>100_bases
TGGAACGCGAGATTGTTTTTTAGTGACCATTACAAAACTTGTTGACAGAAAGTTAAAACAGTTTTGTAATGCATGTTACA
TAATAAATCAAGGAGTCCTT

Downstream 100 bases:

>100_bases
CCGACTACGCTCAATTAAGCCCAGCCATTTCCCATGATGTCTGGGTTTTGTTTACTCACGTCGTCCGCTAAAAGCGGCTC
CTGGTAAATATAAATCTTCT

Product: oxidoreductase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 237; Mature: 236

Protein sequence:

>237_residues
MGAFTGKTVLILGGSRGIGAAIVRRFVTDGANVRFTYAGSKDAAERLAQETGATAVFTDSADRDAVIDVVRKSGALDILV
VNAGIGVFGDALELNADDIDRLFKINIHAPYHASVEAARQMPEGGRILIIGSVNGDRMPVAGMAAYAASKSALQGMARGL
ARDFGPRGITINVVQPGPIDTDANPANGPMRDMLHGFMAIKRHGQPEEVAGMVAWLAGPEASFVTGAMHTIDGAFGA

Sequences:

>Translated_237_residues
MGAFTGKTVLILGGSRGIGAAIVRRFVTDGANVRFTYAGSKDAAERLAQETGATAVFTDSADRDAVIDVVRKSGALDILV
VNAGIGVFGDALELNADDIDRLFKINIHAPYHASVEAARQMPEGGRILIIGSVNGDRMPVAGMAAYAASKSALQGMARGL
ARDFGPRGITINVVQPGPIDTDANPANGPMRDMLHGFMAIKRHGQPEEVAGMVAWLAGPEASFVTGAMHTIDGAFGA
>Mature_236_residues
GAFTGKTVLILGGSRGIGAAIVRRFVTDGANVRFTYAGSKDAAERLAQETGATAVFTDSADRDAVIDVVRKSGALDILVV
NAGIGVFGDALELNADDIDRLFKINIHAPYHASVEAARQMPEGGRILIIGSVNGDRMPVAGMAAYAASKSALQGMARGLA
RDFGPRGITINVVQPGPIDTDANPANGPMRDMLHGFMAIKRHGQPEEVAGMVAWLAGPEASFVTGAMHTIDGAFGA

Specific function: Unknown

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the short-chain dehydrogenases/reductases (SDR) family [H]

Homologues:

Organism=Homo sapiens, GI32483357, Length=244, Percent_Identity=28.6885245901639, Blast_Score=103, Evalue=1e-22,
Organism=Homo sapiens, GI5031737, Length=241, Percent_Identity=26.5560165975104, Blast_Score=96, Evalue=4e-20,
Organism=Homo sapiens, GI126723750, Length=240, Percent_Identity=27.5, Blast_Score=77, Evalue=1e-14,
Organism=Homo sapiens, GI33667109, Length=194, Percent_Identity=26.2886597938144, Blast_Score=74, Evalue=1e-13,
Organism=Homo sapiens, GI126723191, Length=183, Percent_Identity=28.4153005464481, Blast_Score=72, Evalue=5e-13,
Organism=Homo sapiens, GI7705925, Length=241, Percent_Identity=25.3112033195021, Blast_Score=71, Evalue=8e-13,
Organism=Homo sapiens, GI40254992, Length=236, Percent_Identity=28.3898305084746, Blast_Score=69, Evalue=3e-12,
Organism=Homo sapiens, GI66933014, Length=245, Percent_Identity=26.530612244898, Blast_Score=69, Evalue=3e-12,
Organism=Homo sapiens, GI19923817, Length=252, Percent_Identity=26.1904761904762, Blast_Score=68, Evalue=7e-12,
Organism=Homo sapiens, GI223718074, Length=215, Percent_Identity=29.7674418604651, Blast_Score=65, Evalue=7e-11,
Organism=Escherichia coli, GI2367365, Length=237, Percent_Identity=98.3122362869198, Blast_Score=464, Evalue=1e-132,
Organism=Escherichia coli, GI87082100, Length=254, Percent_Identity=34.251968503937, Blast_Score=115, Evalue=2e-27,
Organism=Escherichia coli, GI1787335, Length=241, Percent_Identity=36.5145228215768, Blast_Score=111, Evalue=4e-26,
Organism=Escherichia coli, GI1789378, Length=246, Percent_Identity=31.7073170731707, Blast_Score=110, Evalue=7e-26,
Organism=Escherichia coli, GI1788459, Length=249, Percent_Identity=26.9076305220884, Blast_Score=92, Evalue=4e-20,
Organism=Escherichia coli, GI2367175, Length=242, Percent_Identity=32.2314049586777, Blast_Score=89, Evalue=2e-19,
Organism=Escherichia coli, GI87082160, Length=250, Percent_Identity=27.6, Blast_Score=88, Evalue=4e-19,
Organism=Escherichia coli, GI1786812, Length=248, Percent_Identity=29.8387096774194, Blast_Score=82, Evalue=4e-17,
Organism=Escherichia coli, GI1787905, Length=241, Percent_Identity=31.9502074688797, Blast_Score=81, Evalue=7e-17,
Organism=Escherichia coli, GI1790717, Length=245, Percent_Identity=27.3469387755102, Blast_Score=72, Evalue=4e-14,
Organism=Escherichia coli, GI1789208, Length=176, Percent_Identity=29.5454545454545, Blast_Score=68, Evalue=6e-13,
Organism=Escherichia coli, GI1786701, Length=189, Percent_Identity=31.2169312169312, Blast_Score=63, Evalue=2e-11,
Organism=Caenorhabditis elegans, GI17560676, Length=248, Percent_Identity=28.2258064516129, Blast_Score=97, Evalue=5e-21,
Organism=Caenorhabditis elegans, GI17561402, Length=254, Percent_Identity=30.3149606299213, Blast_Score=97, Evalue=5e-21,
Organism=Caenorhabditis elegans, GI115534694, Length=245, Percent_Identity=29.3877551020408, Blast_Score=84, Evalue=5e-17,
Organism=Caenorhabditis elegans, GI17555706, Length=244, Percent_Identity=29.5081967213115, Blast_Score=80, Evalue=9e-16,
Organism=Caenorhabditis elegans, GI71994600, Length=247, Percent_Identity=27.9352226720648, Blast_Score=78, Evalue=3e-15,
Organism=Caenorhabditis elegans, GI17560150, Length=262, Percent_Identity=28.2442748091603, Blast_Score=78, Evalue=4e-15,
Organism=Caenorhabditis elegans, GI17562908, Length=265, Percent_Identity=29.0566037735849, Blast_Score=77, Evalue=8e-15,
Organism=Caenorhabditis elegans, GI17563726, Length=251, Percent_Identity=28.6852589641434, Blast_Score=76, Evalue=1e-14,
Organism=Caenorhabditis elegans, GI17562906, Length=261, Percent_Identity=28.3524904214559, Blast_Score=75, Evalue=4e-14,
Organism=Caenorhabditis elegans, GI17562990, Length=260, Percent_Identity=28.4615384615385, Blast_Score=74, Evalue=6e-14,
Organism=Caenorhabditis elegans, GI25147288, Length=242, Percent_Identity=27.2727272727273, Blast_Score=74, Evalue=6e-14,
Organism=Caenorhabditis elegans, GI17562904, Length=250, Percent_Identity=28, Blast_Score=72, Evalue=2e-13,
Organism=Caenorhabditis elegans, GI17536651, Length=254, Percent_Identity=29.1338582677165, Blast_Score=72, Evalue=2e-13,
Organism=Caenorhabditis elegans, GI71994604, Length=192, Percent_Identity=27.6041666666667, Blast_Score=72, Evalue=3e-13,
Organism=Caenorhabditis elegans, GI193204405, Length=264, Percent_Identity=28.4090909090909, Blast_Score=71, Evalue=4e-13,
Organism=Caenorhabditis elegans, GI72000259, Length=260, Percent_Identity=26.5384615384615, Blast_Score=70, Evalue=7e-13,
Organism=Caenorhabditis elegans, GI17560332, Length=273, Percent_Identity=27.4725274725275, Blast_Score=70, Evalue=8e-13,
Organism=Caenorhabditis elegans, GI17562910, Length=270, Percent_Identity=25.5555555555556, Blast_Score=69, Evalue=2e-12,
Organism=Caenorhabditis elegans, GI17531453, Length=262, Percent_Identity=26.7175572519084, Blast_Score=69, Evalue=2e-12,
Organism=Caenorhabditis elegans, GI17508651, Length=245, Percent_Identity=25.7142857142857, Blast_Score=68, Evalue=4e-12,
Organism=Caenorhabditis elegans, GI17544670, Length=195, Percent_Identity=28.7179487179487, Blast_Score=67, Evalue=7e-12,
Organism=Caenorhabditis elegans, GI17538486, Length=266, Percent_Identity=28.5714285714286, Blast_Score=64, Evalue=5e-11,
Organism=Caenorhabditis elegans, GI17565030, Length=271, Percent_Identity=27.6752767527675, Blast_Score=64, Evalue=1e-10,
Organism=Drosophila melanogaster, GI24644339, Length=247, Percent_Identity=33.6032388663968, Blast_Score=100, Evalue=1e-21,
Organism=Drosophila melanogaster, GI21355319, Length=242, Percent_Identity=30.1652892561983, Blast_Score=94, Evalue=6e-20,
Organism=Drosophila melanogaster, GI23397609, Length=243, Percent_Identity=30.8641975308642, Blast_Score=93, Evalue=1e-19,
Organism=Drosophila melanogaster, GI24639444, Length=248, Percent_Identity=30.241935483871, Blast_Score=89, Evalue=2e-18,
Organism=Drosophila melanogaster, GI21357041, Length=253, Percent_Identity=30.4347826086957, Blast_Score=89, Evalue=2e-18,
Organism=Drosophila melanogaster, GI24643142, Length=244, Percent_Identity=28.6885245901639, Blast_Score=89, Evalue=3e-18,
Organism=Drosophila melanogaster, GI28571526, Length=250, Percent_Identity=29.2, Blast_Score=89, Evalue=3e-18,
Organism=Drosophila melanogaster, GI24644337, Length=171, Percent_Identity=32.7485380116959, Blast_Score=77, Evalue=1e-14,
Organism=Drosophila melanogaster, GI17737361, Length=255, Percent_Identity=28.6274509803922, Blast_Score=65, Evalue=4e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR002198
- InterPro:   IPR002347
- InterPro:   IPR016040
- InterPro:   IPR020904 [H]

Pfam domain/function: PF00106 adh_short [H]

EC number: 1.-.-.- [C]

Molecular weight: Translated: 24589; Mature: 24458

Theoretical pI: Translated: 6.36; Mature: 6.36

Prosite motif: PS00061 ADH_SHORT

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.0 %Cys     (Translated Protein)
4.2 %Met     (Translated Protein)
4.2 %Cys+Met (Translated Protein)
0.0 %Cys     (Mature Protein)
3.8 %Met     (Mature Protein)
3.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MGAFTGKTVLILGGSRGIGAAIVRRFVTDGANVRFTYAGSKDAAERLAQETGATAVFTDS
CCCCCCCEEEEEECCCCHHHHHHHHHHCCCCCEEEEECCCHHHHHHHHHHHCCEEEEECC
ADRDAVIDVVRKSGALDILVVNAGIGVFGDALELNADDIDRLFKINIHAPYHASVEAARQ
CCCHHHHHHHHCCCCEEEEEEECCCCCCCCEEECCHHHCCEEEEEEECCCCCCHHHHHHC
MPEGGRILIIGSVNGDRMPVAGMAAYAASKSALQGMARGLARDFGPRGITINVVQPGPID
CCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCC
TDANPANGPMRDMLHGFMAIKRHGQPEEVAGMVAWLAGPEASFVTGAMHTIDGAFGA
CCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCC
>Mature Secondary Structure 
GAFTGKTVLILGGSRGIGAAIVRRFVTDGANVRFTYAGSKDAAERLAQETGATAVFTDS
CCCCCCEEEEEECCCCHHHHHHHHHHCCCCCEEEEECCCHHHHHHHHHHHCCEEEEECC
ADRDAVIDVVRKSGALDILVVNAGIGVFGDALELNADDIDRLFKINIHAPYHASVEAARQ
CCCHHHHHHHHCCCCEEEEEEECCCCCCCCEEECCHHHCCEEEEEEECCCCCCHHHHHHC
MPEGGRILIIGSVNGDRMPVAGMAAYAASKSALQGMARGLARDFGPRGITINVVQPGPID
CCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCC
TDANPANGPMRDMLHGFMAIKRHGQPEEVAGMVAWLAGPEASFVTGAMHTIDGAFGA
CCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 7610040; 9278503 [H]