The gene/protein map for NC_009800 is currently unavailable.
Definition Escherichia coli HS, complete genome.
Accession NC_009800
Length 4,643,538

Click here to switch to the map view.

The map label for this gene is yjgB [H]

Identifier: 157163744

GI number: 157163744

Start: 4531319

End: 4532338

Strand: Reverse

Name: yjgB [H]

Synonym: EcHS_A4525

Alternate gene names: 157163744

Gene position: 4532338-4531319 (Counterclockwise)

Preceding gene: 157163746

Following gene: 157163742

Centisome position: 97.61

GC content: 54.31

Gene sequence:

>1020_bases
ATGTCAATGATAAAAAGCTATGCCGCAAAAGAAGCGGGCGGCGAACTGGAAGTTTATGAGTACGATCCCGGTGAGCTGAA
GCCACAAGATGTTGAAGTGCAGGTGGATTACTGCGGGATCTGCCATTCCGATCTGTCGATGATCGATAACGAATGGGGAT
TTTCACAATATCCGCTGGTTGCCGGGCATGAGGTGATTGGTCGCGTGGTGGCGCTCGGGAGTGCCGCGCAGGATAAAGGT
TTGCAGGTCGGTCAGCGTGTCGGGATTGGCTGGACAGCGCGTAGCTGTGGTCACTGCGACGCCTGTATTAGCGGAAATCA
GATCAACTGTGAGCAAGGTGCGGTGCCAACAATTATGAATCGCGGAGGTTTTGCCGAGAAGTTGCGTGTAGACTGGCAAT
GGGTTATTCCACTGCCGGAAAATATCGACATTGAATCTGCCGGGCCGCTGTTGTGCGGCGGTATCACGGTCTTTAAACCA
CTGTTGATGCACCATATCACTGCTACCAGCCGCGTTGGGGTAATTGGTATTGGCGGGCTGGGGCATATCGCTATAAAACT
TCTGCACGCAATGGGATGTGAGGTGACGGCCTTTAGTTCTAATCCGGCGAAAGAGCAGGAATTGCTGGCGATGGGTGCCG
ATAAAGTGGTGAATAGCCGCGATCCGCAGGCACTGAAAGCACTGGCGGGGCAGTTTGATCTCATTATCAATACCGTGAAC
GTCAGCCTCGACTGGCAGCCTTATTTTGAGGCGCTGACGTACGGCGGTAATTTCCACACTGTCGGTGCGGTTCTCACGCC
GCTGTCTGTTCCGGCCTTTACGTTAATTGCGGGCGATCGCAGCGTCTCTGGCTCTGCTACCGGCACGCCTTATGAACTGC
GTAAGCTGATGCGCTTTGCCGCCCGCAGCAAGGTTGCGCCGACAACCGAACTGTTCCCGATGTCGAAAATTAACGACGCC
ATCCAGCATGTGCGCGACGGTAAGGCGCGTTACCGCGTGGTGTTGAAAGCCGATTTTTGA

Upstream 100 bases:

>100_bases
TCAGCATTGCGTACAGCGATGTGTAACCTTTGTCACACTCCATGCACCCCGCCCTGCCATGCTCTACACTTCCCAGACCA
CACCAGAGAAGGACCAAAAA

Downstream 100 bases:

>100_bases
AAATCATTCGCAGCGCTGATCTGAGGCGCTGCCCTCTTTCGCACATATTCTGTTTTGTCGTATCGCCAGCACAGCCTGCC
GACATTGTTCGGTGACATTG

Product: zinc-binding dehydrogenase family oxidoreductase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 339; Mature: 338

Protein sequence:

>339_residues
MSMIKSYAAKEAGGELEVYEYDPGELKPQDVEVQVDYCGICHSDLSMIDNEWGFSQYPLVAGHEVIGRVVALGSAAQDKG
LQVGQRVGIGWTARSCGHCDACISGNQINCEQGAVPTIMNRGGFAEKLRVDWQWVIPLPENIDIESAGPLLCGGITVFKP
LLMHHITATSRVGVIGIGGLGHIAIKLLHAMGCEVTAFSSNPAKEQELLAMGADKVVNSRDPQALKALAGQFDLIINTVN
VSLDWQPYFEALTYGGNFHTVGAVLTPLSVPAFTLIAGDRSVSGSATGTPYELRKLMRFAARSKVAPTTELFPMSKINDA
IQHVRDGKARYRVVLKADF

Sequences:

>Translated_339_residues
MSMIKSYAAKEAGGELEVYEYDPGELKPQDVEVQVDYCGICHSDLSMIDNEWGFSQYPLVAGHEVIGRVVALGSAAQDKG
LQVGQRVGIGWTARSCGHCDACISGNQINCEQGAVPTIMNRGGFAEKLRVDWQWVIPLPENIDIESAGPLLCGGITVFKP
LLMHHITATSRVGVIGIGGLGHIAIKLLHAMGCEVTAFSSNPAKEQELLAMGADKVVNSRDPQALKALAGQFDLIINTVN
VSLDWQPYFEALTYGGNFHTVGAVLTPLSVPAFTLIAGDRSVSGSATGTPYELRKLMRFAARSKVAPTTELFPMSKINDA
IQHVRDGKARYRVVLKADF
>Mature_338_residues
SMIKSYAAKEAGGELEVYEYDPGELKPQDVEVQVDYCGICHSDLSMIDNEWGFSQYPLVAGHEVIGRVVALGSAAQDKGL
QVGQRVGIGWTARSCGHCDACISGNQINCEQGAVPTIMNRGGFAEKLRVDWQWVIPLPENIDIESAGPLLCGGITVFKPL
LMHHITATSRVGVIGIGGLGHIAIKLLHAMGCEVTAFSSNPAKEQELLAMGADKVVNSRDPQALKALAGQFDLIINTVNV
SLDWQPYFEALTYGGNFHTVGAVLTPLSVPAFTLIAGDRSVSGSATGTPYELRKLMRFAARSKVAPTTELFPMSKINDAI
QHVRDGKARYRVVLKADF

Specific function: Unknown

COG id: COG1064

COG function: function code R; Zn-dependent alcohol dehydrogenases

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the zinc-containing alcohol dehydrogenase family [H]

Homologues:

Organism=Escherichia coli, GI87082401, Length=339, Percent_Identity=99.1150442477876, Blast_Score=699, Evalue=0.0,
Organism=Escherichia coli, GI1786518, Length=346, Percent_Identity=35.2601156069364, Blast_Score=192, Evalue=3e-50,
Organism=Escherichia coli, GI87081918, Length=318, Percent_Identity=28.6163522012579, Blast_Score=117, Evalue=1e-27,
Organism=Escherichia coli, GI1788075, Length=342, Percent_Identity=24.2690058479532, Blast_Score=71, Evalue=1e-13,
Organism=Escherichia coli, GI226510992, Length=285, Percent_Identity=25.2631578947368, Blast_Score=67, Evalue=2e-12,
Organism=Escherichia coli, GI1790718, Length=325, Percent_Identity=26.1538461538462, Blast_Score=66, Evalue=3e-12,
Organism=Escherichia coli, GI1787863, Length=129, Percent_Identity=33.3333333333333, Blast_Score=62, Evalue=6e-11,
Organism=Caenorhabditis elegans, GI71988145, Length=333, Percent_Identity=28.8288288288288, Blast_Score=114, Evalue=6e-26,
Organism=Caenorhabditis elegans, GI17562584, Length=341, Percent_Identity=28.7390029325513, Blast_Score=111, Evalue=5e-25,
Organism=Caenorhabditis elegans, GI17562582, Length=331, Percent_Identity=26.5861027190332, Blast_Score=87, Evalue=9e-18,
Organism=Saccharomyces cerevisiae, GI6323980, Length=330, Percent_Identity=31.8181818181818, Blast_Score=159, Evalue=8e-40,
Organism=Saccharomyces cerevisiae, GI6319949, Length=339, Percent_Identity=32.7433628318584, Blast_Score=155, Evalue=6e-39,
Organism=Saccharomyces cerevisiae, GI6323961, Length=337, Percent_Identity=29.673590504451, Blast_Score=129, Evalue=6e-31,
Organism=Saccharomyces cerevisiae, GI6319621, Length=344, Percent_Identity=29.9418604651163, Blast_Score=125, Evalue=7e-30,
Organism=Saccharomyces cerevisiae, GI6324486, Length=340, Percent_Identity=27.6470588235294, Blast_Score=120, Evalue=3e-28,
Organism=Saccharomyces cerevisiae, GI6323729, Length=336, Percent_Identity=27.0833333333333, Blast_Score=115, Evalue=1e-26,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR013149
- InterPro:   IPR013154
- InterPro:   IPR002085
- InterPro:   IPR002328
- InterPro:   IPR006140
- InterPro:   IPR011032
- InterPro:   IPR016040 [H]

Pfam domain/function: PF08240 ADH_N; PF00107 ADH_zinc_N [H]

EC number: NA

Molecular weight: Translated: 36516; Mature: 36385

Theoretical pI: Translated: 6.23; Mature: 6.23

Prosite motif: PS00059 ADH_ZINC ; PS00065 D_2_HYDROXYACID_DH_1

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.4 %Cys     (Translated Protein)
2.7 %Met     (Translated Protein)
5.0 %Cys+Met (Translated Protein)
2.4 %Cys     (Mature Protein)
2.4 %Met     (Mature Protein)
4.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSMIKSYAAKEAGGELEVYEYDPGELKPQDVEVQVDYCGICHSDLSMIDNEWGFSQYPLV
CCHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEEEEEECCCHHHHHHHCCCCCCCCCCEE
AGHEVIGRVVALGSAAQDKGLQVGQRVGIGWTARSCGHCDACISGNQINCEQGAVPTIMN
HHHHHHHHHHHHCCCCCCCCCHHHHHCCCCEECCCCCCCHHHCCCCCCCCCCCCCCHHHC
RGGFAEKLRVDWQWVIPLPENIDIESAGPLLCGGITVFKPLLMHHITATSRVGVIGIGGL
CCCCCCEEEECEEEEEECCCCCCCCCCCCEEECHHHHHHHHHHHHHHHHCCCCEEEECCH
GHIAIKLLHAMGCEVTAFSSNPAKEQELLAMGADKVVNSRDPQALKALAGQFDLIINTVN
HHHHHHHHHHHCCEEEEECCCCCHHHHHHHHCHHHHCCCCCHHHHHHHHCCEEEEEEEEE
VSLDWQPYFEALTYGGNFHTVGAVLTPLSVPAFTLIAGDRSVSGSATGTPYELRKLMRFA
EEECCCHHHHHHHCCCCEEEHHHHHCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHH
ARSKVAPTTELFPMSKINDAIQHVRDGKARYRVVLKADF
HHCCCCCCHHCCCHHHHHHHHHHHHCCCEEEEEEEEECC
>Mature Secondary Structure 
SMIKSYAAKEAGGELEVYEYDPGELKPQDVEVQVDYCGICHSDLSMIDNEWGFSQYPLV
CHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEEEEEECCCHHHHHHHCCCCCCCCCCEE
AGHEVIGRVVALGSAAQDKGLQVGQRVGIGWTARSCGHCDACISGNQINCEQGAVPTIMN
HHHHHHHHHHHHCCCCCCCCCHHHHHCCCCEECCCCCCCHHHCCCCCCCCCCCCCCHHHC
RGGFAEKLRVDWQWVIPLPENIDIESAGPLLCGGITVFKPLLMHHITATSRVGVIGIGGL
CCCCCCEEEECEEEEEECCCCCCCCCCCCEEECHHHHHHHHHHHHHHHHCCCCEEEECCH
GHIAIKLLHAMGCEVTAFSSNPAKEQELLAMGADKVVNSRDPQALKALAGQFDLIINTVN
HHHHHHHHHHHCCEEEEECCCCCHHHHHHHHCHHHHCCCCCHHHHHHHHCCEEEEEEEEE
VSLDWQPYFEALTYGGNFHTVGAVLTPLSVPAFTLIAGDRSVSGSATGTPYELRKLMRFA
EEECCCHHHHHHHCCCCEEEHHHHHCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHH
ARSKVAPTTELFPMSKINDAIQHVRDGKARYRVVLKADF
HHCCCCCCHHCCCHHHHHHHHHHHHCCCEEEEEEEEECC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: Zn [C]

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: Oxidoreductases; Acting on the CH-OH group of donors; With NAD or NADP as acceptor [C]

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 7610040; 9278503 [H]