Definition Mycobacterium sp. MCS chromosome, complete genome.
Accession NC_008146
Length 5,705,448

Click here to switch to the map view.

The map label for this gene is adhE [H]

Identifier: 108797177

GI number: 108797177

Start: 215365

End: 216942

Strand: Reverse

Name: adhE [H]

Synonym: Mmcs_0197

Alternate gene names: 108797177

Gene position: 216942-215365 (Counterclockwise)

Preceding gene: 108797178

Following gene: 108797176

Centisome position: 3.8

GC content: 71.04

Gene sequence:

>1578_bases
ATGAGCGACATCCCGGCAGGGCGAGCGGAGCGACGGGAGATTGCGGCAGCCGGCCACATGCTCGAGCGCGCCCGGTGGGC
GGCACGGGCATATGCCGACTACGACCAGGCCGCCGTTTCGGCCATCACCACCGCGGTCGCCGACGCCGCGTACGGCGCGG
CCGACCGGTTCGCCGCCGAGGCCGTCGCCGAGACAGGGATGGGCGTCGTCGCCGACAAGGTGCTCAAGAACCAGGCCTGC
TCCCGCGGCATCCTCGACTACTACCGCGAACAGGACTTCGTGTCCCCGCGGGTCGACCCGGACAGCAAGATCGTCGAGAT
CCCGCGGCCCGCCGGCGTGGTGCTGGCGCTGACCCCGACGACCAACCCCGTCTCCACCGTGTACTTCAAGGTGCTGCTCG
CCCTGATGACGCGCAACGCCGTCGTCGTCGCACCCCATCCGCGGGCCAAGCGGTGCTCGGCCGACGCCGCCCGACTGCTC
GCCGACGCCGCGATCGCCGCGGGCGCGCCCGACGGGATCGTCCAGGTGGTCGAGGAACCGTCGATCCCACTGGTCGAGGC
GTTGATGGCCGATGAGCGCACCGACGTCATCGTCGCCACCGGCGGCACCGGTGTGGTGCGGGCCGCGTACTCGTCGGGCA
CCCCGGCACTCGGTGTCGGCCCGGGCAACGTGCCCGTACTCGTCGACGCCAGCGCCGACATCACCGCGGCGGCCAAGCGC
ATCGTCGACAGCAAGGCGTTCGACAACTCGGTGCTGTGCACCAACGAGTCGGTGTTGATCGTCGAGGACTCGGTCGCCGA
TGCGCTGCGTTCGGCGATGACCCGCGCCGGTGCGCACATCCTCGACGCCGATGCCACAGAACGCCTGCGCGCCTACATGT
TCGCCGACGGTCACCTCAACACCGATGTGGTCGGGCGCGACGCCGCGTGGATCGCCGGACAGGCCGGGATCCGGGTGACA
CCGAAGACGCGCGTGCTCGTCGCCCCGTTCGACCACGTGATCAGCGAGGAGATGCTCGCCCACGAGAAGCTCTCGCCGGT
GATCGGGATGACGACGGTGCCCGACGCCGCGCGCGGCATCCGCGCCGCCCGCGCGGTGGTGCGCATCGGCGGCGCCGGCC
ACTCGGCGGCCATCCACAGCGAAAACGCGTCTGTCATCACCGAATTCGCCACCCAAGTGCCGGTGCTGCGGGTATCGGTC
AACGTCGGCAACAGCACCGGCAGCTCGGGCCTGGAGACCAACCTGGCACCGTCGATGACGATCGGCACCGGCTTCGTCGG
CCGCAGCTCCATCGGTGAGAACCTGCGCCCCGACAACCTGATGAACTGGGCCCGCATCGCCTACAACAGCGCGCCCGGGG
TGGCCATGCCGAGCTTCGCGGGCATCGACCCGTGGCGGTCACCGGCCGGACCGGTGCCCGAATATCCGCGCGCCTCCAAC
GATCGCGGCGCACCGCCGGTGTCCCCGTCACGCAGCGCCCCGGCGGCACGCCGCGCGGCCGACCCGAGCATCGAGGCCCT
GCGGGCCGAACTGCGGGCGCTGGTCGTCGAAGAGCTCGCACAACTGATCAAGAGGTGA

Upstream 100 bases:

>100_bases
AACCGGGTCTCCTGCTCGGCCGCGACCTGTGCGAGGACGTGCTGGACCGGCTGGAGGTCGCGGTGGGCCGGGCGAAGACT
GCAGCCAGGAAGGGACGGGC

Downstream 100 bases:

>100_bases
CCCGTGGCTGAACTGCGTTCCTTCATCTTCATCGATCGGCTTCAGCCGCAGACGATGTCGTACCTGGGCACCTGGATCAA
GGGCGCCCTGCCGCGGGCGA

Product: aldehyde dehydrogenase

Products: NA

Alternate protein names: Alcohol dehydrogenase; ADH; Acetaldehyde dehydrogenase [acetylating]; ACDH [H]

Number of amino acids: Translated: 525; Mature: 524

Protein sequence:

>525_residues
MSDIPAGRAERREIAAAGHMLERARWAARAYADYDQAAVSAITTAVADAAYGAADRFAAEAVAETGMGVVADKVLKNQAC
SRGILDYYREQDFVSPRVDPDSKIVEIPRPAGVVLALTPTTNPVSTVYFKVLLALMTRNAVVVAPHPRAKRCSADAARLL
ADAAIAAGAPDGIVQVVEEPSIPLVEALMADERTDVIVATGGTGVVRAAYSSGTPALGVGPGNVPVLVDASADITAAAKR
IVDSKAFDNSVLCTNESVLIVEDSVADALRSAMTRAGAHILDADATERLRAYMFADGHLNTDVVGRDAAWIAGQAGIRVT
PKTRVLVAPFDHVISEEMLAHEKLSPVIGMTTVPDAARGIRAARAVVRIGGAGHSAAIHSENASVITEFATQVPVLRVSV
NVGNSTGSSGLETNLAPSMTIGTGFVGRSSIGENLRPDNLMNWARIAYNSAPGVAMPSFAGIDPWRSPAGPVPEYPRASN
DRGAPPVSPSRSAPAARRAADPSIEALRAELRALVVEELAQLIKR

Sequences:

>Translated_525_residues
MSDIPAGRAERREIAAAGHMLERARWAARAYADYDQAAVSAITTAVADAAYGAADRFAAEAVAETGMGVVADKVLKNQAC
SRGILDYYREQDFVSPRVDPDSKIVEIPRPAGVVLALTPTTNPVSTVYFKVLLALMTRNAVVVAPHPRAKRCSADAARLL
ADAAIAAGAPDGIVQVVEEPSIPLVEALMADERTDVIVATGGTGVVRAAYSSGTPALGVGPGNVPVLVDASADITAAAKR
IVDSKAFDNSVLCTNESVLIVEDSVADALRSAMTRAGAHILDADATERLRAYMFADGHLNTDVVGRDAAWIAGQAGIRVT
PKTRVLVAPFDHVISEEMLAHEKLSPVIGMTTVPDAARGIRAARAVVRIGGAGHSAAIHSENASVITEFATQVPVLRVSV
NVGNSTGSSGLETNLAPSMTIGTGFVGRSSIGENLRPDNLMNWARIAYNSAPGVAMPSFAGIDPWRSPAGPVPEYPRASN
DRGAPPVSPSRSAPAARRAADPSIEALRAELRALVVEELAQLIKR
>Mature_524_residues
SDIPAGRAERREIAAAGHMLERARWAARAYADYDQAAVSAITTAVADAAYGAADRFAAEAVAETGMGVVADKVLKNQACS
RGILDYYREQDFVSPRVDPDSKIVEIPRPAGVVLALTPTTNPVSTVYFKVLLALMTRNAVVVAPHPRAKRCSADAARLLA
DAAIAAGAPDGIVQVVEEPSIPLVEALMADERTDVIVATGGTGVVRAAYSSGTPALGVGPGNVPVLVDASADITAAAKRI
VDSKAFDNSVLCTNESVLIVEDSVADALRSAMTRAGAHILDADATERLRAYMFADGHLNTDVVGRDAAWIAGQAGIRVTP
KTRVLVAPFDHVISEEMLAHEKLSPVIGMTTVPDAARGIRAARAVVRIGGAGHSAAIHSENASVITEFATQVPVLRVSVN
VGNSTGSSGLETNLAPSMTIGTGFVGRSSIGENLRPDNLMNWARIAYNSAPGVAMPSFAGIDPWRSPAGPVPEYPRASND
RGAPPVSPSRSAPAARRAADPSIEALRAELRALVVEELAQLIKR

Specific function: This enzyme has probably two activities:ADH, and ACDH [H]

COG id: COG1012

COG function: function code C; NAD-dependent aldehyde dehydrogenases

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: In the C-terminal section; belongs to the iron- containing alcohol dehydrogenase family [H]

Homologues:

Organism=Escherichia coli, GI1787493, Length=440, Percent_Identity=38.6363636363636, Blast_Score=281, Evalue=1e-76,
Organism=Escherichia coli, GI1788797, Length=378, Percent_Identity=36.5079365079365, Blast_Score=175, Evalue=8e-45,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001670
- InterPro:   IPR018211
- InterPro:   IPR016161
- InterPro:   IPR016163
- InterPro:   IPR016162
- InterPro:   IPR015590
- InterPro:   IPR012079 [H]

Pfam domain/function: PF00171 Aldedh; PF00465 Fe-ADH [H]

EC number: =1.1.1.1; =1.2.1.10 [H]

Molecular weight: Translated: 54789; Mature: 54657

Theoretical pI: Translated: 6.13; Mature: 6.13

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
2.3 %Met     (Translated Protein)
2.9 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
2.1 %Met     (Mature Protein)
2.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSDIPAGRAERREIAAAGHMLERARWAARAYADYDQAAVSAITTAVADAAYGAADRFAAE
CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHH
AVAETGMGVVADKVLKNQACSRGILDYYREQDFVSPRVDPDSKIVEIPRPAGVVLALTPT
HHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEECCCCCCEEEEECCC
TNPVSTVYFKVLLALMTRNAVVVAPHPRAKRCSADAARLLADAAIAAGAPDGIVQVVEEP
CCHHHHHHHHHHHHHHHCCCEEECCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCC
SIPLVEALMADERTDVIVATGGTGVVRAAYSSGTPALGVGPGNVPVLVDASADITAAAKR
CCHHHHHHHCCCCCCEEEEECCCCEEEEEECCCCCEEECCCCCCCEEEECCCCHHHHHHH
IVDSKAFDNSVLCTNESVLIVEDSVADALRSAMTRAGAHILDADATERLRAYMFADGHLN
HHHHHCCCCCEEECCCCEEEEEHHHHHHHHHHHHHCCCEEECCCHHHHHHHHEEECCCCC
TDVVGRDAAWIAGQAGIRVTPKTRVLVAPFDHVISEEMLAHEKLSPVIGMTTVPDAARGI
CCEECCCCCEEECCCCEEECCCCEEEECCHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHH
RAARAVVRIGGAGHSAAIHSENASVITEFATQVPVLRVSVNVGNSTGSSGLETNLAPSMT
HHHHHHHEECCCCCCCEEECCCHHHHHHHHHHCCEEEEEEEECCCCCCCCCCCCCCCCEE
IGTGFVGRSSIGENLRPDNLMNWARIAYNSAPGVAMPSFAGIDPWRSPAGPVPEYPRASN
ECCCCCCCHHCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
DRGAPPVSPSRSAPAARRAADPSIEALRAELRALVVEELAQLIKR
CCCCCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCC
>Mature Secondary Structure 
SDIPAGRAERREIAAAGHMLERARWAARAYADYDQAAVSAITTAVADAAYGAADRFAAE
CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHH
AVAETGMGVVADKVLKNQACSRGILDYYREQDFVSPRVDPDSKIVEIPRPAGVVLALTPT
HHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEECCCCCCEEEEECCC
TNPVSTVYFKVLLALMTRNAVVVAPHPRAKRCSADAARLLADAAIAAGAPDGIVQVVEEP
CCHHHHHHHHHHHHHHHCCCEEECCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCC
SIPLVEALMADERTDVIVATGGTGVVRAAYSSGTPALGVGPGNVPVLVDASADITAAAKR
CCHHHHHHHCCCCCCEEEEECCCCEEEEEECCCCCEEECCCCCCCEEEECCCCHHHHHHH
IVDSKAFDNSVLCTNESVLIVEDSVADALRSAMTRAGAHILDADATERLRAYMFADGHLN
HHHHHCCCCCEEECCCCEEEEEHHHHHHHHHHHHHCCCEEECCCHHHHHHHHEEECCCCC
TDVVGRDAAWIAGQAGIRVTPKTRVLVAPFDHVISEEMLAHEKLSPVIGMTTVPDAARGI
CCEECCCCCEEECCCCEEECCCCEEEECCHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHH
RAARAVVRIGGAGHSAAIHSENASVITEFATQVPVLRVSVNVGNSTGSSGLETNLAPSMT
HHHHHHHEECCCCCCCEEECCCHHHHHHHHHHCCEEEEEEEECCCCCCCCCCCCCCCCEE
IGTGFVGRSSIGENLRPDNLMNWARIAYNSAPGVAMPSFAGIDPWRSPAGPVPEYPRASN
ECCCCCCCHHCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
DRGAPPVSPSRSAPAARRAADPSIEALRAELRALVVEELAQLIKR
CCCCCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8226639; 8300540; 11466286 [H]