| Definition | Clostridium botulinum A2 str. Kyoto chromosome, complete genome. |
|---|---|
| Accession | NC_012563 |
| Length | 4,155,278 |
Click here to switch to the map view.
The map label for this gene is adhE [H]
Identifier: 226947576
GI number: 226947576
Start: 434540
End: 437128
Strand: Direct
Name: adhE [H]
Synonym: CLM_0413
Alternate gene names: 226947576
Gene position: 434540-437128 (Clockwise)
Preceding gene: 226947575
Following gene: 226947577
Centisome position: 10.46
GC content: 32.52
Gene sequence:
>2589_bases ATGAAAATTACCACTACTGAGGAACTAATAAGAAAAATTGAAAAAATCAAAGAGGCACAAAAGATTTATTCCACATATTC TCAAGATAAGGTAGACAAAATATTTAAAGCAGCAGCTATAGCAGCTAACAAAGAAAGAATAAAACTTGCAAAAATGGCAG TAGAAGAAACTGGCATGGGTATTGTAGAAGATAAAGTTATAAAAAATCATTTTGCATCTGAATATATTTATAACAAATAT AAAGATGAAAAAACCTGTGGTGTAATAGAAAAGGATGAAGCCTTTGGGTTAACAAAAATTGCAGAACCAATAGGTGTTAT TGCGGCAATAGTTCCAACAACTAATCCAACCTCAACAGCTATTTTCAAAGCTTTAATAGCTTTAAAAACTAGAAATGGTA TAATTTTTTCACCTCATCCAAGAGCTAAAAAATCTACAATAATGGCAGCTAAGATTGTTTTAGATGCAGCAGTACAAGCA GGGGCACCAAAAGAAATAATAGGTTGGATAGATGAACCTACCTTAGAATTGTCTAATGCAGTAATGAGTAATTCAAATCT AGTTCTTGCAACAGGTGGACCAGGTATGGTTAAAGCGGCATACTCTTCAGGAAAGCCAGCTATAGGAGTAGGACCTGGAA ATGTACCAGCTATAATATACGAAACAGCAGATATTAAAATGGCTGTAAGCTCAGTGGTACTTTCTAAAACTTTTGATAAT GGTATGATTTGTGCATCAGAACAATCAGTAATTGTGATGAATAGCATATATGAAGAAGTAAAAAAAGAATTTGTAATTAG AGGAGCCTATGTATTAAATAAAGAAGAGATAGAAAAAGTTAAGAAAATAATTTTAGTTAATGGAAATGTAAATGCTAAAA TTGTTGGTCAAACACCACAAAAGATAGGTGAAATGGCTGGAGTAAAAGTTCCAGATTGGGCTAAACTTTTAGTTGGAGAA GTTCAATCTGTAGAATTAGAAGAACCATTCTCTCACGAAAAACTATCACCAGTTTTAGCAATGTATAAAGTTAAAACTTA TGAAGAGGCTTTAACTAAAGCTGAAAGATTAGTAGAATTAGGAGGATTTGGACATACATCTTCATTGTATATAAACACAG TAAAATGCAAAGAAGAAGTAGAAAAGTTCTCAAATAATATGAAAACAGGAAGAACAATTATAAATATGCCATCAGCCCAA GGTGGTATAGGTGATATATATAACTTTAAATTAGCACCATCTCTAACTCTTGGTTGCGGATCCTGGGGTGGAAACTCAGT ATCAGAGAATGTAGGACCAAAACACTTATTAAATATCAAAAATGTAGCTGAGAGGAGAGAGAATATGCTTTGGTTTAGAG TACCAGAAAAAGTTTACTTTAAATACGGATGTCTTCCAATAGCTTTAAAAGAATTAAAAAGAATGAATAAGAAAAAGGCA TTTATAGTTACAGATAAAGTATTATATGAATTAGGAGTTGCTAAAAAAGCAACAGATGTGTTAGATGAGATAGGAATAAA TTATAAGGTATTCTTTGATGTAGCGCCAGATCCAACTTTAGAAACAGCAAAAAAAGGTGCAAAAGAAATGGTAGATTTTA ATCCTGATACAATAATAGCTATAGGCGGAGGATCTGCTATGGATGCTGCTAAAATAATGTGGGTAATGTATGAACATCCA GAAGCAGAATTCGAAGATTTAGCTATGAGATTTATGGATATAAGAAAAAGAGTATATGAGTTCCCACATATGGGAGATAA AGCAATGATGATATCAGTAGCTACTTCAGCAGGTACTGGTTCAGAAGTAACTCCTTTTGCTGTTATAACTGATGAAAAAA CAGGTGTGAAATATCCACTAGCAGATTATGAATTAACTCCAGATATGGCTATAGTAGATGCAGATTTAATGCTTAATATG CCAAAGGGATTAACTGCTGCATCAGGAATTGATGCATTAACTCATGCAGTAGAAGCTTATGTATCAGTTATGGCATCAGA GTATACAGATGGACTATGTTTAGAGGCTATAAAAACCATATTTGAATATCTACCAAAAGCTTATAAAGAAGGAGCCCAAG ATATAGAAGCAAGAGAAAAAATGGCTCATGCATCAACTATAGCAGGTATGGCTTTTGCAAATGCCTTCTTAGGAGTATGC CACTCTATGGCTCATAAATTAGGATCTATGCATCATGTACCACATGGTATAGCTAATGCTTTATTAATAAATGAAACAAT TAAATTTAATTCTGAGGATATGCCAAGAAAGCAAACTGCTTTCCCACAATATAAATATCCAAATGCTAAGGCTAAATATG CTAATATAGCAGATTATTTATCATTAGGTGGAAAAACTCCAGAAGAAAAAGTTGAGCTATTAATAAAAGCTATAGATAAA TTAAAAGCAGAAGTAAATATTCCAACCTCTATAGAAGAGGCAGGGATATCAAAGGATAAATTCTTTAAAACTTTAGATGA GATGTCCGAACAAGCTTTTGATGATCAATGTACAGGAGCTAATCCAAGATACCCATTAATAAGTGAAATAAAACAAATGT ATACTAATGTTTTTAGTACTAAAAAATAG
Upstream 100 bases:
>100_bases AGAAGTTATAAGGTATTAGTAATATTAATATAAATTAATACTTTTTATATTTTAGATAAGTTTGTTAAAAAATAAACTAT TATTAAAAGAGGTGTTATTA
Downstream 100 bases:
>100_bases GATAGTTTAATTTATATAGCATAAAATATTATTATATATATTTTATAGAAAGCTAAAATTTTAATTTTACCCAAAACTAA ACCATTTATACAAAAAATAA
Product: bifunctional acetaldehyde-CoA/alcohol dehydrogenase
Products: NA
Alternate protein names: Alcohol dehydrogenase; ADH; Acetaldehyde dehydrogenase [acetylating]; ACDH [H]
Number of amino acids: Translated: 862; Mature: 862
Protein sequence:
>862_residues MKITTTEELIRKIEKIKEAQKIYSTYSQDKVDKIFKAAAIAANKERIKLAKMAVEETGMGIVEDKVIKNHFASEYIYNKY KDEKTCGVIEKDEAFGLTKIAEPIGVIAAIVPTTNPTSTAIFKALIALKTRNGIIFSPHPRAKKSTIMAAKIVLDAAVQA GAPKEIIGWIDEPTLELSNAVMSNSNLVLATGGPGMVKAAYSSGKPAIGVGPGNVPAIIYETADIKMAVSSVVLSKTFDN GMICASEQSVIVMNSIYEEVKKEFVIRGAYVLNKEEIEKVKKIILVNGNVNAKIVGQTPQKIGEMAGVKVPDWAKLLVGE VQSVELEEPFSHEKLSPVLAMYKVKTYEEALTKAERLVELGGFGHTSSLYINTVKCKEEVEKFSNNMKTGRTIINMPSAQ GGIGDIYNFKLAPSLTLGCGSWGGNSVSENVGPKHLLNIKNVAERRENMLWFRVPEKVYFKYGCLPIALKELKRMNKKKA FIVTDKVLYELGVAKKATDVLDEIGINYKVFFDVAPDPTLETAKKGAKEMVDFNPDTIIAIGGGSAMDAAKIMWVMYEHP EAEFEDLAMRFMDIRKRVYEFPHMGDKAMMISVATSAGTGSEVTPFAVITDEKTGVKYPLADYELTPDMAIVDADLMLNM PKGLTAASGIDALTHAVEAYVSVMASEYTDGLCLEAIKTIFEYLPKAYKEGAQDIEAREKMAHASTIAGMAFANAFLGVC HSMAHKLGSMHHVPHGIANALLINETIKFNSEDMPRKQTAFPQYKYPNAKAKYANIADYLSLGGKTPEEKVELLIKAIDK LKAEVNIPTSIEEAGISKDKFFKTLDEMSEQAFDDQCTGANPRYPLISEIKQMYTNVFSTKK
Sequences:
>Translated_862_residues MKITTTEELIRKIEKIKEAQKIYSTYSQDKVDKIFKAAAIAANKERIKLAKMAVEETGMGIVEDKVIKNHFASEYIYNKY KDEKTCGVIEKDEAFGLTKIAEPIGVIAAIVPTTNPTSTAIFKALIALKTRNGIIFSPHPRAKKSTIMAAKIVLDAAVQA GAPKEIIGWIDEPTLELSNAVMSNSNLVLATGGPGMVKAAYSSGKPAIGVGPGNVPAIIYETADIKMAVSSVVLSKTFDN GMICASEQSVIVMNSIYEEVKKEFVIRGAYVLNKEEIEKVKKIILVNGNVNAKIVGQTPQKIGEMAGVKVPDWAKLLVGE VQSVELEEPFSHEKLSPVLAMYKVKTYEEALTKAERLVELGGFGHTSSLYINTVKCKEEVEKFSNNMKTGRTIINMPSAQ GGIGDIYNFKLAPSLTLGCGSWGGNSVSENVGPKHLLNIKNVAERRENMLWFRVPEKVYFKYGCLPIALKELKRMNKKKA FIVTDKVLYELGVAKKATDVLDEIGINYKVFFDVAPDPTLETAKKGAKEMVDFNPDTIIAIGGGSAMDAAKIMWVMYEHP EAEFEDLAMRFMDIRKRVYEFPHMGDKAMMISVATSAGTGSEVTPFAVITDEKTGVKYPLADYELTPDMAIVDADLMLNM PKGLTAASGIDALTHAVEAYVSVMASEYTDGLCLEAIKTIFEYLPKAYKEGAQDIEAREKMAHASTIAGMAFANAFLGVC HSMAHKLGSMHHVPHGIANALLINETIKFNSEDMPRKQTAFPQYKYPNAKAKYANIADYLSLGGKTPEEKVELLIKAIDK LKAEVNIPTSIEEAGISKDKFFKTLDEMSEQAFDDQCTGANPRYPLISEIKQMYTNVFSTKK >Mature_862_residues MKITTTEELIRKIEKIKEAQKIYSTYSQDKVDKIFKAAAIAANKERIKLAKMAVEETGMGIVEDKVIKNHFASEYIYNKY KDEKTCGVIEKDEAFGLTKIAEPIGVIAAIVPTTNPTSTAIFKALIALKTRNGIIFSPHPRAKKSTIMAAKIVLDAAVQA GAPKEIIGWIDEPTLELSNAVMSNSNLVLATGGPGMVKAAYSSGKPAIGVGPGNVPAIIYETADIKMAVSSVVLSKTFDN GMICASEQSVIVMNSIYEEVKKEFVIRGAYVLNKEEIEKVKKIILVNGNVNAKIVGQTPQKIGEMAGVKVPDWAKLLVGE VQSVELEEPFSHEKLSPVLAMYKVKTYEEALTKAERLVELGGFGHTSSLYINTVKCKEEVEKFSNNMKTGRTIINMPSAQ GGIGDIYNFKLAPSLTLGCGSWGGNSVSENVGPKHLLNIKNVAERRENMLWFRVPEKVYFKYGCLPIALKELKRMNKKKA FIVTDKVLYELGVAKKATDVLDEIGINYKVFFDVAPDPTLETAKKGAKEMVDFNPDTIIAIGGGSAMDAAKIMWVMYEHP EAEFEDLAMRFMDIRKRVYEFPHMGDKAMMISVATSAGTGSEVTPFAVITDEKTGVKYPLADYELTPDMAIVDADLMLNM PKGLTAASGIDALTHAVEAYVSVMASEYTDGLCLEAIKTIFEYLPKAYKEGAQDIEAREKMAHASTIAGMAFANAFLGVC HSMAHKLGSMHHVPHGIANALLINETIKFNSEDMPRKQTAFPQYKYPNAKAKYANIADYLSLGGKTPEEKVELLIKAIDK LKAEVNIPTSIEEAGISKDKFFKTLDEMSEQAFDDQCTGANPRYPLISEIKQMYTNVFSTKK
Specific function: This enzyme has probably two activities:ADH, and ACDH [H]
COG id: COG1012
COG function: function code C; NAD-dependent aldehyde dehydrogenases
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: In the C-terminal section; belongs to the iron- containing alcohol dehydrogenase family [H]
Homologues:
Organism=Homo sapiens, GI133922590, Length=327, Percent_Identity=28.4403669724771, Blast_Score=100, Evalue=1e-20, Organism=Escherichia coli, GI1787493, Length=862, Percent_Identity=62.4129930394432, Blast_Score=1109, Evalue=0.0, Organism=Escherichia coli, GI1789163, Length=405, Percent_Identity=33.8271604938272, Blast_Score=229, Evalue=5e-61, Organism=Escherichia coli, GI48994951, Length=406, Percent_Identity=35.4679802955665, Blast_Score=218, Evalue=9e-58, Organism=Escherichia coli, GI87082107, Length=403, Percent_Identity=31.5136476426799, Blast_Score=200, Evalue=3e-52, Organism=Escherichia coli, GI1788797, Length=375, Percent_Identity=32.2666666666667, Blast_Score=164, Evalue=2e-41, Organism=Escherichia coli, GI1789386, Length=405, Percent_Identity=23.9506172839506, Blast_Score=77, Evalue=5e-15, Organism=Caenorhabditis elegans, GI17537053, Length=328, Percent_Identity=25.9146341463415, Blast_Score=108, Evalue=8e-24, Organism=Saccharomyces cerevisiae, GI6321181, Length=401, Percent_Identity=33.6658354114713, Blast_Score=216, Evalue=1e-56, Organism=Saccharomyces cerevisiae, GI6323821, Length=371, Percent_Identity=23.1805929919137, Blast_Score=67, Evalue=1e-11, Organism=Drosophila melanogaster, GI24657991, Length=324, Percent_Identity=26.8518518518519, Blast_Score=88, Evalue=2e-17,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001670 - InterPro: IPR018211 - InterPro: IPR016161 - InterPro: IPR016163 - InterPro: IPR016162 - InterPro: IPR015590 - InterPro: IPR012079 [H]
Pfam domain/function: PF00171 Aldedh; PF00465 Fe-ADH [H]
EC number: =1.1.1.1; =1.2.1.10 [H]
Molecular weight: Translated: 94649; Mature: 94649
Theoretical pI: Translated: 7.19; Mature: 7.19
Prosite motif: PS00913 ADH_IRON_1 ; PS00060 ADH_IRON_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.9 %Cys (Translated Protein) 4.1 %Met (Translated Protein) 5.0 %Cys+Met (Translated Protein) 0.9 %Cys (Mature Protein) 4.1 %Met (Mature Protein) 5.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKITTTEELIRKIEKIKEAQKIYSTYSQDKVDKIFKAAAIAANKERIKLAKMAVEETGMG CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCC IVEDKVIKNHFASEYIYNKYKDEKTCGVIEKDEAFGLTKIAEPIGVIAAIVPTTNPTSTA HHHHHHHHHHHHHHHHHHHCCCCCCCCCEECCCCCCHHHHHHHHHHHEEEECCCCCHHHH IFKALIALKTRNGIIFSPHPRAKKSTIMAAKIVLDAAVQAGAPKEIIGWIDEPTLELSNA HHHHHHHHHCCCCEEECCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHCCCCCHHHHHHH VMSNSNLVLATGGPGMVKAAYSSGKPAIGVGPGNVPAIIYETADIKMAVSSVVLSKTFDN HCCCCCEEEEECCCCCEEEECCCCCCEEEECCCCCCEEEEECCHHHHHHHHHHHHHHCCC GMICASEQSVIVMNSIYEEVKKEFVIRGAYVLNKEEIEKVKKIILVNGNVNAKIVGQTPQ CEEEECCCCEEEHHHHHHHHHHHHHHHHHEEECHHHHHHHHEEEEECCCCCEEEECCCHH KIGEMAGVKVPDWAKLLVGEVQSVELEEPFSHEKLSPVLAMYKVKTYEEALTKAERLVEL HHHHHCCCCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH GGFGHTSSLYINTVKCKEEVEKFSNNMKTGRTIINMPSAQGGIGDIYNFKLAPSLTLGCG CCCCCCCCEEEEEHHHHHHHHHHHCCCCCCCEEEECCCCCCCCCCCEEEEECCCEEEECC SWGGNSVSENVGPKHLLNIKNVAERRENMLWFRVPEKVYFKYGCLPIALKELKRMNKKKA CCCCCCCCCCCCHHHHHHHHHHHHHHHCEEEEECCHHHHHHHCCHHHHHHHHHHHCCCCE FIVTDKVLYELGVAKKATDVLDEIGINYKVFFDVAPDPTLETAKKGAKEMVDFNPDTIIA EEEEHHHHHHHCCHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHHHHHCCCCCEEEE IGGGSAMDAAKIMWVMYEHPEAEFEDLAMRFMDIRKRVYEFPHMGDKAMMISVATSAGTG ECCCCHHHHHHEEEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEECCCCC SEVTPFAVITDEKTGVKYPLADYELTPDMAIVDADLMLNMPKGLTAASGIDALTHAVEAY CCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEHHHHEECCCCCCHHHHHHHHHHHHHHH VSVMASEYTDGLCLEAIKTIFEYLPKAYKEGAQDIEAREKMAHASTIAGMAFANAFLGVC HHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH HSMAHKLGSMHHVPHGIANALLINETIKFNSEDMPRKQTAFPQYKYPNAKAKYANIADYL HHHHHHHCCCCCCCHHHHHHHHEECCCCCCCCCCCCHHHCCCCCCCCCCCCCHHHHHHHH SLGGKTPEEKVELLIKAIDKLKAEVNIPTSIEEAGISKDKFFKTLDEMSEQAFDDQCTGA HCCCCCHHHHHHHHHHHHHHHHHCCCCCCCHHHHCCCHHHHHHHHHHHHHHHCCCCCCCC NPRYPLISEIKQMYTNVFSTKK CCCCHHHHHHHHHHHHHHCCCC >Mature Secondary Structure MKITTTEELIRKIEKIKEAQKIYSTYSQDKVDKIFKAAAIAANKERIKLAKMAVEETGMG CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCC IVEDKVIKNHFASEYIYNKYKDEKTCGVIEKDEAFGLTKIAEPIGVIAAIVPTTNPTSTA HHHHHHHHHHHHHHHHHHHCCCCCCCCCEECCCCCCHHHHHHHHHHHEEEECCCCCHHHH IFKALIALKTRNGIIFSPHPRAKKSTIMAAKIVLDAAVQAGAPKEIIGWIDEPTLELSNA HHHHHHHHHCCCCEEECCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHCCCCCHHHHHHH VMSNSNLVLATGGPGMVKAAYSSGKPAIGVGPGNVPAIIYETADIKMAVSSVVLSKTFDN HCCCCCEEEEECCCCCEEEECCCCCCEEEECCCCCCEEEEECCHHHHHHHHHHHHHHCCC GMICASEQSVIVMNSIYEEVKKEFVIRGAYVLNKEEIEKVKKIILVNGNVNAKIVGQTPQ CEEEECCCCEEEHHHHHHHHHHHHHHHHHEEECHHHHHHHHEEEEECCCCCEEEECCCHH KIGEMAGVKVPDWAKLLVGEVQSVELEEPFSHEKLSPVLAMYKVKTYEEALTKAERLVEL HHHHHCCCCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH GGFGHTSSLYINTVKCKEEVEKFSNNMKTGRTIINMPSAQGGIGDIYNFKLAPSLTLGCG CCCCCCCCEEEEEHHHHHHHHHHHCCCCCCCEEEECCCCCCCCCCCEEEEECCCEEEECC SWGGNSVSENVGPKHLLNIKNVAERRENMLWFRVPEKVYFKYGCLPIALKELKRMNKKKA CCCCCCCCCCCCHHHHHHHHHHHHHHHCEEEEECCHHHHHHHCCHHHHHHHHHHHCCCCE FIVTDKVLYELGVAKKATDVLDEIGINYKVFFDVAPDPTLETAKKGAKEMVDFNPDTIIA EEEEHHHHHHHCCHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHHHHHCCCCCEEEE IGGGSAMDAAKIMWVMYEHPEAEFEDLAMRFMDIRKRVYEFPHMGDKAMMISVATSAGTG ECCCCHHHHHHEEEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEECCCCC SEVTPFAVITDEKTGVKYPLADYELTPDMAIVDADLMLNMPKGLTAASGIDALTHAVEAY CCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEHHHHEECCCCCCHHHHHHHHHHHHHHH VSVMASEYTDGLCLEAIKTIFEYLPKAYKEGAQDIEAREKMAHASTIAGMAFANAFLGVC HHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH HSMAHKLGSMHHVPHGIANALLINETIKFNSEDMPRKQTAFPQYKYPNAKAKYANIADYL HHHHHHHCCCCCCCHHHHHHHHEECCCCCCCCCCCCHHHCCCCCCCCCCCCCHHHHHHHH SLGGKTPEEKVELLIKAIDKLKAEVNIPTSIEEAGISKDKFFKTLDEMSEQAFDDQCTGA HCCCCCHHHHHHHHHHHHHHHHHCCCCCCCHHHHCCCHHHHHHHHHHHHHHHCCCCCCCC NPRYPLISEIKQMYTNVFSTKK CCCCHHHHHHHHHHHHHHCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 8226639; 8300540; 11466286 [H]