| Definition | Bacillus licheniformis ATCC 14580, complete genome. |
|---|---|
| Accession | NC_006322 |
| Length | 4,222,645 |
Click here to switch to the map view.
The map label for this gene is dhaT [H]
Identifier: 52787385
GI number: 52787385
Start: 3529116
End: 3530282
Strand: Reverse
Name: dhaT [H]
Synonym: BLi03701
Alternate gene names: 52787385
Gene position: 3530282-3529116 (Counterclockwise)
Preceding gene: 52787386
Following gene: 52787384
Centisome position: 83.6
GC content: 50.64
Gene sequence:
>1167_bases ATGCCAGCGAATTATCAATTTCGGACAGCCGGTCATATCGTTGCAGGTGAACATTCCATCCGCCGGCTGAAAGATCATGT GCGAACGATTGTACCGAAGGCGGACAGGGCGCTGATTATTACCCAGCCGTCGATCGTCAAGCTCGGCGCGATCGGAGAGG TACAGGCTCAGCTTACAGAGATCGGCATTCAGTCAGATGTTTGCACAGCCATATTGCCGGAACCGACCGTACAAAATATT GAAGACGTATTTAGAGGCATCCGTGAGAAACGATACGACATTCTCATCGGAATCGGCGGAGGCAGCGTTTTGGACGGAAC GAAAATATTGTCCGTGCTGCAAACGAATTCAAAAAAGGTCGAGGAGCTGCTCGGGACAGATTTGGTCGAAAAGCCGGGAA TCCCGACCGTGCTGATTCCGAGTACTTCGGGGACGGGCGCAGAGGTGACGCCGAACGCGATCGTTACGCTTCCAGAAGAG GAATTGAAAGTCGGCATCGTCAGTCCGCTTCTTTTGCCGAAACTTGTCATTCTTGACCCGGTCATCACCCTTGGATTGCC AAAACCGATTACGGCGGCTACCGGAATGGATGCATTCACTCATTCGCTTGAATCGTTCATTTCGACGAAAGCAAACCCGA TCAGCGATATGTTCGCTTTAGAATCCATCAGATTGATTTCGGCAAGTATTGTGGAAGCATATGAAAACGGTTCATCGATA CAAGCGAGAGAAAACATGCTGCTCGGTTCGACGTATGGCGGAATGGCGCTGACGGCTGCCGGTACTGCTGCCGTTCACGC CCTCGCCTATCCGCTTGGGGGGAAATACCGAATTTCCCACGGTGTCGCCAATTCGATGCTCCTTCCGCATGTGATGGAGT TTAATATGGATGCCATTACAGAACGGCTGTCTCTTGCGGCAGAAACAATGGGAATCGTTGCATCTGATTTGACCGCCGAA CAAGCGGCTGAGGCCGTTGTGCAGAAAATAAGAGAGTGGACGGAGCGGCTGAATATTCCTCAGGATTTAAAAGCGTTCGG CGTAACCGCAAGCGACGTGGACGATTTAGCCGACTCAGCTTCAAAAGTGACGCGCCTGCTTCATAACAATCCGAAACCGC TCAGTCTGGAAAATATAAAAGACATCTATCGAAAACTAATCAACTAA
Upstream 100 bases:
>100_bases GGCCCGCCGAAAGCGCCCGTCAAAGAACTGACCGGTCCGGCGCTTTTGGAGGTCGAAAAAATGGTTGCTTCCTATCAATC AAAATAGGGAGGTTTCAAGA
Downstream 100 bases:
>100_bases AGGATGAAGACATTTGAAAATTGCAATCATCGCGGATGATTTGACGGGAGCGAACGACTGCGGCGGTCAGCTTGTTCATT ACGGGATGGATGTTTCCGTC
Product: hypothetical protein
Products: NA
Alternate protein names: 1,3-propanediol oxidoreductase; 3-hydroxypropionaldehyde reductase [H]
Number of amino acids: Translated: 388; Mature: 387
Protein sequence:
>388_residues MPANYQFRTAGHIVAGEHSIRRLKDHVRTIVPKADRALIITQPSIVKLGAIGEVQAQLTEIGIQSDVCTAILPEPTVQNI EDVFRGIREKRYDILIGIGGGSVLDGTKILSVLQTNSKKVEELLGTDLVEKPGIPTVLIPSTSGTGAEVTPNAIVTLPEE ELKVGIVSPLLLPKLVILDPVITLGLPKPITAATGMDAFTHSLESFISTKANPISDMFALESIRLISASIVEAYENGSSI QARENMLLGSTYGGMALTAAGTAAVHALAYPLGGKYRISHGVANSMLLPHVMEFNMDAITERLSLAAETMGIVASDLTAE QAAEAVVQKIREWTERLNIPQDLKAFGVTASDVDDLADSASKVTRLLHNNPKPLSLENIKDIYRKLIN
Sequences:
>Translated_388_residues MPANYQFRTAGHIVAGEHSIRRLKDHVRTIVPKADRALIITQPSIVKLGAIGEVQAQLTEIGIQSDVCTAILPEPTVQNI EDVFRGIREKRYDILIGIGGGSVLDGTKILSVLQTNSKKVEELLGTDLVEKPGIPTVLIPSTSGTGAEVTPNAIVTLPEE ELKVGIVSPLLLPKLVILDPVITLGLPKPITAATGMDAFTHSLESFISTKANPISDMFALESIRLISASIVEAYENGSSI QARENMLLGSTYGGMALTAAGTAAVHALAYPLGGKYRISHGVANSMLLPHVMEFNMDAITERLSLAAETMGIVASDLTAE QAAEAVVQKIREWTERLNIPQDLKAFGVTASDVDDLADSASKVTRLLHNNPKPLSLENIKDIYRKLIN >Mature_387_residues PANYQFRTAGHIVAGEHSIRRLKDHVRTIVPKADRALIITQPSIVKLGAIGEVQAQLTEIGIQSDVCTAILPEPTVQNIE DVFRGIREKRYDILIGIGGGSVLDGTKILSVLQTNSKKVEELLGTDLVEKPGIPTVLIPSTSGTGAEVTPNAIVTLPEEE LKVGIVSPLLLPKLVILDPVITLGLPKPITAATGMDAFTHSLESFISTKANPISDMFALESIRLISASIVEAYENGSSIQ ARENMLLGSTYGGMALTAAGTAAVHALAYPLGGKYRISHGVANSMLLPHVMEFNMDAITERLSLAAETMGIVASDLTAEQ AAEAVVQKIREWTERLNIPQDLKAFGVTASDVDDLADSASKVTRLLHNNPKPLSLENIKDIYRKLIN
Specific function: Most active with substrates containing two primary alcohol groups separated by one or two carbon atoms. In the physiological direction, 3-hydroxypropionaldehyde is the preferred substrate [H]
COG id: COG1454
COG function: function code C; Alcohol dehydrogenase, class IV
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the iron-containing alcohol dehydrogenase family [H]
Homologues:
Organism=Homo sapiens, GI133922590, Length=388, Percent_Identity=24.7422680412371, Blast_Score=114, Evalue=1e-25, Organism=Escherichia coli, GI48994951, Length=349, Percent_Identity=36.3896848137536, Blast_Score=224, Evalue=8e-60, Organism=Escherichia coli, GI1789163, Length=371, Percent_Identity=31.266846361186, Blast_Score=196, Evalue=2e-51, Organism=Escherichia coli, GI87082107, Length=326, Percent_Identity=34.6625766871166, Blast_Score=173, Evalue=2e-44, Organism=Escherichia coli, GI1787493, Length=361, Percent_Identity=28.5318559556787, Blast_Score=140, Evalue=2e-34, Organism=Escherichia coli, GI1789386, Length=358, Percent_Identity=24.5810055865922, Blast_Score=98, Evalue=1e-21, Organism=Caenorhabditis elegans, GI17537053, Length=391, Percent_Identity=26.5984654731458, Blast_Score=161, Evalue=5e-40, Organism=Saccharomyces cerevisiae, GI6321181, Length=371, Percent_Identity=34.2318059299191, Blast_Score=221, Evalue=2e-58, Organism=Drosophila melanogaster, GI24657991, Length=394, Percent_Identity=27.1573604060914, Blast_Score=133, Evalue=2e-31,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001670 - InterPro: IPR018211 [H]
Pfam domain/function: PF00465 Fe-ADH [H]
EC number: =1.1.1.202 [H]
Molecular weight: Translated: 41576; Mature: 41444
Theoretical pI: Translated: 5.44; Mature: 5.44
Prosite motif: PS00913 ADH_IRON_1
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.3 %Cys (Translated Protein) 2.3 %Met (Translated Protein) 2.6 %Cys+Met (Translated Protein) 0.3 %Cys (Mature Protein) 2.1 %Met (Mature Protein) 2.3 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MPANYQFRTAGHIVAGEHSIRRLKDHVRTIVPKADRALIITQPSIVKLGAIGEVQAQLTE CCCCCCEECCCCEEECHHHHHHHHHHHHHHCCCCCCEEEEECCCEEEECCCHHHHHHHHH IGIQSDVCTAILPEPTVQNIEDVFRGIREKRYDILIGIGGGSVLDGTKILSVLQTNSKKV HCCCCCCHHHHCCCCCHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHHCCHHHH EELLGTDLVEKPGIPTVLIPSTSGTGAEVTPNAIVTLPEEELKVGIVSPLLLPKLVILDP HHHHCCCHHCCCCCCEEEEECCCCCCCEECCCCEEECCHHHHHHHHHHHHHHHHHHHHHH VITLGLPKPITAATGMDAFTHSLESFISTKANPISDMFALESIRLISASIVEAYENGSSI HHHHCCCCCCHHHCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCC QARENMLLGSTYGGMALTAAGTAAVHALAYPLGGKYRISHGVANSMLLPHVMEFNMDAIT CHHHCEEEECCCCCEEEEHHHHHHHHHHHCCCCCCEEECCCCHHHHHHHHHHHCCHHHHH ERLSLAAETMGIVASDLTAEQAAEAVVQKIREWTERLNIPQDLKAFGVTASDVDDLADSA HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHCCCHHHHHHHHHHH SKVTRLLHNNPKPLSLENIKDIYRKLIN HHHHHHHHCCCCCCCHHHHHHHHHHHCC >Mature Secondary Structure PANYQFRTAGHIVAGEHSIRRLKDHVRTIVPKADRALIITQPSIVKLGAIGEVQAQLTE CCCCCEECCCCEEECHHHHHHHHHHHHHHCCCCCCEEEEECCCEEEECCCHHHHHHHHH IGIQSDVCTAILPEPTVQNIEDVFRGIREKRYDILIGIGGGSVLDGTKILSVLQTNSKKV HCCCCCCHHHHCCCCCHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHHCCHHHH EELLGTDLVEKPGIPTVLIPSTSGTGAEVTPNAIVTLPEEELKVGIVSPLLLPKLVILDP HHHHCCCHHCCCCCCEEEEECCCCCCCEECCCCEEECCHHHHHHHHHHHHHHHHHHHHHH VITLGLPKPITAATGMDAFTHSLESFISTKANPISDMFALESIRLISASIVEAYENGSSI HHHHCCCCCCHHHCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCC QARENMLLGSTYGGMALTAAGTAAVHALAYPLGGKYRISHGVANSMLLPHVMEFNMDAIT CHHHCEEEECCCCCEEEEHHHHHHHHHHHCCCCCCEEECCCCHHHHHHHHHHHCCHHHHH ERLSLAAETMGIVASDLTAEQAAEAVVQKIREWTERLNIPQDLKAFGVTASDVDDLADSA HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHCCCHHHHHHHHHHH SKVTRLLHNNPKPLSLENIKDIYRKLIN HHHHHHHHCCCCCCCHHHHHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 7721705 [H]