Definition Bacillus licheniformis ATCC 14580, complete genome.
Accession NC_006322
Length 4,222,645

Click here to switch to the map view.

The map label for this gene is dhaT [H]

Identifier: 52787385

GI number: 52787385

Start: 3529116

End: 3530282

Strand: Reverse

Name: dhaT [H]

Synonym: BLi03701

Alternate gene names: 52787385

Gene position: 3530282-3529116 (Counterclockwise)

Preceding gene: 52787386

Following gene: 52787384

Centisome position: 83.6

GC content: 50.64

Gene sequence:

>1167_bases
ATGCCAGCGAATTATCAATTTCGGACAGCCGGTCATATCGTTGCAGGTGAACATTCCATCCGCCGGCTGAAAGATCATGT
GCGAACGATTGTACCGAAGGCGGACAGGGCGCTGATTATTACCCAGCCGTCGATCGTCAAGCTCGGCGCGATCGGAGAGG
TACAGGCTCAGCTTACAGAGATCGGCATTCAGTCAGATGTTTGCACAGCCATATTGCCGGAACCGACCGTACAAAATATT
GAAGACGTATTTAGAGGCATCCGTGAGAAACGATACGACATTCTCATCGGAATCGGCGGAGGCAGCGTTTTGGACGGAAC
GAAAATATTGTCCGTGCTGCAAACGAATTCAAAAAAGGTCGAGGAGCTGCTCGGGACAGATTTGGTCGAAAAGCCGGGAA
TCCCGACCGTGCTGATTCCGAGTACTTCGGGGACGGGCGCAGAGGTGACGCCGAACGCGATCGTTACGCTTCCAGAAGAG
GAATTGAAAGTCGGCATCGTCAGTCCGCTTCTTTTGCCGAAACTTGTCATTCTTGACCCGGTCATCACCCTTGGATTGCC
AAAACCGATTACGGCGGCTACCGGAATGGATGCATTCACTCATTCGCTTGAATCGTTCATTTCGACGAAAGCAAACCCGA
TCAGCGATATGTTCGCTTTAGAATCCATCAGATTGATTTCGGCAAGTATTGTGGAAGCATATGAAAACGGTTCATCGATA
CAAGCGAGAGAAAACATGCTGCTCGGTTCGACGTATGGCGGAATGGCGCTGACGGCTGCCGGTACTGCTGCCGTTCACGC
CCTCGCCTATCCGCTTGGGGGGAAATACCGAATTTCCCACGGTGTCGCCAATTCGATGCTCCTTCCGCATGTGATGGAGT
TTAATATGGATGCCATTACAGAACGGCTGTCTCTTGCGGCAGAAACAATGGGAATCGTTGCATCTGATTTGACCGCCGAA
CAAGCGGCTGAGGCCGTTGTGCAGAAAATAAGAGAGTGGACGGAGCGGCTGAATATTCCTCAGGATTTAAAAGCGTTCGG
CGTAACCGCAAGCGACGTGGACGATTTAGCCGACTCAGCTTCAAAAGTGACGCGCCTGCTTCATAACAATCCGAAACCGC
TCAGTCTGGAAAATATAAAAGACATCTATCGAAAACTAATCAACTAA

Upstream 100 bases:

>100_bases
GGCCCGCCGAAAGCGCCCGTCAAAGAACTGACCGGTCCGGCGCTTTTGGAGGTCGAAAAAATGGTTGCTTCCTATCAATC
AAAATAGGGAGGTTTCAAGA

Downstream 100 bases:

>100_bases
AGGATGAAGACATTTGAAAATTGCAATCATCGCGGATGATTTGACGGGAGCGAACGACTGCGGCGGTCAGCTTGTTCATT
ACGGGATGGATGTTTCCGTC

Product: hypothetical protein

Products: NA

Alternate protein names: 1,3-propanediol oxidoreductase; 3-hydroxypropionaldehyde reductase [H]

Number of amino acids: Translated: 388; Mature: 387

Protein sequence:

>388_residues
MPANYQFRTAGHIVAGEHSIRRLKDHVRTIVPKADRALIITQPSIVKLGAIGEVQAQLTEIGIQSDVCTAILPEPTVQNI
EDVFRGIREKRYDILIGIGGGSVLDGTKILSVLQTNSKKVEELLGTDLVEKPGIPTVLIPSTSGTGAEVTPNAIVTLPEE
ELKVGIVSPLLLPKLVILDPVITLGLPKPITAATGMDAFTHSLESFISTKANPISDMFALESIRLISASIVEAYENGSSI
QARENMLLGSTYGGMALTAAGTAAVHALAYPLGGKYRISHGVANSMLLPHVMEFNMDAITERLSLAAETMGIVASDLTAE
QAAEAVVQKIREWTERLNIPQDLKAFGVTASDVDDLADSASKVTRLLHNNPKPLSLENIKDIYRKLIN

Sequences:

>Translated_388_residues
MPANYQFRTAGHIVAGEHSIRRLKDHVRTIVPKADRALIITQPSIVKLGAIGEVQAQLTEIGIQSDVCTAILPEPTVQNI
EDVFRGIREKRYDILIGIGGGSVLDGTKILSVLQTNSKKVEELLGTDLVEKPGIPTVLIPSTSGTGAEVTPNAIVTLPEE
ELKVGIVSPLLLPKLVILDPVITLGLPKPITAATGMDAFTHSLESFISTKANPISDMFALESIRLISASIVEAYENGSSI
QARENMLLGSTYGGMALTAAGTAAVHALAYPLGGKYRISHGVANSMLLPHVMEFNMDAITERLSLAAETMGIVASDLTAE
QAAEAVVQKIREWTERLNIPQDLKAFGVTASDVDDLADSASKVTRLLHNNPKPLSLENIKDIYRKLIN
>Mature_387_residues
PANYQFRTAGHIVAGEHSIRRLKDHVRTIVPKADRALIITQPSIVKLGAIGEVQAQLTEIGIQSDVCTAILPEPTVQNIE
DVFRGIREKRYDILIGIGGGSVLDGTKILSVLQTNSKKVEELLGTDLVEKPGIPTVLIPSTSGTGAEVTPNAIVTLPEEE
LKVGIVSPLLLPKLVILDPVITLGLPKPITAATGMDAFTHSLESFISTKANPISDMFALESIRLISASIVEAYENGSSIQ
ARENMLLGSTYGGMALTAAGTAAVHALAYPLGGKYRISHGVANSMLLPHVMEFNMDAITERLSLAAETMGIVASDLTAEQ
AAEAVVQKIREWTERLNIPQDLKAFGVTASDVDDLADSASKVTRLLHNNPKPLSLENIKDIYRKLIN

Specific function: Most active with substrates containing two primary alcohol groups separated by one or two carbon atoms. In the physiological direction, 3-hydroxypropionaldehyde is the preferred substrate [H]

COG id: COG1454

COG function: function code C; Alcohol dehydrogenase, class IV

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the iron-containing alcohol dehydrogenase family [H]

Homologues:

Organism=Homo sapiens, GI133922590, Length=388, Percent_Identity=24.7422680412371, Blast_Score=114, Evalue=1e-25,
Organism=Escherichia coli, GI48994951, Length=349, Percent_Identity=36.3896848137536, Blast_Score=224, Evalue=8e-60,
Organism=Escherichia coli, GI1789163, Length=371, Percent_Identity=31.266846361186, Blast_Score=196, Evalue=2e-51,
Organism=Escherichia coli, GI87082107, Length=326, Percent_Identity=34.6625766871166, Blast_Score=173, Evalue=2e-44,
Organism=Escherichia coli, GI1787493, Length=361, Percent_Identity=28.5318559556787, Blast_Score=140, Evalue=2e-34,
Organism=Escherichia coli, GI1789386, Length=358, Percent_Identity=24.5810055865922, Blast_Score=98, Evalue=1e-21,
Organism=Caenorhabditis elegans, GI17537053, Length=391, Percent_Identity=26.5984654731458, Blast_Score=161, Evalue=5e-40,
Organism=Saccharomyces cerevisiae, GI6321181, Length=371, Percent_Identity=34.2318059299191, Blast_Score=221, Evalue=2e-58,
Organism=Drosophila melanogaster, GI24657991, Length=394, Percent_Identity=27.1573604060914, Blast_Score=133, Evalue=2e-31,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001670
- InterPro:   IPR018211 [H]

Pfam domain/function: PF00465 Fe-ADH [H]

EC number: =1.1.1.202 [H]

Molecular weight: Translated: 41576; Mature: 41444

Theoretical pI: Translated: 5.44; Mature: 5.44

Prosite motif: PS00913 ADH_IRON_1

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.3 %Cys     (Translated Protein)
2.3 %Met     (Translated Protein)
2.6 %Cys+Met (Translated Protein)
0.3 %Cys     (Mature Protein)
2.1 %Met     (Mature Protein)
2.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MPANYQFRTAGHIVAGEHSIRRLKDHVRTIVPKADRALIITQPSIVKLGAIGEVQAQLTE
CCCCCCEECCCCEEECHHHHHHHHHHHHHHCCCCCCEEEEECCCEEEECCCHHHHHHHHH
IGIQSDVCTAILPEPTVQNIEDVFRGIREKRYDILIGIGGGSVLDGTKILSVLQTNSKKV
HCCCCCCHHHHCCCCCHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHHCCHHHH
EELLGTDLVEKPGIPTVLIPSTSGTGAEVTPNAIVTLPEEELKVGIVSPLLLPKLVILDP
HHHHCCCHHCCCCCCEEEEECCCCCCCEECCCCEEECCHHHHHHHHHHHHHHHHHHHHHH
VITLGLPKPITAATGMDAFTHSLESFISTKANPISDMFALESIRLISASIVEAYENGSSI
HHHHCCCCCCHHHCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCC
QARENMLLGSTYGGMALTAAGTAAVHALAYPLGGKYRISHGVANSMLLPHVMEFNMDAIT
CHHHCEEEECCCCCEEEEHHHHHHHHHHHCCCCCCEEECCCCHHHHHHHHHHHCCHHHHH
ERLSLAAETMGIVASDLTAEQAAEAVVQKIREWTERLNIPQDLKAFGVTASDVDDLADSA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHCCCHHHHHHHHHHH
SKVTRLLHNNPKPLSLENIKDIYRKLIN
HHHHHHHHCCCCCCCHHHHHHHHHHHCC
>Mature Secondary Structure 
PANYQFRTAGHIVAGEHSIRRLKDHVRTIVPKADRALIITQPSIVKLGAIGEVQAQLTE
CCCCCEECCCCEEECHHHHHHHHHHHHHHCCCCCCEEEEECCCEEEECCCHHHHHHHHH
IGIQSDVCTAILPEPTVQNIEDVFRGIREKRYDILIGIGGGSVLDGTKILSVLQTNSKKV
HCCCCCCHHHHCCCCCHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHHCCHHHH
EELLGTDLVEKPGIPTVLIPSTSGTGAEVTPNAIVTLPEEELKVGIVSPLLLPKLVILDP
HHHHCCCHHCCCCCCEEEEECCCCCCCEECCCCEEECCHHHHHHHHHHHHHHHHHHHHHH
VITLGLPKPITAATGMDAFTHSLESFISTKANPISDMFALESIRLISASIVEAYENGSSI
HHHHCCCCCCHHHCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCC
QARENMLLGSTYGGMALTAAGTAAVHALAYPLGGKYRISHGVANSMLLPHVMEFNMDAIT
CHHHCEEEECCCCCEEEEHHHHHHHHHHHCCCCCCEEECCCCHHHHHHHHHHHCCHHHHH
ERLSLAAETMGIVASDLTAEQAAEAVVQKIREWTERLNIPQDLKAFGVTASDVDDLADSA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHCCCHHHHHHHHHHH
SKVTRLLHNNPKPLSLENIKDIYRKLIN
HHHHHHHHCCCCCCCHHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 7721705 [H]