Definition Mycobacterium tuberculosis H37Ra, complete genome.
Accession NC_009525
Length 4,419,977

Click here to switch to the map view.

The map label for this gene is tauD [C]

Identifier: 148659859

GI number: 148659859

Start: 108092

End: 108961

Strand: Direct

Name: tauD [C]

Synonym: MRA_0101

Alternate gene names: 148659859

Gene position: 108092-108961 (Clockwise)

Preceding gene: 148659858

Following gene: 148659860

Centisome position: 2.45

GC content: 58.85

Gene sequence:

>870_bases
ATGACGCTTAAGGTCAAAGGCGAGGGACTCGGTGCGCAGGTCACAGGGGTCGATCCCAAGAATCTGGACGATATAACCAC
CGACGAGATCCGGGATATCGTTTACACGAACAAGCTCGTTGTGCTAAAAGACGTCCATCCGTCTCCGCGGGAGTTCATCA
AACTCGGCAGGATAATTGGACAAATCGTTCCGTATTACGAACCCATGTACCATCACGAAGACCACCCGGAGATCTTTGTC
TCCTCCACTGAGGAAGGTCAGGGGGTCCCAAAAACCGGCGCGTTCTGGCATATCGACTATATGTTTATGCCGGAACCTTT
CGCGTTTTCCATGGTGCTGCCGCTGGCGGTGCCTGGACACGACCGCGGGACCTATTTCATCGATCTCGCCAGGGTCTGGC
AGTCGCTGCCCGCCGCCAAGCGAGACCCGGCCCGCGGAACCGTCAGCACCCACGACCCTCGACGCCACATCAAGATCCGA
CCCAGCGACGTCTACCGGCCCATCGGAGAGGTATGGGACGAGATCAACCGGACCACGCCCCCAATAAAGTGGCCTACGGT
CATCCGGCACCCAAAGACCGGCCAAGAGATCCTCTACATCTGCGCGACGGGCACCACCAAGATCGAGGACAAGGACGGCA
ATCCGGTTGATCCGGAGGTGCTGCAAGAACTCATGGCCGCGACCGGACAGCTCGATCCTGAGTACCAGTCGCCGTTCATA
CATACTCAGCACTACCAGGTTGGCGACATCATCTTGTGGGACAACCGGGTTCTCATGCACCGAGCGAAGCACGGCAGCGC
CGCGGGCACTCTGACGACCTACCGCCTGACCATGCTTGATGGCCTCAAGACGCCGGGATACGCGGCATGA

Upstream 100 bases:

>100_bases
AATGGTTCAACTGTCGTCGCACAGCACAAGCACTACAGTCCCGTTGCTGCCCACTACCTGGACAACCGACGCCGAACAAT
GAACAAGGAGAAAAGAACCG

Downstream 100 bases:

>100_bases
GCCACACCGACTTGACGCCCTGCACACGGGTGCTGGCATCCAGCGGCACGGTTCCGATCGCAGAGGAACTGCTGGCCAGA
GTGCTCGAGCCCTACTCCTG

Product: putative dioxygenase

Products: Aminoacetaldehyde; CO2; H+; Sulfite; Succinate [C]

Alternate protein names: NA

Number of amino acids: Translated: 289; Mature: 288

Protein sequence:

>289_residues
MTLKVKGEGLGAQVTGVDPKNLDDITTDEIRDIVYTNKLVVLKDVHPSPREFIKLGRIIGQIVPYYEPMYHHEDHPEIFV
SSTEEGQGVPKTGAFWHIDYMFMPEPFAFSMVLPLAVPGHDRGTYFIDLARVWQSLPAAKRDPARGTVSTHDPRRHIKIR
PSDVYRPIGEVWDEINRTTPPIKWPTVIRHPKTGQEILYICATGTTKIEDKDGNPVDPEVLQELMAATGQLDPEYQSPFI
HTQHYQVGDIILWDNRVLMHRAKHGSAAGTLTTYRLTMLDGLKTPGYAA

Sequences:

>Translated_289_residues
MTLKVKGEGLGAQVTGVDPKNLDDITTDEIRDIVYTNKLVVLKDVHPSPREFIKLGRIIGQIVPYYEPMYHHEDHPEIFV
SSTEEGQGVPKTGAFWHIDYMFMPEPFAFSMVLPLAVPGHDRGTYFIDLARVWQSLPAAKRDPARGTVSTHDPRRHIKIR
PSDVYRPIGEVWDEINRTTPPIKWPTVIRHPKTGQEILYICATGTTKIEDKDGNPVDPEVLQELMAATGQLDPEYQSPFI
HTQHYQVGDIILWDNRVLMHRAKHGSAAGTLTTYRLTMLDGLKTPGYAA
>Mature_288_residues
TLKVKGEGLGAQVTGVDPKNLDDITTDEIRDIVYTNKLVVLKDVHPSPREFIKLGRIIGQIVPYYEPMYHHEDHPEIFVS
STEEGQGVPKTGAFWHIDYMFMPEPFAFSMVLPLAVPGHDRGTYFIDLARVWQSLPAAKRDPARGTVSTHDPRRHIKIRP
SDVYRPIGEVWDEINRTTPPIKWPTVIRHPKTGQEILYICATGTTKIEDKDGNPVDPEVLQELMAATGQLDPEYQSPFIH
TQHYQVGDIILWDNRVLMHRAKHGSAAGTLTTYRLTMLDGLKTPGYAA

Specific function: Catalyzes The Conversion Of Taurine And Alpha Ketoglutarate To Sulfite, Aminoacetaldehyde And Succinate. Required For The Utilization Of Taurine (2-Aminoethanesulfonic Acid) As An Alternative Sulfur Source. Pentane-Sulfonic Acid, 3- (N-Morpholino)Propanes

COG id: COG2175

COG function: function code Q; Probable taurine catabolism dioxygenase

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the tfdA dioxygenase family

Homologues:

Organism=Escherichia coli, GI1786565, Length=278, Percent_Identity=26.978417266187, Blast_Score=78, Evalue=7e-16,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): Y097_MYCTU (P67755)

Other databases:

- EMBL:   BX842572
- EMBL:   AE000516
- PIR:   A70751
- RefSeq:   NP_214611.1
- RefSeq:   NP_334514.1
- ProteinModelPortal:   P67755
- EnsemblBacteria:   EBMYCT00000003780
- EnsemblBacteria:   EBMYCT00000072936
- GeneID:   886942
- GeneID:   922930
- GenomeReviews:   AE000516_GR
- GenomeReviews:   AL123456_GR
- KEGG:   mtc:MT0106
- KEGG:   mtu:Rv0097
- TIGR:   MT0106
- TubercuList:   Rv0097
- GeneTree:   EBGT00050000016277
- HOGENOM:   HBG567532
- OMA:   EEILYIC
- ProtClustDB:   CLSK790261
- InterPro:   IPR003819

Pfam domain/function: PF02668 TauD

EC number: 1.14.11.17 [C]

Molecular weight: Translated: 32642; Mature: 32510

Theoretical pI: Translated: 6.58; Mature: 6.58

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.3 %Cys     (Translated Protein)
2.8 %Met     (Translated Protein)
3.1 %Cys+Met (Translated Protein)
0.3 %Cys     (Mature Protein)
2.4 %Met     (Mature Protein)
2.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTLKVKGEGLGAQVTGVDPKNLDDITTDEIRDIVYTNKLVVLKDVHPSPREFIKLGRIIG
CEEEEECCCCCEEEECCCCCCCCCCCHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHH
QIVPYYEPMYHHEDHPEIFVSSTEEGQGVPKTGAFWHIDYMFMPEPFAFSMVLPLAVPGH
HHHHHHCHHHCCCCCCEEEEECCCCCCCCCCCCCEEEEEEEECCCCCEEEEEEEEECCCC
DRGTYFIDLARVWQSLPAAKRDPARGTVSTHDPRRHIKIRPSDVYRPIGEVWDEINRTTP
CCCEEEEEHHHHHHHCCCCCCCCCCCCCCCCCCCCEEEECCHHHHHHHHHHHHHHCCCCC
PIKWPTVIRHPKTGQEILYICATGTTKIEDKDGNPVDPEVLQELMAATGQLDPEYQSPFI
CCCCCCEEECCCCCCEEEEEEECCCEEEECCCCCCCCHHHHHHHHHHHCCCCCCCCCCEE
HTQHYQVGDIILWDNRVLMHRAKHGSAAGTLTTYRLTMLDGLKTPGYAA
EECCEEECCEEEECCCEEEEECCCCCCCCEEEEEEEEEHHCCCCCCCCC
>Mature Secondary Structure 
TLKVKGEGLGAQVTGVDPKNLDDITTDEIRDIVYTNKLVVLKDVHPSPREFIKLGRIIG
EEEEECCCCCEEEECCCCCCCCCCCHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHH
QIVPYYEPMYHHEDHPEIFVSSTEEGQGVPKTGAFWHIDYMFMPEPFAFSMVLPLAVPGH
HHHHHHCHHHCCCCCCEEEEECCCCCCCCCCCCCEEEEEEEECCCCCEEEEEEEEECCCC
DRGTYFIDLARVWQSLPAAKRDPARGTVSTHDPRRHIKIRPSDVYRPIGEVWDEINRTTP
CCCEEEEEHHHHHHHCCCCCCCCCCCCCCCCCCCCEEEECCHHHHHHHHHHHHHHCCCCC
PIKWPTVIRHPKTGQEILYICATGTTKIEDKDGNPVDPEVLQELMAATGQLDPEYQSPFI
CCCCCCEEECCCCCCEEEEEEECCCEEEECCCCCCCCHHHHHHHHHHHCCCCCCCCCCEE
HTQHYQVGDIILWDNRVLMHRAKHGSAAGTLTTYRLTMLDGLKTPGYAA
EECCEEECCEEEECCCEEEEECCCCCCCCEEEEEEEEEHHCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: Ascorbate. [C]

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: 2-Oxoglutarate; O2; Taurine [C]

Specific reaction: 2-Oxoglutarate + O2 + Taurine --> Aminoacetaldehyde + CO2 + H+ + Sulfite + Succinate [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 9634230; 12218036