The gene/protein map for NC_008312 is currently unavailable.
Definition Trichodesmium erythraeum IMS101 chromosome, complete genome.
Accession NC_008312
Length 7,750,108

Click here to switch to the map view.

The map label for this gene is thiG

Identifier: 113475862

GI number: 113475862

Start: 3462429

End: 3463268

Strand: Reverse

Name: thiG

Synonym: Tery_2221

Alternate gene names: 113475862

Gene position: 3463268-3462429 (Counterclockwise)

Preceding gene: 113475864

Following gene: 113475861

Centisome position: 44.69

GC content: 40.48

Gene sequence:

>840_bases
ATGCAAACAGTTGAAAAATTAACTACAGAAACTCTAGAAAAACCTCTAATTATTGCCGGTAAAAAGTTTACCTCTCGTCT
GATGACAGGCACAGGCAAGTATCCGACTATTGAGACAATGCAGCAAAGTATAGAGGCTAGTAAATGTGAAATTATCACTG
TAGCAGTACGACGAGTACAAACTCAAGCTCCTGGCCATAAAGGGTTAGCAGAAGCTATAGACTGGCAAAAAGTCTGGATG
TTGCCTAATACTGCTGGTTGTCAAACTGCTGAAGATGCAGTAAGAGTAGCCAGGTTGGGTAGAGAAATGGCAAAGTTGTT
AGGCCAAGAAGATAATAATTTTGTCAAACTAGAAGTTATTCCTGACTCTAAATATTTACTGCCAGACCCTATCGGAACTC
TACAAGCCGCAGAACAGTTAATTAAAGAAGGTTTTGCTGTATTACCTTATATTAATGCTGACCCACTTCTAGCTAAAAGA
CTTGAGGAGGCTGGTTGTTCTACAGTTATGCCTTTGGGTTCACCCATAGGCTCAGGCCAGGGAATACAAAATGCAGCCAA
TATTTCCATAATTATTGACAATTCTACTGTTCCTGTGGTAATAGATGCCGGTATCGGAACTCCCAGTGAAGCAACTCAAG
CAATGGAAATGGGAGCTGATGCTTTACTAATTAATTCAGCGATCGCTTTAGCCAAAAATCCGCCCATCATGGCAAAAGCT
ATGGGAATGGCCACAGAATCAGGACGTTTAGCTTATCTAGCTGGCAGAATACCGAAAAAAAGTTATGCTACTCCTTCTTC
TCCTGTAACTGGAAAGATTAATACAACTACTTCAGAATAA

Upstream 100 bases:

>100_bases
TAGGAAGCAGAAGGAACAGGAGTTGGAGATATTTTTAAGAAGTTCAGTTATTAGACATAATATTTAGGACTGCTATAGTT
ATAAACATCAAAATTAAACA

Downstream 100 bases:

>100_bases
TAATTTCTTATATTCTCATATTCTCCGAATTTCTGTTAGGCTAAGTAATAGTTAAGAAATAATAAAGAAATAGGGACTTT
GGGTTATGCCATATACAACA

Product: thiazole synthase

Products: 4-methyl-5-(beta-hydroxyethyl)thiazole phosphate; 4-hydroxy-benzyl-alcohol; C1 of tyrosine; ThiS protein [C]

Alternate protein names: NA

Number of amino acids: Translated: 279; Mature: 279

Protein sequence:

>279_residues
MQTVEKLTTETLEKPLIIAGKKFTSRLMTGTGKYPTIETMQQSIEASKCEIITVAVRRVQTQAPGHKGLAEAIDWQKVWM
LPNTAGCQTAEDAVRVARLGREMAKLLGQEDNNFVKLEVIPDSKYLLPDPIGTLQAAEQLIKEGFAVLPYINADPLLAKR
LEEAGCSTVMPLGSPIGSGQGIQNAANISIIIDNSTVPVVIDAGIGTPSEATQAMEMGADALLINSAIALAKNPPIMAKA
MGMATESGRLAYLAGRIPKKSYATPSSPVTGKINTTTSE

Sequences:

>Translated_279_residues
MQTVEKLTTETLEKPLIIAGKKFTSRLMTGTGKYPTIETMQQSIEASKCEIITVAVRRVQTQAPGHKGLAEAIDWQKVWM
LPNTAGCQTAEDAVRVARLGREMAKLLGQEDNNFVKLEVIPDSKYLLPDPIGTLQAAEQLIKEGFAVLPYINADPLLAKR
LEEAGCSTVMPLGSPIGSGQGIQNAANISIIIDNSTVPVVIDAGIGTPSEATQAMEMGADALLINSAIALAKNPPIMAKA
MGMATESGRLAYLAGRIPKKSYATPSSPVTGKINTTTSE
>Mature_279_residues
MQTVEKLTTETLEKPLIIAGKKFTSRLMTGTGKYPTIETMQQSIEASKCEIITVAVRRVQTQAPGHKGLAEAIDWQKVWM
LPNTAGCQTAEDAVRVARLGREMAKLLGQEDNNFVKLEVIPDSKYLLPDPIGTLQAAEQLIKEGFAVLPYINADPLLAKR
LEEAGCSTVMPLGSPIGSGQGIQNAANISIIIDNSTVPVVIDAGIGTPSEATQAMEMGADALLINSAIALAKNPPIMAKA
MGMATESGRLAYLAGRIPKKSYATPSSPVTGKINTTTSE

Specific function: Catalyzes the rearrangement of 1-deoxy-D-xylulose 5- phosphate (DXP) to produce the thiazole phosphate moiety of thiamine. Sulfur is provided by the thiocarboxylate moiety of the carrier protein ThiS. In vitro, sulfur can be provided by H(2)S

COG id: COG2022

COG function: function code H; Uncharacterized enzyme of thiazole biosynthesis

Gene ontology:

Cell location: Cytoplasm

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the thiG family

Homologues:

Organism=Escherichia coli, GI48994993, Length=261, Percent_Identity=49.8084291187739, Blast_Score=250, Evalue=7e-68,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): THIG_TRIEI (Q112X4)

Other databases:

- EMBL:   CP000393
- RefSeq:   YP_721923.1
- ProteinModelPortal:   Q112X4
- SMR:   Q112X4
- STRING:   Q112X4
- GeneID:   4243254
- GenomeReviews:   CP000393_GR
- KEGG:   ter:Tery_2221
- NMPDR:   fig|203124.1.peg.1421
- eggNOG:   COG2022
- HOGENOM:   HBG296821
- OMA:   PIIIDAG
- PhylomeDB:   Q112X4
- ProtClustDB:   PRK00208
- BioCyc:   TERY203124:TERY_2221-MONOMER
- GO:   GO:0005737
- HAMAP:   MF_00443
- InterPro:   IPR013785
- InterPro:   IPR008867
- Gene3D:   G3DSA:3.20.20.70

Pfam domain/function: PF05690 ThiG; SSF110399 ThiG

EC number: NA

Molecular weight: Translated: 29634; Mature: 29634

Theoretical pI: Translated: 6.05; Mature: 6.05

Prosite motif: NA

Important sites: ACT_SITE 116-116 BINDING 177-177

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.1 %Cys     (Translated Protein)
3.9 %Met     (Translated Protein)
5.0 %Cys+Met (Translated Protein)
1.1 %Cys     (Mature Protein)
3.9 %Met     (Mature Protein)
5.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MQTVEKLTTETLEKPLIIAGKKFTSRLMTGTGKYPTIETMQQSIEASKCEIITVAVRRVQ
CCHHHHHHHHHHCCCEEEECHHHHHHHHCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHH
TQAPGHKGLAEAIDWQKVWMLPNTAGCQTAEDAVRVARLGREMAKLLGQEDNNFVKLEVI
HCCCCCCCHHHHCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEE
PDSKYLLPDPIGTLQAAEQLIKEGFAVLPYINADPLLAKRLEEAGCSTVMPLGSPIGSGQ
CCCCCCCCCCCHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHCCCCEECCCCCCCCCCC
GIQNAANISIIIDNSTVPVVIDAGIGTPSEATQAMEMGADALLINSAIALAKNPPIMAKA
CCCCCCEEEEEEECCCEEEEEECCCCCCHHHHHHHHHCCCEEEHHHHHHHCCCCCHHHHH
MGMATESGRLAYLAGRIPKKSYATPSSPVTGKINTTTSE
HCCCCCCCCEEEEECCCCCCCCCCCCCCCEEEECCCCCC
>Mature Secondary Structure
MQTVEKLTTETLEKPLIIAGKKFTSRLMTGTGKYPTIETMQQSIEASKCEIITVAVRRVQ
CCHHHHHHHHHHCCCEEEECHHHHHHHHCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHH
TQAPGHKGLAEAIDWQKVWMLPNTAGCQTAEDAVRVARLGREMAKLLGQEDNNFVKLEVI
HCCCCCCCHHHHCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEE
PDSKYLLPDPIGTLQAAEQLIKEGFAVLPYINADPLLAKRLEEAGCSTVMPLGSPIGSGQ
CCCCCCCCCCCHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHCCCCEECCCCCCCCCCC
GIQNAANISIIIDNSTVPVVIDAGIGTPSEATQAMEMGADALLINSAIALAKNPPIMAKA
CCCCCCEEEEEEECCCEEEEEECCCCCCHHHHHHHHHCCCEEEHHHHHHHCCCCCHHHHH
MGMATESGRLAYLAGRIPKKSYATPSSPVTGKINTTTSE
HCCCCCCCCEEEEECCCCCCCCCCCCCCCEEEECCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: deoxyxylulose-5-phosphate; ThiS-COSH; L-tyrosine [C]

Specific reaction: deoxyxylulose-5-phosphate + ThiS-COSH + L-tyrosine = 4-methyl-5-(beta-hydroxyethyl)thiazole phosphate + 4-hydroxy-benzyl-alcohol + C1 of tyrosine + ThiS protein [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: NA