Definition Agrobacterium tumefaciens str. C58 chromosome circular, complete sequence.
Accession NC_003062
Length 2,841,580

Click here to switch to the map view.

The map label for this gene is thiG

Identifier: 15889824

GI number: 15889824

Start: 2537959

End: 2538732

Strand: Reverse

Name: thiG

Synonym: Atu2566

Alternate gene names: 15889824

Gene position: 2538732-2537959 (Counterclockwise)

Preceding gene: 15889825

Following gene: 159185269

Centisome position: 89.34

GC content: 61.76

Gene sequence:

>774_bases
ATGCTGACGCTTTATGGCCGCGAGGTCTCGTCCCGCCTTCTGCTCGGCACGGCGCGTTATCCGTCTCCCGCTGTGCTTGC
GGATGCGGTGCGGGCCAGCAATACGGATATTCTGACGATTTCGTTGCGACGGGAAATGGCAGGGGCCAAGAAGGGCGGTC
AGTTTTTCGAGCTGATCCGCGAGCTGGATCGCCATATCCTGCCCAATACGGCTGGTTGCCACACCGCCAAGGAAGCGGTG
CTGACGGCGAAGATGGCCCGCGAGGTATTCCGTACGGACTGGATCAAGCTGGAGGTCATTGGCCATCACGATACGTTGCA
GCCCGATGTCTTTGCGCTTGTCGAGGCGGCAAAAATCCTGTGTGACGAGGGTTTCGAAGTCTTTCCCTATACGACGGACG
ATTTGGTCGTGGCCGAGAAACTGCTGGAGGCTGGTTGCAGGGTGCTGATGCCCTGGTGCGCGCCGATCGGCAGCGCCATG
GGGCCGCTCAATCTGACGGCGCTGAAATCGATGCGGGCGCGGTTTCCCGAAGTACCACTGATCGTGGATGCCGGTATCGG
CCGGCCCTCCCATGCGGTGACCGTGATGGAGCTTGGTTACGATGCCGTTCTCCTCAACACGGCGGTTGCGGGTGCGGGCG
ACCCAGTCGGCATGGCGGAGGCTTTTGCCCGCGCCATAGAGGCAGGCCATCAGGCGTATCTTTCGGGGCCGCTGGAGCCA
CGAGACATGGCCGTTCCTTCGACCCCTGTTATCGGGACGGCGGTTTTCTCCTGA

Upstream 100 bases:

>100_bases
GTCAATGGCGATCTCGTGCATTCGGAGGAACGCGCCGGTTACGTGCTGAAGGCGTTCGACCGTGTCGAAATCCTTTCGCC
GATGCAGGGAGGTTGAGACC

Downstream 100 bases:

>100_bases
TCGCCGCTTTCCGCCAAGGTCGGGCATGTCACCTTTTGCGTTGACAGTGTCCCGTGGCGGCGGGATACCGGGAGTAAAGG
GGAACAGCATGGCGGCACAC

Product: thiazole synthase

Products: 4-methyl-5-(beta-hydroxyethyl)thiazole phosphate; 4-hydroxy-benzyl-alcohol; C1 of tyrosine; ThiS protein [C]

Alternate protein names: NA

Number of amino acids: Translated: 257; Mature: 257

Protein sequence:

>257_residues
MLTLYGREVSSRLLLGTARYPSPAVLADAVRASNTDILTISLRREMAGAKKGGQFFELIRELDRHILPNTAGCHTAKEAV
LTAKMAREVFRTDWIKLEVIGHHDTLQPDVFALVEAAKILCDEGFEVFPYTTDDLVVAEKLLEAGCRVLMPWCAPIGSAM
GPLNLTALKSMRARFPEVPLIVDAGIGRPSHAVTVMELGYDAVLLNTAVAGAGDPVGMAEAFARAIEAGHQAYLSGPLEP
RDMAVPSTPVIGTAVFS

Sequences:

>Translated_257_residues
MLTLYGREVSSRLLLGTARYPSPAVLADAVRASNTDILTISLRREMAGAKKGGQFFELIRELDRHILPNTAGCHTAKEAV
LTAKMAREVFRTDWIKLEVIGHHDTLQPDVFALVEAAKILCDEGFEVFPYTTDDLVVAEKLLEAGCRVLMPWCAPIGSAM
GPLNLTALKSMRARFPEVPLIVDAGIGRPSHAVTVMELGYDAVLLNTAVAGAGDPVGMAEAFARAIEAGHQAYLSGPLEP
RDMAVPSTPVIGTAVFS
>Mature_257_residues
MLTLYGREVSSRLLLGTARYPSPAVLADAVRASNTDILTISLRREMAGAKKGGQFFELIRELDRHILPNTAGCHTAKEAV
LTAKMAREVFRTDWIKLEVIGHHDTLQPDVFALVEAAKILCDEGFEVFPYTTDDLVVAEKLLEAGCRVLMPWCAPIGSAM
GPLNLTALKSMRARFPEVPLIVDAGIGRPSHAVTVMELGYDAVLLNTAVAGAGDPVGMAEAFARAIEAGHQAYLSGPLEP
RDMAVPSTPVIGTAVFS

Specific function: Catalyzes the rearrangement of 1-deoxy-D-xylulose 5- phosphate (DXP) to produce the thiazole phosphate moiety of thiamine. Sulfur is provided by the thiocarboxylate moiety of the carrier protein ThiS. In vitro, sulfur can be provided by H(2)S

COG id: COG2022

COG function: function code H; Uncharacterized enzyme of thiazole biosynthesis

Gene ontology:

Cell location: Cytoplasm

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the thiG family

Homologues:

Organism=Escherichia coli, GI48994993, Length=252, Percent_Identity=39.6825396825397, Blast_Score=172, Evalue=2e-44,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): THIG_AGRT5 (Q8UCD2)

Other databases:

- EMBL:   AE007869
- PIR:   A97667
- PIR:   AE2891
- RefSeq:   NP_355505.1
- ProteinModelPortal:   Q8UCD2
- SMR:   Q8UCD2
- STRING:   Q8UCD2
- GeneID:   1134604
- GenomeReviews:   AE007869_GR
- KEGG:   atu:Atu2566
- eggNOG:   COG2022
- HOGENOM:   HBG296821
- OMA:   VAIRRTN
- PhylomeDB:   Q8UCD2
- ProtClustDB:   PRK00208
- BioCyc:   ATUM176299-1:ATU2566-MONOMER
- GO:   GO:0005737
- HAMAP:   MF_00443
- InterPro:   IPR013785
- InterPro:   IPR008867
- Gene3D:   G3DSA:3.20.20.70

Pfam domain/function: PF05690 ThiG; SSF110399 ThiG

EC number: NA

Molecular weight: Translated: 27582; Mature: 27582

Theoretical pI: Translated: 5.64; Mature: 5.64

Prosite motif: NA

Important sites: ACT_SITE 96-96 BINDING 157-157

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.6 %Cys     (Translated Protein)
3.5 %Met     (Translated Protein)
5.1 %Cys+Met (Translated Protein)
1.6 %Cys     (Mature Protein)
3.5 %Met     (Mature Protein)
5.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MLTLYGREVSSRLLLGTARYPSPAVLADAVRASNTDILTISLRREMAGAKKGGQFFELIR
CEEEECCCHHCCEEEEECCCCCHHHHHHHHHCCCCCEEEEEEHHHHCCCCCCHHHHHHHH
ELDRHILPNTAGCHTAKEAVLTAKMAREVFRTDWIKLEVIGHHDTLQPDVFALVEAAKIL
HHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCHHHHHHHHHHHHH
CDEGFEVFPYTTDDLVVAEKLLEAGCRVLMPWCAPIGSAMGPLNLTALKSMRARFPEVPL
HHCCCEEECCCCHHHHHHHHHHHCCHHEEHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCE
IVDAGIGRPSHAVTVMELGYDAVLLNTAVAGAGDPVGMAEAFARAIEAGHQAYLSGPLEP
EEECCCCCCCCEEEEECCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHCCHHHHCCCCCC
RDMAVPSTPVIGTAVFS
CCCCCCCCCCCCCCCCC
>Mature Secondary Structure
MLTLYGREVSSRLLLGTARYPSPAVLADAVRASNTDILTISLRREMAGAKKGGQFFELIR
CEEEECCCHHCCEEEEECCCCCHHHHHHHHHCCCCCEEEEEEHHHHCCCCCCHHHHHHHH
ELDRHILPNTAGCHTAKEAVLTAKMAREVFRTDWIKLEVIGHHDTLQPDVFALVEAAKIL
HHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCHHHHHHHHHHHHH
CDEGFEVFPYTTDDLVVAEKLLEAGCRVLMPWCAPIGSAMGPLNLTALKSMRARFPEVPL
HHCCCEEECCCCHHHHHHHHHHHCCHHEEHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCE
IVDAGIGRPSHAVTVMELGYDAVLLNTAVAGAGDPVGMAEAFARAIEAGHQAYLSGPLEP
EEECCCCCCCCEEEEECCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHCCHHHHCCCCCC
RDMAVPSTPVIGTAVFS
CCCCCCCCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: deoxyxylulose-5-phosphate; ThiS-COSH; L-tyrosine [C]

Specific reaction: deoxyxylulose-5-phosphate + ThiS-COSH + L-tyrosine = 4-methyl-5-(beta-hydroxyethyl)thiazole phosphate + 4-hydroxy-benzyl-alcohol + C1 of tyrosine + ThiS protein [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11743193; 11743194