The gene/protein map for NC_012578 is currently unavailable.
Definition Vibrio cholerae M66-2 chromosome I, complete genome.
Accession NC_012578
Length 2,892,523

Click here to switch to the map view.

The map label for this gene is thiG

Identifier: 227080301

GI number: 227080301

Start: 64185

End: 64955

Strand: Direct

Name: thiG

Synonym: VCM66_0065

Alternate gene names: 227080301

Gene position: 64185-64955 (Clockwise)

Preceding gene: 227080300

Following gene: 227080302

Centisome position: 2.22

GC content: 50.45

Gene sequence:

>771_bases
ATGCTCAAGATTGCAGATAAAACGTTTCAATCCAGACTGTTTACTGGGACTGGAAAATTCTCTAATCGCCATGTCATGGC
TGAAGCATTGGCCGCCTCAGGTTCAGAGTTGGTGACCATGGCGTTAAAGCGGATTGATTTGGCTCAGCGCGACGATGACA
TTCTTGCGCCGCTACTCAGTATGCAGATGAATCTCTTACCTAACACATCTGGAGCCAAAAATGCTGCTGATGCGGTGTAT
GCCGCCGAGCTAGCCCGAGAAGCCTTAGCCACGAACTGGCTTAAGTTGGAGATTCATCCTGATCCCAAATATCTAATGCC
CGATCCCATCGAAACCCTGCTTGCGGCAGAACAATTGGTGAAGCAAGGTTTTATTGTGTTGCCGTATTGTCATGCTGACC
CTGTCTTGTGCAAAAGATTGGAAGAGGTTGGCTGCGCTGCGGTTATGCCCCTCGGTTCACCGATTGGCAGCAATCAAGGC
TTGGCGTCGAAAACCTTTCTTGAAATCATCATTGACCAAGCCAAGGTGCCGGTCATTGTCGATGCGGGGATTGGGTCGCC
ATCCGATGCTGCGCAAGCGATGGAGCTTGGCGCCGACGCTGTGTTAGTGAATACCGCTATTGCCGCGGCGCATGACCCCA
TCGCTATGGCAAAAGCCTTTAAGCTAGCGGTCGAAGCTGGGCGAATGGCTTATGAATCCGGCTTGCCATCACGAGTAAAA
ATGGCGACGGCATCAAGTCCATTAACTGGATTTTTGGATTTTGTATCATGA

Upstream 100 bases:

>100_bases
TCAATGGGCAGGTTGTCCCAAGAAGTGAATGGCAACACACCAAGCTTAATTCAGGGGATGAGATTTCCCTTTTCCAAGCG
ATAGCAGGAGGCTAAATCAG

Downstream 100 bases:

>100_bases
GCTTCATTACCCACTTCCAACAATTAGGCTGGGATGACAGTCGGCTATCGATTTATGGTAAGACAGCGCGGGATGTTGAA
CGCGCTTTATCTTCACCTAA

Product: thiazole synthase

Products: 4-methyl-5-(beta-hydroxyethyl)thiazole phosphate; 4-hydroxy-benzyl-alcohol; C1 of tyrosine; ThiS protein [C]

Alternate protein names: NA

Number of amino acids: Translated: 256; Mature: 256

Protein sequence:

>256_residues
MLKIADKTFQSRLFTGTGKFSNRHVMAEALAASGSELVTMALKRIDLAQRDDDILAPLLSMQMNLLPNTSGAKNAADAVY
AAELAREALATNWLKLEIHPDPKYLMPDPIETLLAAEQLVKQGFIVLPYCHADPVLCKRLEEVGCAAVMPLGSPIGSNQG
LASKTFLEIIIDQAKVPVIVDAGIGSPSDAAQAMELGADAVLVNTAIAAAHDPIAMAKAFKLAVEAGRMAYESGLPSRVK
MATASSPLTGFLDFVS

Sequences:

>Translated_256_residues
MLKIADKTFQSRLFTGTGKFSNRHVMAEALAASGSELVTMALKRIDLAQRDDDILAPLLSMQMNLLPNTSGAKNAADAVY
AAELAREALATNWLKLEIHPDPKYLMPDPIETLLAAEQLVKQGFIVLPYCHADPVLCKRLEEVGCAAVMPLGSPIGSNQG
LASKTFLEIIIDQAKVPVIVDAGIGSPSDAAQAMELGADAVLVNTAIAAAHDPIAMAKAFKLAVEAGRMAYESGLPSRVK
MATASSPLTGFLDFVS
>Mature_256_residues
MLKIADKTFQSRLFTGTGKFSNRHVMAEALAASGSELVTMALKRIDLAQRDDDILAPLLSMQMNLLPNTSGAKNAADAVY
AAELAREALATNWLKLEIHPDPKYLMPDPIETLLAAEQLVKQGFIVLPYCHADPVLCKRLEEVGCAAVMPLGSPIGSNQG
LASKTFLEIIIDQAKVPVIVDAGIGSPSDAAQAMELGADAVLVNTAIAAAHDPIAMAKAFKLAVEAGRMAYESGLPSRVK
MATASSPLTGFLDFVS

Specific function: Catalyzes the rearrangement of 1-deoxy-D-xylulose 5- phosphate (DXP) to produce the thiazole phosphate moiety of thiamine. Sulfur is provided by the thiocarboxylate moiety of the carrier protein ThiS. In vitro, sulfur can be provided by H(2)S

COG id: COG2022

COG function: function code H; Uncharacterized enzyme of thiazole biosynthesis

Gene ontology:

Cell location: Cytoplasm

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the thiG family

Homologues:

Organism=Escherichia coli, GI48994993, Length=253, Percent_Identity=71.5415019762846, Blast_Score=374, Evalue=1e-105,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): THIG_VIBC3 (A5F4E2)

Other databases:

- EMBL:   CP000627
- EMBL:   CP001235
- ProteinModelPortal:   A5F4E2
- SMR:   A5F4E2
- STRING:   A5F4E2
- GenomeReviews:   CP000627_GR
- GenomeReviews:   CP001235_GR
- KEGG:   vco:VC0395_A2449
- eggNOG:   COG2022
- HOGENOM:   HBG296821
- OMA:   VAIRRTN
- ProtClustDB:   PRK00208
- BioCyc:   VCHO345073:VC0395_A2449-MONOMER
- GO:   GO:0005737
- HAMAP:   MF_00443
- InterPro:   IPR013785
- InterPro:   IPR008867
- Gene3D:   G3DSA:3.20.20.70

Pfam domain/function: PF05690 ThiG; SSF110399 ThiG

EC number: NA

Molecular weight: Translated: 27090; Mature: 27090

Theoretical pI: Translated: 5.09; Mature: 5.09

Prosite motif: NA

Important sites: ACT_SITE 95-95 BINDING 156-156

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.2 %Cys     (Translated Protein)
4.3 %Met     (Translated Protein)
5.5 %Cys+Met (Translated Protein)
1.2 %Cys     (Mature Protein)
4.3 %Met     (Mature Protein)
5.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MLKIADKTFQSRLFTGTGKFSNRHVMAEALAASGSELVTMALKRIDLAQRDDDILAPLLS
CCCCCHHHHHHHHCCCCCCCCCCHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHH
MQMNLLPNTSGAKNAADAVYAAELAREALATNWLKLEIHPDPKYLMPDPIETLLAAEQLV
HHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCEEEEEECCCCCCCCCCHHHHHHHHHHHH
KQGFIVLPYCHADPVLCKRLEEVGCAAVMPLGSPIGSNQGLASKTFLEIIIDQAKVPVIV
HCCCEEEEECCCCHHHHHHHHHCCCEEEECCCCCCCCCCCCHHHHHHHHHHHCCCCCEEE
DAGIGSPSDAAQAMELGADAVLVNTAIAAAHDPIAMAKAFKLAVEAGRMAYESGLPSRVK
ECCCCCCHHHHHHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCCCEE
MATASSPLTGFLDFVS
EECCCCCHHHHHHHCC
>Mature Secondary Structure
MLKIADKTFQSRLFTGTGKFSNRHVMAEALAASGSELVTMALKRIDLAQRDDDILAPLLS
CCCCCHHHHHHHHCCCCCCCCCCHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHH
MQMNLLPNTSGAKNAADAVYAAELAREALATNWLKLEIHPDPKYLMPDPIETLLAAEQLV
HHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCEEEEEECCCCCCCCCCHHHHHHHHHHHH
KQGFIVLPYCHADPVLCKRLEEVGCAAVMPLGSPIGSNQGLASKTFLEIIIDQAKVPVIV
HCCCEEEEECCCCHHHHHHHHHCCCEEEECCCCCCCCCCCCHHHHHHHHHHHCCCCCEEE
DAGIGSPSDAAQAMELGADAVLVNTAIAAAHDPIAMAKAFKLAVEAGRMAYESGLPSRVK
ECCCCCCHHHHHHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCCCEE
MATASSPLTGFLDFVS
EECCCCCHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: deoxyxylulose-5-phosphate; ThiS-COSH; L-tyrosine [C]

Specific reaction: deoxyxylulose-5-phosphate + ThiS-COSH + L-tyrosine = 4-methyl-5-(beta-hydroxyethyl)thiazole phosphate + 4-hydroxy-benzyl-alcohol + C1 of tyrosine + ThiS protein [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: NA