The gene/protein map for NC_008942 is currently unavailable.
Definition Rhodopseudomonas palustris HaA2, complete genome.
Accession NC_007778
Length 5,331,656

Click here to switch to the map view.

The map label for this gene is thiG

Identifier: 86749079

GI number: 86749079

Start: 2223326

End: 2224108

Strand: Reverse

Name: thiG

Synonym: RPB_1956

Alternate gene names: 86749079

Gene position: 2224108-2223326 (Counterclockwise)

Preceding gene: 86749080

Following gene: 86749078

Centisome position: 41.72

GC content: 68.33

Gene sequence:

>783_bases
ATGGTGAAATTCTACGACCGCGAGATCTCCTCGCGCCTGCTGATCGGCAGCGCGCTGTATCCGTCGCCGGCGATCATGCA
GGATTCCATCCGCGAATCCGGCGCGGACATCGTCACCGTGTCGCTGCGCCGCGAGGCCGCCGGCGGCAAGGCGGGCGATC
AGTTCTGGTCGCTGATCCGCGAGCTCGGCGTCACCGTGCTGCCGAACACCGCCGGCTGCCGCAGCGTGCGCGAGGCGGTG
ACCACGGCAAAGCTGGCGCGCGAATTGTTCGGCACCGCATGGATCAAGCTCGAAGTCATCGCCGACAACGACACGCTGCA
GCCCGACGTCGTCGGCTTGGTCGAAGCGGCGCAAATCCTGACCAAGGACGGCTTCGAGGTGTTTCCCTATTGCACCGAGG
ATCTGTCGGTGGCGATGCGGCTGGTCGATGCCGGCTGCCGCGTCATCATGCCGTGGGCGGCGCCGATCGGCAGCGCGCGC
GGCATCGTCGCTCGCGACGCGCTGAAGCTGCTGCGCGACCGCCTGCCCGATATCACCCTCGTCGTCGATGCCGGCCTCGG
CGCGCCGAGCCACGCGGCCGAAGCGATGGAGCTCGGCTACGACGCCGTCCTCCTCAACACCGCGATCGCCAAAGCCGAAG
ATCCGGTGGCGATGGCCCGCGCCTTCAAGCTCGCGGTCGAAGCCGGCCGCACCGGATTCGAGGCCGGGCTGATGGGCGCC
CGCGATTTCGCCTCCCCCTCAACCCCCGTGATTGGGACCCCGTTCTGGCATGCCGTATCCTGA

Upstream 100 bases:

>100_bases
GACCCAGTATCCCAGAGCGCCGGCGTTCAGCCGCTAACTCTCTGGGATACTGGATCCCCGCTTTCGCGGGGATGACGGCC
TTTTGTGGAGCAACCCACGC

Downstream 100 bases:

>100_bases
TCGCTTCTATCCCGTCGTCGACAGCATCGCGTGGGTCAAACGCCTCGCCGCGCTCGGCGTCGGCACCGTGCAACTCCGCG
CCAAGGACCTCGACGACGGC

Product: thiazole synthase

Products: 4-methyl-5-(beta-hydroxyethyl)thiazole phosphate; 4-hydroxy-benzyl-alcohol; C1 of tyrosine; ThiS protein [C]

Alternate protein names: NA

Number of amino acids: Translated: 260; Mature: 260

Protein sequence:

>260_residues
MVKFYDREISSRLLIGSALYPSPAIMQDSIRESGADIVTVSLRREAAGGKAGDQFWSLIRELGVTVLPNTAGCRSVREAV
TTAKLARELFGTAWIKLEVIADNDTLQPDVVGLVEAAQILTKDGFEVFPYCTEDLSVAMRLVDAGCRVIMPWAAPIGSAR
GIVARDALKLLRDRLPDITLVVDAGLGAPSHAAEAMELGYDAVLLNTAIAKAEDPVAMARAFKLAVEAGRTGFEAGLMGA
RDFASPSTPVIGTPFWHAVS

Sequences:

>Translated_260_residues
MVKFYDREISSRLLIGSALYPSPAIMQDSIRESGADIVTVSLRREAAGGKAGDQFWSLIRELGVTVLPNTAGCRSVREAV
TTAKLARELFGTAWIKLEVIADNDTLQPDVVGLVEAAQILTKDGFEVFPYCTEDLSVAMRLVDAGCRVIMPWAAPIGSAR
GIVARDALKLLRDRLPDITLVVDAGLGAPSHAAEAMELGYDAVLLNTAIAKAEDPVAMARAFKLAVEAGRTGFEAGLMGA
RDFASPSTPVIGTPFWHAVS
>Mature_260_residues
MVKFYDREISSRLLIGSALYPSPAIMQDSIRESGADIVTVSLRREAAGGKAGDQFWSLIRELGVTVLPNTAGCRSVREAV
TTAKLARELFGTAWIKLEVIADNDTLQPDVVGLVEAAQILTKDGFEVFPYCTEDLSVAMRLVDAGCRVIMPWAAPIGSAR
GIVARDALKLLRDRLPDITLVVDAGLGAPSHAAEAMELGYDAVLLNTAIAKAEDPVAMARAFKLAVEAGRTGFEAGLMGA
RDFASPSTPVIGTPFWHAVS

Specific function: Catalyzes the rearrangement of 1-deoxy-D-xylulose 5- phosphate (DXP) to produce the thiazole phosphate moiety of thiamine. Sulfur is provided by the thiocarboxylate moiety of the carrier protein ThiS. In vitro, sulfur can be provided by H(2)S

COG id: COG2022

COG function: function code H; Uncharacterized enzyme of thiazole biosynthesis

Gene ontology:

Cell location: Cytoplasm

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the thiG family

Homologues:

Organism=Escherichia coli, GI48994993, Length=252, Percent_Identity=42.0634920634921, Blast_Score=198, Evalue=3e-52,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): THIG_RHOP2 (Q2IYP6)

Other databases:

- EMBL:   CP000250
- RefSeq:   YP_485575.1
- ProteinModelPortal:   Q2IYP6
- SMR:   Q2IYP6
- STRING:   Q2IYP6
- GeneID:   3908035
- GenomeReviews:   CP000250_GR
- KEGG:   rpb:RPB_1956
- eggNOG:   COG2022
- HOGENOM:   HBG296821
- OMA:   PIIIDAG
- ProtClustDB:   PRK00208
- BioCyc:   RPAL316058:RPB_1956-MONOMER
- GO:   GO:0005737
- HAMAP:   MF_00443
- InterPro:   IPR013785
- InterPro:   IPR008867
- Gene3D:   G3DSA:3.20.20.70

Pfam domain/function: PF05690 ThiG; SSF110399 ThiG

EC number: NA

Molecular weight: Translated: 27701; Mature: 27701

Theoretical pI: Translated: 4.85; Mature: 4.85

Prosite motif: NA

Important sites: ACT_SITE 96-96 BINDING 157-157

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.2 %Cys     (Translated Protein)
2.7 %Met     (Translated Protein)
3.8 %Cys+Met (Translated Protein)
1.2 %Cys     (Mature Protein)
2.7 %Met     (Mature Protein)
3.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MVKFYDREISSRLLIGSALYPSPAIMQDSIRESGADIVTVSLRREAAGGKAGDQFWSLIR
CCEECCHHHHHHEEECCCCCCCCHHHHHHHHHCCCCEEEEEEEHHCCCCCCHHHHHHHHH
ELGVTVLPNTAGCRSVREAVTTAKLARELFGTAWIKLEVIADNDTLQPDVVGLVEAAQIL
HCCCEECCCCHHHHHHHHHHHHHHHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHH
TKDGFEVFPYCTEDLSVAMRLVDAGCRVIMPWAAPIGSARGIVARDALKLLRDRLPDITL
HCCCCEECCCCHHHHHHHHHHHHCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCEEE
VVDAGLGAPSHAAEAMELGYDAVLLNTAIAKAEDPVAMARAFKLAVEAGRTGFEAGLMGA
EEECCCCCCHHHHHHHHHCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHCCCC
RDFASPSTPVIGTPFWHAVS
HHCCCCCCCEECCCCHHCCC
>Mature Secondary Structure
MVKFYDREISSRLLIGSALYPSPAIMQDSIRESGADIVTVSLRREAAGGKAGDQFWSLIR
CCEECCHHHHHHEEECCCCCCCCHHHHHHHHHCCCCEEEEEEEHHCCCCCCHHHHHHHHH
ELGVTVLPNTAGCRSVREAVTTAKLARELFGTAWIKLEVIADNDTLQPDVVGLVEAAQIL
HCCCEECCCCHHHHHHHHHHHHHHHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHH
TKDGFEVFPYCTEDLSVAMRLVDAGCRVIMPWAAPIGSARGIVARDALKLLRDRLPDITL
HCCCCEECCCCHHHHHHHHHHHHCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCEEE
VVDAGLGAPSHAAEAMELGYDAVLLNTAIAKAEDPVAMARAFKLAVEAGRTGFEAGLMGA
EEECCCCCCHHHHHHHHHCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHCCCC
RDFASPSTPVIGTPFWHAVS
HHCCCCCCCEECCCCHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: deoxyxylulose-5-phosphate; ThiS-COSH; L-tyrosine [C]

Specific reaction: deoxyxylulose-5-phosphate + ThiS-COSH + L-tyrosine = 4-methyl-5-(beta-hydroxyethyl)thiazole phosphate + 4-hydroxy-benzyl-alcohol + C1 of tyrosine + ThiS protein [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: NA