Definition Frankia sp. EAN1pec chromosome, complete genome.
Accession NC_009921
Length 8,982,042

Click here to switch to the map view.

The map label for this gene is thiG

Identifier: 158313672

GI number: 158313672

Start: 2205245

End: 2206156

Strand: Direct

Name: thiG

Synonym: Franean1_1836

Alternate gene names: 158313672

Gene position: 2205245-2206156 (Clockwise)

Preceding gene: 158313669

Following gene: 158313673

Centisome position: 24.55

GC content: 74.89

Gene sequence:

>912_bases
ATGACACAGCGGGTCACCGAGGTCCGACGCGAGCCGGACAAGTCCGATCATCCGGAGTCCGATCATTCGGAGTCCGATCA
TCCGGAGCGCGGTCATCCCGACCCGTTCCGGATCGCCGGCACCGTCTACGCCAGCCGGCTCCTCGTCGGCACCGGCAAGT
TCGCGAGCCATCCGGTCATGCGCGACAGCCTGGTCGCCTCGGGGGCGGACATCGTCACCGTCGCCCTGCGCCGGGTCGAC
CTGAGCCGCGCGGGGGAGGGCGACGTGCTCGACTTCGTCCCGGCCGGCATGACGCTGCTGCCGAACACCTCCGGCGCGCA
GGACGCGGCCGAGGCGCTGCGGCTGGCCCGGCTCGGCCGCGCGGCGACCGGGACGTCCCTGGTGAAGCTGGAGGTCACGC
CGGATCCGCGCACCCTCGCGCCGGACCCGATCGAGACGCTGCGCGCCGCCGAGCTGATGGTCGCCGACGGGTTCACCGTG
CTCCCGTACTGCTCGGCCGACCCGGTGCTGGCACGCCGGCTCGAGGAGGCCGGCTGCGCCACGGTGATGCCGCTGGGTAG
CTGGATCGGTTCCAACCGCGGCCTGCGCACCCGCGACGCGATCGAGGCGATCGTGGAGACCGCCGGGGTCCCGGTGGTGG
TGGACGCCGGCATCGGCGCGCCCTCCGACGCCGCCGAGGCGATGGAGATCGGGGCGGACGCGGTGCTCGTCAACACGGCG
ATCGCGATCGCCGCCGACCCGGTCGCGATGGCCCGGGCCTTCGCGCTCGCGACCATCGCCGGGCGGATGGCCCACCTCGC
CGGCAGGCCGCGGGCGGGCAGCGCCACCGTGGCCGAGGCGTCCTCTCCGCTCACCGGTTTCCTGGGCGCGGTACCCGGCG
GCCTGCCCGGTCTGCCCGGCGGGGGCGGCTGA

Upstream 100 bases:

>100_bases
ACCGGGGAGCCCGCGGACGCGGGCTGAGAGGCCGGCCAGACTGCCGCCGGCGACCCGCATACCCGATCGCGGTCATGCGC
GCGTGGGGAGGAACCATCTG

Downstream 100 bases:

>100_bases
TGGCCAGCCCGGCAGGGCTGTTCGCCCGCGAGCTCGCCGCGCTCGACATCCCGGCGCTCGCCCGTGTCTCGGTCGAGGCC
GACGAGGCGCGGGTCGACGC

Product: thiazole synthase

Products: 4-methyl-5-(beta-hydroxyethyl)thiazole phosphate; 4-hydroxy-benzyl-alcohol; C1 of tyrosine; ThiS protein [C]

Alternate protein names: NA

Number of amino acids: Translated: 303; Mature: 302

Protein sequence:

>303_residues
MTQRVTEVRREPDKSDHPESDHSESDHPERGHPDPFRIAGTVYASRLLVGTGKFASHPVMRDSLVASGADIVTVALRRVD
LSRAGEGDVLDFVPAGMTLLPNTSGAQDAAEALRLARLGRAATGTSLVKLEVTPDPRTLAPDPIETLRAAELMVADGFTV
LPYCSADPVLARRLEEAGCATVMPLGSWIGSNRGLRTRDAIEAIVETAGVPVVVDAGIGAPSDAAEAMEIGADAVLVNTA
IAIAADPVAMARAFALATIAGRMAHLAGRPRAGSATVAEASSPLTGFLGAVPGGLPGLPGGGG

Sequences:

>Translated_303_residues
MTQRVTEVRREPDKSDHPESDHSESDHPERGHPDPFRIAGTVYASRLLVGTGKFASHPVMRDSLVASGADIVTVALRRVD
LSRAGEGDVLDFVPAGMTLLPNTSGAQDAAEALRLARLGRAATGTSLVKLEVTPDPRTLAPDPIETLRAAELMVADGFTV
LPYCSADPVLARRLEEAGCATVMPLGSWIGSNRGLRTRDAIEAIVETAGVPVVVDAGIGAPSDAAEAMEIGADAVLVNTA
IAIAADPVAMARAFALATIAGRMAHLAGRPRAGSATVAEASSPLTGFLGAVPGGLPGLPGGGG
>Mature_302_residues
TQRVTEVRREPDKSDHPESDHSESDHPERGHPDPFRIAGTVYASRLLVGTGKFASHPVMRDSLVASGADIVTVALRRVDL
SRAGEGDVLDFVPAGMTLLPNTSGAQDAAEALRLARLGRAATGTSLVKLEVTPDPRTLAPDPIETLRAAELMVADGFTVL
PYCSADPVLARRLEEAGCATVMPLGSWIGSNRGLRTRDAIEAIVETAGVPVVVDAGIGAPSDAAEAMEIGADAVLVNTAI
AIAADPVAMARAFALATIAGRMAHLAGRPRAGSATVAEASSPLTGFLGAVPGGLPGLPGGGG

Specific function: Catalyzes the rearrangement of 1-deoxy-D-xylulose 5- phosphate (DXP) to produce the thiazole phosphate moiety of thiamine. Sulfur is provided by the thiocarboxylate moiety of the carrier protein ThiS. In vitro, sulfur can be provided by H(2)S

COG id: COG2022

COG function: function code H; Uncharacterized enzyme of thiazole biosynthesis

Gene ontology:

Cell location: Cytoplasm

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the thiG family

Homologues:

Organism=Escherichia coli, GI48994993, Length=255, Percent_Identity=58.8235294117647, Blast_Score=278, Evalue=2e-76,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): THIG_FRASN (A8LFH1)

Other databases:

- EMBL:   CP000820
- RefSeq:   YP_001506180.1
- ProteinModelPortal:   A8LFH1
- SMR:   A8LFH1
- GeneID:   5670238
- GenomeReviews:   CP000820_GR
- KEGG:   fre:Franean1_1836
- HOGENOM:   HBG296821
- OMA:   VAIRRTN
- ProtClustDB:   PRK00208
- BioCyc:   FSP1855:FRANEAN1_1836-MONOMER
- GO:   GO:0005737
- HAMAP:   MF_00443
- InterPro:   IPR013785
- InterPro:   IPR008867
- Gene3D:   G3DSA:3.20.20.70

Pfam domain/function: PF05690 ThiG; SSF110399 ThiG

EC number: NA

Molecular weight: Translated: 31014; Mature: 30882

Theoretical pI: Translated: 4.91; Mature: 4.91

Prosite motif: NA

Important sites: ACT_SITE 129-129 BINDING 190-190

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.7 %Cys     (Translated Protein)
2.6 %Met     (Translated Protein)
3.3 %Cys+Met (Translated Protein)
0.7 %Cys     (Mature Protein)
2.3 %Met     (Mature Protein)
3.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTQRVTEVRREPDKSDHPESDHSESDHPERGHPDPFRIAGTVYASRLLVGTGKFASHPVM
CCCHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHEECCCHHCCCCH
RDSLVASGADIVTVALRRVDLSRAGEGDVLDFVPAGMTLLPNTSGAQDAAEALRLARLGR
HHHHHHCCCHHHHHHHHHHHHHCCCCCCEEECCCCCEEECCCCCCHHHHHHHHHHHHHCC
AATGTSLVKLEVTPDPRTLAPDPIETLRAAELMVADGFTVLPYCSADPVLARRLEEAGCA
CCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHEECCCEEEECCCCCHHHHHHHHHCCCE
TVMPLGSWIGSNRGLRTRDAIEAIVETAGVPVVVDAGIGAPSDAAEAMEIGADAVLVNTA
EEEEHHHHHCCCCCCCHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHCCCEEEEHHH
IAIAADPVAMARAFALATIAGRMAHLAGRPRAGSATVAEASSPLTGFLGAVPGGLPGLPG
HHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHCCCCHHHHHHCCCCCCCCCCC
GGG
CCC
>Mature Secondary Structure 
TQRVTEVRREPDKSDHPESDHSESDHPERGHPDPFRIAGTVYASRLLVGTGKFASHPVM
CCHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHEECCCHHCCCCH
RDSLVASGADIVTVALRRVDLSRAGEGDVLDFVPAGMTLLPNTSGAQDAAEALRLARLGR
HHHHHHCCCHHHHHHHHHHHHHCCCCCCEEECCCCCEEECCCCCCHHHHHHHHHHHHHCC
AATGTSLVKLEVTPDPRTLAPDPIETLRAAELMVADGFTVLPYCSADPVLARRLEEAGCA
CCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHEECCCEEEECCCCCHHHHHHHHHCCCE
TVMPLGSWIGSNRGLRTRDAIEAIVETAGVPVVVDAGIGAPSDAAEAMEIGADAVLVNTA
EEEEHHHHHCCCCCCCHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHCCCEEEEHHH
IAIAADPVAMARAFALATIAGRMAHLAGRPRAGSATVAEASSPLTGFLGAVPGGLPGLPG
HHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHCCCCHHHHHHCCCCCCCCCCC
GGG
CCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: deoxyxylulose-5-phosphate; ThiS-COSH; L-tyrosine [C]

Specific reaction: deoxyxylulose-5-phosphate + ThiS-COSH + L-tyrosine = 4-methyl-5-(beta-hydroxyethyl)thiazole phosphate + 4-hydroxy-benzyl-alcohol + C1 of tyrosine + ThiS protein [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: NA