Definition | Frankia sp. EAN1pec chromosome, complete genome. |
---|---|
Accession | NC_009921 |
Length | 8,982,042 |
Click here to switch to the map view.
The map label for this gene is thiG
Identifier: 158313672
GI number: 158313672
Start: 2205245
End: 2206156
Strand: Direct
Name: thiG
Synonym: Franean1_1836
Alternate gene names: 158313672
Gene position: 2205245-2206156 (Clockwise)
Preceding gene: 158313669
Following gene: 158313673
Centisome position: 24.55
GC content: 74.89
Gene sequence:
>912_bases ATGACACAGCGGGTCACCGAGGTCCGACGCGAGCCGGACAAGTCCGATCATCCGGAGTCCGATCATTCGGAGTCCGATCA TCCGGAGCGCGGTCATCCCGACCCGTTCCGGATCGCCGGCACCGTCTACGCCAGCCGGCTCCTCGTCGGCACCGGCAAGT TCGCGAGCCATCCGGTCATGCGCGACAGCCTGGTCGCCTCGGGGGCGGACATCGTCACCGTCGCCCTGCGCCGGGTCGAC CTGAGCCGCGCGGGGGAGGGCGACGTGCTCGACTTCGTCCCGGCCGGCATGACGCTGCTGCCGAACACCTCCGGCGCGCA GGACGCGGCCGAGGCGCTGCGGCTGGCCCGGCTCGGCCGCGCGGCGACCGGGACGTCCCTGGTGAAGCTGGAGGTCACGC CGGATCCGCGCACCCTCGCGCCGGACCCGATCGAGACGCTGCGCGCCGCCGAGCTGATGGTCGCCGACGGGTTCACCGTG CTCCCGTACTGCTCGGCCGACCCGGTGCTGGCACGCCGGCTCGAGGAGGCCGGCTGCGCCACGGTGATGCCGCTGGGTAG CTGGATCGGTTCCAACCGCGGCCTGCGCACCCGCGACGCGATCGAGGCGATCGTGGAGACCGCCGGGGTCCCGGTGGTGG TGGACGCCGGCATCGGCGCGCCCTCCGACGCCGCCGAGGCGATGGAGATCGGGGCGGACGCGGTGCTCGTCAACACGGCG ATCGCGATCGCCGCCGACCCGGTCGCGATGGCCCGGGCCTTCGCGCTCGCGACCATCGCCGGGCGGATGGCCCACCTCGC CGGCAGGCCGCGGGCGGGCAGCGCCACCGTGGCCGAGGCGTCCTCTCCGCTCACCGGTTTCCTGGGCGCGGTACCCGGCG GCCTGCCCGGTCTGCCCGGCGGGGGCGGCTGA
Upstream 100 bases:
>100_bases ACCGGGGAGCCCGCGGACGCGGGCTGAGAGGCCGGCCAGACTGCCGCCGGCGACCCGCATACCCGATCGCGGTCATGCGC GCGTGGGGAGGAACCATCTG
Downstream 100 bases:
>100_bases TGGCCAGCCCGGCAGGGCTGTTCGCCCGCGAGCTCGCCGCGCTCGACATCCCGGCGCTCGCCCGTGTCTCGGTCGAGGCC GACGAGGCGCGGGTCGACGC
Product: thiazole synthase
Products: 4-methyl-5-(beta-hydroxyethyl)thiazole phosphate; 4-hydroxy-benzyl-alcohol; C1 of tyrosine; ThiS protein [C]
Alternate protein names: NA
Number of amino acids: Translated: 303; Mature: 302
Protein sequence:
>303_residues MTQRVTEVRREPDKSDHPESDHSESDHPERGHPDPFRIAGTVYASRLLVGTGKFASHPVMRDSLVASGADIVTVALRRVD LSRAGEGDVLDFVPAGMTLLPNTSGAQDAAEALRLARLGRAATGTSLVKLEVTPDPRTLAPDPIETLRAAELMVADGFTV LPYCSADPVLARRLEEAGCATVMPLGSWIGSNRGLRTRDAIEAIVETAGVPVVVDAGIGAPSDAAEAMEIGADAVLVNTA IAIAADPVAMARAFALATIAGRMAHLAGRPRAGSATVAEASSPLTGFLGAVPGGLPGLPGGGG
Sequences:
>Translated_303_residues MTQRVTEVRREPDKSDHPESDHSESDHPERGHPDPFRIAGTVYASRLLVGTGKFASHPVMRDSLVASGADIVTVALRRVD LSRAGEGDVLDFVPAGMTLLPNTSGAQDAAEALRLARLGRAATGTSLVKLEVTPDPRTLAPDPIETLRAAELMVADGFTV LPYCSADPVLARRLEEAGCATVMPLGSWIGSNRGLRTRDAIEAIVETAGVPVVVDAGIGAPSDAAEAMEIGADAVLVNTA IAIAADPVAMARAFALATIAGRMAHLAGRPRAGSATVAEASSPLTGFLGAVPGGLPGLPGGGG >Mature_302_residues TQRVTEVRREPDKSDHPESDHSESDHPERGHPDPFRIAGTVYASRLLVGTGKFASHPVMRDSLVASGADIVTVALRRVDL SRAGEGDVLDFVPAGMTLLPNTSGAQDAAEALRLARLGRAATGTSLVKLEVTPDPRTLAPDPIETLRAAELMVADGFTVL PYCSADPVLARRLEEAGCATVMPLGSWIGSNRGLRTRDAIEAIVETAGVPVVVDAGIGAPSDAAEAMEIGADAVLVNTAI AIAADPVAMARAFALATIAGRMAHLAGRPRAGSATVAEASSPLTGFLGAVPGGLPGLPGGGG
Specific function: Catalyzes the rearrangement of 1-deoxy-D-xylulose 5- phosphate (DXP) to produce the thiazole phosphate moiety of thiamine. Sulfur is provided by the thiocarboxylate moiety of the carrier protein ThiS. In vitro, sulfur can be provided by H(2)S
COG id: COG2022
COG function: function code H; Uncharacterized enzyme of thiazole biosynthesis
Gene ontology:
Cell location: Cytoplasm
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the thiG family
Homologues:
Organism=Escherichia coli, GI48994993, Length=255, Percent_Identity=58.8235294117647, Blast_Score=278, Evalue=2e-76,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): THIG_FRASN (A8LFH1)
Other databases:
- EMBL: CP000820 - RefSeq: YP_001506180.1 - ProteinModelPortal: A8LFH1 - SMR: A8LFH1 - GeneID: 5670238 - GenomeReviews: CP000820_GR - KEGG: fre:Franean1_1836 - HOGENOM: HBG296821 - OMA: VAIRRTN - ProtClustDB: PRK00208 - BioCyc: FSP1855:FRANEAN1_1836-MONOMER - GO: GO:0005737 - HAMAP: MF_00443 - InterPro: IPR013785 - InterPro: IPR008867 - Gene3D: G3DSA:3.20.20.70
Pfam domain/function: PF05690 ThiG; SSF110399 ThiG
EC number: NA
Molecular weight: Translated: 31014; Mature: 30882
Theoretical pI: Translated: 4.91; Mature: 4.91
Prosite motif: NA
Important sites: ACT_SITE 129-129 BINDING 190-190
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.7 %Cys (Translated Protein) 2.6 %Met (Translated Protein) 3.3 %Cys+Met (Translated Protein) 0.7 %Cys (Mature Protein) 2.3 %Met (Mature Protein) 3.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTQRVTEVRREPDKSDHPESDHSESDHPERGHPDPFRIAGTVYASRLLVGTGKFASHPVM CCCHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHEECCCHHCCCCH RDSLVASGADIVTVALRRVDLSRAGEGDVLDFVPAGMTLLPNTSGAQDAAEALRLARLGR HHHHHHCCCHHHHHHHHHHHHHCCCCCCEEECCCCCEEECCCCCCHHHHHHHHHHHHHCC AATGTSLVKLEVTPDPRTLAPDPIETLRAAELMVADGFTVLPYCSADPVLARRLEEAGCA CCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHEECCCEEEECCCCCHHHHHHHHHCCCE TVMPLGSWIGSNRGLRTRDAIEAIVETAGVPVVVDAGIGAPSDAAEAMEIGADAVLVNTA EEEEHHHHHCCCCCCCHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHCCCEEEEHHH IAIAADPVAMARAFALATIAGRMAHLAGRPRAGSATVAEASSPLTGFLGAVPGGLPGLPG HHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHCCCCHHHHHHCCCCCCCCCCC GGG CCC >Mature Secondary Structure TQRVTEVRREPDKSDHPESDHSESDHPERGHPDPFRIAGTVYASRLLVGTGKFASHPVM CCHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHEECCCHHCCCCH RDSLVASGADIVTVALRRVDLSRAGEGDVLDFVPAGMTLLPNTSGAQDAAEALRLARLGR HHHHHHCCCHHHHHHHHHHHHHCCCCCCEEECCCCCEEECCCCCCHHHHHHHHHHHHHCC AATGTSLVKLEVTPDPRTLAPDPIETLRAAELMVADGFTVLPYCSADPVLARRLEEAGCA CCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHEECCCEEEECCCCCHHHHHHHHHCCCE TVMPLGSWIGSNRGLRTRDAIEAIVETAGVPVVVDAGIGAPSDAAEAMEIGADAVLVNTA EEEEHHHHHCCCCCCCHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHCCCEEEEHHH IAIAADPVAMARAFALATIAGRMAHLAGRPRAGSATVAEASSPLTGFLGAVPGGLPGLPG HHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHCCCCHHHHHHCCCCCCCCCCC GGG CCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: deoxyxylulose-5-phosphate; ThiS-COSH; L-tyrosine [C]
Specific reaction: deoxyxylulose-5-phosphate + ThiS-COSH + L-tyrosine = 4-methyl-5-(beta-hydroxyethyl)thiazole phosphate + 4-hydroxy-benzyl-alcohol + C1 of tyrosine + ThiS protein [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: NA