Definition | Agrobacterium tumefaciens str. C58 chromosome circular, complete sequence. |
---|---|
Accession | NC_003062 |
Length | 2,841,580 |
Click here to switch to the map view.
The map label for this gene is thiG
Identifier: 15889824
GI number: 15889824
Start: 2537959
End: 2538732
Strand: Reverse
Name: thiG
Synonym: Atu2566
Alternate gene names: 15889824
Gene position: 2538732-2537959 (Counterclockwise)
Preceding gene: 15889825
Following gene: 159185269
Centisome position: 89.34
GC content: 61.76
Gene sequence:
>774_bases ATGCTGACGCTTTATGGCCGCGAGGTCTCGTCCCGCCTTCTGCTCGGCACGGCGCGTTATCCGTCTCCCGCTGTGCTTGC GGATGCGGTGCGGGCCAGCAATACGGATATTCTGACGATTTCGTTGCGACGGGAAATGGCAGGGGCCAAGAAGGGCGGTC AGTTTTTCGAGCTGATCCGCGAGCTGGATCGCCATATCCTGCCCAATACGGCTGGTTGCCACACCGCCAAGGAAGCGGTG CTGACGGCGAAGATGGCCCGCGAGGTATTCCGTACGGACTGGATCAAGCTGGAGGTCATTGGCCATCACGATACGTTGCA GCCCGATGTCTTTGCGCTTGTCGAGGCGGCAAAAATCCTGTGTGACGAGGGTTTCGAAGTCTTTCCCTATACGACGGACG ATTTGGTCGTGGCCGAGAAACTGCTGGAGGCTGGTTGCAGGGTGCTGATGCCCTGGTGCGCGCCGATCGGCAGCGCCATG GGGCCGCTCAATCTGACGGCGCTGAAATCGATGCGGGCGCGGTTTCCCGAAGTACCACTGATCGTGGATGCCGGTATCGG CCGGCCCTCCCATGCGGTGACCGTGATGGAGCTTGGTTACGATGCCGTTCTCCTCAACACGGCGGTTGCGGGTGCGGGCG ACCCAGTCGGCATGGCGGAGGCTTTTGCCCGCGCCATAGAGGCAGGCCATCAGGCGTATCTTTCGGGGCCGCTGGAGCCA CGAGACATGGCCGTTCCTTCGACCCCTGTTATCGGGACGGCGGTTTTCTCCTGA
Upstream 100 bases:
>100_bases GTCAATGGCGATCTCGTGCATTCGGAGGAACGCGCCGGTTACGTGCTGAAGGCGTTCGACCGTGTCGAAATCCTTTCGCC GATGCAGGGAGGTTGAGACC
Downstream 100 bases:
>100_bases TCGCCGCTTTCCGCCAAGGTCGGGCATGTCACCTTTTGCGTTGACAGTGTCCCGTGGCGGCGGGATACCGGGAGTAAAGG GGAACAGCATGGCGGCACAC
Product: thiazole synthase
Products: 4-methyl-5-(beta-hydroxyethyl)thiazole phosphate; 4-hydroxy-benzyl-alcohol; C1 of tyrosine; ThiS protein [C]
Alternate protein names: NA
Number of amino acids: Translated: 257; Mature: 257
Protein sequence:
>257_residues MLTLYGREVSSRLLLGTARYPSPAVLADAVRASNTDILTISLRREMAGAKKGGQFFELIRELDRHILPNTAGCHTAKEAV LTAKMAREVFRTDWIKLEVIGHHDTLQPDVFALVEAAKILCDEGFEVFPYTTDDLVVAEKLLEAGCRVLMPWCAPIGSAM GPLNLTALKSMRARFPEVPLIVDAGIGRPSHAVTVMELGYDAVLLNTAVAGAGDPVGMAEAFARAIEAGHQAYLSGPLEP RDMAVPSTPVIGTAVFS
Sequences:
>Translated_257_residues MLTLYGREVSSRLLLGTARYPSPAVLADAVRASNTDILTISLRREMAGAKKGGQFFELIRELDRHILPNTAGCHTAKEAV LTAKMAREVFRTDWIKLEVIGHHDTLQPDVFALVEAAKILCDEGFEVFPYTTDDLVVAEKLLEAGCRVLMPWCAPIGSAM GPLNLTALKSMRARFPEVPLIVDAGIGRPSHAVTVMELGYDAVLLNTAVAGAGDPVGMAEAFARAIEAGHQAYLSGPLEP RDMAVPSTPVIGTAVFS >Mature_257_residues MLTLYGREVSSRLLLGTARYPSPAVLADAVRASNTDILTISLRREMAGAKKGGQFFELIRELDRHILPNTAGCHTAKEAV LTAKMAREVFRTDWIKLEVIGHHDTLQPDVFALVEAAKILCDEGFEVFPYTTDDLVVAEKLLEAGCRVLMPWCAPIGSAM GPLNLTALKSMRARFPEVPLIVDAGIGRPSHAVTVMELGYDAVLLNTAVAGAGDPVGMAEAFARAIEAGHQAYLSGPLEP RDMAVPSTPVIGTAVFS
Specific function: Catalyzes the rearrangement of 1-deoxy-D-xylulose 5- phosphate (DXP) to produce the thiazole phosphate moiety of thiamine. Sulfur is provided by the thiocarboxylate moiety of the carrier protein ThiS. In vitro, sulfur can be provided by H(2)S
COG id: COG2022
COG function: function code H; Uncharacterized enzyme of thiazole biosynthesis
Gene ontology:
Cell location: Cytoplasm
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the thiG family
Homologues:
Organism=Escherichia coli, GI48994993, Length=252, Percent_Identity=39.6825396825397, Blast_Score=172, Evalue=2e-44,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): THIG_AGRT5 (Q8UCD2)
Other databases:
- EMBL: AE007869 - PIR: A97667 - PIR: AE2891 - RefSeq: NP_355505.1 - ProteinModelPortal: Q8UCD2 - SMR: Q8UCD2 - STRING: Q8UCD2 - GeneID: 1134604 - GenomeReviews: AE007869_GR - KEGG: atu:Atu2566 - eggNOG: COG2022 - HOGENOM: HBG296821 - OMA: VAIRRTN - PhylomeDB: Q8UCD2 - ProtClustDB: PRK00208 - BioCyc: ATUM176299-1:ATU2566-MONOMER - GO: GO:0005737 - HAMAP: MF_00443 - InterPro: IPR013785 - InterPro: IPR008867 - Gene3D: G3DSA:3.20.20.70
Pfam domain/function: PF05690 ThiG; SSF110399 ThiG
EC number: NA
Molecular weight: Translated: 27582; Mature: 27582
Theoretical pI: Translated: 5.64; Mature: 5.64
Prosite motif: NA
Important sites: ACT_SITE 96-96 BINDING 157-157
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.6 %Cys (Translated Protein) 3.5 %Met (Translated Protein) 5.1 %Cys+Met (Translated Protein) 1.6 %Cys (Mature Protein) 3.5 %Met (Mature Protein) 5.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MLTLYGREVSSRLLLGTARYPSPAVLADAVRASNTDILTISLRREMAGAKKGGQFFELIR CEEEECCCHHCCEEEEECCCCCHHHHHHHHHCCCCCEEEEEEHHHHCCCCCCHHHHHHHH ELDRHILPNTAGCHTAKEAVLTAKMAREVFRTDWIKLEVIGHHDTLQPDVFALVEAAKIL HHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCHHHHHHHHHHHHH CDEGFEVFPYTTDDLVVAEKLLEAGCRVLMPWCAPIGSAMGPLNLTALKSMRARFPEVPL HHCCCEEECCCCHHHHHHHHHHHCCHHEEHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCE IVDAGIGRPSHAVTVMELGYDAVLLNTAVAGAGDPVGMAEAFARAIEAGHQAYLSGPLEP EEECCCCCCCCEEEEECCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHCCHHHHCCCCCC RDMAVPSTPVIGTAVFS CCCCCCCCCCCCCCCCC >Mature Secondary Structure MLTLYGREVSSRLLLGTARYPSPAVLADAVRASNTDILTISLRREMAGAKKGGQFFELIR CEEEECCCHHCCEEEEECCCCCHHHHHHHHHCCCCCEEEEEEHHHHCCCCCCHHHHHHHH ELDRHILPNTAGCHTAKEAVLTAKMAREVFRTDWIKLEVIGHHDTLQPDVFALVEAAKIL HHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCHHHHHHHHHHHHH CDEGFEVFPYTTDDLVVAEKLLEAGCRVLMPWCAPIGSAMGPLNLTALKSMRARFPEVPL HHCCCEEECCCCHHHHHHHHHHHCCHHEEHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCE IVDAGIGRPSHAVTVMELGYDAVLLNTAVAGAGDPVGMAEAFARAIEAGHQAYLSGPLEP EEECCCCCCCCEEEEECCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHCCHHHHCCCCCC RDMAVPSTPVIGTAVFS CCCCCCCCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: deoxyxylulose-5-phosphate; ThiS-COSH; L-tyrosine [C]
Specific reaction: deoxyxylulose-5-phosphate + ThiS-COSH + L-tyrosine = 4-methyl-5-(beta-hydroxyethyl)thiazole phosphate + 4-hydroxy-benzyl-alcohol + C1 of tyrosine + ThiS protein [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11743193; 11743194