Definition | Serratia proteamaculans 568 chromosome, complete genome. |
---|---|
Accession | NC_009832 |
Length | 5,448,853 |
Click here to switch to the map view.
The map label for this gene is thiM
Identifier: 157371813
GI number: 157371813
Start: 3953585
End: 3954379
Strand: Reverse
Name: thiM
Synonym: Spro_3578
Alternate gene names: 157371813
Gene position: 3954379-3953585 (Counterclockwise)
Preceding gene: 157371815
Following gene: 157371812
Centisome position: 72.57
GC content: 64.28
Gene sequence:
>795_bases ATGCTCGCTCGACCTGATGTTTTCCCCGGCGCCCGCGCCGCGGCCTGTTTAACCCAATTTAAACGCCAATCCCCATTGAT TCACTGCCTGACCAACGAAGTGGTGCAGGAGCTGACGGCTAACGTGTTGCTGGCGCTCGGTGCTTCCCCGGCGATGGTGG TTGAGCCGACCGAAGCGGCACAGTTCAGCCGTTTGGCAGACGCTTTGTTGATCAATATCGGCACCCTCAATGCATCACGC GCCGAATCCATGCTGGCGGCGATAGAAGCTGCCAATGCGGCAGGCACGCCCTGGACGCTGGATCCGGTCGCCGTCGGTGG GCTGGCGTATCGCACCGCCTTTGCCCAAAGTTTGCTGGGCGAAAAGCCGGCGGCAATCCGCGGCAATGCTTCTGAAATTA TGGCGCTGAGTGGCCTGCAGGCGAGTGGGCGCGGCGTCGACAGCGCTGACGATTCACTGGCGGCGTTACCGGCGGCGCGC GAATTGGCACGAAACAGTGGGGCCGTGGTGGCGGTGACCGGCGTGGTGGATTACATCACCGACGGCCAGCGTGACTGGGC GGTTGCCGGGGGCGACGTGCTGATGACGCGAGTGGTGGGTACCGGTTGCGCACTTTCGGCCGTGGTGGCGGCATTTTGCA GCCTGCCGGGCGATCGGTTGGACAACGTGGCGACCGCCTGCCGGGTGATGTCACATTGCGGTGAGATGGCCACGCGCCGC GCTGCCGGGCCGGGCAGTTTTACCCCGGCATTCCTCGACGCGCTGTACCAACTGCGCCCGGAGGATCTGCAATGA
Upstream 100 bases:
>100_bases TGAGATTTTACCCGTATTACCTGATCTGGATTATGCCAGCGTAGGGAAGTCTCGGTGCCCCAACCGGTACCCGCCTTCTT GAACGCCGGTTGGGAGGAAT
Downstream 100 bases:
>100_bases AACGGATTAACGCGTTGACCATCGCCGGTACCGATCCGAGCGGGGGGGCCGGCATCCAGGCCGACTTGAAAGCCTTCTCC GCGCTGGGGGCCTACGGTGC
Product: hydroxyethylthiazole kinase
Products: NA
Alternate protein names: 4-methyl-5-beta-hydroxyethylthiazole kinase; TH kinase; Thz kinase
Number of amino acids: Translated: 264; Mature: 264
Protein sequence:
>264_residues MLARPDVFPGARAAACLTQFKRQSPLIHCLTNEVVQELTANVLLALGASPAMVVEPTEAAQFSRLADALLINIGTLNASR AESMLAAIEAANAAGTPWTLDPVAVGGLAYRTAFAQSLLGEKPAAIRGNASEIMALSGLQASGRGVDSADDSLAALPAAR ELARNSGAVVAVTGVVDYITDGQRDWAVAGGDVLMTRVVGTGCALSAVVAAFCSLPGDRLDNVATACRVMSHCGEMATRR AAGPGSFTPAFLDALYQLRPEDLQ
Sequences:
>Translated_264_residues MLARPDVFPGARAAACLTQFKRQSPLIHCLTNEVVQELTANVLLALGASPAMVVEPTEAAQFSRLADALLINIGTLNASR AESMLAAIEAANAAGTPWTLDPVAVGGLAYRTAFAQSLLGEKPAAIRGNASEIMALSGLQASGRGVDSADDSLAALPAAR ELARNSGAVVAVTGVVDYITDGQRDWAVAGGDVLMTRVVGTGCALSAVVAAFCSLPGDRLDNVATACRVMSHCGEMATRR AAGPGSFTPAFLDALYQLRPEDLQ >Mature_264_residues MLARPDVFPGARAAACLTQFKRQSPLIHCLTNEVVQELTANVLLALGASPAMVVEPTEAAQFSRLADALLINIGTLNASR AESMLAAIEAANAAGTPWTLDPVAVGGLAYRTAFAQSLLGEKPAAIRGNASEIMALSGLQASGRGVDSADDSLAALPAAR ELARNSGAVVAVTGVVDYITDGQRDWAVAGGDVLMTRVVGTGCALSAVVAAFCSLPGDRLDNVATACRVMSHCGEMATRR AAGPGSFTPAFLDALYQLRPEDLQ
Specific function: Thiamine biosynthesis. [C]
COG id: COG2145
COG function: function code H; Hydroxyethylthiazole kinase, sugar kinase family
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the Thz kinase family
Homologues:
Organism=Escherichia coli, GI1788421, Length=256, Percent_Identity=61.71875, Blast_Score=310, Evalue=6e-86, Organism=Saccharomyces cerevisiae, GI6325042, Length=275, Percent_Identity=31.6363636363636, Blast_Score=107, Evalue=2e-24,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): THIM_SERP5 (A8GHT4)
Other databases:
- EMBL: CP000826 - RefSeq: YP_001479802.1 - ProteinModelPortal: A8GHT4 - SMR: A8GHT4 - STRING: A8GHT4 - GeneID: 5605594 - GenomeReviews: CP000826_GR - KEGG: spe:Spro_3578 - eggNOG: COG2145 - HOGENOM: HBG351126 - OMA: AIRGNAG - ProtClustDB: PRK09355 - BioCyc: SPRO399741:SPRO_3578-MONOMER - HAMAP: MF_00228 - InterPro: IPR000417 - PANTHER: PTHR20857:SF14 - PIRSF: PIRSF000513 - PRINTS: PR01099 - TIGRFAMs: TIGR00694
Pfam domain/function: PF02110 HK
EC number: =2.7.1.50
Molecular weight: Translated: 27151; Mature: 27151
Theoretical pI: Translated: 4.72; Mature: 4.72
Prosite motif: NA
Important sites: BINDING 52-52 BINDING 127-127 BINDING 173-173 BINDING 200-200
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.3 %Cys (Translated Protein) 2.7 %Met (Translated Protein) 4.9 %Cys+Met (Translated Protein) 2.3 %Cys (Mature Protein) 2.7 %Met (Mature Protein) 4.9 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MLARPDVFPGARAAACLTQFKRQSPLIHCLTNEVVQELTANVLLALGASPAMVVEPTEAA CCCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCEEECCHHHH QFSRLADALLINIGTLNASRAESMLAAIEAANAAGTPWTLDPVAVGGLAYRTAFAQSLLG HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHC EKPAAIRGNASEIMALSGLQASGRGVDSADDSLAALPAARELARNSGAVVAVTGVVDYIT CCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEHHHHHHHH DGQRDWAVAGGDVLMTRVVGTGCALSAVVAAFCSLPGDRLDNVATACRVMSHCGEMATRR CCCCCEEEECCHHHHHHHHHCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHH AAGPGSFTPAFLDALYQLRPEDLQ CCCCCCCCHHHHHHHHHCCCCCCC >Mature Secondary Structure MLARPDVFPGARAAACLTQFKRQSPLIHCLTNEVVQELTANVLLALGASPAMVVEPTEAA CCCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCEEECCHHHH QFSRLADALLINIGTLNASRAESMLAAIEAANAAGTPWTLDPVAVGGLAYRTAFAQSLLG HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHC EKPAAIRGNASEIMALSGLQASGRGVDSADDSLAALPAARELARNSGAVVAVTGVVDYIT CCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEHHHHHHHH DGQRDWAVAGGDVLMTRVVGTGCALSAVVAAFCSLPGDRLDNVATACRVMSHCGEMATRR CCCCCEEEECCHHHHHHHHHCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHH AAGPGSFTPAFLDALYQLRPEDLQ CCCCCCCCHHHHHHHHHCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: NA