Definition Akkermansia muciniphila ATCC BAA-835, complete genome.
Accession NC_010655
Length 2,664,102

Click here to switch to the map view.

The map label for this gene is thiM

Identifier: 187736495

GI number: 187736495

Start: 2448658

End: 2449473

Strand: Direct

Name: thiM

Synonym: Amuc_2016

Alternate gene names: 187736495

Gene position: 2448658-2449473 (Clockwise)

Preceding gene: 187736494

Following gene: 187736496

Centisome position: 91.91

GC content: 63.73

Gene sequence:

>816_bases
ATGCTTTCATCGACAGACCTTGTTCAAGCCGTTACCGCCGATTTGGGAAAAATCCGGGAAACGGCCCCGCTGGTTCTCTC
CCTGACCAATTCCGTCGTCCAGCCCCTGACGGCCAATCTCCTGCTGGCCATAGGCGCCGTCCCCGCCATGCTCAACGACG
CGGAAGAAGCGGTGGACATGCTCCGCAGCGGAACAGGCGCCCTGCTGGTCAACCTGGGCACCGTGACGCGTGAACAGGGA
GCCGCCATGCAAACGGCGGTGCGGGAAGCCAACCGGCTGAATATCCCCTGGGTGCTGGATCCTGTGGCCGTAGGGGCTCT
TTCCCTGCGCACGCGGCTGGCGGGGCAATTGAAGGAACAATCCCCCCGCATCATCCGCGGAAACGCTTCTGAAATCATGG
CCCTGGCCGGCTATTCCTCCGTCACGAAAGGGCCGGAAAGCACCAGCTCCAGCGCAGACGCCCTGCATGCGGCCAGGGAA
CTGGCCCTGCACACGGGGGCGGCCGTGCTCGTTACGGGGCGTACGGATTATTCCACTGACGGCCGCCAGGTAACCGCCAC
GGAAAACGGCCACGCCATGATGTCCCGGGTTACGGGCGTGGGCTGTTCCATGGGAGCCCTGTCCGCCGCCTGCGCCGCCG
TCTCCCCCACCCCCCTGCAGGCGGCCGTTTCCACAGCCGTACTCATGGGCATTGCCGGAGAAATGGCCTTTGAACAAAGC
CCCTCCCCCGGTTCCTTTGCCGTATCATTGCTGGACAGCCTTTACGCCCTTTCTCCGGAAGACGTTGTCCGCAGAGCGCG
CTTTCTTTCCCTTTGA

Upstream 100 bases:

>100_bases
CCGCCATTTGCGGAGCGGAGAATCCGGAAACCGCCGCACGGGCTCTTGCCTGACGGACGTACCGCAGGAAATACCGGACA
ATCAAGCACTTTTTTACGCC

Downstream 100 bases:

>100_bases
CAGTCTTCTCTTCCCCGGAAAGCAACTCCGCTGCCATGCACACGGCCTGACCCGGAAATCTCCGGAAGGGGAAGCACATC
ACCAGGAGCTGCCCGGGAGA

Product: Hydroxyethylthiazole kinase

Products: NA

Alternate protein names: 4-methyl-5-beta-hydroxyethylthiazole kinase; TH kinase; Thz kinase

Number of amino acids: Translated: 271; Mature: 271

Protein sequence:

>271_residues
MLSSTDLVQAVTADLGKIRETAPLVLSLTNSVVQPLTANLLLAIGAVPAMLNDAEEAVDMLRSGTGALLVNLGTVTREQG
AAMQTAVREANRLNIPWVLDPVAVGALSLRTRLAGQLKEQSPRIIRGNASEIMALAGYSSVTKGPESTSSSADALHAARE
LALHTGAAVLVTGRTDYSTDGRQVTATENGHAMMSRVTGVGCSMGALSAACAAVSPTPLQAAVSTAVLMGIAGEMAFEQS
PSPGSFAVSLLDSLYALSPEDVVRRARFLSL

Sequences:

>Translated_271_residues
MLSSTDLVQAVTADLGKIRETAPLVLSLTNSVVQPLTANLLLAIGAVPAMLNDAEEAVDMLRSGTGALLVNLGTVTREQG
AAMQTAVREANRLNIPWVLDPVAVGALSLRTRLAGQLKEQSPRIIRGNASEIMALAGYSSVTKGPESTSSSADALHAARE
LALHTGAAVLVTGRTDYSTDGRQVTATENGHAMMSRVTGVGCSMGALSAACAAVSPTPLQAAVSTAVLMGIAGEMAFEQS
PSPGSFAVSLLDSLYALSPEDVVRRARFLSL
>Mature_271_residues
MLSSTDLVQAVTADLGKIRETAPLVLSLTNSVVQPLTANLLLAIGAVPAMLNDAEEAVDMLRSGTGALLVNLGTVTREQG
AAMQTAVREANRLNIPWVLDPVAVGALSLRTRLAGQLKEQSPRIIRGNASEIMALAGYSSVTKGPESTSSSADALHAARE
LALHTGAAVLVTGRTDYSTDGRQVTATENGHAMMSRVTGVGCSMGALSAACAAVSPTPLQAAVSTAVLMGIAGEMAFEQS
PSPGSFAVSLLDSLYALSPEDVVRRARFLSL

Specific function: Thiamine biosynthesis. [C]

COG id: COG2145

COG function: function code H; Hydroxyethylthiazole kinase, sugar kinase family

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the Thz kinase family

Homologues:

Organism=Escherichia coli, GI1788421, Length=246, Percent_Identity=44.7154471544715, Blast_Score=190, Evalue=1e-49,
Organism=Saccharomyces cerevisiae, GI6325042, Length=275, Percent_Identity=29.0909090909091, Blast_Score=83, Evalue=6e-17,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): THIM_AKKM8 (B2UP55)

Other databases:

- EMBL:   CP001071
- RefSeq:   YP_001878607.1
- ProteinModelPortal:   B2UP55
- SMR:   B2UP55
- GeneID:   6275623
- GenomeReviews:   CP001071_GR
- KEGG:   amu:Amuc_2016
- HOGENOM:   HBG351126
- OMA:   AIRGNAG
- HAMAP:   MF_00228
- InterPro:   IPR000417
- PANTHER:   PTHR20857:SF14
- PIRSF:   PIRSF000513
- PRINTS:   PR01099

Pfam domain/function: PF02110 HK

EC number: =2.7.1.50

Molecular weight: Translated: 27898; Mature: 27898

Theoretical pI: Translated: 5.30; Mature: 5.30

Prosite motif: NA

Important sites: BINDING 50-50 BINDING 126-126 BINDING 172-172 BINDING 199-199

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.7 %Cys     (Translated Protein)
3.7 %Met     (Translated Protein)
4.4 %Cys+Met (Translated Protein)
0.7 %Cys     (Mature Protein)
3.7 %Met     (Mature Protein)
4.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MLSSTDLVQAVTADLGKIRETAPLVLSLTNSVVQPLTANLLLAIGAVPAMLNDAEEAVDM
CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHH
LRSGTGALLVNLGTVTREQGAAMQTAVREANRLNIPWVLDPVAVGALSLRTRLAGQLKEQ
HHCCCCEEEEEECHHHHHHCHHHHHHHHHHHCCCCCEEECHHHHHHHHHHHHHHHHHHHC
SPRIIRGNASEIMALAGYSSVTKGPESTSSSADALHAARELALHTGAAVLVTGRTDYSTD
CCCEEECCHHHHHHHHCCHHHCCCCCCCCCHHHHHHHHHHHHHHCCCEEEEECCCCCCCC
GRQVTATENGHAMMSRVTGVGCSMGALSAACAAVSPTPLQAAVSTAVLMGIAGEMAFEQS
CCEEEECCCCHHHHHHHHCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCC
PSPGSFAVSLLDSLYALSPEDVVRRARFLSL
CCCHHHHHHHHHHHHCCCHHHHHHHHHHHCC
>Mature Secondary Structure
MLSSTDLVQAVTADLGKIRETAPLVLSLTNSVVQPLTANLLLAIGAVPAMLNDAEEAVDM
CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHH
LRSGTGALLVNLGTVTREQGAAMQTAVREANRLNIPWVLDPVAVGALSLRTRLAGQLKEQ
HHCCCCEEEEEECHHHHHHCHHHHHHHHHHHCCCCCEEECHHHHHHHHHHHHHHHHHHHC
SPRIIRGNASEIMALAGYSSVTKGPESTSSSADALHAARELALHTGAAVLVTGRTDYSTD
CCCEEECCHHHHHHHHCCHHHCCCCCCCCCHHHHHHHHHHHHHHCCCEEEEECCCCCCCC
GRQVTATENGHAMMSRVTGVGCSMGALSAACAAVSPTPLQAAVSTAVLMGIAGEMAFEQS
CCEEEECCCCHHHHHHHHCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCC
PSPGSFAVSLLDSLYALSPEDVVRRARFLSL
CCCHHHHHHHHHHHHCCCHHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: NA