Definition | Treponema pallidum subsp. pallidum SS14, complete genome. |
---|---|
Accession | NC_010741 |
Length | 1,139,457 |
Click here to switch to the map view.
The map label for this gene is glmS
Identifier: 189026085
GI number: 189026085
Start: 938722
End: 940629
Strand: Reverse
Name: glmS
Synonym: TPASS_0861
Alternate gene names: 189026085
Gene position: 940629-938722 (Counterclockwise)
Preceding gene: 189026086
Following gene: 189026081
Centisome position: 82.55
GC content: 56.87
Gene sequence:
>1908_bases ATGTGTGGAATCGTTGGGATGGTGGCGGGGCGTGATGTCTCAGGTCTCTTGCTTGAGGGGCTGCGTCGTCTTGAATACCG CGGGTATGATTCTGCAGGGATCGCTGTCGTTGGCTCGGACTGTGCGCTCCGCCTTTTACGCTGTGAGGGTCGTGTTCAGT CGCTCTGCGCGCTATTAGGTCAGAGTCCTCTTTGCGGGACCATGGGCATTGCGCACACCCGGTGGGCAACGCACGGCAAG CCGTGTGCAGCGAATGCGCATCCGCATTGTTCTGAATCCGTTGCGATAGTTCACAACGGTATTGTTGAAAACCATCGGTC TTTGCGTGAAATGCTAGTGACCCGTGGCTATTTTTTTCATTCACAAACTGATAGCGAGGTGCTCGCGCATCTTTTGCACT GGGAGTTGCGGTACACGGCGCACCTCCTGCTTGCGGTAAAAAAAGTGTTGACGCAGGTTCGGGGTACCTACGGGTTACTT TGCATGGACGCTGCATCTCCTGGGCGGTTGATTGCTGCGCGTTCTGGCAGTCCGTTGGCGGTAGGACTCGGCTGTGGGGA AAATTTTGTTACGTCAGACCCGCTTGCGCTTGCGCATGTTACGCAGCGTTTTTTGTATCTTGAGGAAGGAGACATCGCAG ACGTCCACCGAGATAGCGTGTGCGTCCACGACGCGCAGGGAAACGTTGTGGCGCGTCCTGTTGTTACGTATCAGATGCAG CTTTGTACGCAGGATAAGGGTACGCACCGTCACCATATGCATCAAGAAATATGGCAGCAGCCACACGCTATACGGCATAC GCTGAATGCCTATATGTCGTTTTCTTCGTCTTCTCGCGCACAGGTGCGGACGTTCGGGGAGGACCGGGTCTTGGACGGTA CCTCGTGTAAGACGTTTGAGCGGCTGTTTAGACGGATAACACGGGTGCGCATTATCGCGTGTGGTACCTCGTACCATGCA GGGTTAGTGGCGCGTTACTGGTTTGAGGCATTTGCAGGGGTGGGATGTCAGGTGGAGATTGCCTCAGAGTACCGCTATCG CACGAGCGTCGTGCACGCGCGTGAAATAGTACTGACGATTTCTCAGTCGGGAGAAACGGCTGACACCATTGCCGCACTGC GCCTGGCAAAAACGCAAGGGTATCTCTGTGCGATTGCCATTTGCAACGGGGCACGTTCCACGTTGGTGCGTGAGTCGGAC GCAGTGCTGCTCACCCACGCGGGGTCAGAAATAGGGGTTGCTTCAACGAAGTCTTTTACCACCCAGCTGGTGTGCTTGCT CGTGCTCACCCGCATGATTGCGCAGGCAAAAAAGATACTCACGCAGGAGCCCGAGGACGCGCTCTCTGCGGCGCTCCAGC GACTGCCTCAGGATGTGGAGCACGTGCTTGAGTGTGAGGCAGACGTTGCGCGGTGCGCGCGGCATTTTGTGCATGCGCAG CATGCGCTCTTTTTGGGGCGTGGGGAATTGTATCCGATTGCAATTGAATCGGCGCTCAAGCTGAAAGAAATTTCGTACAT CCATGCCGAAGCATACGCAGCGGGGGAGCTAAAGCATGGGCCCCTTGCATTAGTGGACGCGCAGATGCCGGTGGTTGCAA TCGCGCCGGCGTCACCTGGCGTCCTGTTTGAAAAAATGGCCTCAAACATCGAGGAAGTTCGCGCGCGCGGCGGGATGCTC TACATCTTTACGGACGTGCCGGAGCGTTTTGGCCCGGTATGCACGCCCGAGGCGGACGCGCCGGAAGGGGCGTGCAGTCA GATAGTCACCGTACCTTCGGTATCTCCCCTTACCGCGCCGATTTTTTACGCCGTACCGCTGCAGCTGTTGGCATACCACA TTGCCCTGCTCAAGGGGACTGACATTGACCAGCCGCGGAACCTGGCAAAATCGGTGACGGTCGAGTAA
Upstream 100 bases:
>100_bases CTCGTGGCTTGGTTGAGTCAGAATGGTTCCTACGCGTGCGTGTGTTCTGTGGGTTTGAGTTTAACGTTCTCCCGTGTGCT TTTGCACTGAAAAGGCGTCT
Downstream 100 bases:
>100_bases GGGGTCCGCCCCCACTTTCCGCCGGTGCCGGCAGCGTGGCACAGGACGGAGCAGATGATGCCGCGGCGGCGCGGCTAGAA AATGTTAAAATCGTGAACCT
Product: glucosamine--fructose-6-phosphate aminotransferase
Products: NA
Alternate protein names: D-fructose-6-phosphate amidotransferase; GFAT; Glucosamine-6-phosphate synthase; Hexosephosphate aminotransferase; L-glutamine-D-fructose-6-phosphate amidotransferase
Number of amino acids: Translated: 635; Mature: 635
Protein sequence:
>635_residues MCGIVGMVAGRDVSGLLLEGLRRLEYRGYDSAGIAVVGSDCALRLLRCEGRVQSLCALLGQSPLCGTMGIAHTRWATHGK PCAANAHPHCSESVAIVHNGIVENHRSLREMLVTRGYFFHSQTDSEVLAHLLHWELRYTAHLLLAVKKVLTQVRGTYGLL CMDAASPGRLIAARSGSPLAVGLGCGENFVTSDPLALAHVTQRFLYLEEGDIADVHRDSVCVHDAQGNVVARPVVTYQMQ LCTQDKGTHRHHMHQEIWQQPHAIRHTLNAYMSFSSSSRAQVRTFGEDRVLDGTSCKTFERLFRRITRVRIIACGTSYHA GLVARYWFEAFAGVGCQVEIASEYRYRTSVVHAREIVLTISQSGETADTIAALRLAKTQGYLCAIAICNGARSTLVRESD AVLLTHAGSEIGVASTKSFTTQLVCLLVLTRMIAQAKKILTQEPEDALSAALQRLPQDVEHVLECEADVARCARHFVHAQ HALFLGRGELYPIAIESALKLKEISYIHAEAYAAGELKHGPLALVDAQMPVVAIAPASPGVLFEKMASNIEEVRARGGML YIFTDVPERFGPVCTPEADAPEGACSQIVTVPSVSPLTAPIFYAVPLQLLAYHIALLKGTDIDQPRNLAKSVTVE
Sequences:
>Translated_635_residues MCGIVGMVAGRDVSGLLLEGLRRLEYRGYDSAGIAVVGSDCALRLLRCEGRVQSLCALLGQSPLCGTMGIAHTRWATHGK PCAANAHPHCSESVAIVHNGIVENHRSLREMLVTRGYFFHSQTDSEVLAHLLHWELRYTAHLLLAVKKVLTQVRGTYGLL CMDAASPGRLIAARSGSPLAVGLGCGENFVTSDPLALAHVTQRFLYLEEGDIADVHRDSVCVHDAQGNVVARPVVTYQMQ LCTQDKGTHRHHMHQEIWQQPHAIRHTLNAYMSFSSSSRAQVRTFGEDRVLDGTSCKTFERLFRRITRVRIIACGTSYHA GLVARYWFEAFAGVGCQVEIASEYRYRTSVVHAREIVLTISQSGETADTIAALRLAKTQGYLCAIAICNGARSTLVRESD AVLLTHAGSEIGVASTKSFTTQLVCLLVLTRMIAQAKKILTQEPEDALSAALQRLPQDVEHVLECEADVARCARHFVHAQ HALFLGRGELYPIAIESALKLKEISYIHAEAYAAGELKHGPLALVDAQMPVVAIAPASPGVLFEKMASNIEEVRARGGML YIFTDVPERFGPVCTPEADAPEGACSQIVTVPSVSPLTAPIFYAVPLQLLAYHIALLKGTDIDQPRNLAKSVTVE >Mature_635_residues MCGIVGMVAGRDVSGLLLEGLRRLEYRGYDSAGIAVVGSDCALRLLRCEGRVQSLCALLGQSPLCGTMGIAHTRWATHGK PCAANAHPHCSESVAIVHNGIVENHRSLREMLVTRGYFFHSQTDSEVLAHLLHWELRYTAHLLLAVKKVLTQVRGTYGLL CMDAASPGRLIAARSGSPLAVGLGCGENFVTSDPLALAHVTQRFLYLEEGDIADVHRDSVCVHDAQGNVVARPVVTYQMQ LCTQDKGTHRHHMHQEIWQQPHAIRHTLNAYMSFSSSSRAQVRTFGEDRVLDGTSCKTFERLFRRITRVRIIACGTSYHA GLVARYWFEAFAGVGCQVEIASEYRYRTSVVHAREIVLTISQSGETADTIAALRLAKTQGYLCAIAICNGARSTLVRESD AVLLTHAGSEIGVASTKSFTTQLVCLLVLTRMIAQAKKILTQEPEDALSAALQRLPQDVEHVLECEADVARCARHFVHAQ HALFLGRGELYPIAIESALKLKEISYIHAEAYAAGELKHGPLALVDAQMPVVAIAPASPGVLFEKMASNIEEVRARGGML YIFTDVPERFGPVCTPEADAPEGACSQIVTVPSVSPLTAPIFYAVPLQLLAYHIALLKGTDIDQPRNLAKSVTVE
Specific function: Catalyzes the first step in hexosamine metabolism, converting fructose-6P into glucosamine-6P using glutamine as a nitrogen source
COG id: COG0449
COG function: function code M; Glucosamine 6-phosphate synthetase, contains amidotransferase and phosphosugar isomerase domains
Gene ontology:
Cell location: Cytoplasm
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 2 SIS domains
Homologues:
Organism=Homo sapiens, GI4826742, Length=708, Percent_Identity=33.4745762711864, Blast_Score=372, Evalue=1e-103, Organism=Homo sapiens, GI205277386, Length=708, Percent_Identity=33.1920903954802, Blast_Score=371, Evalue=1e-102, Organism=Escherichia coli, GI1790167, Length=635, Percent_Identity=54.0157480314961, Blast_Score=679, Evalue=0.0, Organism=Escherichia coli, GI1788651, Length=173, Percent_Identity=29.4797687861272, Blast_Score=80, Evalue=4e-16, Organism=Caenorhabditis elegans, GI17539970, Length=727, Percent_Identity=32.1870701513067, Blast_Score=318, Evalue=8e-87, Organism=Caenorhabditis elegans, GI17532899, Length=454, Percent_Identity=35.2422907488987, Blast_Score=245, Evalue=4e-65, Organism=Caenorhabditis elegans, GI17532897, Length=454, Percent_Identity=35.2422907488987, Blast_Score=245, Evalue=4e-65, Organism=Saccharomyces cerevisiae, GI6322745, Length=474, Percent_Identity=33.7552742616034, Blast_Score=248, Evalue=1e-66, Organism=Saccharomyces cerevisiae, GI6323731, Length=454, Percent_Identity=29.295154185022, Blast_Score=173, Evalue=9e-44, Organism=Saccharomyces cerevisiae, GI6323730, Length=205, Percent_Identity=36.5853658536585, Blast_Score=128, Evalue=3e-30, Organism=Saccharomyces cerevisiae, GI6323958, Length=168, Percent_Identity=30.952380952381, Blast_Score=68, Evalue=4e-12, Organism=Drosophila melanogaster, GI21357745, Length=709, Percent_Identity=33.0042313117066, Blast_Score=362, Evalue=1e-100, Organism=Drosophila melanogaster, GI28573187, Length=278, Percent_Identity=25.8992805755396, Blast_Score=73, Evalue=5e-13,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): GLMS_TREPA (O83833)
Other databases:
- EMBL: AE000520 - PIR: E71272 - RefSeq: NP_219297.1 - ProteinModelPortal: O83833 - SMR: O83833 - GeneID: 2611074 - GenomeReviews: AE000520_GR - KEGG: tpa:TP0861 - NMPDR: fig|243276.1.peg.856 - TIGR: TP_0861 - HOGENOM: HBG645312 - OMA: LGIGENF - ProtClustDB: PRK00331 - BioCyc: TPAL243276:TP_0861-MONOMER - BRENDA: 2.6.1.16 - GO: GO:0005737 - HAMAP: MF_00164 - InterPro: IPR000583 - InterPro: IPR017932 - InterPro: IPR005855 - InterPro: IPR001347 - TIGRFAMs: TIGR01135
Pfam domain/function: PF00310 GATase_2; PF01380 SIS
EC number: =2.6.1.16
Molecular weight: Translated: 69182; Mature: 69182
Theoretical pI: Translated: 7.36; Mature: 7.36
Prosite motif: PS51278 GATASE_TYPE_2; PS51464 SIS; PS00443 GATASE_TYPE_II
Important sites: ACT_SITE 2-2 ACT_SITE 630-630
Signals:
None
Transmembrane regions:
None
Cys/Met content:
3.3 %Cys (Translated Protein) 1.9 %Met (Translated Protein) 5.2 %Cys+Met (Translated Protein) 3.3 %Cys (Mature Protein) 1.9 %Met (Mature Protein) 5.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MCGIVGMVAGRDVSGLLLEGLRRLEYRGYDSAGIAVVGSDCALRLLRCEGRVQSLCALLG CCCCEEECCCCCHHHHHHHHHHHHHCCCCCCCCEEEECCHHHHHHHHHCCHHHHHHHHHC QSPLCGTMGIAHTRWATHGKPCAANAHPHCSESVAIVHNGIVENHRSLREMLVTRGYFFH CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCEEEE SQTDSEVLAHLLHWELRYTAHLLLAVKKVLTQVRGTYGLLCMDAASPGRLIAARSGSPLA CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCEEEECCCCCEE VGLGCGENFVTSDPLALAHVTQRFLYLEEGDIADVHRDSVCVHDAQGNVVARPVVTYQMQ EEECCCCCCCCCCCHHHHHHHHHHEEEECCCCCCCCCCCEEEECCCCCEEECHHHHHHHH LCTQDKGTHRHHMHQEIWQQPHAIRHTLNAYMSFSSSSRAQVRTFGEDRVLDGTSCKTFE HHHCCCCCHHHHHHHHHHHCHHHHHHHHHHHHHCCCCCCCHHEECCCCCCCCCCCHHHHH RLFRRITRVRIIACGTSYHAGLVARYWFEAFAGVGCQVEIASEYRYRTSVVHAREIVLTI HHHHHHHHEEEEEECCCHHHHHHHHHHHHHHHCCCCEEEECCHHHHHHHHHEEEEEEEEE SQSGETADTIAALRLAKTQGYLCAIAICNGARSTLVRESDAVLLTHAGSEIGVASTKSFT ECCCCHHHHHHHHHHHHCCCCEEEEEEECCCHHHHEECCCCEEEEECCCCCCCCCCHHHH TQLVCLLVLTRMIAQAKKILTQEPEDALSAALQRLPQDVEHVLECEADVARCARHFVHAQ HHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCHHHHHHHHHCCHHHHHHHHHHHHHH HALFLGRGELYPIAIESALKLKEISYIHAEAYAAGELKHGPLALVDAQMPVVAIAPASPG HHEEECCCCEEEEEHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCEEEECCCCCC VLFEKMASNIEEVRARGGMLYIFTDVPERFGPVCTPEADAPEGACSQIVTVPSVSPLTAP HHHHHHHHHHHHHHHCCCEEEEEECCHHHHCCCCCCCCCCCCCHHHCEEECCCCCCCHHH IFYAVPLQLLAYHIALLKGTDIDQPRNLAKSVTVE HHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHCCCC >Mature Secondary Structure MCGIVGMVAGRDVSGLLLEGLRRLEYRGYDSAGIAVVGSDCALRLLRCEGRVQSLCALLG CCCCEEECCCCCHHHHHHHHHHHHHCCCCCCCCEEEECCHHHHHHHHHCCHHHHHHHHHC QSPLCGTMGIAHTRWATHGKPCAANAHPHCSESVAIVHNGIVENHRSLREMLVTRGYFFH CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCEEEE SQTDSEVLAHLLHWELRYTAHLLLAVKKVLTQVRGTYGLLCMDAASPGRLIAARSGSPLA CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCEEEECCCCCEE VGLGCGENFVTSDPLALAHVTQRFLYLEEGDIADVHRDSVCVHDAQGNVVARPVVTYQMQ EEECCCCCCCCCCCHHHHHHHHHHEEEECCCCCCCCCCCEEEECCCCCEEECHHHHHHHH LCTQDKGTHRHHMHQEIWQQPHAIRHTLNAYMSFSSSSRAQVRTFGEDRVLDGTSCKTFE HHHCCCCCHHHHHHHHHHHCHHHHHHHHHHHHHCCCCCCCHHEECCCCCCCCCCCHHHHH RLFRRITRVRIIACGTSYHAGLVARYWFEAFAGVGCQVEIASEYRYRTSVVHAREIVLTI HHHHHHHHEEEEEECCCHHHHHHHHHHHHHHHCCCCEEEECCHHHHHHHHHEEEEEEEEE SQSGETADTIAALRLAKTQGYLCAIAICNGARSTLVRESDAVLLTHAGSEIGVASTKSFT ECCCCHHHHHHHHHHHHCCCCEEEEEEECCCHHHHEECCCCEEEEECCCCCCCCCCHHHH TQLVCLLVLTRMIAQAKKILTQEPEDALSAALQRLPQDVEHVLECEADVARCARHFVHAQ HHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCHHHHHHHHHCCHHHHHHHHHHHHHH HALFLGRGELYPIAIESALKLKEISYIHAEAYAAGELKHGPLALVDAQMPVVAIAPASPG HHEEECCCCEEEEEHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCEEEECCCCCC VLFEKMASNIEEVRARGGMLYIFTDVPERFGPVCTPEADAPEGACSQIVTVPSVSPLTAP HHHHHHHHHHHHHHHCCCEEEEEECCHHHHCCCCCCCCCCCCCHHHCEEECCCCCCCHHH IFYAVPLQLLAYHIALLKGTDIDQPRNLAKSVTVE HHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9665876