| Definition | Trichodesmium erythraeum IMS101 chromosome, complete genome. |
|---|---|
| Accession | NC_008312 |
| Length | 7,750,108 |
Click here to switch to the map view.
The map label for this gene is glmU
Identifier: 113474420
GI number: 113474420
Start: 878411
End: 879826
Strand: Direct
Name: glmU
Synonym: Tery_0560
Alternate gene names: 113474420
Gene position: 878411-879826 (Clockwise)
Preceding gene: 113474419
Following gene: 113474421
Centisome position: 11.33
GC content: 39.12
Gene sequence:
>1416_bases ATGGTAGCGGTAGCAATTTTAGCGGCTGGACGTGGCACTAGAATGAAGTCAGACTTACCTAAGGTATTACACCAATTAGG TTCTTGTACTTTAGTCAAACGAGTTATCAAAAGTTGTGTTTCTATTCAACCATCAAAAATAATGGTAATAGTTGGTTATC GTGGGGGCCTTGTACAAAAGTCTCTCCTAGATAATAACAATAATATTGATAATGATAATACTCCTACACTAGAATTTGTA GAACAGACGGAACAATTGGGAACTGGTCATGCTATTCAACAATTATTACCTTATCTGAAAGATTTTTCTGAGGAGCTGCT GGTCTTAAATGGAGATGTACCATTATTACGACCCGAAACTATTAAGCAGTTAATTGATACCCATCAGCAAAATAAAAATT CAGCGACTATCTTGACTGCTAATCTTCCTAATCCTAAAGGTTACGGTAGAATATTCTGTAATACAAATAATTTTGTGACT CAAATAGTAGAAGAGCGAGACTGTACTGCTGCCCAAACAAAAAACCATCGCGTTAATGCCGGAGTTTACTGTTTTAACTG GCCTGCTTTAGCAAATATATTGCCTAAACTAAAAGCAGACAATGACCAACAAGAGTATTATTTAACAGATGTGGTTCCTC TTCTTGACCCTGTCATGGCAGTTGATGTGAATGACTATCAAGAAATTTTTGGTATTAATAACCGTAAACATTTAGCTAAA GCTCATGAAATTTTGCAGGTGCGGGTCAAAGATGATTGGATGGAGGCCGGAGTCACGTTGATAGACCCTGATAGTATTAC AATTGATGATACAGTTTTACTACAGCAGGATGTGATTGTTGAACCACAAACTCATATTCGGGGTTCGAGTATAATTGGTT CTGGTAGTCGCATTGGGCCGGGAAGTTTGATTGAAAATAGTCATATAGGTAAAAATACTTCGGTTTTGTACTCTGTGATT TCTGATAGTATGGTGGCCGACAATACTCGTATTGGTCCTTATGCACATTTACGGGGTGACTCTCAGGTGGGTTCTCACTG TCGAATTGGCAATTTTGTGGAATTGAAAAAAGCAACGGTTGGCGATCGCTCTAATGCAGCTCATTTGTCCTATTTGGGGG ACGCGACTTTGGGAGAAAAAGTAAATATTGGCGCTGGCACTATTACAGCTAATTATGATGGGGTGAAAAAGCATAAGACT AAAATAGGCGATCGCTCGAAAACTGGGTCAAATAGTGTACTGGTTGCTCCTGTAACTTTAGGAGAAGATGTGACAGTAGC TGCAGGTTCGGTAGTGACAAAGAATGTGGAAGATGATAGTTTAGTTATTGGTCGCGCTCGACAAGCTGTAAAAAAAGGTT GGAGGTTGAAGCAGTCCGATGAAAGTAAGAAAGAAGAAAATAAGTCATCGCCCTGA
Upstream 100 bases:
>100_bases TACTCCCATAGCTCTAATTGTCAGTCAAATATGAGAATATAAATACTGTAGATTCGTGATTAAACGTTTCTCAATTCAGA AAGCAAATAATTCAAAAATT
Downstream 100 bases:
>100_bases ATAATTTTCAAGTTTATGATCAACCGCGAAACTTCAATAACTTATGAGATCATACATAGCCATCAGTAACGCCTAATTTT TTTTTAGATGTTGGTAGAAA
Product: bifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase
Products: NA
Alternate protein names: UDP-N-acetylglucosamine pyrophosphorylase; N-acetylglucosamine-1-phosphate uridyltransferase; Glucosamine-1-phosphate N-acetyltransferase
Number of amino acids: Translated: 471; Mature: 471
Protein sequence:
>471_residues MVAVAILAAGRGTRMKSDLPKVLHQLGSCTLVKRVIKSCVSIQPSKIMVIVGYRGGLVQKSLLDNNNNIDNDNTPTLEFV EQTEQLGTGHAIQQLLPYLKDFSEELLVLNGDVPLLRPETIKQLIDTHQQNKNSATILTANLPNPKGYGRIFCNTNNFVT QIVEERDCTAAQTKNHRVNAGVYCFNWPALANILPKLKADNDQQEYYLTDVVPLLDPVMAVDVNDYQEIFGINNRKHLAK AHEILQVRVKDDWMEAGVTLIDPDSITIDDTVLLQQDVIVEPQTHIRGSSIIGSGSRIGPGSLIENSHIGKNTSVLYSVI SDSMVADNTRIGPYAHLRGDSQVGSHCRIGNFVELKKATVGDRSNAAHLSYLGDATLGEKVNIGAGTITANYDGVKKHKT KIGDRSKTGSNSVLVAPVTLGEDVTVAAGSVVTKNVEDDSLVIGRARQAVKKGWRLKQSDESKKEENKSSP
Sequences:
>Translated_471_residues MVAVAILAAGRGTRMKSDLPKVLHQLGSCTLVKRVIKSCVSIQPSKIMVIVGYRGGLVQKSLLDNNNNIDNDNTPTLEFV EQTEQLGTGHAIQQLLPYLKDFSEELLVLNGDVPLLRPETIKQLIDTHQQNKNSATILTANLPNPKGYGRIFCNTNNFVT QIVEERDCTAAQTKNHRVNAGVYCFNWPALANILPKLKADNDQQEYYLTDVVPLLDPVMAVDVNDYQEIFGINNRKHLAK AHEILQVRVKDDWMEAGVTLIDPDSITIDDTVLLQQDVIVEPQTHIRGSSIIGSGSRIGPGSLIENSHIGKNTSVLYSVI SDSMVADNTRIGPYAHLRGDSQVGSHCRIGNFVELKKATVGDRSNAAHLSYLGDATLGEKVNIGAGTITANYDGVKKHKT KIGDRSKTGSNSVLVAPVTLGEDVTVAAGSVVTKNVEDDSLVIGRARQAVKKGWRLKQSDESKKEENKSSP >Mature_471_residues MVAVAILAAGRGTRMKSDLPKVLHQLGSCTLVKRVIKSCVSIQPSKIMVIVGYRGGLVQKSLLDNNNNIDNDNTPTLEFV EQTEQLGTGHAIQQLLPYLKDFSEELLVLNGDVPLLRPETIKQLIDTHQQNKNSATILTANLPNPKGYGRIFCNTNNFVT QIVEERDCTAAQTKNHRVNAGVYCFNWPALANILPKLKADNDQQEYYLTDVVPLLDPVMAVDVNDYQEIFGINNRKHLAK AHEILQVRVKDDWMEAGVTLIDPDSITIDDTVLLQQDVIVEPQTHIRGSSIIGSGSRIGPGSLIENSHIGKNTSVLYSVI SDSMVADNTRIGPYAHLRGDSQVGSHCRIGNFVELKKATVGDRSNAAHLSYLGDATLGEKVNIGAGTITANYDGVKKHKT KIGDRSKTGSNSVLVAPVTLGEDVTVAAGSVVTKNVEDDSLVIGRARQAVKKGWRLKQSDESKKEENKSSP
Specific function: Catalyzes the last two sequential reactions in the de novo biosynthetic pathway for UDP-GlcNAc. Responsible for the acetylation of Glc-N-1-P to give GlcNAc-1-P and for the uridyl transfer from UTP to GlcNAc-1-P which produces UDP-GlcNAc
COG id: COG1207
COG function: function code M; N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains)
Gene ontology:
Cell location: Cytoplasm
Metaboloic importance: Essential [C]
Operon status: Not Known
Operon components: None
Similarity: In the C-terminal section; belongs to the transferase hexapeptide repeat family
Homologues:
Organism=Escherichia coli, GI1790168, Length=459, Percent_Identity=41.3943355119826, Blast_Score=332, Evalue=5e-92, Organism=Drosophila melanogaster, GI21355443, Length=362, Percent_Identity=20.9944751381215, Blast_Score=66, Evalue=4e-11, Organism=Drosophila melanogaster, GI24644084, Length=362, Percent_Identity=20.9944751381215, Blast_Score=66, Evalue=4e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): GLMU_TRIEI (Q118R6)
Other databases:
- EMBL: CP000393 - RefSeq: YP_720481.1 - ProteinModelPortal: Q118R6 - SMR: Q118R6 - STRING: Q118R6 - GeneID: 4242950 - GenomeReviews: CP000393_GR - KEGG: ter:Tery_0560 - NMPDR: fig|203124.1.peg.5859 - eggNOG: COG1207 - HOGENOM: HBG688195 - OMA: GSKVNHL - PhylomeDB: Q118R6 - ProtClustDB: PRK14360 - BioCyc: TERY203124:TERY_0560-MONOMER - GO: GO:0005737 - HAMAP: MF_01631 - InterPro: IPR005882 - InterPro: IPR005835 - InterPro: IPR011004 - TIGRFAMs: TIGR01173
Pfam domain/function: PF00483 NTP_transferase; SSF51161 Trimer_LpxA_like
EC number: =2.7.7.23; =2.3.1.157
Molecular weight: Translated: 51359; Mature: 51359
Theoretical pI: Translated: 7.36; Mature: 7.36
Prosite motif: PS00101 HEXAPEP_TRANSFERASES
Important sites: ACT_SITE 368-368 BINDING 82-82 BINDING 149-149 BINDING 164-164 BINDING 179-179 BINDING 392-392 BINDING 410-410 BINDING 428-428 BINDING 445-445
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.3 %Cys (Translated Protein) 1.3 %Met (Translated Protein) 2.5 %Cys+Met (Translated Protein) 1.3 %Cys (Mature Protein) 1.3 %Met (Mature Protein) 2.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MVAVAILAAGRGTRMKSDLPKVLHQLGSCTLVKRVIKSCVSIQPSKIMVIVGYRGGLVQK CEEEEEEECCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCEEEEEEECCCCCHHH SLLDNNNNIDNDNTPTLEFVEQTEQLGTGHAIQQLLPYLKDFSEELLVLNGDVPLLRPET HHHCCCCCCCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCEEEEECCCCCCCHHH IKQLIDTHQQNKNSATILTANLPNPKGYGRIFCNTNNFVTQIVEERDCTAAQTKNHRVNA HHHHHHHHHCCCCCEEEEEEECCCCCCCEEEEECCCHHHHHHHHHCCCCHHHCCCCEEEC GVYCFNWPALANILPKLKADNDQQEYYLTDVVPLLDPVMAVDVNDYQEIFGINNRKHLAK CEEEECCHHHHHHHHHHCCCCCCCCEEHHHHHHHHCCEEEECCCHHHHHHCCCCHHHHHH AHEILQVRVKDDWMEAGVTLIDPDSITIDDTVLLQQDVIVEPQTHIRGSSIIGSGSRIGP HHHHEEEEECCHHHHCCCEEECCCCEEECCEEEEECCEEECCCHHCCCCCEECCCCCCCC GSLIENSHIGKNTSVLYSVISDSMVADNTRIGPYAHLRGDSQVGSHCRIGNFVELKKATV CCCCCCCCCCCCHHHHHHHHHCHHHCCCCCCCCEEEECCCCCCCCCCCCCCEEEEHHHCC GDRSNAAHLSYLGDATLGEKVNIGAGTITANYDGVKKHKTKIGDRSKTGSNSVLVAPVTL CCCCCCEEEEECCCCCCCCEEECCCCEEEECCCCHHHHHHHCCCCCCCCCCCEEEEEEEC GEDVTVAAGSVVTKNVEDDSLVIGRARQAVKKGWRLKQSDESKKEENKSSP CCCEEEECCCEEECCCCCCCEEHHHHHHHHHHCCCCCCCCHHHHHHCCCCC >Mature Secondary Structure MVAVAILAAGRGTRMKSDLPKVLHQLGSCTLVKRVIKSCVSIQPSKIMVIVGYRGGLVQK CEEEEEEECCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCEEEEEEECCCCCHHH SLLDNNNNIDNDNTPTLEFVEQTEQLGTGHAIQQLLPYLKDFSEELLVLNGDVPLLRPET HHHCCCCCCCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCEEEEECCCCCCCHHH IKQLIDTHQQNKNSATILTANLPNPKGYGRIFCNTNNFVTQIVEERDCTAAQTKNHRVNA HHHHHHHHHCCCCCEEEEEEECCCCCCCEEEEECCCHHHHHHHHHCCCCHHHCCCCEEEC GVYCFNWPALANILPKLKADNDQQEYYLTDVVPLLDPVMAVDVNDYQEIFGINNRKHLAK CEEEECCHHHHHHHHHHCCCCCCCCEEHHHHHHHHCCEEEECCCHHHHHHCCCCHHHHHH AHEILQVRVKDDWMEAGVTLIDPDSITIDDTVLLQQDVIVEPQTHIRGSSIIGSGSRIGP HHHHEEEEECCHHHHCCCEEECCCCEEECCEEEEECCEEECCCHHCCCCCEECCCCCCCC GSLIENSHIGKNTSVLYSVISDSMVADNTRIGPYAHLRGDSQVGSHCRIGNFVELKKATV CCCCCCCCCCCCHHHHHHHHHCHHHCCCCCCCCEEEECCCCCCCCCCCCCCEEEEHHHCC GDRSNAAHLSYLGDATLGEKVNIGAGTITANYDGVKKHKTKIGDRSKTGSNSVLVAPVTL CCCCCCEEEEECCCCCCCCEEECCCCEEEECCCCHHHHHHHCCCCCCCCCCCEEEEEEEC GEDVTVAAGSVVTKNVEDDSLVIGRARQAVKKGWRLKQSDESKKEENKSSP CCCEEEECCCEEECCCCCCCEEHHHHHHHHHHCCCCCCCCHHHHHHCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA