Definition | Akkermansia muciniphila ATCC BAA-835, complete genome. |
---|---|
Accession | NC_010655 |
Length | 2,664,102 |
Click here to switch to the map view.
The map label for this gene is guaA [H]
Identifier: 187735243
GI number: 187735243
Start: 873861
End: 875384
Strand: Direct
Name: guaA [H]
Synonym: Amuc_0738
Alternate gene names: 187735243
Gene position: 873861-875384 (Clockwise)
Preceding gene: 187735242
Following gene: 187735244
Centisome position: 32.8
GC content: 59.84
Gene sequence:
>1524_bases ATGGACGACAAGCACCTCGTAGCCGTCATTGACTTCGGCTCCCAATACACCCAGCTCATCGTACGCCGCGTGCGCGAACT GGGCTACATGGCCAAGCTGTATGCGCTGGAAGACCTGGACCAGATTCACGAACCCGGCGCCGTCATCCTTTCCGGCGGCC CCAAAAGCACCACGGATGCGGACGCCCCGGACATTGACTTTGAATGGCTCCAAAGCCTGAATGTCCCCGTCCTGGGCGTG TGCTACGGCATGCAGCTGCTGAACATCAAGCACGGCGGCACCGTAAAAGCCAGCAACAAGCGGGAATACGGCCCCGCCGC CCTGCTGCCGGAAACCTGCGTGGGCCTGTACCGGGACATGTCCCCTTCCTCCCAGGTATGGATGAGCCATTCGGACACGG TGGACCATCTGGCGGAAGGCTGCCGCGTAATCGCCCGCAATGCGGAAGGCGTCCCCGTCTCCCTCCAATGGGGAGAAACC ACCTTCGGCATCCAATTTCATCCGGAAGTGACCCATTCCCATGAAGGGCGCACCATCCTGCGCAACTTCCTTTCCTGTGC GGCCAACCTTAAAAAGTTCGACATCGGAGACTTTAAAAGGGAACTCATCCGGGAAATCCGGGAACGCGTAGGCAACAGGC AGGTGGTCTGCGGCGTCTCCGGCGGCGTGGACAGCACCGTTCTGGCCGTTCTGTTGCATGAAGCGGGCGTGAACATGCGC GCCATCTTTGTGGATAACGGTCTGCTCCGCAAAAACGAGGCAGAGGAAGTGCGGGCCAATTTCGCCCGTATGAACGTGGA AATTGAAACGGTGGATGCCTCCGAACGCTTCCTGGCGGCTCTGGACGGAGAAAGCGATCCGGAAAAGAAACGCCGTATCA TCGGAGGCCTGTTCATCGACGTTTTCTGGGATGCCGTGGGAGACGCGGAAATGCTCGCCCAGGGCACCCTGTACCCGGAC GTGATCGAAAGCGCCTCCAACGCCAAATCCAAGGCCTCCGTCATCAAGACCCACCACAATCGCGTGGAACGCGTGCTGGA ACTCCAGGCGCAGGGGAAAGTACTGGAACCCCTGGCGGAACTGTTCAAGGACGAAGTTCGAGAACTGGGCGCCTCCATGG GCATCCCGCATGACATTCTGTGGCGCCACCCCTTCCCCGGCCCCGGCCTGGCCGTGCGCTGCCCCGGCGTAGTCACCAAA GAACGCCTGGACATCATCCGGGAATGCGATGCCATTTTCATCGGCAATCTGAAAAAATACGGCTGGTACGACAAAGTCTG GCAGGCCTATGCCGGCCTCATCCCTGTAAAAACGGTGGGAGTGAAGGGGGACGAACGCTCCTATGAATGGGCTACCAACC TGCGCGCCATCGTGTCGGAAGACGCCATGACGGCGGACTGGGTGGAACTCCCCCCGGCCCTGCTGAGGGAAACCAGCAAC CGCATCCTCAATGAAGTAAAGGGCATCAACCGCGTCCTTTACGACATTTCCACCAAGCCCCCGGCTTCCATTGAGTGGGA GTAA
Upstream 100 bases:
>100_bases GTTCGTTCAAATCACCTCCGGCGGGCTGAAGGAAAGCCATCCCCACGATATCACCATCACGGAAGAACCCGTCAATTATT CCTGTTAACTCCTCCCTCAC
Downstream 100 bases:
>100_bases AGCTTCCGCACCCTGTTGACGGCCGTTCAGCAGGGTGCCTCCGCTCCTCCTCTCCGGTAACAAAAATCAAGAAAGGAGAA AATAATGACTGAACTGGAAA
Product: GMP synthase, large subunit
Products: NA
Alternate protein names: GMP synthetase; Glutamine amidotransferase [H]
Number of amino acids: Translated: 507; Mature: 507
Protein sequence:
>507_residues MDDKHLVAVIDFGSQYTQLIVRRVRELGYMAKLYALEDLDQIHEPGAVILSGGPKSTTDADAPDIDFEWLQSLNVPVLGV CYGMQLLNIKHGGTVKASNKREYGPAALLPETCVGLYRDMSPSSQVWMSHSDTVDHLAEGCRVIARNAEGVPVSLQWGET TFGIQFHPEVTHSHEGRTILRNFLSCAANLKKFDIGDFKRELIREIRERVGNRQVVCGVSGGVDSTVLAVLLHEAGVNMR AIFVDNGLLRKNEAEEVRANFARMNVEIETVDASERFLAALDGESDPEKKRRIIGGLFIDVFWDAVGDAEMLAQGTLYPD VIESASNAKSKASVIKTHHNRVERVLELQAQGKVLEPLAELFKDEVRELGASMGIPHDILWRHPFPGPGLAVRCPGVVTK ERLDIIRECDAIFIGNLKKYGWYDKVWQAYAGLIPVKTVGVKGDERSYEWATNLRAIVSEDAMTADWVELPPALLRETSN RILNEVKGINRVLYDISTKPPASIEWE
Sequences:
>Translated_507_residues MDDKHLVAVIDFGSQYTQLIVRRVRELGYMAKLYALEDLDQIHEPGAVILSGGPKSTTDADAPDIDFEWLQSLNVPVLGV CYGMQLLNIKHGGTVKASNKREYGPAALLPETCVGLYRDMSPSSQVWMSHSDTVDHLAEGCRVIARNAEGVPVSLQWGET TFGIQFHPEVTHSHEGRTILRNFLSCAANLKKFDIGDFKRELIREIRERVGNRQVVCGVSGGVDSTVLAVLLHEAGVNMR AIFVDNGLLRKNEAEEVRANFARMNVEIETVDASERFLAALDGESDPEKKRRIIGGLFIDVFWDAVGDAEMLAQGTLYPD VIESASNAKSKASVIKTHHNRVERVLELQAQGKVLEPLAELFKDEVRELGASMGIPHDILWRHPFPGPGLAVRCPGVVTK ERLDIIRECDAIFIGNLKKYGWYDKVWQAYAGLIPVKTVGVKGDERSYEWATNLRAIVSEDAMTADWVELPPALLRETSN RILNEVKGINRVLYDISTKPPASIEWE >Mature_507_residues MDDKHLVAVIDFGSQYTQLIVRRVRELGYMAKLYALEDLDQIHEPGAVILSGGPKSTTDADAPDIDFEWLQSLNVPVLGV CYGMQLLNIKHGGTVKASNKREYGPAALLPETCVGLYRDMSPSSQVWMSHSDTVDHLAEGCRVIARNAEGVPVSLQWGET TFGIQFHPEVTHSHEGRTILRNFLSCAANLKKFDIGDFKRELIREIRERVGNRQVVCGVSGGVDSTVLAVLLHEAGVNMR AIFVDNGLLRKNEAEEVRANFARMNVEIETVDASERFLAALDGESDPEKKRRIIGGLFIDVFWDAVGDAEMLAQGTLYPD VIESASNAKSKASVIKTHHNRVERVLELQAQGKVLEPLAELFKDEVRELGASMGIPHDILWRHPFPGPGLAVRCPGVVTK ERLDIIRECDAIFIGNLKKYGWYDKVWQAYAGLIPVKTVGVKGDERSYEWATNLRAIVSEDAMTADWVELPPALLRETSN RILNEVKGINRVLYDISTKPPASIEWE
Specific function: Catalyzes the synthesis of GMP from XMP [H]
COG id: COG0519
COG function: function code F; GMP synthase, PP-ATPase domain/subunit
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 GMP-binding domain [H]
Homologues:
Organism=Homo sapiens, GI4504035, Length=550, Percent_Identity=36.3636363636364, Blast_Score=297, Evalue=2e-80, Organism=Escherichia coli, GI1788854, Length=526, Percent_Identity=48.4790874524715, Blast_Score=453, Evalue=1e-128, Organism=Caenorhabditis elegans, GI133901714, Length=568, Percent_Identity=35.0352112676056, Blast_Score=290, Evalue=1e-78, Organism=Caenorhabditis elegans, GI71992710, Length=568, Percent_Identity=34.8591549295775, Blast_Score=290, Evalue=1e-78, Organism=Caenorhabditis elegans, GI71992717, Length=568, Percent_Identity=34.8591549295775, Blast_Score=290, Evalue=2e-78, Organism=Saccharomyces cerevisiae, GI6323873, Length=521, Percent_Identity=47.9846449136276, Blast_Score=431, Evalue=1e-121, Organism=Drosophila melanogaster, GI281365319, Length=551, Percent_Identity=34.6642468239564, Blast_Score=280, Evalue=1e-75,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR006220 - InterPro: IPR011702 - InterPro: IPR017926 - InterPro: IPR000991 - InterPro: IPR001674 - InterPro: IPR004739 - InterPro: IPR022955 - InterPro: IPR022310 - InterPro: IPR014729 [H]
Pfam domain/function: PF00117 GATase; PF00958 GMP_synt_C; PF02540 NAD_synthase [H]
EC number: =6.3.5.2 [H]
Molecular weight: Translated: 56408; Mature: 56408
Theoretical pI: Translated: 5.40; Mature: 5.40
Prosite motif: PS00442 GATASE_TYPE_I
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.4 %Cys (Translated Protein) 2.0 %Met (Translated Protein) 3.4 %Cys+Met (Translated Protein) 1.4 %Cys (Mature Protein) 2.0 %Met (Mature Protein) 3.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MDDKHLVAVIDFGSQYTQLIVRRVRELGYMAKLYALEDLDQIHEPGAVILSGGPKSTTDA CCCCEEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCC DAPDIDFEWLQSLNVPVLGVCYGMQLLNIKHGGTVKASNKREYGPAALLPETCVGLYRDM CCCCCCHHHHHHCCCCHHHHHHCHHEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHCC SPSSQVWMSHSDTVDHLAEGCRVIARNAEGVPVSLQWGETTFGIQFHPEVTHSHEGRTIL CCCCCEEECCCCCHHHHHHHHHEECCCCCCCEEEEEECCEEEEEEECCCCCCCCCCHHHH RNFLSCAANLKKFDIGDFKRELIREIRERVGNRQVVCGVSGGVDSTVLAVLLHEAGVNMR HHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCEE AIFVDNGLLRKNEAEEVRANFARMNVEIETVDASERFLAALDGESDPEKKRRIIGGLFID EEEECCCCCCCCCHHHHHHHHHHEEEEEEEECCHHHHHHCCCCCCCHHHHHHHHHHHHHH VFWDAVGDAEMLAQGTLYPDVIESASNAKSKASVIKTHHNRVERVLELQAQGKVLEPLAE HHHHHCCCHHHHHCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHH LFKDEVRELGASMGIPHDILWRHPFPGPGLAVRCPGVVTKERLDIIRECDAIFIGNLKKY HHHHHHHHHHHHCCCCHHHHCCCCCCCCCCEEECCCCCCHHHHHHHHHCCHHEECCCHHC GWYDKVWQAYAGLIPVKTVGVKGDERSYEWATNLRAIVSEDAMTADWVELPPALLRETSN CCHHHHHHHHHCCCEEEEECCCCCCCCHHHHHHHHHHHHCCCCCCHHHHCCHHHHHHHHH RILNEVKGINRVLYDISTKPPASIEWE HHHHHHHHHHHHHHCCCCCCCCCCCCC >Mature Secondary Structure MDDKHLVAVIDFGSQYTQLIVRRVRELGYMAKLYALEDLDQIHEPGAVILSGGPKSTTDA CCCCEEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCC DAPDIDFEWLQSLNVPVLGVCYGMQLLNIKHGGTVKASNKREYGPAALLPETCVGLYRDM CCCCCCHHHHHHCCCCHHHHHHCHHEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHCC SPSSQVWMSHSDTVDHLAEGCRVIARNAEGVPVSLQWGETTFGIQFHPEVTHSHEGRTIL CCCCCEEECCCCCHHHHHHHHHEECCCCCCCEEEEEECCEEEEEEECCCCCCCCCCHHHH RNFLSCAANLKKFDIGDFKRELIREIRERVGNRQVVCGVSGGVDSTVLAVLLHEAGVNMR HHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCEE AIFVDNGLLRKNEAEEVRANFARMNVEIETVDASERFLAALDGESDPEKKRRIIGGLFID EEEECCCCCCCCCHHHHHHHHHHEEEEEEEECCHHHHHHCCCCCCCHHHHHHHHHHHHHH VFWDAVGDAEMLAQGTLYPDVIESASNAKSKASVIKTHHNRVERVLELQAQGKVLEPLAE HHHHHCCCHHHHHCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHH LFKDEVRELGASMGIPHDILWRHPFPGPGLAVRCPGVVTKERLDIIRECDAIFIGNLKKY HHHHHHHHHHHHCCCCHHHHCCCCCCCCCCEEECCCCCCHHHHHHHHHCCHHEECCCHHC GWYDKVWQAYAGLIPVKTVGVKGDERSYEWATNLRAIVSEDAMTADWVELPPALLRETSN CCHHHHHHHHHCCCEEEEECCCCCCCCHHHHHHHHHHHHCCCCCCHHHHCCHHHHHHHHH RILNEVKGINRVLYDISTKPPASIEWE HHHHHHHHHHHHHHCCCCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA