Definition | Akkermansia muciniphila ATCC BAA-835, complete genome. |
---|---|
Accession | NC_010655 |
Length | 2,664,102 |
Click here to switch to the map view.
The map label for this gene is cobDQ [H]
Identifier: 187736170
GI number: 187736170
Start: 2040750
End: 2043326
Strand: Reverse
Name: cobDQ [H]
Synonym: Amuc_1683
Alternate gene names: 187736170
Gene position: 2043326-2040750 (Counterclockwise)
Preceding gene: 187736174
Following gene: 187736169
Centisome position: 76.7
GC content: 61.66
Gene sequence:
>2577_bases ATGAACGTTTTTTCACATGGCGGAGATTTAAAATCCCTGGCGGAGGATGCCTGCCGCCCGGAACGGGATATTTTGGACTT CAGCGTCAACCTGAGGCCGGAAGGTATGCCGGAGTTCATCGTTTCAGCATTGTGGAAGGCCATGGAAAACGCAGTACCCT ACCCTTCTCCGGATGCGGCGGATCTGCGGGAACTGGCGGCGGTTCATTACGGGCTTCCTTCCGGCTGCTTCGTATTTGGA AACGGGGCCAATGAACTCATCCACGCTCTTCCGCGCGCATTAAACCTTAAACAGGCCGTCATTCCGGAACCCGCTTTTTC CGAATACAGGCTGGCCTGCCTGCGCCACGGCACGGATATTCTTTCCATCCGGACGGAGGAACGGAATTCCTTCCTCCCGT CTCTTTGCCGTCTGGAGGAACAGGCCGCAGACGGAAGCGCTGTTTTTCTGGCAAATCCCAGCAATCCGTCCGGCGGCCTG CTGGACGCAGCGGCCCTGCACAGGGTTGTCCAGAGCCGCCCCGAAGTCCTCTGGATCATTGACGAATCTTTCATGGATTA CGCACAAGGAGCGGAATCGCTGCTCCATGAAGCAGCCCTCCTTCCCAACCTGGTGGTTCTGCGCTCCCTGACCAAATTTT ACGGCATGGCCGGCGTCCGGTGCGGTTTTTCCATCTGTGCCGCTCCGCTCGCGGAGCGGCTGCGGCGATCCCTGCCCGCC TGGAACGTGAATGCCTTTGCGACGGCAGCGGTGAAGGCCGTGCTGGCACAACCCTCCTCCTGGGCAGACAGGGAACGCGC CCGGAACCGGGAACGCCGGGACGACCTGTTCCGCAGGCTCTCCTCCCTGCCGGGCGCCGCCGTACTCCCGTCCGAGGCCA ACTTCCTGCTCTTCCGCCTGGCGGGGGCGCCTCATGGCCTGGCAGCCCGGCTCCTGAAAAAATACGGCATCGCACTGCGC GACTGTTCCAATTATCCGGGTCTGGAAACGGGCTGCTGGTTGCGCTCCGGCGTCCGCACGCCGGAGGAACACGCCCTGCT GGCGGAAGCTCTGCGCGCCGAACTGGCGGGAAACGGCCCCTCCATTATCCGTAAGGCTCCCAAACCGGCCCTGATGATTC AGGGCACCTGCTCCGATGCAGGAAAAAGCGTTCTCACGGCAGCCCTGTGCCGCATTTTCCTTCAGGACGGCTATCACGTG GCACCGTTCAAGGCGCAGAACATGGCTCTCAACTCCGGCGTAACTGCGCTGGGAGAGGAAATGGGCCGCGCCCAGCTGGT GCAGGCCCAGGCCTGCCGCATTGATCCGGATGCCAGAATGAACCCCATTCTTCTCAAGCCCCATTCCAATACCGGCTCCC AGGTGATCGTGATGGGGCGCCCCGTAGGCCGCATGGACGCGCGGGAATACTTCACGGCTAAAAGGCGCTTCTGGCCGGAC GTATGCAAAGCATACGATTCCCTGGCGGACGAATATGAACTCCTCTGCCTGGAAGGGGCCGGAAGCCCCGGAGAAATCAA TCTGAAATCAGCAGACGTGGTCAACATGAACATGGCCCGCTACGCGCGTGCCAGAGTCCTGCTCGCCGGGGACATTGACC GTGGCGGAGTGTACGCCTCCTTTCTGGGAACGTGGATGACATTCGCCCCGTGGGAAAAAGAACTGCTGGCGGGGTTCGTG GTCAACAAATTTCGAGGAGATCCGGATCTGCTGGCCCCGGCGCACAGCTACATGCGGAATCGTACGGGCAAGCCTGTGCT GGGCGTCATCCCGATGATGCGGGACATCAACATTCCGGAAGAAGACCGCGCCACGCTGCCCCCTGGCCACGGGGAGCACG GGAAACATGCGGATTGCCTGGATGTGGCCGTAGTCATGCCCGCCCACGTCTCCAACTTCACGGACTTCGCCCCTCTGGCG GCGGAGCCGGACGTCCGGCTCCGCCAGGTGCGGACACGGGAGGAATGGGGAAATCCGGACCTGGTCATTCTGCCCGGCAC TAAAAGCGTGGCTGCGGACCTGGCTTCTCTCCGTTCCGCCGGGCTGGAAGAACCCATCCGCCGTCATGCCGAAAAGGGAA AGTGGCTTCTGGGCGTCTGCGGAGGGCTGCAAATGCTGGGGACGGACATTCTGGACCCGCTGCACATGGAATCCCCGGAG GAGCGCACGCCAGGACTGGGGCTGCTGGAACTTTCCACCACCTTCTCTTCAGCCAAAACCCTGATCAACGTGCGCCGGGC AAGCACGCCGCTGCCTGTTCCCGCCGCAGGTTATGAAATCCACCATGGCGTCACCAGCCATCAAGAATCCAGCCCCCCCG TCATGTTCCGGGAAGACGGCTCCCCCTGCGGCTACGGCAAAGGCCGTATCTGGGCCACGTATCTGCACGGCATGCTGGAC GGAGACCAATTCCGCCGCGCATTCATCAACATGGTCAGGAAAGATTCAGGGCTGAAAGCCAACCCAGCCCTGCACACCGC CTATGACCTGGACGGAGCGCTGGACCGCCTGGCGGACGTGGTCCGGAAACATCTGGACCTGAAAACCATTTACCGAGCTC TCCAACTAAAACGCTGA
Upstream 100 bases:
>100_bases GCGAAGCTTCCGGCTACGTGGAATTCCGAACTTGCAAGGCGCTTCCTTTCTTTCTCCGGCGCGGGCAGCCCTTCGGTACG CCGGTTCCAATTACTGTGGC
Downstream 100 bases:
>100_bases CCCATGTTCCCGTGCCCGTTCATCCTGCCTGCGGCCTTCCTGCTGGATATTCTGGCGGGAGAACCGCCCAACAGGTTTCA TCCCGTCTGTCTGATCGGCC
Product: cobyric acid synthase CobQ
Products: NA
Alternate protein names: Putative threonine-phosphate decarboxylase; L-threonine-O-3-phosphate decarboxylase; Cobyric acid synthase [H]
Number of amino acids: Translated: 858; Mature: 858
Protein sequence:
>858_residues MNVFSHGGDLKSLAEDACRPERDILDFSVNLRPEGMPEFIVSALWKAMENAVPYPSPDAADLRELAAVHYGLPSGCFVFG NGANELIHALPRALNLKQAVIPEPAFSEYRLACLRHGTDILSIRTEERNSFLPSLCRLEEQAADGSAVFLANPSNPSGGL LDAAALHRVVQSRPEVLWIIDESFMDYAQGAESLLHEAALLPNLVVLRSLTKFYGMAGVRCGFSICAAPLAERLRRSLPA WNVNAFATAAVKAVLAQPSSWADRERARNRERRDDLFRRLSSLPGAAVLPSEANFLLFRLAGAPHGLAARLLKKYGIALR DCSNYPGLETGCWLRSGVRTPEEHALLAEALRAELAGNGPSIIRKAPKPALMIQGTCSDAGKSVLTAALCRIFLQDGYHV APFKAQNMALNSGVTALGEEMGRAQLVQAQACRIDPDARMNPILLKPHSNTGSQVIVMGRPVGRMDAREYFTAKRRFWPD VCKAYDSLADEYELLCLEGAGSPGEINLKSADVVNMNMARYARARVLLAGDIDRGGVYASFLGTWMTFAPWEKELLAGFV VNKFRGDPDLLAPAHSYMRNRTGKPVLGVIPMMRDINIPEEDRATLPPGHGEHGKHADCLDVAVVMPAHVSNFTDFAPLA AEPDVRLRQVRTREEWGNPDLVILPGTKSVAADLASLRSAGLEEPIRRHAEKGKWLLGVCGGLQMLGTDILDPLHMESPE ERTPGLGLLELSTTFSSAKTLINVRRASTPLPVPAAGYEIHHGVTSHQESSPPVMFREDGSPCGYGKGRIWATYLHGMLD GDQFRRAFINMVRKDSGLKANPALHTAYDLDGALDRLADVVRKHLDLKTIYRALQLKR
Sequences:
>Translated_858_residues MNVFSHGGDLKSLAEDACRPERDILDFSVNLRPEGMPEFIVSALWKAMENAVPYPSPDAADLRELAAVHYGLPSGCFVFG NGANELIHALPRALNLKQAVIPEPAFSEYRLACLRHGTDILSIRTEERNSFLPSLCRLEEQAADGSAVFLANPSNPSGGL LDAAALHRVVQSRPEVLWIIDESFMDYAQGAESLLHEAALLPNLVVLRSLTKFYGMAGVRCGFSICAAPLAERLRRSLPA WNVNAFATAAVKAVLAQPSSWADRERARNRERRDDLFRRLSSLPGAAVLPSEANFLLFRLAGAPHGLAARLLKKYGIALR DCSNYPGLETGCWLRSGVRTPEEHALLAEALRAELAGNGPSIIRKAPKPALMIQGTCSDAGKSVLTAALCRIFLQDGYHV APFKAQNMALNSGVTALGEEMGRAQLVQAQACRIDPDARMNPILLKPHSNTGSQVIVMGRPVGRMDAREYFTAKRRFWPD VCKAYDSLADEYELLCLEGAGSPGEINLKSADVVNMNMARYARARVLLAGDIDRGGVYASFLGTWMTFAPWEKELLAGFV VNKFRGDPDLLAPAHSYMRNRTGKPVLGVIPMMRDINIPEEDRATLPPGHGEHGKHADCLDVAVVMPAHVSNFTDFAPLA AEPDVRLRQVRTREEWGNPDLVILPGTKSVAADLASLRSAGLEEPIRRHAEKGKWLLGVCGGLQMLGTDILDPLHMESPE ERTPGLGLLELSTTFSSAKTLINVRRASTPLPVPAAGYEIHHGVTSHQESSPPVMFREDGSPCGYGKGRIWATYLHGMLD GDQFRRAFINMVRKDSGLKANPALHTAYDLDGALDRLADVVRKHLDLKTIYRALQLKR >Mature_858_residues MNVFSHGGDLKSLAEDACRPERDILDFSVNLRPEGMPEFIVSALWKAMENAVPYPSPDAADLRELAAVHYGLPSGCFVFG NGANELIHALPRALNLKQAVIPEPAFSEYRLACLRHGTDILSIRTEERNSFLPSLCRLEEQAADGSAVFLANPSNPSGGL LDAAALHRVVQSRPEVLWIIDESFMDYAQGAESLLHEAALLPNLVVLRSLTKFYGMAGVRCGFSICAAPLAERLRRSLPA WNVNAFATAAVKAVLAQPSSWADRERARNRERRDDLFRRLSSLPGAAVLPSEANFLLFRLAGAPHGLAARLLKKYGIALR DCSNYPGLETGCWLRSGVRTPEEHALLAEALRAELAGNGPSIIRKAPKPALMIQGTCSDAGKSVLTAALCRIFLQDGYHV APFKAQNMALNSGVTALGEEMGRAQLVQAQACRIDPDARMNPILLKPHSNTGSQVIVMGRPVGRMDAREYFTAKRRFWPD VCKAYDSLADEYELLCLEGAGSPGEINLKSADVVNMNMARYARARVLLAGDIDRGGVYASFLGTWMTFAPWEKELLAGFV VNKFRGDPDLLAPAHSYMRNRTGKPVLGVIPMMRDINIPEEDRATLPPGHGEHGKHADCLDVAVVMPAHVSNFTDFAPLA AEPDVRLRQVRTREEWGNPDLVILPGTKSVAADLASLRSAGLEEPIRRHAEKGKWLLGVCGGLQMLGTDILDPLHMESPE ERTPGLGLLELSTTFSSAKTLINVRRASTPLPVPAAGYEIHHGVTSHQESSPPVMFREDGSPCGYGKGRIWATYLHGMLD GDQFRRAFINMVRKDSGLKANPALHTAYDLDGALDRLADVVRKHLDLKTIYRALQLKR
Specific function: Catalyzes two activities which are involved in the adenosylcobalamin biosynthesis:decarboxylates L-threonine-O-3- phosphate to yield (R)-1-amino-2-propanol O-2-phosphate, the precursor for the linkage between the nucleotide loop and the corrin ring in cob
COG id: COG1492
COG function: function code H; Cobyric acid synthase
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 GATase cobBQ-type domain [H]
Homologues:
Organism=Escherichia coli, GI1788332, Length=307, Percent_Identity=26.7100977198697, Blast_Score=83, Evalue=8e-17, Organism=Saccharomyces cerevisiae, GI6322075, Length=300, Percent_Identity=27.3333333333333, Blast_Score=90, Evalue=2e-18,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR004839 - InterPro: IPR002586 - InterPro: IPR017929 - InterPro: IPR004459 - InterPro: IPR011698 - InterPro: IPR004838 - InterPro: IPR015424 - InterPro: IPR015421 - InterPro: IPR015422 [H]
Pfam domain/function: PF00155 Aminotran_1_2; PF01656 CbiA; PF07685 GATase_3 [H]
EC number: =4.1.1.81 [H]
Molecular weight: Translated: 93816; Mature: 93816
Theoretical pI: Translated: 7.57; Mature: 7.57
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.9 %Cys (Translated Protein) 2.7 %Met (Translated Protein) 4.5 %Cys+Met (Translated Protein) 1.9 %Cys (Mature Protein) 2.7 %Met (Mature Protein) 4.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MNVFSHGGDLKSLAEDACRPERDILDFSVNLRPEGMPEFIVSALWKAMENAVPYPSPDAA CCCCCCCCCHHHHHHHHCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCHH DLRELAAVHYGLPSGCFVFGNGANELIHALPRALNLKQAVIPEPAFSEYRLACLRHGTDI HHHHHHHHHCCCCCCCEEEECCHHHHHHHHHHHCCCHHHHCCCCCHHHHHHHHHHCCCCE LSIRTEERNSFLPSLCRLEEQAADGSAVFLANPSNPSGGLLDAAALHRVVQSRPEVLWII EEEEEHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCHHHHHHHHHHHCCCCEEEEE DESFMDYAQGAESLLHEAALLPNLVVLRSLTKFYGMAGVRCGFSICAAPLAERLRRSLPA CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCC WNVNAFATAAVKAVLAQPSSWADRERARNRERRDDLFRRLSSLPGAAVLPSEANFLLFRL CCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCEEEEEE AGAPHGLAARLLKKYGIALRDCSNYPGLETGCWLRSGVRTPEEHALLAEALRAELAGNGP CCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHCCCCCHHHHHHHHHHHHHHCCCCH SIIRKAPKPALMIQGTCSDAGKSVLTAALCRIFLQDGYHVAPFKAQNMALNSGVTALGEE HHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHHCCCEECCCCCCCEEHHCCHHHHHHH MGRAQLVQAQACRIDPDARMNPILLKPHSNTGSQVIVMGRPVGRMDAREYFTAKRRFWPD HHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCEEEEECCCCCCCHHHHHHHHHHHCCHH VCKAYDSLADEYELLCLEGAGSPGEINLKSADVVNMNMARYARARVLLAGDIDRGGVYAS HHHHHHHHCCCEEEEEEECCCCCCEEEECCCCEECHHHHHHHHHEEEEEECCCCCCHHHH FLGTWMTFAPWEKELLAGFVVNKFRGDPDLLAPAHSYMRNRTGKPVLGVIPMMRDINIPE HHHHHHHCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHCCCCCC EDRATLPPGHGEHGKHADCLDVAVVMPAHVSNFTDFAPLAAEPDVRLRQVRTREEWGNPD CCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHCCCCC LVILPGTKSVAADLASLRSAGLEEPIRRHAEKGKWLLGVCGGLQMLGTDILDPLHMESPE EEEECCCHHHHHHHHHHHHCCHHHHHHHHHHHCCEEEEHHCCHHHHHHHHCCCCCCCCCC ERTPGLGLLELSTTFSSAKTLINVRRASTPLPVPAAGYEIHHGVTSHQESSPPVMFREDG CCCCCCEEEEEHHHHHHHHHHHHHHHCCCCCCCCCCCCEECCCCCCCCCCCCCEEEECCC SPCGYGKGRIWATYLHGMLDGDQFRRAFINMVRKDSGLKANPALHTAYDLDGALDRLADV CCCCCCCCCCHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCEEHHCCHHHHHHHHHH VRKHLDLKTIYRALQLKR HHHHCCHHHHHHHHHCCC >Mature Secondary Structure MNVFSHGGDLKSLAEDACRPERDILDFSVNLRPEGMPEFIVSALWKAMENAVPYPSPDAA CCCCCCCCCHHHHHHHHCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCHH DLRELAAVHYGLPSGCFVFGNGANELIHALPRALNLKQAVIPEPAFSEYRLACLRHGTDI HHHHHHHHHCCCCCCCEEEECCHHHHHHHHHHHCCCHHHHCCCCCHHHHHHHHHHCCCCE LSIRTEERNSFLPSLCRLEEQAADGSAVFLANPSNPSGGLLDAAALHRVVQSRPEVLWII EEEEEHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCHHHHHHHHHHHCCCCEEEEE DESFMDYAQGAESLLHEAALLPNLVVLRSLTKFYGMAGVRCGFSICAAPLAERLRRSLPA CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCC WNVNAFATAAVKAVLAQPSSWADRERARNRERRDDLFRRLSSLPGAAVLPSEANFLLFRL CCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCEEEEEE AGAPHGLAARLLKKYGIALRDCSNYPGLETGCWLRSGVRTPEEHALLAEALRAELAGNGP CCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHCCCCCHHHHHHHHHHHHHHCCCCH SIIRKAPKPALMIQGTCSDAGKSVLTAALCRIFLQDGYHVAPFKAQNMALNSGVTALGEE HHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHHCCCEECCCCCCCEEHHCCHHHHHHH MGRAQLVQAQACRIDPDARMNPILLKPHSNTGSQVIVMGRPVGRMDAREYFTAKRRFWPD HHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCEEEEECCCCCCCHHHHHHHHHHHCCHH VCKAYDSLADEYELLCLEGAGSPGEINLKSADVVNMNMARYARARVLLAGDIDRGGVYAS HHHHHHHHCCCEEEEEEECCCCCCEEEECCCCEECHHHHHHHHHEEEEEECCCCCCHHHH FLGTWMTFAPWEKELLAGFVVNKFRGDPDLLAPAHSYMRNRTGKPVLGVIPMMRDINIPE HHHHHHHCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHCCCCCC EDRATLPPGHGEHGKHADCLDVAVVMPAHVSNFTDFAPLAAEPDVRLRQVRTREEWGNPD CCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHCCCCC LVILPGTKSVAADLASLRSAGLEEPIRRHAEKGKWLLGVCGGLQMLGTDILDPLHMESPE EEEECCCHHHHHHHHHHHHCCHHHHHHHHHHHCCEEEEHHCCHHHHHHHHCCCCCCCCCC ERTPGLGLLELSTTFSSAKTLINVRRASTPLPVPAAGYEIHHGVTSHQESSPPVMFREDG CCCCCCEEEEEHHHHHHHHHHHHHHHCCCCCCCCCCCCEECCCCCCCCCCCCCEEEECCC SPCGYGKGRIWATYLHGMLDGDQFRRAFINMVRKDSGLKANPALHTAYDLDGALDRLADV CCCCCCCCCCHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCEEHHCCHHHHHHHHHH VRKHLDLKTIYRALQLKR HHHHCCHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 12712204 [H]