Definition | Akkermansia muciniphila ATCC BAA-835, complete genome. |
---|---|
Accession | NC_010655 |
Length | 2,664,102 |
Click here to switch to the map view.
The map label for this gene is glaA
Identifier: 187735954
GI number: 187735954
Start: 1753731
End: 1755521
Strand: Reverse
Name: glaA
Synonym: Amuc_1463
Alternate gene names: 187735954
Gene position: 1755521-1753731 (Counterclockwise)
Preceding gene: 187735955
Following gene: 187735947
Centisome position: 65.9
GC content: 55.11
Gene sequence:
>1791_bases ATGCAAAACCCCGTGGCCTCCCTGCTTTTTATTCTCGCAATGTTGACAGGGCCATGCCCGGCAGCGGATTATCCGGAACG GACGGAGCGCACGCAATCCGCAGGGAACCATGTCTGGCATATAGACCCGGACAAGGGAAATGACGGCAATCCGGGAACGG CCCCATCCACGGCGTGGAAAAGCATGGCGCCGGCCAACCGTCTCATCATGGCGCGGGGAGATACGCTCGTCATCCATCCG GGGGAACATGCCGTCTCCCTGGCGCTGATGGGGGAAGGTTCCAAACAGGCCCCGGTCACCATCCGGTTCATGCCCGGCAG GCATATCTTCAAACACGGTGCCCTGATGACCGGAAAACCGCAGATTTCCAACACCAACGATGCGCCCAACGAGCCCAAGG CCATGGCTATCCGCCTGATGGAAGCCAAAAACATACGCCTGGAGGGAAAGCCGGGAGCCACCGACATCCTGCTGGAAGGA AAAGCCATCTTTGTCTGCATGGAACATGCGGAAAACGTTTCCCTGAACGGCCTGGGTTTTGATTACCTGCATCCGACCAT GGGAGAATTTCTGGTCACGGAAGTTGAAGGGGATACCATGAAAGCAACTATTCCGGACGGAACCCTGTACACAGTGAAGG ATGGAAACCTGACCTGGCACGGACCCGGCTGGGAATTCCGCATGGGAGGGTATTCCAAAGTTTTCGATTCCGCCTCCGGC ACTTTCCAGGGCCGTTTCGACCCCGGGAAAACAGTCATCAGAGAATTGTCTCCCGGAAAAATCAGCATTACGTTCAAGGA AGGCTCACCAACCATGAAACCGGGCCAATCCTACCAGAACCGCAATACCCGGAGGGACTGCTGCGGCTTCTTTCAATACA GAAGCAAAAACATCCTCTGGAATAACTGCCATATTTACTATATGCACGGCATGGGCGTCGTTTCCCAGTTCTGCGAGAAC ATCATGTTCAGCCACTTGAAAATAGCCCCGCGCCCCCGTTCCCTCCGCACCAATTCCTCCTGGGCGGATAATCTGCACTT TTCCGGATGCCGGGGAAAAATCATCGTCAAAGATTGCGTGCTGGGAGCCTCCCATGATGACGCCGTTAATGTTCATGGAA CCCACTTGCGCATTATAGACAGGCCGGCCCCCAACAAAATCACCGTCCGGTTCATGCATCCCCAGACCTTCGGCTTTGAC GCCTTTGCCGCAGGAGACCGGATCGATTATGTTTCCTGCAACACGCTGGTACCTTATGCATCCAATACCGTTTCCGGCGT CAAACAACTCAATGAAAAGGAAATAGAACTTACATTGCAACATCCCAATCCCGGGAACATCCAGCCTGACGATGTTGTGG AAAACGTCACATGGACGCCGTCCGTCCACATCAGCAATACGGTATGCCGCCACATTCCCACCAGGGGCTTCCTGCTCACC ACTAGGAAGCCGGTGCTGGTTGAACGGTGCCGGTTTGAAAAAACGGGCATGCCCGCCATCCTGGTGGAAGATGACGCCTC CGGCTGGTATGAATCCGGCGTGGTCAGGAACATGACCATTTCCCGCAATACCTTTATCCAATGCGGAGAAGCCGTCATCC AGATCGTGCCGCACGCTCCACGGCCGGAAGGGGACGTTCACCGGAACATCACCATTACCGGAAACACGTTTGACCTCAAA AACGGAACCGCCATCCGCATTCGCCATACCGGTGACGTCAAGGCGGAGAAAAACACTTTCACCAAAGACGGGAAGAAAAT CCCTGAGGAAAAGGCGGTGGATATCCGGTAG
Upstream 100 bases:
>100_bases GGGGCCTCCCAGGTGAGCCATGGAAAAGCTGCGGCTTTTTCTCCCGGCCCTGCAAGAATTATCCTTGTCCCGAAAACCGT GTTCCTGTAGGAGTTGCAGC
Downstream 100 bases:
>100_bases AAAACCTCTTTCTTACCGGTGCCGGGAAACATGCACCGTTACGGAAACGCTGTCCGTCCGCGGCAGAATAAACTTGCGCA GGGTAACAACCACGTGCCGC
Product: hypothetical protein
Products: NA
Alternate protein names: Exo-alpha-galactosidase A
Number of amino acids: Translated: 596; Mature: 596
Protein sequence:
>596_residues MQNPVASLLFILAMLTGPCPAADYPERTERTQSAGNHVWHIDPDKGNDGNPGTAPSTAWKSMAPANRLIMARGDTLVIHP GEHAVSLALMGEGSKQAPVTIRFMPGRHIFKHGALMTGKPQISNTNDAPNEPKAMAIRLMEAKNIRLEGKPGATDILLEG KAIFVCMEHAENVSLNGLGFDYLHPTMGEFLVTEVEGDTMKATIPDGTLYTVKDGNLTWHGPGWEFRMGGYSKVFDSASG TFQGRFDPGKTVIRELSPGKISITFKEGSPTMKPGQSYQNRNTRRDCCGFFQYRSKNILWNNCHIYYMHGMGVVSQFCEN IMFSHLKIAPRPRSLRTNSSWADNLHFSGCRGKIIVKDCVLGASHDDAVNVHGTHLRIIDRPAPNKITVRFMHPQTFGFD AFAAGDRIDYVSCNTLVPYASNTVSGVKQLNEKEIELTLQHPNPGNIQPDDVVENVTWTPSVHISNTVCRHIPTRGFLLT TRKPVLVERCRFEKTGMPAILVEDDASGWYESGVVRNMTISRNTFIQCGEAVIQIVPHAPRPEGDVHRNITITGNTFDLK NGTAIRIRHTGDVKAEKNTFTKDGKKIPEEKAVDIR
Sequences:
>Translated_596_residues MQNPVASLLFILAMLTGPCPAADYPERTERTQSAGNHVWHIDPDKGNDGNPGTAPSTAWKSMAPANRLIMARGDTLVIHP GEHAVSLALMGEGSKQAPVTIRFMPGRHIFKHGALMTGKPQISNTNDAPNEPKAMAIRLMEAKNIRLEGKPGATDILLEG KAIFVCMEHAENVSLNGLGFDYLHPTMGEFLVTEVEGDTMKATIPDGTLYTVKDGNLTWHGPGWEFRMGGYSKVFDSASG TFQGRFDPGKTVIRELSPGKISITFKEGSPTMKPGQSYQNRNTRRDCCGFFQYRSKNILWNNCHIYYMHGMGVVSQFCEN IMFSHLKIAPRPRSLRTNSSWADNLHFSGCRGKIIVKDCVLGASHDDAVNVHGTHLRIIDRPAPNKITVRFMHPQTFGFD AFAAGDRIDYVSCNTLVPYASNTVSGVKQLNEKEIELTLQHPNPGNIQPDDVVENVTWTPSVHISNTVCRHIPTRGFLLT TRKPVLVERCRFEKTGMPAILVEDDASGWYESGVVRNMTISRNTFIQCGEAVIQIVPHAPRPEGDVHRNITITGNTFDLK NGTAIRIRHTGDVKAEKNTFTKDGKKIPEEKAVDIR >Mature_596_residues MQNPVASLLFILAMLTGPCPAADYPERTERTQSAGNHVWHIDPDKGNDGNPGTAPSTAWKSMAPANRLIMARGDTLVIHP GEHAVSLALMGEGSKQAPVTIRFMPGRHIFKHGALMTGKPQISNTNDAPNEPKAMAIRLMEAKNIRLEGKPGATDILLEG KAIFVCMEHAENVSLNGLGFDYLHPTMGEFLVTEVEGDTMKATIPDGTLYTVKDGNLTWHGPGWEFRMGGYSKVFDSASG TFQGRFDPGKTVIRELSPGKISITFKEGSPTMKPGQSYQNRNTRRDCCGFFQYRSKNILWNNCHIYYMHGMGVVSQFCEN IMFSHLKIAPRPRSLRTNSSWADNLHFSGCRGKIIVKDCVLGASHDDAVNVHGTHLRIIDRPAPNKITVRFMHPQTFGFD AFAAGDRIDYVSCNTLVPYASNTVSGVKQLNEKEIELTLQHPNPGNIQPDDVVENVTWTPSVHISNTVCRHIPTRGFLLT TRKPVLVERCRFEKTGMPAILVEDDASGWYESGVVRNMTISRNTFIQCGEAVIQIVPHAPRPEGDVHRNITITGNTFDLK NGTAIRIRHTGDVKAEKNTFTKDGKKIPEEKAVDIR
Specific function: Alpha-galactosidase that specifically removes branched alpha-1,3-linked galactose residues present in blood group B antigens. Has no activity toward linear alpha-1,3-linked galactose residues
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: Contains 4 PbH1 repeats
Homologues:
None
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): GLAA_AKKM8 (B2UL12)
Other databases:
- EMBL: CP001071 - RefSeq: YP_001878066.1 - GeneID: 6275708 - GenomeReviews: CP001071_GR - KEGG: amu:Amuc_1463 - OMA: LKLRFMH - ProtClustDB: CLSK823424 - InterPro: IPR006626 - InterPro: IPR012334 - InterPro: IPR011050 - Gene3D: G3DSA:2.160.20.10 - SMART: SM00710
Pfam domain/function: SSF51126 Pectin_lyas_like
EC number: =3.2.1.22
Molecular weight: Translated: 65953; Mature: 65953
Theoretical pI: Translated: 8.43; Mature: 8.43
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.0 %Cys (Translated Protein) 3.4 %Met (Translated Protein) 5.4 %Cys+Met (Translated Protein) 2.0 %Cys (Mature Protein) 3.4 %Met (Mature Protein) 5.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MQNPVASLLFILAMLTGPCPAADYPERTERTQSAGNHVWHIDPDKGNDGNPGTAPSTAWK CCCHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCHHHH SMAPANRLIMARGDTLVIHPGEHAVSLALMGEGSKQAPVTIRFMPGRHIFKHGALMTGKP HCCCCCEEEEEECCEEEEECCCCEEEEEEECCCCCCCCEEEEECCCCHHHHCCCEEECCC QISNTNDAPNEPKAMAIRLMEAKNIRLEGKPGATDILLEGKAIFVCMEHAENVSLNGLGF CCCCCCCCCCCCCEEEEEEEECCCEEEECCCCCEEEEECCCEEEEEEECCCCCEECCCCC DYLHPTMGEFLVTEVEGDTMKATIPDGTLYTVKDGNLTWHGPGWEFRMGGYSKVFDSASG HHHCCCCCCEEEEEECCCEEEEECCCCEEEEEECCCEEECCCCEEEECCCHHHHHCCCCC TFQGRFDPGKTVIRELSPGKISITFKEGSPTMKPGQSYQNRNTRRDCCGFFQYRSKNILW EEEECCCCHHHHHHHCCCCEEEEEEECCCCCCCCCCCHHCCCCHHHHHHHHHHCCCCEEE NNCHIYYMHGMGVVSQFCENIMFSHLKIAPRPRSLRTNSSWADNLHFSGCRGKIIVKDCV ECEEEEEEECHHHHHHHHHHHHHHHEEECCCCCCCCCCCCCCCCCEECCCCCEEEEEEEE LGASHDDAVNVHGTHLRIIDRPAPNKITVRFMHPQTFGFDAFAAGDRIDYVSCNTLVPYA ECCCCCCEEEECCEEEEEEECCCCCEEEEEEECCCCCCCCCCCCCCEEEEEECCEECCCC SNTVSGVKQLNEKEIELTLQHPNPGNIQPDDVVENVTWTPSVHISNTVCRHIPTRGFLLT CCHHHHHHHCCCCEEEEEEECCCCCCCCHHHHHHCCCCCCCEEECCHHHHHCCCCCEEEE TRKPVLVERCRFEKTGMPAILVEDDASGWYESGVVRNMTISRNTFIQCGEAVIQIVPHAP CCCCHHHHHHCCCCCCCCEEEEECCCCCHHHCCEEEEEEECCCHHHHHCCHHHEECCCCC RPEGDVHRNITITGNTFDLKNGTAIRIRHTGDVKAEKNTFTKDGKKIPEEKAVDIR CCCCCCEEEEEEECCEEECCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC >Mature Secondary Structure MQNPVASLLFILAMLTGPCPAADYPERTERTQSAGNHVWHIDPDKGNDGNPGTAPSTAWK CCCHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCHHHH SMAPANRLIMARGDTLVIHPGEHAVSLALMGEGSKQAPVTIRFMPGRHIFKHGALMTGKP HCCCCCEEEEEECCEEEEECCCCEEEEEEECCCCCCCCEEEEECCCCHHHHCCCEEECCC QISNTNDAPNEPKAMAIRLMEAKNIRLEGKPGATDILLEGKAIFVCMEHAENVSLNGLGF CCCCCCCCCCCCCEEEEEEEECCCEEEECCCCCEEEEECCCEEEEEEECCCCCEECCCCC DYLHPTMGEFLVTEVEGDTMKATIPDGTLYTVKDGNLTWHGPGWEFRMGGYSKVFDSASG HHHCCCCCCEEEEEECCCEEEEECCCCEEEEEECCCEEECCCCEEEECCCHHHHHCCCCC TFQGRFDPGKTVIRELSPGKISITFKEGSPTMKPGQSYQNRNTRRDCCGFFQYRSKNILW EEEECCCCHHHHHHHCCCCEEEEEEECCCCCCCCCCCHHCCCCHHHHHHHHHHCCCCEEE NNCHIYYMHGMGVVSQFCENIMFSHLKIAPRPRSLRTNSSWADNLHFSGCRGKIIVKDCV ECEEEEEEECHHHHHHHHHHHHHHHEEECCCCCCCCCCCCCCCCCEECCCCCEEEEEEEE LGASHDDAVNVHGTHLRIIDRPAPNKITVRFMHPQTFGFDAFAAGDRIDYVSCNTLVPYA ECCCCCCEEEECCEEEEEEECCCCCEEEEEEECCCCCCCCCCCCCCEEEEEECCEECCCC SNTVSGVKQLNEKEIELTLQHPNPGNIQPDDVVENVTWTPSVHISNTVCRHIPTRGFLLT CCHHHHHHHCCCCEEEEEEECCCCCCCCHHHHHHCCCCCCCEEECCHHHHHCCCCCEEEE TRKPVLVERCRFEKTGMPAILVEDDASGWYESGVVRNMTISRNTFIQCGEAVIQIVPHAP CCCCHHHHHHCCCCCCCCEEEEECCCCCHHHCCEEEEEEECCCHHHHHCCHHHEECCCCC RPEGDVHRNITITGNTFDLKNGTAIRIRHTGDVKAEKNTFTKDGKKIPEEKAVDIR CCCCCCEEEEEEECCEEECCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA