Definition Akkermansia muciniphila ATCC BAA-835, complete genome.
Accession NC_010655
Length 2,664,102

Click here to switch to the map view.

The map label for this gene is nahA [H]

Identifier: 187735370

GI number: 187735370

Start: 1036915

End: 1038564

Strand: Reverse

Name: nahA [H]

Synonym: Amuc_0868

Alternate gene names: 187735370

Gene position: 1038564-1036915 (Counterclockwise)

Preceding gene: 187735372

Following gene: 187735369

Centisome position: 38.98

GC content: 58.36

Gene sequence:

>1650_bases
ATGATTTCCAAGTGTACCTTTTCCGCCACGGTTTTCAGCCTGTTTTCCCTTTGCTGGGGCGCCCCATCCTCTCCGGTTCT
CGAAGCGCCCCATACCATTCCCCTGCCCGCCGCCATGCGCGTCCAAACCGGAGAAAGCGGGTTTTCCCTGAAAAACGGCG
TCAGGCTCCCGGAAAAAAATCCTCTTTCCAGGCAGGCGGAACGGATTTTCCGCGACAACGGGATCAACACGGCCCTGGTT
AAAAACAACGCGGACATCATCTTTACGGAAGACGCTTCCCTGGGCAGGGAAGGCTACCGCCTTGCCGTAACGCCGGATTC
CATCTCCATTGCCTCCGGTTCCGTGAACGGAACCCTGTATGCCCTTCAATCCCTCGTTCAAAGCATCGCTGCCGACAAAA
ACGGAGCTCCGGCCCTGCCCCGGATGGACGTAAAAGACCAGCCCCGCTTTTCATGGCGGGGCCTGATGGTAGACAGCTGC
CGCCACATGATGCCCGTGCGGGACATCAAAAAAGTGCTGGACCTGATGGAACGGTATAAATTCAACACCCTGCACTGGCA
CCTGACGGACGACCAGGGGTGGCGTCTCCCAATCGCCAAGTACCCCAGGCTGACAACCGTGGGAGGCGCCCGGGCTCAAT
CCCCCGTCATCGGCAACCGCAATAAGGGAGACGGCATCCCCTACTCCGGCCATTACACCGCAGATGAAATCCGGGATGTG
GTGCGGTACGCCAGAGACCGGGGCATTACCGTCATTCCGGAAGTGGAAATGCCAGGCCATGCCTCCGCAGCCATCGCCGC
CTATCCGGAACTGGGGAATACGGACATCCCGGGTTATGAGCCTAGGGTGCAGGAAACCTGGGGCGTGCACTCCTATACCT
TCTCCCCCACGGAAAAAACCTTCCGTTTTCTGGAAGACGTCATTGATGAAATATGCGCCCTGTTCCCGGACAGCCCCTAC
ATCCACATCGGAGGGGATGAAGCGCCCAAGAATCAGTGGAAACAGTCCCCCACGGCCCAGCGGGTCATGAAGGACAACGG
CCTGGCCAATGAACACGAGCTCCAGAGCTACTTCATCCGCCGCGTGGAAAAAATGATCAATAACCGCGGAAAAAGGCTCA
TTGGCTGGGATGAAATCCAGGAAGGGGGCCTTTCCCCCACCGCTACCATGATGGTTTGGCGCAGCCAAATGCCGCACATC
GCCGCACAAGCCCTGGCTCAAGGCAACGATATTGTGATGACGCCCAACAGCCACCTGTACTTTGACTATGACCAGGGGCC
CGGAAAACCCGCTGCCCCCGAATACGAGACGATTAATAACAATCAGCTGACCTGGCAGCATGTTTACGGACTGGAACCGG
TGCCTCAGGGAACGCCCCGGGAACGGGAAAAGCAGGTGCTGGGCTGCCAGGCGAACATCTGGACGGAATATATCCCGAAC
CTGCCGAAATGGGAATACCATGTCTTCCCCCGCGCCCTGGCGCTGGCGGAAGTTGCCTGGACCCCGCAGGAGCTAAAAAA
TGAGAAAGATTTCCGTAAACGCCTCGACCGCCAGCTTCCCTTCCTGGACGCCCGCGGCGTCAATTACAAAAGACCGGACA
ATGGAGCCCCCGCACAGCCGAAGGCCGTCATTACGCGGGAACGCCGTTAA

Upstream 100 bases:

>100_bases
ACGGAACAACCGCTATTTTCATGTGCCGGTTTTCTCTGCATTTGCAAAAATAACCTCGAAACGGAATTTTCCCGGAAAAT
GCCGCGTATTTACGTACGCT

Downstream 100 bases:

>100_bases
GCACGGCAGACGGGTTCCTGCGGAGCCTCCTTCCGGAGGCCGCATTATGGGCTAGCCAACCTTCATGAAAACCATTACAT
TGGCGGCATTATGAATTTGG

Product: Beta-N-acetylhexosaminidase

Products: NA

Alternate protein names: Beta-GlcNAcase; Beta-N-acetylhexosaminidase; Beta-NAHase; N-acetyl-beta-glucosaminidase [H]

Number of amino acids: Translated: 549; Mature: 549

Protein sequence:

>549_residues
MISKCTFSATVFSLFSLCWGAPSSPVLEAPHTIPLPAAMRVQTGESGFSLKNGVRLPEKNPLSRQAERIFRDNGINTALV
KNNADIIFTEDASLGREGYRLAVTPDSISIASGSVNGTLYALQSLVQSIAADKNGAPALPRMDVKDQPRFSWRGLMVDSC
RHMMPVRDIKKVLDLMERYKFNTLHWHLTDDQGWRLPIAKYPRLTTVGGARAQSPVIGNRNKGDGIPYSGHYTADEIRDV
VRYARDRGITVIPEVEMPGHASAAIAAYPELGNTDIPGYEPRVQETWGVHSYTFSPTEKTFRFLEDVIDEICALFPDSPY
IHIGGDEAPKNQWKQSPTAQRVMKDNGLANEHELQSYFIRRVEKMINNRGKRLIGWDEIQEGGLSPTATMMVWRSQMPHI
AAQALAQGNDIVMTPNSHLYFDYDQGPGKPAAPEYETINNNQLTWQHVYGLEPVPQGTPREREKQVLGCQANIWTEYIPN
LPKWEYHVFPRALALAEVAWTPQELKNEKDFRKRLDRQLPFLDARGVNYKRPDNGAPAQPKAVITRERR

Sequences:

>Translated_549_residues
MISKCTFSATVFSLFSLCWGAPSSPVLEAPHTIPLPAAMRVQTGESGFSLKNGVRLPEKNPLSRQAERIFRDNGINTALV
KNNADIIFTEDASLGREGYRLAVTPDSISIASGSVNGTLYALQSLVQSIAADKNGAPALPRMDVKDQPRFSWRGLMVDSC
RHMMPVRDIKKVLDLMERYKFNTLHWHLTDDQGWRLPIAKYPRLTTVGGARAQSPVIGNRNKGDGIPYSGHYTADEIRDV
VRYARDRGITVIPEVEMPGHASAAIAAYPELGNTDIPGYEPRVQETWGVHSYTFSPTEKTFRFLEDVIDEICALFPDSPY
IHIGGDEAPKNQWKQSPTAQRVMKDNGLANEHELQSYFIRRVEKMINNRGKRLIGWDEIQEGGLSPTATMMVWRSQMPHI
AAQALAQGNDIVMTPNSHLYFDYDQGPGKPAAPEYETINNNQLTWQHVYGLEPVPQGTPREREKQVLGCQANIWTEYIPN
LPKWEYHVFPRALALAEVAWTPQELKNEKDFRKRLDRQLPFLDARGVNYKRPDNGAPAQPKAVITRERR
>Mature_549_residues
MISKCTFSATVFSLFSLCWGAPSSPVLEAPHTIPLPAAMRVQTGESGFSLKNGVRLPEKNPLSRQAERIFRDNGINTALV
KNNADIIFTEDASLGREGYRLAVTPDSISIASGSVNGTLYALQSLVQSIAADKNGAPALPRMDVKDQPRFSWRGLMVDSC
RHMMPVRDIKKVLDLMERYKFNTLHWHLTDDQGWRLPIAKYPRLTTVGGARAQSPVIGNRNKGDGIPYSGHYTADEIRDV
VRYARDRGITVIPEVEMPGHASAAIAAYPELGNTDIPGYEPRVQETWGVHSYTFSPTEKTFRFLEDVIDEICALFPDSPY
IHIGGDEAPKNQWKQSPTAQRVMKDNGLANEHELQSYFIRRVEKMINNRGKRLIGWDEIQEGGLSPTATMMVWRSQMPHI
AAQALAQGNDIVMTPNSHLYFDYDQGPGKPAAPEYETINNNQLTWQHVYGLEPVPQGTPREREKQVLGCQANIWTEYIPN
LPKWEYHVFPRALALAEVAWTPQELKNEKDFRKRLDRQLPFLDARGVNYKRPDNGAPAQPKAVITRERR

Specific function: Unknown

COG id: COG3525

COG function: function code G; N-acetyl-beta-hexosaminidase

Gene ontology:

Cell location: Cell outer membrane; Lipid-anchor (Probable) [H]

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the glycosyl hydrolase 20 family [H]

Homologues:

Organism=Homo sapiens, GI4504373, Length=439, Percent_Identity=29.3849658314351, Blast_Score=185, Evalue=1e-46,
Organism=Homo sapiens, GI189181666, Length=442, Percent_Identity=28.7330316742081, Blast_Score=178, Evalue=1e-44,
Organism=Caenorhabditis elegans, GI17569815, Length=430, Percent_Identity=28.3720930232558, Blast_Score=152, Evalue=6e-37,
Organism=Drosophila melanogaster, GI24657468, Length=519, Percent_Identity=26.7822736030828, Blast_Score=129, Evalue=6e-30,
Organism=Drosophila melanogaster, GI17647501, Length=519, Percent_Identity=26.7822736030828, Blast_Score=129, Evalue=6e-30,
Organism=Drosophila melanogaster, GI281365639, Length=519, Percent_Identity=26.7822736030828, Blast_Score=129, Evalue=7e-30,
Organism=Drosophila melanogaster, GI24657474, Length=519, Percent_Identity=26.7822736030828, Blast_Score=129, Evalue=7e-30,
Organism=Drosophila melanogaster, GI45551090, Length=448, Percent_Identity=27.4553571428571, Blast_Score=120, Evalue=3e-27,
Organism=Drosophila melanogaster, GI24653074, Length=448, Percent_Identity=27.4553571428571, Blast_Score=120, Evalue=3e-27,
Organism=Drosophila melanogaster, GI17933586, Length=476, Percent_Identity=23.9495798319328, Blast_Score=117, Evalue=3e-26,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR015882
- InterPro:   IPR001540
- InterPro:   IPR015883
- InterPro:   IPR017853
- InterPro:   IPR013781
- InterPro:   IPR011658 [H]

Pfam domain/function: PF00728 Glyco_hydro_20; PF02838 Glyco_hydro_20b [H]

EC number: =3.2.1.52 [H]

Molecular weight: Translated: 61831; Mature: 61831

Theoretical pI: Translated: 8.20; Mature: 8.20

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.9 %Cys     (Translated Protein)
2.6 %Met     (Translated Protein)
3.5 %Cys+Met (Translated Protein)
0.9 %Cys     (Mature Protein)
2.6 %Met     (Mature Protein)
3.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MISKCTFSATVFSLFSLCWGAPSSPVLEAPHTIPLPAAMRVQTGESGFSLKNGVRLPEKN
CCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEECCCCCCCHHCCCCCCCCC
PLSRQAERIFRDNGINTALVKNNADIIFTEDASLGREGYRLAVTPDSISIASGSVNGTLY
CHHHHHHHHHHHCCCCEEEEECCCCEEEECCCCCCCCCCEEEECCCCEEEECCCCCHHHH
ALQSLVQSIAADKNGAPALPRMDVKDQPRFSWRGLMVDSCRHMMPVRDIKKVLDLMERYK
HHHHHHHHHHCCCCCCCCCCCCCCCCCCCCEECCEEHHHHHHCCCHHHHHHHHHHHHHCC
FNTLHWHLTDDQGWRLPIAKYPRLTTVGGARAQSPVIGNRNKGDGIPYSGHYTADEIRDV
CCEEEEEEECCCCCCCCHHCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHH
VRYARDRGITVIPEVEMPGHASAAIAAYPELGNTDIPGYEPRVQETWGVHSYTFSPTEKT
HHHHHHCCCEEEECCCCCCCCCEEEEECCCCCCCCCCCCCCCHHHHCCCCEEECCCHHHH
FRFLEDVIDEICALFPDSPYIHIGGDEAPKNQWKQSPTAQRVMKDNGLANEHELQSYFIR
HHHHHHHHHHHHHHCCCCCEEEECCCCCCHHHHHCCHHHHHHHHHCCCCCHHHHHHHHHH
RVEKMINNRGKRLIGWDEIQEGGLSPTATMMVWRSQMPHIAAQALAQGNDIVMTPNSHLY
HHHHHHHCCCCEEECHHHHHHCCCCCHHHHHHHHHHCCHHHHHHHHCCCCEEECCCCCEE
FDYDQGPGKPAAPEYETINNNQLTWQHVYGLEPVPQGTPREREKQVLGCQANIWTEYIPN
EEECCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCHHHHHHHHCCCHHHHHHHCCC
LPKWEYHVFPRALALAEVAWTPQELKNEKDFRKRLDRQLPFLDARGVNYKRPDNGAPAQP
CCCCEEEHHHHHHHHHHHHCCHHHHCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCC
KAVITRERR
CHHCCCCCC
>Mature Secondary Structure
MISKCTFSATVFSLFSLCWGAPSSPVLEAPHTIPLPAAMRVQTGESGFSLKNGVRLPEKN
CCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEECCCCCCCHHCCCCCCCCC
PLSRQAERIFRDNGINTALVKNNADIIFTEDASLGREGYRLAVTPDSISIASGSVNGTLY
CHHHHHHHHHHHCCCCEEEEECCCCEEEECCCCCCCCCCEEEECCCCEEEECCCCCHHHH
ALQSLVQSIAADKNGAPALPRMDVKDQPRFSWRGLMVDSCRHMMPVRDIKKVLDLMERYK
HHHHHHHHHHCCCCCCCCCCCCCCCCCCCCEECCEEHHHHHHCCCHHHHHHHHHHHHHCC
FNTLHWHLTDDQGWRLPIAKYPRLTTVGGARAQSPVIGNRNKGDGIPYSGHYTADEIRDV
CCEEEEEEECCCCCCCCHHCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHH
VRYARDRGITVIPEVEMPGHASAAIAAYPELGNTDIPGYEPRVQETWGVHSYTFSPTEKT
HHHHHHCCCEEEECCCCCCCCCEEEEECCCCCCCCCCCCCCCHHHHCCCCEEECCCHHHH
FRFLEDVIDEICALFPDSPYIHIGGDEAPKNQWKQSPTAQRVMKDNGLANEHELQSYFIR
HHHHHHHHHHHHHHCCCCCEEEECCCCCCHHHHHCCHHHHHHHHHCCCCCHHHHHHHHHH
RVEKMINNRGKRLIGWDEIQEGGLSPTATMMVWRSQMPHIAAQALAQGNDIVMTPNSHLY
HHHHHHHCCCCEEECHHHHHHCCCCCHHHHHHHHHHCCHHHHHHHHCCCCEEECCCCCEE
FDYDQGPGKPAAPEYETINNNQLTWQHVYGLEPVPQGTPREREKQVLGCQANIWTEYIPN
EEECCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCHHHHHHHHCCCHHHHHHHCCC
LPKWEYHVFPRALALAEVAWTPQELKNEKDFRKRLDRQLPFLDARGVNYKRPDNGAPAQP
CCCCEEEHHHHHHHHHHHHCCHHHHCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCC
KAVITRERR
CHHCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 7881557; 12949112 [H]