Definition Akkermansia muciniphila ATCC BAA-835, complete genome.
Accession NC_010655
Length 2,664,102

Click here to switch to the map view.

The map label for this gene is 187736052

Identifier: 187736052

GI number: 187736052

Start: 1878236

End: 1881109

Strand: Reverse

Name: 187736052

Synonym: Amuc_1562

Alternate gene names: NA

Gene position: 1881109-1878236 (Counterclockwise)

Preceding gene: 187736053

Following gene: 187736051

Centisome position: 70.61

GC content: 60.13

Gene sequence:

>2874_bases
ATGAACCGATTCTTTCTATCCTGGGACAAACCGGCCTGCCGTGCTGTAGCGGAACGCCTGCTTTCCCTGGGAAATGGCTT
CCACCGGCATCTGGTGCTGGTTCCCACACGGGAATCCGGAAGGCAGCTGAGGGAATACCTTGCCTCCATTTCCCGTACAC
AGGCCATCTTTGCTCCCCAAGTCATTCCGGCAGACCAGTTCCCCCGGATGGAAGAAAAGGAAGAAACGGCGTCCGCTCCG
GAAGAACTGGCCGGGTGGCTCCTGGCCCTGGGGAAAACGCCTCACCGCCTTTATCCCCGCCTGTTTCCACGCACCATGCC
GGAGGATTTTTCCAGCATGCTGGAAATGGCCGGCAGCCTGCAAAACCTGAGGCACGCCATGGCGAACCAGGGCGTCTCCT
GCATCATGGCCCACCATGCCTGTGCGGGCAGAGACGAACGCTGGAAGGACATGGAAAGGCTGGAGGGACAATGCACGCAA
CAACTGGAAAGCTGGAAGCTTGAAGACAGGACCGCCATGAAGGCGGAAGCCCCGCCCCGGCTGCTGAATTCCCTGAGGGA
AACAGGTGGAAACATTATTCTGGCCTGCACAGCGGAGGTGCCGCCCCCCCTGCGCCATGCCCTCCAGCATGCGGAAAGAA
ATGGCGTACCCGTCCAGATATGGATACATGCGCCGGAAGAAGAAGCCGCCTCCTTTGATTCCTGGGGATGCCCTCTGCCG
GAAGAGTGGTCCCGGCGCCCCATCCCAATCCGCGACGGTCAAATCAGAATTGCGGCCAACCCGGCGCGGCTGGCTGAAGA
AACGTGCCGCATCATCGCCCGCACAGCGGAAGGAGGAACGCCGGACATCGCCCTGGGCGTTTGCGATCCTGACATGAACG
TCGCCCTGGACGCAGCTCTGCGCCAATACGGCTGGGGCCTCCACAATCCGGAAGGGAAGCCTTTTGCCGGAAGCGGCGTC
ATGGACCTGCTCCGCAACCTTCGGCGGGCGCTGGAAGAAAAGGGGACGGCGCGCCCCGTCTATTCCCTGGCGCGCTCCTC
CCTGCTCTGCGCCTCCCTGGGAATAAAAGGCCAGCAGGGTTGCTGCGCCGCGCTGGACAAAATCCAGCAGAAATTCCTTC
CGGAAACGGAAGAATACCTGCTTTCCAGGCTGAAAGAGGCTTATCCCGGCGCCTTTCCTTCCATCCAGGCCATTCTGGAA
TGGCGGAACCGGATGGCGGAACCGGGAATGCTGGGGGAACGGCTGATGGAATGGTTCCCGGCTTTGGCAACCATCTATGG
CCCGGAGACGGAAGCCATGGAAATGTTCCATTCCTGCCTTTCCGGGCTCATGCGTCTGCAACAGCGCTCCACCGCGTTTT
CCAGCCCGGAGACGGCTCTCCTGCTCCTGATGAAATGCCTCCAGCCGCTGCGTGTCAGAAACAGGAGAAAAGCTCATGCG
GCGCTGGATTCCCTGGGCTGGATGGAAGTTCACTTCCGCCCGGAAAGAAACCTCATTCTCACGGGACTAAACGAAGGGAC
CGTACCGGAAGGAGGCGTTTCCGACCAGTTTATGCCGGAGGAACTGCGGGAAACCCTGGGCATTGATTCCTTTAACCGGA
AAAAAGCCCGGGACAGTTTTCTGCTGACCGCCCTGCTCCATTCCCGCGAACGGGAAGGCAGTCTCACCATCCTTCTCTCC
CGGACCAGCAGCAGGAACGATCCTCTGACCCCTTCCTCCCTGCTCATGCGCTGCCCGGAGGCGGAGCTGCCGCACCGCGT
AGAACGGCTTTTCCAGGAAATCAGCGATGTGCCCGCTCCCCTGCCCTACCAGCGCGGAAACTGGCATCTTCAACCCGCGG
AAGGCTGGAAGACCGCTGCGGACATCGGCGCGATGGCGCCCGGCTACAAAAACCCGTGGAAGGAAAAAGGGCTGGCCTTC
TCCCCCTCCGTCCTCAAAAGGTTCCTGGCCTGCCCCATGCGCTTCTGGATGCGGGAGGCGCTGCACATGAATGAGGAAGA
ATTCCTGCCGGACAAAGAGGACATGGCCGTCAACGAGCTGGGCACCATGCTGCACGACGTGCTGGAATGTTTCTGCCGGG
AACACGCCGTCCTGAAAGACGGCATGAACGCAGCCTCCTTTCAGAACGCTATCACGGAAATACTGGAACAAACCTTCCGG
AAACAATACGGCCCCTCCCCCCTCATGCCCCTGCTGCTGCAAAAACGCTCCATGGAACAGAGGCTTTCCGTTTACGCCGT
TCAGCATTTGCAGGCCCTTCAGGAAGGATGGTCCTGCATTGCCTTTGAGCACCAGGTAGAAAACTGGATACTGGGCGGCT
TCCCCATGAAATTCCGCATTGACCGCATTGACCGCCATGCGGACGGGCACATACGGGTCATTGACTACAAAACGGGAGCG
GCCTCCTCCTGTGAAAAAAAGCATCTGGACCCGCTCGGACGGCCGGACGCCCTGCCCCTGCTTTCCCCAGCCCTGCATCC
CTACACCAAACGCCTGAAAAACGGGAAACTGGCGCACGCGCGCTGGAAAGACCTGCAGCTACCCGTTTATGTGCTGTGGG
CCTCGGAAACCTACGGGGGCCATCCCTCTGCGGCTTATTACGCCCTTCCCGCCAATCCCATGGACATCGGAATCTCCTCC
TGGGACACCCTCCACGACACCATGCCCGGGTATGAGGAATGCGCGCTGGACAGCGCCTGTTCCTGGACGGTGGAGCTGAT
GAAACTTCTTCATGAAGGACGCGGTCTCATCACTGCGGAGGAACTGGGCTGGACACCGCCCTCTTATGACGTATTCAAGG
ACCTGATGACTTCCCGGAGGGAAAGCCTGCAGGATCTGCTGGGCCTCCGCCTCACCCCCCACCTTCCTTTTTAA

Upstream 100 bases:

>100_bases
CAATAATGAAAAAAATAGATGGGAAATCCCGCCATCCTGATGTAGCATTCCCAGGTAATTTTCCGCTTTCCCCCTTTCCG
GGGAATCCTGCTTCCCTGCC

Downstream 100 bases:

>100_bases
GATGCCGCCCCTGACCAACATGCTGATCTCCGCTTCCGCCGGAACGGGAAAAACCTACCAGCTCTCCCTGCGTTTCCTGG
GGTTGCTGGCGCTGAACAGC

Product: ATP-dependent nuclease subunit B-like protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 957; Mature: 957

Protein sequence:

>957_residues
MNRFFLSWDKPACRAVAERLLSLGNGFHRHLVLVPTRESGRQLREYLASISRTQAIFAPQVIPADQFPRMEEKEETASAP
EELAGWLLALGKTPHRLYPRLFPRTMPEDFSSMLEMAGSLQNLRHAMANQGVSCIMAHHACAGRDERWKDMERLEGQCTQ
QLESWKLEDRTAMKAEAPPRLLNSLRETGGNIILACTAEVPPPLRHALQHAERNGVPVQIWIHAPEEEAASFDSWGCPLP
EEWSRRPIPIRDGQIRIAANPARLAEETCRIIARTAEGGTPDIALGVCDPDMNVALDAALRQYGWGLHNPEGKPFAGSGV
MDLLRNLRRALEEKGTARPVYSLARSSLLCASLGIKGQQGCCAALDKIQQKFLPETEEYLLSRLKEAYPGAFPSIQAILE
WRNRMAEPGMLGERLMEWFPALATIYGPETEAMEMFHSCLSGLMRLQQRSTAFSSPETALLLLMKCLQPLRVRNRRKAHA
ALDSLGWMEVHFRPERNLILTGLNEGTVPEGGVSDQFMPEELRETLGIDSFNRKKARDSFLLTALLHSREREGSLTILLS
RTSSRNDPLTPSSLLMRCPEAELPHRVERLFQEISDVPAPLPYQRGNWHLQPAEGWKTAADIGAMAPGYKNPWKEKGLAF
SPSVLKRFLACPMRFWMREALHMNEEEFLPDKEDMAVNELGTMLHDVLECFCREHAVLKDGMNAASFQNAITEILEQTFR
KQYGPSPLMPLLLQKRSMEQRLSVYAVQHLQALQEGWSCIAFEHQVENWILGGFPMKFRIDRIDRHADGHIRVIDYKTGA
ASSCEKKHLDPLGRPDALPLLSPALHPYTKRLKNGKLAHARWKDLQLPVYVLWASETYGGHPSAAYYALPANPMDIGISS
WDTLHDTMPGYEECALDSACSWTVELMKLLHEGRGLITAEELGWTPPSYDVFKDLMTSRRESLQDLLGLRLTPHLPF

Sequences:

>Translated_957_residues
MNRFFLSWDKPACRAVAERLLSLGNGFHRHLVLVPTRESGRQLREYLASISRTQAIFAPQVIPADQFPRMEEKEETASAP
EELAGWLLALGKTPHRLYPRLFPRTMPEDFSSMLEMAGSLQNLRHAMANQGVSCIMAHHACAGRDERWKDMERLEGQCTQ
QLESWKLEDRTAMKAEAPPRLLNSLRETGGNIILACTAEVPPPLRHALQHAERNGVPVQIWIHAPEEEAASFDSWGCPLP
EEWSRRPIPIRDGQIRIAANPARLAEETCRIIARTAEGGTPDIALGVCDPDMNVALDAALRQYGWGLHNPEGKPFAGSGV
MDLLRNLRRALEEKGTARPVYSLARSSLLCASLGIKGQQGCCAALDKIQQKFLPETEEYLLSRLKEAYPGAFPSIQAILE
WRNRMAEPGMLGERLMEWFPALATIYGPETEAMEMFHSCLSGLMRLQQRSTAFSSPETALLLLMKCLQPLRVRNRRKAHA
ALDSLGWMEVHFRPERNLILTGLNEGTVPEGGVSDQFMPEELRETLGIDSFNRKKARDSFLLTALLHSREREGSLTILLS
RTSSRNDPLTPSSLLMRCPEAELPHRVERLFQEISDVPAPLPYQRGNWHLQPAEGWKTAADIGAMAPGYKNPWKEKGLAF
SPSVLKRFLACPMRFWMREALHMNEEEFLPDKEDMAVNELGTMLHDVLECFCREHAVLKDGMNAASFQNAITEILEQTFR
KQYGPSPLMPLLLQKRSMEQRLSVYAVQHLQALQEGWSCIAFEHQVENWILGGFPMKFRIDRIDRHADGHIRVIDYKTGA
ASSCEKKHLDPLGRPDALPLLSPALHPYTKRLKNGKLAHARWKDLQLPVYVLWASETYGGHPSAAYYALPANPMDIGISS
WDTLHDTMPGYEECALDSACSWTVELMKLLHEGRGLITAEELGWTPPSYDVFKDLMTSRRESLQDLLGLRLTPHLPF
>Mature_957_residues
MNRFFLSWDKPACRAVAERLLSLGNGFHRHLVLVPTRESGRQLREYLASISRTQAIFAPQVIPADQFPRMEEKEETASAP
EELAGWLLALGKTPHRLYPRLFPRTMPEDFSSMLEMAGSLQNLRHAMANQGVSCIMAHHACAGRDERWKDMERLEGQCTQ
QLESWKLEDRTAMKAEAPPRLLNSLRETGGNIILACTAEVPPPLRHALQHAERNGVPVQIWIHAPEEEAASFDSWGCPLP
EEWSRRPIPIRDGQIRIAANPARLAEETCRIIARTAEGGTPDIALGVCDPDMNVALDAALRQYGWGLHNPEGKPFAGSGV
MDLLRNLRRALEEKGTARPVYSLARSSLLCASLGIKGQQGCCAALDKIQQKFLPETEEYLLSRLKEAYPGAFPSIQAILE
WRNRMAEPGMLGERLMEWFPALATIYGPETEAMEMFHSCLSGLMRLQQRSTAFSSPETALLLLMKCLQPLRVRNRRKAHA
ALDSLGWMEVHFRPERNLILTGLNEGTVPEGGVSDQFMPEELRETLGIDSFNRKKARDSFLLTALLHSREREGSLTILLS
RTSSRNDPLTPSSLLMRCPEAELPHRVERLFQEISDVPAPLPYQRGNWHLQPAEGWKTAADIGAMAPGYKNPWKEKGLAF
SPSVLKRFLACPMRFWMREALHMNEEEFLPDKEDMAVNELGTMLHDVLECFCREHAVLKDGMNAASFQNAITEILEQTFR
KQYGPSPLMPLLLQKRSMEQRLSVYAVQHLQALQEGWSCIAFEHQVENWILGGFPMKFRIDRIDRHADGHIRVIDYKTGA
ASSCEKKHLDPLGRPDALPLLSPALHPYTKRLKNGKLAHARWKDLQLPVYVLWASETYGGHPSAAYYALPANPMDIGISS
WDTLHDTMPGYEECALDSACSWTVELMKLLHEGRGLITAEELGWTPPSYDVFKDLMTSRRESLQDLLGLRLTPHLPF

Specific function: Unknown

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 107986; Mature: 107986

Theoretical pI: Translated: 6.79; Mature: 6.79

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.2 %Cys     (Translated Protein)
3.7 %Met     (Translated Protein)
5.9 %Cys+Met (Translated Protein)
2.2 %Cys     (Mature Protein)
3.7 %Met     (Mature Protein)
5.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNRFFLSWDKPACRAVAERLLSLGNGFHRHLVLVPTRESGRQLREYLASISRTQAIFAPQ
CCCEEECCCCHHHHHHHHHHHHHCCCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHCCC
VIPADQFPRMEEKEETASAPEELAGWLLALGKTPHRLYPRLFPRTMPEDFSSMLEMAGSL
CCCCCCCCCCCHHHHHHCCHHHHHHHHHHHCCCCHHHHHHHCCCCCCHHHHHHHHHHHHH
QNLRHAMANQGVSCIMAHHACAGRDERWKDMERLEGQCTQQLESWKLEDRTAMKAEAPPR
HHHHHHHHHCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCCCHHHHCCCCCHH
LLNSLRETGGNIILACTAEVPPPLRHALQHAERNGVPVQIWIHAPEEEAASFDSWGCPLP
HHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHCCCCCEEEEEECCHHHHCCCCCCCCCCC
EEWSRRPIPIRDGQIRIAANPARLAEETCRIIARTAEGGTPDIALGVCDPDMNVALDAAL
HHHCCCCCCCCCCEEEEECCHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHH
RQYGWGLHNPEGKPFAGSGVMDLLRNLRRALEEKGTARPVYSLARSSLLCASLGIKGQQG
HHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCCCCC
CCAALDKIQQKFLPETEEYLLSRLKEAYPGAFPSIQAILEWRNRMAEPGMLGERLMEWFP
HHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHH
ALATIYGPETEAMEMFHSCLSGLMRLQQRSTAFSSPETALLLLMKCLQPLRVRNRRKAHA
HHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHH
ALDSLGWMEVHFRPERNLILTGLNEGTVPEGGVSDQFMPEELRETLGIDSFNRKKARDSF
HHHHCCCEEEEEECCCCEEEEECCCCCCCCCCCCCCCCHHHHHHHHCCCCCCHHHHHHHH
LLTALLHSREREGSLTILLSRTSSRNDPLTPSSLLMRCPEAELPHRVERLFQEISDVPAP
HHHHHHHHCCCCCCEEEEEECCCCCCCCCCHHHHHHHCCCCCCCHHHHHHHHHHHCCCCC
LPYQRGNWHLQPAEGWKTAADIGAMAPGYKNPWKEKGLAFSPSVLKRFLACPMRFWMREA
CCCCCCCEEECCCCCCHHHHHHHCCCCCCCCCHHHCCCCCCHHHHHHHHHHHHHHHHHHH
LHMNEEEFLPDKEDMAVNELGTMLHDVLECFCREHAVLKDGMNAASFQNAITEILEQTFR
HCCCHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHH
KQYGPSPLMPLLLQKRSMEQRLSVYAVQHLQALQEGWSCIAFEHQVENWILGGFPMKFRI
HHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEHHHHHHHEECCCCCEEEH
DRIDRHADGHIRVIDYKTGAASSCEKKHLDPLGRPDALPLLSPALHPYTKRLKNGKLAHA
HHHHCCCCCCEEEEEECCCCCCCCHHHCCCCCCCCCCCCHHCHHHHHHHHHHCCCCCCCC
RWKDLQLPVYVLWASETYGGHPSAAYYALPANPMDIGISSWDTLHDTMPGYEECALDSAC
HHCCCCCCEEEEEECCCCCCCCCCEEEEECCCCHHCCCCCHHHHHHCCCCHHHHHCCHHH
SWTVELMKLLHEGRGLITAEELGWTPPSYDVFKDLMTSRRESLQDLLGLRLTPHLPF
HHHHHHHHHHHCCCCCEEHHHCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCC
>Mature Secondary Structure
MNRFFLSWDKPACRAVAERLLSLGNGFHRHLVLVPTRESGRQLREYLASISRTQAIFAPQ
CCCEEECCCCHHHHHHHHHHHHHCCCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHCCC
VIPADQFPRMEEKEETASAPEELAGWLLALGKTPHRLYPRLFPRTMPEDFSSMLEMAGSL
CCCCCCCCCCCHHHHHHCCHHHHHHHHHHHCCCCHHHHHHHCCCCCCHHHHHHHHHHHHH
QNLRHAMANQGVSCIMAHHACAGRDERWKDMERLEGQCTQQLESWKLEDRTAMKAEAPPR
HHHHHHHHHCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCCCHHHHCCCCCHH
LLNSLRETGGNIILACTAEVPPPLRHALQHAERNGVPVQIWIHAPEEEAASFDSWGCPLP
HHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHCCCCCEEEEEECCHHHHCCCCCCCCCCC
EEWSRRPIPIRDGQIRIAANPARLAEETCRIIARTAEGGTPDIALGVCDPDMNVALDAAL
HHHCCCCCCCCCCEEEEECCHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHH
RQYGWGLHNPEGKPFAGSGVMDLLRNLRRALEEKGTARPVYSLARSSLLCASLGIKGQQG
HHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCCCCC
CCAALDKIQQKFLPETEEYLLSRLKEAYPGAFPSIQAILEWRNRMAEPGMLGERLMEWFP
HHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHH
ALATIYGPETEAMEMFHSCLSGLMRLQQRSTAFSSPETALLLLMKCLQPLRVRNRRKAHA
HHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHH
ALDSLGWMEVHFRPERNLILTGLNEGTVPEGGVSDQFMPEELRETLGIDSFNRKKARDSF
HHHHCCCEEEEEECCCCEEEEECCCCCCCCCCCCCCCCHHHHHHHHCCCCCCHHHHHHHH
LLTALLHSREREGSLTILLSRTSSRNDPLTPSSLLMRCPEAELPHRVERLFQEISDVPAP
HHHHHHHHCCCCCCEEEEEECCCCCCCCCCHHHHHHHCCCCCCCHHHHHHHHHHHCCCCC
LPYQRGNWHLQPAEGWKTAADIGAMAPGYKNPWKEKGLAFSPSVLKRFLACPMRFWMREA
CCCCCCCEEECCCCCCHHHHHHHCCCCCCCCCHHHCCCCCCHHHHHHHHHHHHHHHHHHH
LHMNEEEFLPDKEDMAVNELGTMLHDVLECFCREHAVLKDGMNAASFQNAITEILEQTFR
HCCCHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHH
KQYGPSPLMPLLLQKRSMEQRLSVYAVQHLQALQEGWSCIAFEHQVENWILGGFPMKFRI
HHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEHHHHHHHEECCCCCEEEH
DRIDRHADGHIRVIDYKTGAASSCEKKHLDPLGRPDALPLLSPALHPYTKRLKNGKLAHA
HHHHCCCCCCEEEEEECCCCCCCCHHHCCCCCCCCCCCCHHCHHHHHHHHHHCCCCCCCC
RWKDLQLPVYVLWASETYGGHPSAAYYALPANPMDIGISSWDTLHDTMPGYEECALDSAC
HHCCCCCCEEEEEECCCCCCCCCCEEEEECCCCHHCCCCCHHHHHHCCCCHHHHHCCHHH
SWTVELMKLLHEGRGLITAEELGWTPPSYDVFKDLMTSRRESLQDLLGLRLTPHLPF
HHHHHHHHHHHCCCCCEEHHHCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA