Definition | Akkermansia muciniphila ATCC BAA-835, complete genome. |
---|---|
Accession | NC_010655 |
Length | 2,664,102 |
Click here to switch to the map view.
The map label for this gene is rnr [H]
Identifier: 187735416
GI number: 187735416
Start: 1091430
End: 1093700
Strand: Reverse
Name: rnr [H]
Synonym: Amuc_0915
Alternate gene names: 187735416
Gene position: 1093700-1091430 (Counterclockwise)
Preceding gene: 187735418
Following gene: 187735415
Centisome position: 41.05
GC content: 60.19
Gene sequence:
>2271_bases ATGAATAATTCCCTTAAAGACCGCCTGATCCGTCACATGGAAGACGGACATTACGAACCTCAGAGCAAATCTGAACTGGC TCGCGCCCTGAACGTGGATTCCCGGCAGAAGCTGGATTTCCGCGCCCTGGTGGACCAGATGGAGGAGGAAGGAAAACTCG TGCGGCTGCAAAAGGGCAGGTACGCTTTAAAACGGGAGCGCCGGAACCTGGTGCATGGCATGATCCGCATCCTGCGCTCC GGCAAGATTCTTTTTCTCCCCAGAAAAGGGGATCCCGCCGCTGCGGCTCTGGGATGGGACACGGAAGCCGTTCCGGAACT GGAACTTAAACCCAACCATCTGGGCACCGCCCTGGACGGGGACCGAGTGGCCGTGCGCGTGGAACGCAAGGCAGCCAGGG GACGGAGGAATATCCGCAGAAATCGCTTTTCTTCCCCGGATGCCGACATGAAAGCCCGCGTGGAGGAAGTGACGGAACGC GCGCGCTCCCGATGGCTTGGCGTCTTCCGCACCGGAAAAAACAAGCCGGGCCGCGTGCTGGGGGACGGCGTAAGCTCTCC GTCATCCATTGAACTGGCGGAAAAACCCGCCATGGAGGTGCTTCCCGGCCAACTGGTTTCCGTGGAGCCCATAACCTGCG GTGAGGAAAAGAAAGCCCCGCGCGGGAAAATCGTGGAGGTGCTCGGCTATCCGGACGAGCCTCACGTGGACATGGAAGCC GTTATCAGAAAATATGGCCTTTCCGTGGAATTCCCCGCCTCCGTGCTCCGGGAGCTGGAAACGCTGCCTCAAACACCTTC CCCCGGGGAACTCGCCCGCCGGGAAGACTGGACGGACCGTACCGTCATCACCATTGACCCGGCAAGCGCCAGGGATTTTG ACGACGCCATTTCCATCACGGCCACTCCGTCCGGCTGGACGCTGGCCGTTCACATTGCGGACGTCTCCCATTTCGTCAAG CCCGGCGGCTCGCTGGATGGGGAAGCCCTGCGCCGCGGCAACTCTACCTACCTGCCGGACCGCGTTCTTCCGATGCTCCC GCACCGCCTTTCCGACGATTTGTGCAGCCTGCGGCCGGATGTCGTGCGCCTGACAAAAGTATGCGAAATGAAATTTGATA AAAAGGGGAAGATGCTCCGTGCCCGCTTTGCTGACGCTTTCATCCGCAGCAAGGCGCGCCTGACCTACCAGGAGGCCTTC GCCATGCTGAAAGGAAACGACAAAGGGGAAGTTCCCTCCACGGTGCGGGAAGCCTGGAATCTGGCCTCCATTCTGCGCCG CAACCGCTACGCCAAGGGAGCCCTGGATCTGGACTTCCCGGAAGTGAGAGCCGTTATGGACAAGGACGGCCGCGTGACGG GCATCATCACGGAAGAGTACGATGAAAGCCATCAGCTCATTGAAGAGTGCATGCTGGCCGCCAACGAAGCCGTGGCCCTG GCCCTGAAAAACGGGAACCGTCCCACCATCTACCGTGTCCACGAAGAACCGGATTCCGCCAAACTTTTTGAATTCGGCCA GTTGTGCAGACTGTACGGGCATCCCGTGCACGATATCGACCAACGCCAGTACCTGAATGAACTGATGAAATCCATCAAGG GTTCCCCGGACGAGCAATTGCTTAAGCTGGCGCTCCTGAAAAGCTTGATGAGGGCCAGGTACGACACGGAGCCCCTGGGC CACTACGGCCTGGCCACCCCCAATTACTGCCACTTCACCAGCCCCATCCGCCGTTACGCGGATCTGGTGGTACACCGCTC CCTGAATCCCCTGCTGGCCAACCCTCCCAAAGGAGCTAAAGGAGCAGGCAGTACAGGCAGGCTGGAGGAAGACGCGGAAC ACATTTCAGAAACGGAACGCATTTCCGCCAGCGCGGAAAAGGACGCGAACCGGATGAAGCTCTTCGAATGGCTGGAAGGC CAGTGCTACGCGGATCATCCGGAAGTGCACGAAGCTCTGGTTACGGAAACGCGCCACTTCGGCGTTTTGCTGGAAATCCC GCGTCTCCAGATTAAGGGGCTGGTCAAACCGGACAAGCTTCCCGGCGGCCGCTGGGTGTATGAGGCGTTTGCCAGCCGCT GGAAAAACGACCACGGTTCCGTGCTGTGCGCAGGGCTGCGCGTTCCCGTCATTCCCGTGAAAGTGGACAGGGAGCAGCAG TGGGCGGATTTCGCCATCGTCTCACGGGAGAAACCGCGGCAAACGGGAAAAACGGCCTTCCCCACCAAACAGGAGAAGGG CATATCCGGCCGCGGGCGGCGGAACCGCTGA
Upstream 100 bases:
>100_bases GAACGCGGCATGCACCGCCCGCGCCTTGAAGCGGCGCGAAAACCTTCTTATGATGGCATCCATGGGAATGGGAAACTCCT TTTCCCGGAACTTCCGTAAA
Downstream 100 bases:
>100_bases CGGCAATACGCCCGTATATGGAAACAATCCCATCCGGAAATCAAATCCGATTCTCCCTGCAACAGAATCAAATCTACATT TTCCTACACAATTTTTCTTG
Product: VacB and RNase II family 3'-5' exoribonuclease
Products: NA
Alternate protein names: RNase R; VacB protein homolog [H]
Number of amino acids: Translated: 756; Mature: 756
Protein sequence:
>756_residues MNNSLKDRLIRHMEDGHYEPQSKSELARALNVDSRQKLDFRALVDQMEEEGKLVRLQKGRYALKRERRNLVHGMIRILRS GKILFLPRKGDPAAAALGWDTEAVPELELKPNHLGTALDGDRVAVRVERKAARGRRNIRRNRFSSPDADMKARVEEVTER ARSRWLGVFRTGKNKPGRVLGDGVSSPSSIELAEKPAMEVLPGQLVSVEPITCGEEKKAPRGKIVEVLGYPDEPHVDMEA VIRKYGLSVEFPASVLRELETLPQTPSPGELARREDWTDRTVITIDPASARDFDDAISITATPSGWTLAVHIADVSHFVK PGGSLDGEALRRGNSTYLPDRVLPMLPHRLSDDLCSLRPDVVRLTKVCEMKFDKKGKMLRARFADAFIRSKARLTYQEAF AMLKGNDKGEVPSTVREAWNLASILRRNRYAKGALDLDFPEVRAVMDKDGRVTGIITEEYDESHQLIEECMLAANEAVAL ALKNGNRPTIYRVHEEPDSAKLFEFGQLCRLYGHPVHDIDQRQYLNELMKSIKGSPDEQLLKLALLKSLMRARYDTEPLG HYGLATPNYCHFTSPIRRYADLVVHRSLNPLLANPPKGAKGAGSTGRLEEDAEHISETERISASAEKDANRMKLFEWLEG QCYADHPEVHEALVTETRHFGVLLEIPRLQIKGLVKPDKLPGGRWVYEAFASRWKNDHGSVLCAGLRVPVIPVKVDREQQ WADFAIVSREKPRQTGKTAFPTKQEKGISGRGRRNR
Sequences:
>Translated_756_residues MNNSLKDRLIRHMEDGHYEPQSKSELARALNVDSRQKLDFRALVDQMEEEGKLVRLQKGRYALKRERRNLVHGMIRILRS GKILFLPRKGDPAAAALGWDTEAVPELELKPNHLGTALDGDRVAVRVERKAARGRRNIRRNRFSSPDADMKARVEEVTER ARSRWLGVFRTGKNKPGRVLGDGVSSPSSIELAEKPAMEVLPGQLVSVEPITCGEEKKAPRGKIVEVLGYPDEPHVDMEA VIRKYGLSVEFPASVLRELETLPQTPSPGELARREDWTDRTVITIDPASARDFDDAISITATPSGWTLAVHIADVSHFVK PGGSLDGEALRRGNSTYLPDRVLPMLPHRLSDDLCSLRPDVVRLTKVCEMKFDKKGKMLRARFADAFIRSKARLTYQEAF AMLKGNDKGEVPSTVREAWNLASILRRNRYAKGALDLDFPEVRAVMDKDGRVTGIITEEYDESHQLIEECMLAANEAVAL ALKNGNRPTIYRVHEEPDSAKLFEFGQLCRLYGHPVHDIDQRQYLNELMKSIKGSPDEQLLKLALLKSLMRARYDTEPLG HYGLATPNYCHFTSPIRRYADLVVHRSLNPLLANPPKGAKGAGSTGRLEEDAEHISETERISASAEKDANRMKLFEWLEG QCYADHPEVHEALVTETRHFGVLLEIPRLQIKGLVKPDKLPGGRWVYEAFASRWKNDHGSVLCAGLRVPVIPVKVDREQQ WADFAIVSREKPRQTGKTAFPTKQEKGISGRGRRNR >Mature_756_residues MNNSLKDRLIRHMEDGHYEPQSKSELARALNVDSRQKLDFRALVDQMEEEGKLVRLQKGRYALKRERRNLVHGMIRILRS GKILFLPRKGDPAAAALGWDTEAVPELELKPNHLGTALDGDRVAVRVERKAARGRRNIRRNRFSSPDADMKARVEEVTER ARSRWLGVFRTGKNKPGRVLGDGVSSPSSIELAEKPAMEVLPGQLVSVEPITCGEEKKAPRGKIVEVLGYPDEPHVDMEA VIRKYGLSVEFPASVLRELETLPQTPSPGELARREDWTDRTVITIDPASARDFDDAISITATPSGWTLAVHIADVSHFVK PGGSLDGEALRRGNSTYLPDRVLPMLPHRLSDDLCSLRPDVVRLTKVCEMKFDKKGKMLRARFADAFIRSKARLTYQEAF AMLKGNDKGEVPSTVREAWNLASILRRNRYAKGALDLDFPEVRAVMDKDGRVTGIITEEYDESHQLIEECMLAANEAVAL ALKNGNRPTIYRVHEEPDSAKLFEFGQLCRLYGHPVHDIDQRQYLNELMKSIKGSPDEQLLKLALLKSLMRARYDTEPLG HYGLATPNYCHFTSPIRRYADLVVHRSLNPLLANPPKGAKGAGSTGRLEEDAEHISETERISASAEKDANRMKLFEWLEG QCYADHPEVHEALVTETRHFGVLLEIPRLQIKGLVKPDKLPGGRWVYEAFASRWKNDHGSVLCAGLRVPVIPVKVDREQQ WADFAIVSREKPRQTGKTAFPTKQEKGISGRGRRNR
Specific function: 3'-5'exoribonuclease that participates in an essential cell function. Acts nonspecifically on poly(A), poly(U) and ribosomal RNAs [H]
COG id: COG0557
COG function: function code K; Exoribonuclease R
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 S1 motif domain [H]
Homologues:
Organism=Homo sapiens, GI190014623, Length=513, Percent_Identity=30.0194931773879, Blast_Score=199, Evalue=1e-50, Organism=Homo sapiens, GI190014625, Length=513, Percent_Identity=30.0194931773879, Blast_Score=198, Evalue=2e-50, Organism=Homo sapiens, GI134288890, Length=400, Percent_Identity=34, Blast_Score=189, Evalue=7e-48, Organism=Homo sapiens, GI19115966, Length=504, Percent_Identity=30.1587301587302, Blast_Score=165, Evalue=1e-40, Organism=Homo sapiens, GI219521928, Length=504, Percent_Identity=30.1587301587302, Blast_Score=165, Evalue=2e-40, Organism=Homo sapiens, GI156105695, Length=422, Percent_Identity=27.7251184834123, Blast_Score=89, Evalue=2e-17, Organism=Homo sapiens, GI156105693, Length=421, Percent_Identity=28.2660332541568, Blast_Score=89, Evalue=2e-17, Organism=Escherichia coli, GI87082383, Length=736, Percent_Identity=30.4347826086957, Blast_Score=294, Evalue=1e-80, Organism=Escherichia coli, GI1787542, Length=459, Percent_Identity=29.8474945533769, Blast_Score=148, Evalue=2e-36, Organism=Caenorhabditis elegans, GI17553506, Length=410, Percent_Identity=31.7073170731707, Blast_Score=178, Evalue=1e-44, Organism=Caenorhabditis elegans, GI212645896, Length=511, Percent_Identity=27.2015655577299, Blast_Score=165, Evalue=8e-41, Organism=Saccharomyces cerevisiae, GI6324552, Length=494, Percent_Identity=28.3400809716599, Blast_Score=182, Evalue=3e-46, Organism=Saccharomyces cerevisiae, GI6320499, Length=497, Percent_Identity=23.1388329979879, Blast_Score=92, Evalue=3e-19, Organism=Drosophila melanogaster, GI24649634, Length=402, Percent_Identity=31.3432835820896, Blast_Score=186, Evalue=5e-47, Organism=Drosophila melanogaster, GI19922976, Length=378, Percent_Identity=31.2169312169312, Blast_Score=140, Evalue=3e-33, Organism=Drosophila melanogaster, GI24654597, Length=378, Percent_Identity=31.2169312169312, Blast_Score=140, Evalue=3e-33, Organism=Drosophila melanogaster, GI24654592, Length=378, Percent_Identity=31.2169312169312, Blast_Score=140, Evalue=3e-33,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR011129 - InterPro: IPR012340 - InterPro: IPR016027 - InterPro: IPR003029 - InterPro: IPR022967 - InterPro: IPR013223 - InterPro: IPR001900 - InterPro: IPR022966 - InterPro: IPR004476 - InterPro: IPR011805 [H]
Pfam domain/function: PF08206 OB_RNB; PF00773 RNB; PF00575 S1 [H]
EC number: 3.1.-.- [C]
Molecular weight: Translated: 85158; Mature: 85158
Theoretical pI: Translated: 9.51; Mature: 9.51
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.1 %Cys (Translated Protein) 2.1 %Met (Translated Protein) 3.2 %Cys+Met (Translated Protein) 1.1 %Cys (Mature Protein) 2.1 %Met (Mature Protein) 3.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MNNSLKDRLIRHMEDGHYEPQSKSELARALNVDSRQKLDFRALVDQMEEEGKLVRLQKGR CCCCHHHHHHHHHHCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCEEEEECCH YALKRERRNLVHGMIRILRSGKILFLPRKGDPAAAALGWDTEAVPELELKPNHLGTALDG HHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCC DRVAVRVERKAARGRRNIRRNRFSSPDADMKARVEEVTERARSRWLGVFRTGKNKPGRVL CEEEEEEHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEE GDGVSSPSSIELAEKPAMEVLPGQLVSVEPITCGEEKKAPRGKIVEVLGYPDEPHVDMEA CCCCCCCCCCCHHHCCCHHHCCCCEEEECCCCCCCCCCCCCCCEEEEECCCCCCCCCHHH VIRKYGLSVEFPASVLRELETLPQTPSPGELARREDWTDRTVITIDPASARDFDDAISIT HHHHHCCCCCCHHHHHHHHHHCCCCCCCCHHHHCCCCCCCEEEEECCCCCCCCCCCEEEE ATPSGWTLAVHIADVSHFVKPGGSLDGEALRRGNSTYLPDRVLPMLPHRLSDDLCSLRPD ECCCCCEEEEEEEHHHHHHCCCCCCCHHHHHCCCCCCCCCHHHHHCCHHHHHHHHHCCHH VVRLTKVCEMKFDKKGKMLRARFADAFIRSKARLTYQEAFAMLKGNDKGEVPSTVREAWN HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHH LASILRRNRYAKGALDLDFPEVRAVMDKDGRVTGIITEEYDESHQLIEECMLAANEAVAL HHHHHHHCCCCCCCCCCCCHHHHHHHCCCCCEEEEEECCCCHHHHHHHHHHHHCCCEEEE ALKNGNRPTIYRVHEEPDSAKLFEFGQLCRLYGHPVHDIDQRQYLNELMKSIKGSPDEQL EEECCCCCEEEEECCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCHHHH LKLALLKSLMRARYDTEPLGHYGLATPNYCHFTSPIRRYADLVVHRSLNPLLANPPKGAK HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCC GAGSTGRLEEDAEHISETERISASAEKDANRMKLFEWLEGQCYADHPEVHEALVTETRHF CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCCHHHHHHHHHHHHHC GVLLEIPRLQIKGLVKPDKLPGGRWVYEAFASRWKNDHGSVLCAGLRVPVIPVKVDREQQ EEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCEEEECCCCCEEEEEECCCHH WADFAIVSREKPRQTGKTAFPTKQEKGISGRGRRNR HHHHHEECCCCCHHCCCCCCCCCHHCCCCCCCCCCC >Mature Secondary Structure MNNSLKDRLIRHMEDGHYEPQSKSELARALNVDSRQKLDFRALVDQMEEEGKLVRLQKGR CCCCHHHHHHHHHHCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCEEEEECCH YALKRERRNLVHGMIRILRSGKILFLPRKGDPAAAALGWDTEAVPELELKPNHLGTALDG HHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCC DRVAVRVERKAARGRRNIRRNRFSSPDADMKARVEEVTERARSRWLGVFRTGKNKPGRVL CEEEEEEHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEE GDGVSSPSSIELAEKPAMEVLPGQLVSVEPITCGEEKKAPRGKIVEVLGYPDEPHVDMEA CCCCCCCCCCCHHHCCCHHHCCCCEEEECCCCCCCCCCCCCCCEEEEECCCCCCCCCHHH VIRKYGLSVEFPASVLRELETLPQTPSPGELARREDWTDRTVITIDPASARDFDDAISIT HHHHHCCCCCCHHHHHHHHHHCCCCCCCCHHHHCCCCCCCEEEEECCCCCCCCCCCEEEE ATPSGWTLAVHIADVSHFVKPGGSLDGEALRRGNSTYLPDRVLPMLPHRLSDDLCSLRPD ECCCCCEEEEEEEHHHHHHCCCCCCCHHHHHCCCCCCCCCHHHHHCCHHHHHHHHHCCHH VVRLTKVCEMKFDKKGKMLRARFADAFIRSKARLTYQEAFAMLKGNDKGEVPSTVREAWN HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHH LASILRRNRYAKGALDLDFPEVRAVMDKDGRVTGIITEEYDESHQLIEECMLAANEAVAL HHHHHHHCCCCCCCCCCCCHHHHHHHCCCCCEEEEEECCCCHHHHHHHHHHHHCCCEEEE ALKNGNRPTIYRVHEEPDSAKLFEFGQLCRLYGHPVHDIDQRQYLNELMKSIKGSPDEQL EEECCCCCEEEEECCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCHHHH LKLALLKSLMRARYDTEPLGHYGLATPNYCHFTSPIRRYADLVVHRSLNPLLANPPKGAK HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCC GAGSTGRLEEDAEHISETERISASAEKDANRMKLFEWLEGQCYADHPEVHEALVTETRHF CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCCHHHHHHHHHHHHHC GVLLEIPRLQIKGLVKPDKLPGGRWVYEAFASRWKNDHGSVLCAGLRVPVIPVKVDREQQ EEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCEEEECCCCCEEEEEECCCHH WADFAIVSREKPRQTGKTAFPTKQEKGISGRGRRNR HHHHHEECCCCCHHCCCCCCCCCHHCCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9384377 [H]