Definition | Akkermansia muciniphila ATCC BAA-835, complete genome. |
---|---|
Accession | NC_010655 |
Length | 2,664,102 |
Click here to switch to the map view.
The map label for this gene is dinB
Identifier: 187735823
GI number: 187735823
Start: 1606901
End: 1607980
Strand: Reverse
Name: dinB
Synonym: Amuc_1330
Alternate gene names: 187735823
Gene position: 1607980-1606901 (Counterclockwise)
Preceding gene: 187735846
Following gene: 187735822
Centisome position: 60.36
GC content: 58.98
Gene sequence:
>1080_bases ATGAACCAGAGAAAAATCATCCATGTGGATATGGATGCCTTTTACGCATCCATAGAACAGCGGGACCATCCCGAATACCG CGGCAAGCCCATCGCCGTAGGCAGGCCGGAAATGCGCGGCGTGGTGGCGGCGGCCAGTTATGAGGCGCGCCGTTTCGGAG TGCGTTCCGCCATGCCTTCCATGAAGGCTCTCAAGCTTTGCCCCCATCTGATTTTCACCCGCAACCGCATGGATGTGTAC AAGGCCGTCTCCGCGCAGATACACGCCATTTTCCACCGTTACACAGATCTGGTGGAACCCCTTTCCCTGGATGAAGCCTT TCTGGACGTCACGGAAAACAAGCCGGGCATTCCGCTGGCCGTCGACATTGCGAGGAGGATTAAGAAGGAAATCCGCCGGG AACTTCACCTGACGGCCTCCGCCGGCGTTTCCTACAATAAATTCCTGGCAAAAATCGCTTCCGATTACCGTAAGCCGGAC GGGCTGTTCACGATCCATCCATCCCGGGCGGAAAAATTCATCGCGGCACTTCCCATTGAAGCTTTCTGGGGAGTCGGGCA CGCCACCGCCGAACGCATGCGCGCCCTTTCCATCACCAACGGGGCGCAGCTCCGGGCACGGGACAAAGACTTCCTGGTAA GGCATTTCGGCAAAACAGGAGCCATCTTCTACAACTTCGCCCGCGGTGTGGACGACCGCCCTGTGGAACCTTCCCGCATG CGCAAATCCGTGGGTTGTGAAGAAACCTACCGGGAAAACGTCACCAGGGCGGAAGCGCTGGAACAACGCCTCCCCCTGCT GGCGGAAGAACTCGCGGGGCGGCTGGCCCGTTCCGGCTTCCGGGGAAACACCCTTACCCTGAAGGTTAAGTTCCCGGACT TTGTCCAGAAGACCCGCTGCGCGACCGTTCCGGAAATCCTGACGGAGAAAGAAGGAATTCTCCCCCTGGCCCGCACCCTG ATGGAAGAACTGGATTCCGGGGACCGTACATTCCGCCTTCTGGGGCTGTCCGTCTCCCATCCCCAGGAAGAACAGCGGCA GGGCATCTGGGAACAGCTCTGGCTGGAGCTGGAGTATTAA
Upstream 100 bases:
>100_bases AGCTTTTCCTTCCAACTGGACTTGCAGGATAAGTCGGATTATGATGGGGCATCCCATTCCCTCATGTTTCCCGCCCCATC AGGAATCATGAAGCGCTCCC
Downstream 100 bases:
>100_bases CTCTTCCATAGACTGCCATATTTTCAGGACTCTCCCCTTAAAAACGGCAATACCCACAAGATATCGTTTCAGGAATGAGC AAAACCCGCTCCAACCTGGT
Product: DNA-directed DNA polymerase
Products: NA
Alternate protein names: Pol IV
Number of amino acids: Translated: 359; Mature: 359
Protein sequence:
>359_residues MNQRKIIHVDMDAFYASIEQRDHPEYRGKPIAVGRPEMRGVVAAASYEARRFGVRSAMPSMKALKLCPHLIFTRNRMDVY KAVSAQIHAIFHRYTDLVEPLSLDEAFLDVTENKPGIPLAVDIARRIKKEIRRELHLTASAGVSYNKFLAKIASDYRKPD GLFTIHPSRAEKFIAALPIEAFWGVGHATAERMRALSITNGAQLRARDKDFLVRHFGKTGAIFYNFARGVDDRPVEPSRM RKSVGCEETYRENVTRAEALEQRLPLLAEELAGRLARSGFRGNTLTLKVKFPDFVQKTRCATVPEILTEKEGILPLARTL MEELDSGDRTFRLLGLSVSHPQEEQRQGIWEQLWLELEY
Sequences:
>Translated_359_residues MNQRKIIHVDMDAFYASIEQRDHPEYRGKPIAVGRPEMRGVVAAASYEARRFGVRSAMPSMKALKLCPHLIFTRNRMDVY KAVSAQIHAIFHRYTDLVEPLSLDEAFLDVTENKPGIPLAVDIARRIKKEIRRELHLTASAGVSYNKFLAKIASDYRKPD GLFTIHPSRAEKFIAALPIEAFWGVGHATAERMRALSITNGAQLRARDKDFLVRHFGKTGAIFYNFARGVDDRPVEPSRM RKSVGCEETYRENVTRAEALEQRLPLLAEELAGRLARSGFRGNTLTLKVKFPDFVQKTRCATVPEILTEKEGILPLARTL MEELDSGDRTFRLLGLSVSHPQEEQRQGIWEQLWLELEY >Mature_359_residues MNQRKIIHVDMDAFYASIEQRDHPEYRGKPIAVGRPEMRGVVAAASYEARRFGVRSAMPSMKALKLCPHLIFTRNRMDVY KAVSAQIHAIFHRYTDLVEPLSLDEAFLDVTENKPGIPLAVDIARRIKKEIRRELHLTASAGVSYNKFLAKIASDYRKPD GLFTIHPSRAEKFIAALPIEAFWGVGHATAERMRALSITNGAQLRARDKDFLVRHFGKTGAIFYNFARGVDDRPVEPSRM RKSVGCEETYRENVTRAEALEQRLPLLAEELAGRLARSGFRGNTLTLKVKFPDFVQKTRCATVPEILTEKEGILPLARTL MEELDSGDRTFRLLGLSVSHPQEEQRQGIWEQLWLELEY
Specific function: Poorly processive, error-prone DNA polymerase involved in untargeted mutagenesis. Copies undamaged DNA at stalled replication forks, which arise in vivo from mismatched or misaligned primer ends. These misaligned primers can be extended by polIV. Exhibits
COG id: COG0389
COG function: function code L; Nucleotidyltransferase/DNA polymerase involved in DNA repair
Gene ontology:
Cell location: Cytoplasm
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 umuC domain
Homologues:
Organism=Homo sapiens, GI84043967, Length=342, Percent_Identity=31.2865497076023, Blast_Score=162, Evalue=6e-40, Organism=Homo sapiens, GI7706681, Length=343, Percent_Identity=31.1953352769679, Blast_Score=162, Evalue=6e-40, Organism=Homo sapiens, GI154350220, Length=324, Percent_Identity=27.4691358024691, Blast_Score=118, Evalue=1e-26, Organism=Homo sapiens, GI5729982, Length=349, Percent_Identity=27.5071633237822, Blast_Score=114, Evalue=1e-25, Organism=Homo sapiens, GI7705344, Length=107, Percent_Identity=48.5981308411215, Blast_Score=111, Evalue=1e-24, Organism=Escherichia coli, GI1786425, Length=350, Percent_Identity=44, Blast_Score=257, Evalue=7e-70, Organism=Escherichia coli, GI1787432, Length=280, Percent_Identity=27.1428571428571, Blast_Score=99, Evalue=3e-22, Organism=Caenorhabditis elegans, GI193205700, Length=417, Percent_Identity=28.0575539568345, Blast_Score=137, Evalue=6e-33, Organism=Caenorhabditis elegans, GI17537959, Length=304, Percent_Identity=27.6315789473684, Blast_Score=121, Evalue=7e-28, Organism=Caenorhabditis elegans, GI193205702, Length=363, Percent_Identity=25.6198347107438, Blast_Score=86, Evalue=2e-17, Organism=Caenorhabditis elegans, GI115534089, Length=123, Percent_Identity=35.7723577235772, Blast_Score=78, Evalue=7e-15, Organism=Saccharomyces cerevisiae, GI6324921, Length=207, Percent_Identity=26.0869565217391, Blast_Score=72, Evalue=2e-13, Organism=Drosophila melanogaster, GI21355641, Length=316, Percent_Identity=29.746835443038, Blast_Score=124, Evalue=8e-29, Organism=Drosophila melanogaster, GI24644984, Length=316, Percent_Identity=29.746835443038, Blast_Score=124, Evalue=8e-29, Organism=Drosophila melanogaster, GI19923006, Length=340, Percent_Identity=26.1764705882353, Blast_Score=120, Evalue=1e-27, Organism=Drosophila melanogaster, GI24668444, Length=118, Percent_Identity=36.4406779661017, Blast_Score=83, Evalue=3e-16,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): DPO4_AKKM8 (B2UKN1)
Other databases:
- EMBL: CP001071 - RefSeq: YP_001877935.1 - ProteinModelPortal: B2UKN1 - SMR: B2UKN1 - GeneID: 6275839 - GenomeReviews: CP001071_GR - KEGG: amu:Amuc_1330 - HOGENOM: HBG734504 - OMA: HRIFRRY - ProtClustDB: CLSK823933 - GO: GO:0005737 - HAMAP: MF_01113 - InterPro: IPR017962 - InterPro: IPR017961 - InterPro: IPR001126 - InterPro: IPR017963 - InterPro: IPR022880 - Gene3D: G3DSA:3.30.1490.100 - PANTHER: PTHR11076
Pfam domain/function: PF00817 IMS; SSF100879 DNA_pol_Y-fam_little_finger
EC number: =2.7.7.7
Molecular weight: Translated: 40892; Mature: 40892
Theoretical pI: Translated: 9.81; Mature: 9.81
Prosite motif: PS50173 UMUC
Important sites: ACT_SITE 105-105
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.8 %Cys (Translated Protein) 2.5 %Met (Translated Protein) 3.3 %Cys+Met (Translated Protein) 0.8 %Cys (Mature Protein) 2.5 %Met (Mature Protein) 3.3 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MNQRKIIHVDMDAFYASIEQRDHPEYRGKPIAVGRPEMRGVVAAASYEARRFGVRSAMPS CCCCEEEEEEHHHHHHHHHHCCCCCCCCCCEECCCCHHHHEEHHHHHHHHHHHHHHHCCH MKALKLCPHLIFTRNRMDVYKAVSAQIHAIFHRYTDLVEPLSLDEAFLDVTENKPGIPLA HHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCCCCCHH VDIARRIKKEIRRELHLTASAGVSYNKFLAKIASDYRKPDGLFTIHPSRAEKFIAALPIE HHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCEEEECCHHHHHHHHHCCHH AFWGVGHATAERMRALSITNGAQLRARDKDFLVRHFGKTGAIFYNFARGVDDRPVEPSRM HHHCCCHHHHHHHHHHCCCCCCEEEECCHHHHHHHCCCCCHHHHHHHCCCCCCCCCHHHH RKSVGCEETYRENVTRAEALEQRLPLLAEELAGRLARSGFRGNTLTLKVKFPDFVQKTRC HHHCCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEECHHHHHHHHH ATVPEILTEKEGILPLARTLMEELDSGDRTFRLLGLSVSHPQEEQRQGIWEQLWLELEY HHHHHHHHCCCCCHHHHHHHHHHHCCCCCEEEEEECCCCCCHHHHHCCHHHHHHEECCC >Mature Secondary Structure MNQRKIIHVDMDAFYASIEQRDHPEYRGKPIAVGRPEMRGVVAAASYEARRFGVRSAMPS CCCCEEEEEEHHHHHHHHHHCCCCCCCCCCEECCCCHHHHEEHHHHHHHHHHHHHHHCCH MKALKLCPHLIFTRNRMDVYKAVSAQIHAIFHRYTDLVEPLSLDEAFLDVTENKPGIPLA HHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCCCCCHH VDIARRIKKEIRRELHLTASAGVSYNKFLAKIASDYRKPDGLFTIHPSRAEKFIAALPIE HHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCEEEECCHHHHHHHHHCCHH AFWGVGHATAERMRALSITNGAQLRARDKDFLVRHFGKTGAIFYNFARGVDDRPVEPSRM HHHCCCHHHHHHHHHHCCCCCCEEEECCHHHHHHHCCCCCHHHHHHHCCCCCCCCCHHHH RKSVGCEETYRENVTRAEALEQRLPLLAEELAGRLARSGFRGNTLTLKVKFPDFVQKTRC HHHCCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEECHHHHHHHHH ATVPEILTEKEGILPLARTLMEELDSGDRTFRLLGLSVSHPQEEQRQGIWEQLWLELEY HHHHHHHHCCCCCHHHHHHHHHHHCCCCCEEEEEECCCCCCHHHHHCCHHHHHHEECCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA