Definition Clostridium botulinum A str. ATCC 3502, complete genome.
Accession NC_009495
Length 3,886,916

Click here to switch to the map view.

The map label for this gene is dinB [H]

Identifier: 148380777

GI number: 148380777

Start: 2987038

End: 2988087

Strand: Reverse

Name: dinB [H]

Synonym: CBO2818

Alternate gene names: 148380777

Gene position: 2988087-2987038 (Counterclockwise)

Preceding gene: 148380778

Following gene: 148380774

Centisome position: 76.88

GC content: 28.57

Gene sequence:

>1050_bases
ATGGATAGAGTTATTATACACGTTGATATGGATGCATTTTTTGCTTCTGTAGAACAAATGGATAATAAAAAACTTAAGGG
TAAACCTGTAATAGTAGGTGGAATAGGAGAAAGAGGAGTAGTTTCTACTGCATCTTATGAGGCAAGAAAATATGGGGTAC
ATTCAGCTATGCCTGTTTTTATAGCTAAAAAGAGATGCCCTTTTGGCATATATGTACCTGGAAGGCATGAGAGATATAAG
GAAGTTTCAAACAATATATTTAAAATATTATATAAGGTTACTCCTATTATTGAGCCTGTATCCATAGATGAAGCATATTT
AGATATCACCAATATTAAAGAAGAACCTATTAAAGTGGCAGAACATATAAAAAAGAGAGTTAAAAATGAAATGGGATTAA
CCTTATCTGTAGGAATTTCATATAATAAATTTCTAGCAAAGCTTGCTTCTGATTGGAATAAACCAGATGGTATAAAAATA
ATAAGACAAGATATGATTCCAAGCATACTTATGCCACTTCCTATAAACAAGATTCATGGAATAGGTAAAAAGTCAGTGGA
AAAATTAAATAACATAGGTATTTATACTGTGGAAGATATGCATAAGCTATCTAGAGACTTTTGTATAGAGTATTTTGGAA
AATTTGGAGTAGAAATATACGAGAGAATAAGAGGAATAGATTATAGAGAAGTAAAAGTTAGTAGAGAAAGAAAGTCCATA
GGGAAGGAAATAACCCTTAAAAAGGATATAACAAATAAAGAGGAAATGAAAAAATATCTTTTGGACTTTTCAAATAAAGT
ATCTACTAATTTGTATAAAAGAAATACTGGAGGAAAAACTGTTACTGTAAAAATAAAAACTTCTAATTTTCAAACTCATA
CAAGGAGTAAAACCGTAAATGATTATATAAGGGATAAAGATGAAATTTATAATATAGCCTGTGATATATTAGAACATATA
AATTTTAAGGAACCTATAAGACTTATAGGGCTTACTGTATCAAATTTAGGAGAAAATAAAATTAAGCAATTGACCTTTAA
TATTTATTAA

Upstream 100 bases:

>100_bases
AATGTTAAAAACATCTGTATTTAACTTTTGGAGATTTTGATATAATTATATGTAAGCTATATTAAAGTGTAAATTATAGT
TCAGAGAGGAATTCATATTT

Downstream 100 bases:

>100_bases
GTTAAATAAATTTCAGTAAATATAACTAAAAATAAATATTAATTTAAAGTTTACACAAAACTGTAATTGATTTATAAATA
ATATAAAATATATTGTAAAA

Product: DNA polymerase IV

Products: NA

Alternate protein names: Pol IV [H]

Number of amino acids: Translated: 349; Mature: 349

Protein sequence:

>349_residues
MDRVIIHVDMDAFFASVEQMDNKKLKGKPVIVGGIGERGVVSTASYEARKYGVHSAMPVFIAKKRCPFGIYVPGRHERYK
EVSNNIFKILYKVTPIIEPVSIDEAYLDITNIKEEPIKVAEHIKKRVKNEMGLTLSVGISYNKFLAKLASDWNKPDGIKI
IRQDMIPSILMPLPINKIHGIGKKSVEKLNNIGIYTVEDMHKLSRDFCIEYFGKFGVEIYERIRGIDYREVKVSRERKSI
GKEITLKKDITNKEEMKKYLLDFSNKVSTNLYKRNTGGKTVTVKIKTSNFQTHTRSKTVNDYIRDKDEIYNIACDILEHI
NFKEPIRLIGLTVSNLGENKIKQLTFNIY

Sequences:

>Translated_349_residues
MDRVIIHVDMDAFFASVEQMDNKKLKGKPVIVGGIGERGVVSTASYEARKYGVHSAMPVFIAKKRCPFGIYVPGRHERYK
EVSNNIFKILYKVTPIIEPVSIDEAYLDITNIKEEPIKVAEHIKKRVKNEMGLTLSVGISYNKFLAKLASDWNKPDGIKI
IRQDMIPSILMPLPINKIHGIGKKSVEKLNNIGIYTVEDMHKLSRDFCIEYFGKFGVEIYERIRGIDYREVKVSRERKSI
GKEITLKKDITNKEEMKKYLLDFSNKVSTNLYKRNTGGKTVTVKIKTSNFQTHTRSKTVNDYIRDKDEIYNIACDILEHI
NFKEPIRLIGLTVSNLGENKIKQLTFNIY
>Mature_349_residues
MDRVIIHVDMDAFFASVEQMDNKKLKGKPVIVGGIGERGVVSTASYEARKYGVHSAMPVFIAKKRCPFGIYVPGRHERYK
EVSNNIFKILYKVTPIIEPVSIDEAYLDITNIKEEPIKVAEHIKKRVKNEMGLTLSVGISYNKFLAKLASDWNKPDGIKI
IRQDMIPSILMPLPINKIHGIGKKSVEKLNNIGIYTVEDMHKLSRDFCIEYFGKFGVEIYERIRGIDYREVKVSRERKSI
GKEITLKKDITNKEEMKKYLLDFSNKVSTNLYKRNTGGKTVTVKIKTSNFQTHTRSKTVNDYIRDKDEIYNIACDILEHI
NFKEPIRLIGLTVSNLGENKIKQLTFNIY

Specific function: Poorly processive, error-prone DNA polymerase involved in untargeted mutagenesis. Copies undamaged DNA at stalled replication forks, which arise in vivo from mismatched or misaligned primer ends. These misaligned primers can be extended by polIV. Exhibits

COG id: COG0389

COG function: function code L; Nucleotidyltransferase/DNA polymerase involved in DNA repair

Gene ontology:

Cell location: Cytoplasm (Probable) [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 umuC domain [H]

Homologues:

Organism=Homo sapiens, GI84043967, Length=338, Percent_Identity=29.8816568047337, Blast_Score=135, Evalue=7e-32,
Organism=Homo sapiens, GI7706681, Length=339, Percent_Identity=29.4985250737463, Blast_Score=129, Evalue=3e-30,
Organism=Homo sapiens, GI154350220, Length=320, Percent_Identity=28.125, Blast_Score=124, Evalue=2e-28,
Organism=Homo sapiens, GI5729982, Length=365, Percent_Identity=27.9452054794521, Blast_Score=115, Evalue=8e-26,
Organism=Homo sapiens, GI7705344, Length=115, Percent_Identity=46.0869565217391, Blast_Score=110, Evalue=3e-24,
Organism=Escherichia coli, GI1786425, Length=352, Percent_Identity=39.4886363636364, Blast_Score=234, Evalue=7e-63,
Organism=Escherichia coli, GI1787432, Length=295, Percent_Identity=26.4406779661017, Blast_Score=101, Evalue=6e-23,
Organism=Caenorhabditis elegans, GI17537959, Length=327, Percent_Identity=27.217125382263, Blast_Score=103, Evalue=1e-22,
Organism=Caenorhabditis elegans, GI193205700, Length=285, Percent_Identity=31.9298245614035, Blast_Score=97, Evalue=1e-20,
Organism=Caenorhabditis elegans, GI193205702, Length=285, Percent_Identity=31.9298245614035, Blast_Score=97, Evalue=1e-20,
Organism=Caenorhabditis elegans, GI115534089, Length=350, Percent_Identity=26.8571428571429, Blast_Score=87, Evalue=1e-17,
Organism=Saccharomyces cerevisiae, GI6324921, Length=272, Percent_Identity=24.2647058823529, Blast_Score=79, Evalue=1e-15,
Organism=Drosophila melanogaster, GI19923006, Length=339, Percent_Identity=24.7787610619469, Blast_Score=119, Evalue=4e-27,
Organism=Drosophila melanogaster, GI21355641, Length=303, Percent_Identity=27.7227722772277, Blast_Score=114, Evalue=1e-25,
Organism=Drosophila melanogaster, GI24644984, Length=303, Percent_Identity=27.7227722772277, Blast_Score=114, Evalue=1e-25,
Organism=Drosophila melanogaster, GI24668444, Length=256, Percent_Identity=27.734375, Blast_Score=70, Evalue=2e-12,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR017962
- InterPro:   IPR017961
- InterPro:   IPR001126
- InterPro:   IPR017963
- InterPro:   IPR022880 [H]

Pfam domain/function: PF00817 IMS [H]

EC number: =2.7.7.7 [H]

Molecular weight: Translated: 40129; Mature: 40129

Theoretical pI: Translated: 10.05; Mature: 10.05

Prosite motif: PS50173 UMUC

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.9 %Cys     (Translated Protein)
2.6 %Met     (Translated Protein)
3.4 %Cys+Met (Translated Protein)
0.9 %Cys     (Mature Protein)
2.6 %Met     (Mature Protein)
3.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MDRVIIHVDMDAFFASVEQMDNKKLKGKPVIVGGIGERGVVSTASYEARKYGVHSAMPVF
CCEEEEEEEHHHHHHHHHHHCCCCCCCCCEEEECCCCCCCEECCCHHHHHHCCCCCCHHE
IAKKRCPFGIYVPGRHERYKEVSNNIFKILYKVTPIIEPVSIDEAYLDITNIKEEPIKVA
EECCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHEEHHCCCCHHHHHH
EHIKKRVKNEMGLTLSVGISYNKFLAKLASDWNKPDGIKIIRQDMIPSILMPLPINKIHG
HHHHHHHHHHCCEEEEECCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCCHHHHC
IGKKSVEKLNNIGIYTVEDMHKLSRDFCIEYFGKFGVEIYERIRGIDYREVKVSRERKSI
CCHHHHHHHHCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHC
GKEITLKKDITNKEEMKKYLLDFSNKVSTNLYKRNTGGKTVTVKIKTSNFQTHTRSKTVN
CCEEEEECCCCCHHHHHHHHHHCCCHHHHHHEEECCCCEEEEEEEEECCCCCHHHHHHHH
DYIRDKDEIYNIACDILEHINFKEPIRLIGLTVSNLGENKIKQLTFNIY
HHHCCHHHHHHHHHHHHHHCCCCCCHHEEEEEHHHCCCHHHHHEEEECC
>Mature Secondary Structure
MDRVIIHVDMDAFFASVEQMDNKKLKGKPVIVGGIGERGVVSTASYEARKYGVHSAMPVF
CCEEEEEEEHHHHHHHHHHHCCCCCCCCCEEEECCCCCCCEECCCHHHHHHCCCCCCHHE
IAKKRCPFGIYVPGRHERYKEVSNNIFKILYKVTPIIEPVSIDEAYLDITNIKEEPIKVA
EECCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHEEHHCCCCHHHHHH
EHIKKRVKNEMGLTLSVGISYNKFLAKLASDWNKPDGIKIIRQDMIPSILMPLPINKIHG
HHHHHHHHHHCCEEEEECCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCCHHHHC
IGKKSVEKLNNIGIYTVEDMHKLSRDFCIEYFGKFGVEIYERIRGIDYREVKVSRERKSI
CCHHHHHHHHCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHC
GKEITLKKDITNKEEMKKYLLDFSNKVSTNLYKRNTGGKTVTVKIKTSNFQTHTRSKTVN
CCEEEEECCCCCHHHHHHHHHHCCCHHHHHHEEECCCCEEEEEEEEECCCCCHHHHHHHH
DYIRDKDEIYNIACDILEHINFKEPIRLIGLTVSNLGENKIKQLTFNIY
HHHCCHHHHHHHHHHHHHHCCCCCCHHEEEEEHHHCCCHHHHHEEEECC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11997336 [H]