The gene/protein map for NC_007778 is currently unavailable.
Definition Rhodopseudomonas palustris HaA2, complete genome.
Accession NC_007778
Length 5,331,656

Click here to switch to the map view.

The map label for this gene is dinB1 [H]

Identifier: 86749530

GI number: 86749530

Start: 2763188

End: 2764477

Strand: Reverse

Name: dinB1 [H]

Synonym: RPB_2410

Alternate gene names: 86749530

Gene position: 2764477-2763188 (Counterclockwise)

Preceding gene: 86749531

Following gene: 86749528

Centisome position: 51.85

GC content: 69.22

Gene sequence:

>1290_bases
GTGAATTCGGCCTCGCCGCCCGGACCGCTGTGCTTCTGCCGCGATTGCCTCGCCGACAACGCGACGGCCGCGCGGCGCTG
CGCCGCCTGCGGTTCCCCGCGGCTGGCCCGGCATCGTGCGCTTCCGGCGCTCACCCTCGCCCATATCGACTGCGATGCGT
TCTACGCGACCGTCGAGAAGCGCGACAATCCGGAACTCGCCGACCGCCCGGTGATCATCGGCGGCGGCAAACGCGGCGTG
GTCTCGGCGGCCTGCTACATCGCGCGGACCTTCGGTGTGCGCTCGGCGATGCCGATGTTCAAGGCGCTGGCGCTGTGCCC
GTCGGCCGCCGTGGTCCGGCCCGACATGGCGAAATACGTCCGCGTCGGCCGCGAGGTCCGCCAGGCAATGCTGCAGCTCA
CGCCGCTGGTCGAGCCGCTGTCGATCGACGAGGCGTTTCTCGACCTGTCCGGCACCGAACGGATGCACGGCACCATCGCC
GCCAAGGTGCTGGCGCGGTTCGCCCGCGACATCGAGCGCGACGTCGGCATCACCGTCTCGGTCGGGCTGTCCTGCAACAA
GTTCCTCGCCAAGATCACCTCCGATCTCGACAAGCCGCGCGGCTTTGCGACGCTCGATCAGGACGACGCCCGGGCGATGC
TCGGCCCGCGCCCGGTCGGCTTCATCTTCGGCGTCGGACCCGCGACCGCGGCACGGATCGCGCAGCATGGCTTCCGCACC
ATCGCCGATCTGCAGAAGGCCGACGAGATCGAGCTGATGCGGCAGTTCGGCGACGAGGGCCGGCGGCTGTGGCGGCTCGC
CCGCGGCATCGACGACCGCAAGGTCGTGCCGGATCGCGGCGCCAAGTCGATCTCCAACGAGACCACGTTCGAGACCGACA
TCCGCGATTTCGAAACGCTGGAACGGATCCTGTGGCGATTGTCCGAGAAGGTGTCGTCGCGTCTCAAAGGCGCAGCGCTG
GCCGGCTCGACGATCACCCTGAAGCTGAAAACCGGCGACTTCCGCCAGCGCACCCGCTCGCAGACGATTCACGCGCCGAC
CCAGCTCGCAGGACGAATCTTCGCGATCTCGCGCGACATGCTCGCGAAGGAAATCGACGGCACCGCGTTCCGCCTGATCG
GCACCGGCGTCAGCGCACTGGCGCCGGGCTCCACGGCCGGCGACACCGACATGATCGATCGCCGCTCCGCGACAGCCGAG
CGCGCCATCGACGATCTGCGCAAGAAATTCGGCGCCGCCGCGGTGATCAGGGGGCTCGCTTACGACGGCCCGGACAAGCC
GCGGGGATGA

Upstream 100 bases:

>100_bases
CGACCGTGAAGGCGTTCGCAAATTCGGTCGAGCTCGACCCGACGAATGTCGCGGCCGCGCTGCAGGTGCTGGAAGACAAA
CCCTGGGAGCGCGACACGCC

Downstream 100 bases:

>100_bases
CTCACTTTCGTCTTTAGCGCCGAGACATCAACAACCACCGTCATGCCCGGGCTTGTCCCGGGCATCCACGTGTTTGTTAC
GTGGACAGGCCCAAGACGTG

Product: DNA polymerase IV

Products: NA

Alternate protein names: Pol IV 1 [H]

Number of amino acids: Translated: 429; Mature: 429

Protein sequence:

>429_residues
MNSASPPGPLCFCRDCLADNATAARRCAACGSPRLARHRALPALTLAHIDCDAFYATVEKRDNPELADRPVIIGGGKRGV
VSAACYIARTFGVRSAMPMFKALALCPSAAVVRPDMAKYVRVGREVRQAMLQLTPLVEPLSIDEAFLDLSGTERMHGTIA
AKVLARFARDIERDVGITVSVGLSCNKFLAKITSDLDKPRGFATLDQDDARAMLGPRPVGFIFGVGPATAARIAQHGFRT
IADLQKADEIELMRQFGDEGRRLWRLARGIDDRKVVPDRGAKSISNETTFETDIRDFETLERILWRLSEKVSSRLKGAAL
AGSTITLKLKTGDFRQRTRSQTIHAPTQLAGRIFAISRDMLAKEIDGTAFRLIGTGVSALAPGSTAGDTDMIDRRSATAE
RAIDDLRKKFGAAAVIRGLAYDGPDKPRG

Sequences:

>Translated_429_residues
MNSASPPGPLCFCRDCLADNATAARRCAACGSPRLARHRALPALTLAHIDCDAFYATVEKRDNPELADRPVIIGGGKRGV
VSAACYIARTFGVRSAMPMFKALALCPSAAVVRPDMAKYVRVGREVRQAMLQLTPLVEPLSIDEAFLDLSGTERMHGTIA
AKVLARFARDIERDVGITVSVGLSCNKFLAKITSDLDKPRGFATLDQDDARAMLGPRPVGFIFGVGPATAARIAQHGFRT
IADLQKADEIELMRQFGDEGRRLWRLARGIDDRKVVPDRGAKSISNETTFETDIRDFETLERILWRLSEKVSSRLKGAAL
AGSTITLKLKTGDFRQRTRSQTIHAPTQLAGRIFAISRDMLAKEIDGTAFRLIGTGVSALAPGSTAGDTDMIDRRSATAE
RAIDDLRKKFGAAAVIRGLAYDGPDKPRG
>Mature_429_residues
MNSASPPGPLCFCRDCLADNATAARRCAACGSPRLARHRALPALTLAHIDCDAFYATVEKRDNPELADRPVIIGGGKRGV
VSAACYIARTFGVRSAMPMFKALALCPSAAVVRPDMAKYVRVGREVRQAMLQLTPLVEPLSIDEAFLDLSGTERMHGTIA
AKVLARFARDIERDVGITVSVGLSCNKFLAKITSDLDKPRGFATLDQDDARAMLGPRPVGFIFGVGPATAARIAQHGFRT
IADLQKADEIELMRQFGDEGRRLWRLARGIDDRKVVPDRGAKSISNETTFETDIRDFETLERILWRLSEKVSSRLKGAAL
AGSTITLKLKTGDFRQRTRSQTIHAPTQLAGRIFAISRDMLAKEIDGTAFRLIGTGVSALAPGSTAGDTDMIDRRSATAE
RAIDDLRKKFGAAAVIRGLAYDGPDKPRG

Specific function: Poorly processive, error-prone DNA polymerase involved in untargeted mutagenesis. Copies undamaged DNA at stalled replication forks, which arise in vivo from mismatched or misaligned primer ends. These misaligned primers can be extended by polIV. Exhibits

COG id: COG0389

COG function: function code L; Nucleotidyltransferase/DNA polymerase involved in DNA repair

Gene ontology:

Cell location: Cytoplasm (Probable) [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 umuC domain [H]

Homologues:

Organism=Homo sapiens, GI84043967, Length=271, Percent_Identity=28.7822878228782, Blast_Score=108, Evalue=1e-23,
Organism=Homo sapiens, GI7706681, Length=271, Percent_Identity=28.7822878228782, Blast_Score=108, Evalue=1e-23,
Organism=Homo sapiens, GI7705344, Length=106, Percent_Identity=46.2264150943396, Blast_Score=104, Evalue=2e-22,
Organism=Homo sapiens, GI154350220, Length=306, Percent_Identity=25.4901960784314, Blast_Score=96, Evalue=5e-20,
Organism=Escherichia coli, GI1786425, Length=341, Percent_Identity=37.2434017595308, Blast_Score=206, Evalue=2e-54,
Organism=Escherichia coli, GI1787432, Length=304, Percent_Identity=25.9868421052632, Blast_Score=83, Evalue=4e-17,
Organism=Caenorhabditis elegans, GI193205700, Length=393, Percent_Identity=30.7888040712468, Blast_Score=130, Evalue=1e-30,
Organism=Caenorhabditis elegans, GI193205702, Length=345, Percent_Identity=28.1159420289855, Blast_Score=91, Evalue=1e-18,
Organism=Caenorhabditis elegans, GI17537959, Length=302, Percent_Identity=25.1655629139073, Blast_Score=76, Evalue=4e-14,
Organism=Caenorhabditis elegans, GI115534089, Length=118, Percent_Identity=37.2881355932203, Blast_Score=70, Evalue=3e-12,
Organism=Drosophila melanogaster, GI19923006, Length=345, Percent_Identity=24.0579710144928, Blast_Score=106, Evalue=4e-23,
Organism=Drosophila melanogaster, GI21355641, Length=312, Percent_Identity=27.5641025641026, Blast_Score=97, Evalue=3e-20,
Organism=Drosophila melanogaster, GI24644984, Length=312, Percent_Identity=27.5641025641026, Blast_Score=97, Evalue=3e-20,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR017962
- InterPro:   IPR017961
- InterPro:   IPR001126
- InterPro:   IPR017963
- InterPro:   IPR022880 [H]

Pfam domain/function: PF00817 IMS [H]

EC number: =2.7.7.7 [H]

Molecular weight: Translated: 46525; Mature: 46525

Theoretical pI: Translated: 10.09; Mature: 10.09

Prosite motif: PS50173 UMUC

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.1 %Cys     (Translated Protein)
2.3 %Met     (Translated Protein)
4.4 %Cys+Met (Translated Protein)
2.1 %Cys     (Mature Protein)
2.3 %Met     (Mature Protein)
4.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNSASPPGPLCFCRDCLADNATAARRCAACGSPRLARHRALPALTLAHIDCDAFYATVEK
CCCCCCCCCHHHHHHHHCCCHHHHHHHHHCCCCHHHHHHCCCHHHHHHCCHHHHHHHHHC
RDNPELADRPVIIGGGKRGVVSAACYIARTFGVRSAMPMFKALALCPSAAVVRPDMAKYV
CCCCCCCCCCEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHCCHHHHHHH
RVGREVRQAMLQLTPLVEPLSIDEAFLDLSGTERMHGTIAAKVLARFARDIERDVGITVS
HHHHHHHHHHHHHHHCCCCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCEEEE
VGLSCNKFLAKITSDLDKPRGFATLDQDDARAMLGPRPVGFIFGVGPATAARIAQHGFRT
ECCCHHHHHHHHHHHHCCCCCCCCCCCCCHHHHCCCCCCEEEEECCHHHHHHHHHHHHHH
IADLQKADEIELMRQFGDEGRRLWRLARGIDDRKVVPDRGAKSISNETTFETDIRDFETL
HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHCCCCCCCHHHHHHHHHH
ERILWRLSEKVSSRLKGAALAGSTITLKLKTGDFRQRTRSQTIHAPTQLAGRIFAISRDM
HHHHHHHHHHHHHHHCCCEEECCEEEEEEECCHHHHHHHHCCCCCHHHHHHHHHHHHHHH
LAKEIDGTAFRLIGTGVSALAPGSTAGDTDMIDRRSATAERAIDDLRKKFGAAAVIRGLA
HHHHCCCCEEEEECCCHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
YDGPDKPRG
CCCCCCCCC
>Mature Secondary Structure
MNSASPPGPLCFCRDCLADNATAARRCAACGSPRLARHRALPALTLAHIDCDAFYATVEK
CCCCCCCCCHHHHHHHHCCCHHHHHHHHHCCCCHHHHHHCCCHHHHHHCCHHHHHHHHHC
RDNPELADRPVIIGGGKRGVVSAACYIARTFGVRSAMPMFKALALCPSAAVVRPDMAKYV
CCCCCCCCCCEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHCCHHHHHHH
RVGREVRQAMLQLTPLVEPLSIDEAFLDLSGTERMHGTIAAKVLARFARDIERDVGITVS
HHHHHHHHHHHHHHHCCCCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCEEEE
VGLSCNKFLAKITSDLDKPRGFATLDQDDARAMLGPRPVGFIFGVGPATAARIAQHGFRT
ECCCHHHHHHHHHHHHCCCCCCCCCCCCCHHHHCCCCCCEEEEECCHHHHHHHHHHHHHH
IADLQKADEIELMRQFGDEGRRLWRLARGIDDRKVVPDRGAKSISNETTFETDIRDFETL
HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHCCCCCCCHHHHHHHHHH
ERILWRLSEKVSSRLKGAALAGSTITLKLKTGDFRQRTRSQTIHAPTQLAGRIFAISRDM
HHHHHHHHHHHHHHHCCCEEECCEEEEEEECCHHHHHHHHCCCCCHHHHHHHHHHHHHHH
LAKEIDGTAFRLIGTGVSALAPGSTAGDTDMIDRRSATAERAIDDLRKKFGAAAVIRGLA
HHHHCCCCEEEEECCCHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
YDGPDKPRG
CCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11481430 [H]