The gene/protein map for NC_003551 is currently unavailable.
Definition Methanopyrus kandleri AV19, complete genome.
Accession NC_003551
Length 1,694,969

Click here to switch to the map view.

The map label for this gene is PolB [H]

Identifier: 20094475

GI number: 20094475

Start: 1000589

End: 1003081

Strand: Direct

Name: PolB [H]

Synonym: MK1039

Alternate gene names: 20094475

Gene position: 1000589-1003081 (Clockwise)

Preceding gene: 20094474

Following gene: 20094476

Centisome position: 59.03

GC content: 62.98

Gene sequence:

>2493_bases
TTGCTCCGTACAGTGTGGGTAGATTACGCCAGGAAAGGCGAACCCGATGTCATCTTAGTCGGTCGGCGGGAGGACGGGAA
CCCAGCTGCCCTCGTCGTTAAGGGGTTTCGTCCCTACTTCTACGCGGAGGTGGAGGACGGGTTCGATCCGTCCGAGGTGG
AGCGTCTGAGCGGTGTGGTGGAGGTCGAAGAAGTCCTGTTGGAGCACCCCTACGGCGGCGACCGGGTGGAGCTCCTACGG
ATCGTCGCCACGTACCCCAAGGTCGTCCCCAAACTGCGCGAGCAGGTCAAGAAGCTGGACGGCGTGAAGGAGGTCTACGA
GGCGGACATCCCCTTCGTGCGCCGTGCCGCCGTCGACCTCAACTTGCCGCCGGCGTCCGAGGTCGACGTCTCCGACCTGG
ACACGGGGTCCTGGTCCGGACTTCCCGCGTACTTCGCGGACGTCGAGGACGCCCGTGAGTTGGACCACCGCCCCTACCCG
ATCGAGGACCTCGTCGTCGCCAGCTTCGACCTCGAGGTGCTGGCGGAACCCGGAACGACGATCAAGGGAGCCTCGGGCCC
CATCATCGCCATCAGCTTCGCGTACAGTACGCCTGACGGGGAGCGCCGTAACTACGTGATTACCTGGAAAGGAGAGGACG
AGTCGTTCGAAGTGGACGGGGTGGAAACCGAGGTCATCGTGTGTAGGTCGGAAGCCGCCGCGCTGCGGAGGTTCTTCGAC
GAGTTCCGCCGCGTTGATCCCGACGTGGTGTTCACATACAACGGGGACGAGTTCGACCTGCCGTACCTGCAACACCGGGC
CGGGAAGCTGGGTATCGACGTCTCGCCCCTGGCGCGGCCGGCGGGCAAGCGTGGGATCATACTGAAGCACGGGGGCGGCC
GTTATGCCTCCGATATTTTCGGACGGGCCCACGTCGACCTGTACCACACGGCGAGGAAGAACCTCAAGCTGGAACGCTTC
ACGCTCGAGGAGGCCGTCAAGGACGTGCTCGGTGTAGAGAAGGAGGAGATGGAGCTAGCCGACATCAACGAGGCCTGGAA
GCGCGGTAATCTGGACGAACTGATGAGGTACTCGGCTGAGGATGCCCACTACACCTTAGAGTTGGGGCTCGAGCTGGCGC
AGGTCGAGTTGGAGCTCTCCTACCTGACACGGCTGCCGCTGCCGGATGCGACCCGCTTCAGCTTCGGACAGCTCGCGGAG
TGGAGGGCCATCTACAAGGCGCGCCAGGAGGACATCCTGGTACCGAACAAACCGACTCGAGATGAGTACAAGCGGCGGCG
TCGCAAGGCGTACAAGGGCGCGATAGTGTTCGAGCCCGAGATAGGGCTCCACGAGAACGTGGTCTGCGTGGACTTCGCCA
GCCTGTACCCCAACGTGATGGTCGCGCACAATATCTCGCCCGACACGTTCGACTGCGACTGTTGTCCGCGTGTGACCGTC
GAGGAGGTGGACGACCCCACGGACGCGACCGTGGCACCGGACGTGGGCCACAAGTTCTGCAAGCGGCGTAAGGGGTTCTT
CCCCCGGCTCGTAGAGGGACTCATCGAACGCCGGCGTGAGCTCAAGCGCCGCCTCCGGAAGCTCGATACCGAGTCGCATC
CTCACGAGGCTAAGATCCTCGACGTACGACAGCAGGCGTACAAGGTCCTGGCCAACAGCTACTACGGTTACATGGGCTGG
GCTAACGCGCGCTGGTTCTGCCGCGAGTGTGCCGAGAGTGTTACCGCTTGGGGTCGCTACTACATCAGCGAGGTTCGAAG
GATCGCGGAGGAGAAGTACGGGTTGAAGGTCGTGTACGGGGACACCGACTCGCTGTTCGTGAAGCTGCCCGACGCGGACT
TGGAGGAAACCATCGAGCGGGTGAAGGAGTTCCTGAAAGAGGTCAACGGCCGCCTCCCCGTGGAACTAGAGCTGGAGGAC
GCCTACAAGAGGATCCTGTTCGTGACCAAGAAGAAGTACGCCGGGTACACCGAGGACGGGAAGATCGTTACGAAAGGTCT
GGAGGTGGTCCGACGGGATTGGGCGCCTATCGCCAGGGAGACGCAGCGTCGAGTCTTGAAGCGGATCCTAGCCGACAACG
ACCCGGAGGCGGCGCTGAAGGAGATCCATGAGGTCCTCGAGAGGCTGAAGTCGGGCGACGTCGACATCGACGAGCTCGCG
GTCACGTCCCAGCTCACGAAGAAGCCCTCGGAGTACGTTCAGAAGGGTCCCCACGTCAGGGCCGCGCTACGGCTCGCTCG
ACACCTCGGAGTGGAGCCCGAACCGGGTACCATCGTGAGGTACGTCATCGTCCGCGGTCCCGGTAGCGTCAGCGACAAGG
CGTACCCGGTGGAACTGGTGCGGGAAGAGGGGAAAGAGCCCGACGTCGATTACTACATCGAGCACCAGATACTACCGGCC
GTGGAGCGGATCATGCGGGCGATAGGTTATTCCCGCGGGCAGATCGTCGGTGAGACGGCCTCACAGAAGACGCTGGATCA
GTTCTTCGGCTGA

Upstream 100 bases:

>100_bases
CCCAGTTACCTGTGCACGATAGTCGCCCTCCGGCCCTGAGATACCAGGGCGGGGGCCGGGTTTTTACCCGGTGACTACCC
GGGAGGTTCGGGGGCGAGGG

Downstream 100 bases:

>100_bases
CGGGATGACGAGAGCACCCCTAAGTGATCGGTGATCGGCTTCACCCTGGCCGACCGTGGGTGTTCTCCTTGCGAGTGGCC
ATCCTGGACGGGTACACGGA

Product: B family DNA polymerase

Products: NA

Alternate protein names: Pfu polymerase [H]

Number of amino acids: Translated: 830; Mature: 830

Protein sequence:

>830_residues
MLRTVWVDYARKGEPDVILVGRREDGNPAALVVKGFRPYFYAEVEDGFDPSEVERLSGVVEVEEVLLEHPYGGDRVELLR
IVATYPKVVPKLREQVKKLDGVKEVYEADIPFVRRAAVDLNLPPASEVDVSDLDTGSWSGLPAYFADVEDARELDHRPYP
IEDLVVASFDLEVLAEPGTTIKGASGPIIAISFAYSTPDGERRNYVITWKGEDESFEVDGVETEVIVCRSEAAALRRFFD
EFRRVDPDVVFTYNGDEFDLPYLQHRAGKLGIDVSPLARPAGKRGIILKHGGGRYASDIFGRAHVDLYHTARKNLKLERF
TLEEAVKDVLGVEKEEMELADINEAWKRGNLDELMRYSAEDAHYTLELGLELAQVELELSYLTRLPLPDATRFSFGQLAE
WRAIYKARQEDILVPNKPTRDEYKRRRRKAYKGAIVFEPEIGLHENVVCVDFASLYPNVMVAHNISPDTFDCDCCPRVTV
EEVDDPTDATVAPDVGHKFCKRRKGFFPRLVEGLIERRRELKRRLRKLDTESHPHEAKILDVRQQAYKVLANSYYGYMGW
ANARWFCRECAESVTAWGRYYISEVRRIAEEKYGLKVVYGDTDSLFVKLPDADLEETIERVKEFLKEVNGRLPVELELED
AYKRILFVTKKKYAGYTEDGKIVTKGLEVVRRDWAPIARETQRRVLKRILADNDPEAALKEIHEVLERLKSGDVDIDELA
VTSQLTKKPSEYVQKGPHVRAALRLARHLGVEPEPGTIVRYVIVRGPGSVSDKAYPVELVREEGKEPDVDYYIEHQILPA
VERIMRAIGYSRGQIVGETASQKTLDQFFG

Sequences:

>Translated_830_residues
MLRTVWVDYARKGEPDVILVGRREDGNPAALVVKGFRPYFYAEVEDGFDPSEVERLSGVVEVEEVLLEHPYGGDRVELLR
IVATYPKVVPKLREQVKKLDGVKEVYEADIPFVRRAAVDLNLPPASEVDVSDLDTGSWSGLPAYFADVEDARELDHRPYP
IEDLVVASFDLEVLAEPGTTIKGASGPIIAISFAYSTPDGERRNYVITWKGEDESFEVDGVETEVIVCRSEAAALRRFFD
EFRRVDPDVVFTYNGDEFDLPYLQHRAGKLGIDVSPLARPAGKRGIILKHGGGRYASDIFGRAHVDLYHTARKNLKLERF
TLEEAVKDVLGVEKEEMELADINEAWKRGNLDELMRYSAEDAHYTLELGLELAQVELELSYLTRLPLPDATRFSFGQLAE
WRAIYKARQEDILVPNKPTRDEYKRRRRKAYKGAIVFEPEIGLHENVVCVDFASLYPNVMVAHNISPDTFDCDCCPRVTV
EEVDDPTDATVAPDVGHKFCKRRKGFFPRLVEGLIERRRELKRRLRKLDTESHPHEAKILDVRQQAYKVLANSYYGYMGW
ANARWFCRECAESVTAWGRYYISEVRRIAEEKYGLKVVYGDTDSLFVKLPDADLEETIERVKEFLKEVNGRLPVELELED
AYKRILFVTKKKYAGYTEDGKIVTKGLEVVRRDWAPIARETQRRVLKRILADNDPEAALKEIHEVLERLKSGDVDIDELA
VTSQLTKKPSEYVQKGPHVRAALRLARHLGVEPEPGTIVRYVIVRGPGSVSDKAYPVELVREEGKEPDVDYYIEHQILPA
VERIMRAIGYSRGQIVGETASQKTLDQFFG
>Mature_830_residues
MLRTVWVDYARKGEPDVILVGRREDGNPAALVVKGFRPYFYAEVEDGFDPSEVERLSGVVEVEEVLLEHPYGGDRVELLR
IVATYPKVVPKLREQVKKLDGVKEVYEADIPFVRRAAVDLNLPPASEVDVSDLDTGSWSGLPAYFADVEDARELDHRPYP
IEDLVVASFDLEVLAEPGTTIKGASGPIIAISFAYSTPDGERRNYVITWKGEDESFEVDGVETEVIVCRSEAAALRRFFD
EFRRVDPDVVFTYNGDEFDLPYLQHRAGKLGIDVSPLARPAGKRGIILKHGGGRYASDIFGRAHVDLYHTARKNLKLERF
TLEEAVKDVLGVEKEEMELADINEAWKRGNLDELMRYSAEDAHYTLELGLELAQVELELSYLTRLPLPDATRFSFGQLAE
WRAIYKARQEDILVPNKPTRDEYKRRRRKAYKGAIVFEPEIGLHENVVCVDFASLYPNVMVAHNISPDTFDCDCCPRVTV
EEVDDPTDATVAPDVGHKFCKRRKGFFPRLVEGLIERRRELKRRLRKLDTESHPHEAKILDVRQQAYKVLANSYYGYMGW
ANARWFCRECAESVTAWGRYYISEVRRIAEEKYGLKVVYGDTDSLFVKLPDADLEETIERVKEFLKEVNGRLPVELELED
AYKRILFVTKKKYAGYTEDGKIVTKGLEVVRRDWAPIARETQRRVLKRILADNDPEAALKEIHEVLERLKSGDVDIDELA
VTSQLTKKPSEYVQKGPHVRAALRLARHLGVEPEPGTIVRYVIVRGPGSVSDKAYPVELVREEGKEPDVDYYIEHQILPA
VERIMRAIGYSRGQIVGETASQKTLDQFFG

Specific function: In addition to polymerase activity, this DNA polymerase exhibits 3' to 5' exonuclease activity [H]

COG id: COG0417

COG function: function code L; DNA polymerase elongation subunit (family B)

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the DNA polymerase type-B family [H]

Homologues:

Organism=Homo sapiens, GI156616275, Length=908, Percent_Identity=28.9647577092511, Blast_Score=253, Evalue=8e-67,
Organism=Homo sapiens, GI106507301, Length=711, Percent_Identity=26.7229254571027, Blast_Score=211, Evalue=3e-54,
Organism=Homo sapiens, GI153792012, Length=773, Percent_Identity=25.614489003881, Blast_Score=152, Evalue=1e-36,
Organism=Escherichia coli, GI1786246, Length=650, Percent_Identity=27.5384615384615, Blast_Score=142, Evalue=1e-34,
Organism=Caenorhabditis elegans, GI17559488, Length=914, Percent_Identity=25.9299781181619, Blast_Score=212, Evalue=8e-55,
Organism=Caenorhabditis elegans, GI32565317, Length=542, Percent_Identity=28.9667896678967, Blast_Score=171, Evalue=2e-42,
Organism=Caenorhabditis elegans, GI86563326, Length=296, Percent_Identity=29.0540540540541, Blast_Score=104, Evalue=2e-22,
Organism=Saccharomyces cerevisiae, GI6320101, Length=863, Percent_Identity=26.9988412514484, Blast_Score=246, Evalue=9e-66,
Organism=Saccharomyces cerevisiae, GI6324227, Length=661, Percent_Identity=27.9878971255673, Blast_Score=203, Evalue=1e-52,
Organism=Saccharomyces cerevisiae, GI6325090, Length=660, Percent_Identity=23.4848484848485, Blast_Score=136, Evalue=1e-32,
Organism=Saccharomyces cerevisiae, GI6324067, Length=340, Percent_Identity=27.3529411764706, Blast_Score=74, Evalue=9e-14,
Organism=Drosophila melanogaster, GI24664937, Length=879, Percent_Identity=27.872582480091, Blast_Score=251, Evalue=2e-66,
Organism=Drosophila melanogaster, GI24648780, Length=653, Percent_Identity=28.3307810107198, Blast_Score=194, Evalue=2e-49,
Organism=Drosophila melanogaster, GI24586371, Length=437, Percent_Identity=27.9176201372998, Blast_Score=129, Evalue=9e-30,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR006172
- InterPro:   IPR017964
- InterPro:   IPR006133
- InterPro:   IPR006134
- InterPro:   IPR004578
- InterPro:   IPR023211
- InterPro:   IPR012337 [H]

Pfam domain/function: PF00136 DNA_pol_B; PF03104 DNA_pol_B_exo [H]

EC number: =2.7.7.7 [H]

Molecular weight: Translated: 94562; Mature: 94562

Theoretical pI: Translated: 5.14; Mature: 5.14

Prosite motif: PS00116 DNA_POLYMERASE_B

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.0 %Cys     (Translated Protein)
0.7 %Met     (Translated Protein)
1.7 %Cys+Met (Translated Protein)
1.0 %Cys     (Mature Protein)
0.7 %Met     (Mature Protein)
1.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MLRTVWVDYARKGEPDVILVGRREDGNPAALVVKGFRPYFYAEVEDGFDPSEVERLSGVV
CCEEEEEHHHCCCCCCEEEEECCCCCCCEEEEEECCCCEEEEEECCCCCHHHHHHHHHHH
EVEEVLLEHPYGGDRVELLRIVATYPKVVPKLREQVKKLDGVKEVYEADIPFVRRAAVDL
HHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHEEEC
NLPPASEVDVSDLDTGSWSGLPAYFADVEDARELDHRPYPIEDLVVASFDLEVLAEPGTT
CCCCCCCCCCCCCCCCCCCCCCHHHHCHHHHHHCCCCCCCHHHHEEEECCEEEEECCCCE
IKGASGPIIAISFAYSTPDGERRNYVITWKGEDESFEVDGVETEVIVCRSEAAALRRFFD
EECCCCCEEEEEEEECCCCCCCCCEEEEECCCCCCEEECCCCEEEEEECCHHHHHHHHHH
EFRRVDPDVVFTYNGDEFDLPYLQHRAGKLGIDVSPLARPAGKRGIILKHGGGRYASDIF
HHHCCCCCEEEEECCCCCCCCHHHHCCCCCCCCCCHHCCCCCCCCEEEECCCCCHHHHHH
GRAHVDLYHTARKNLKLERFTLEEAVKDVLGVEKEEMELADINEAWKRGNLDELMRYSAE
CCHHHHHHHHHHHCCEEHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHCCCC
DAHYTLELGLELAQVELELSYLTRLPLPDATRFSFGQLAEWRAIYKARQEDILVPNKPTR
CCEEEEEECCEEEHHEEEHHHHHCCCCCCCCCCCHHHHHHHHHHHHHCCCCEECCCCCCH
DEYKRRRRKAYKGAIVFEPEIGLHENVVCVDFASLYPNVMVAHNISPDTFDCDCCPRVTV
HHHHHHHHHHHCCCEEECCCCCCCCCEEEEEHHHHCCCEEEEECCCCCCCCCCCCCCCCH
EEVDDPTDATVAPDVGHKFCKRRKGFFPRLVEGLIERRRELKRRLRKLDTESHPHEAKIL
HHCCCCCCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHH
DVRQQAYKVLANSYYGYMGWANARWFCRECAESVTAWGRYYISEVRRIAEEKYGLKVVYG
HHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEEEC
DTDSLFVKLPDADLEETIERVKEFLKEVNGRLPVELELEDAYKRILFVTKKKYAGYTEDG
CCCEEEEECCCCCHHHHHHHHHHHHHHHCCCCCEEEEHHHHHHHHHHEEHHHHCCCCCCC
KIVTKGLEVVRRDWAPIARETQRRVLKRILADNDPEAALKEIHEVLERLKSGDVDIDELA
CHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCHHHHH
VTSQLTKKPSEYVQKGPHVRAALRLARHLGVEPEPGTIVRYVIVRGPGSVSDKAYPVELV
HHHHHHCCCHHHHHCCCCHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCCCCCCHHHHH
REEGKEPDVDYYIEHQILPAVERIMRAIGYSRGQIVGETASQKTLDQFFG
HHCCCCCCCCEEEHHHHHHHHHHHHHHHCCCCCCEECCCHHHHHHHHHCC
>Mature Secondary Structure
MLRTVWVDYARKGEPDVILVGRREDGNPAALVVKGFRPYFYAEVEDGFDPSEVERLSGVV
CCEEEEEHHHCCCCCCEEEEECCCCCCCEEEEEECCCCEEEEEECCCCCHHHHHHHHHHH
EVEEVLLEHPYGGDRVELLRIVATYPKVVPKLREQVKKLDGVKEVYEADIPFVRRAAVDL
HHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHEEEC
NLPPASEVDVSDLDTGSWSGLPAYFADVEDARELDHRPYPIEDLVVASFDLEVLAEPGTT
CCCCCCCCCCCCCCCCCCCCCCHHHHCHHHHHHCCCCCCCHHHHEEEECCEEEEECCCCE
IKGASGPIIAISFAYSTPDGERRNYVITWKGEDESFEVDGVETEVIVCRSEAAALRRFFD
EECCCCCEEEEEEEECCCCCCCCCEEEEECCCCCCEEECCCCEEEEEECCHHHHHHHHHH
EFRRVDPDVVFTYNGDEFDLPYLQHRAGKLGIDVSPLARPAGKRGIILKHGGGRYASDIF
HHHCCCCCEEEEECCCCCCCCHHHHCCCCCCCCCCHHCCCCCCCCEEEECCCCCHHHHHH
GRAHVDLYHTARKNLKLERFTLEEAVKDVLGVEKEEMELADINEAWKRGNLDELMRYSAE
CCHHHHHHHHHHHCCEEHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHCCCC
DAHYTLELGLELAQVELELSYLTRLPLPDATRFSFGQLAEWRAIYKARQEDILVPNKPTR
CCEEEEEECCEEEHHEEEHHHHHCCCCCCCCCCCHHHHHHHHHHHHHCCCCEECCCCCCH
DEYKRRRRKAYKGAIVFEPEIGLHENVVCVDFASLYPNVMVAHNISPDTFDCDCCPRVTV
HHHHHHHHHHHCCCEEECCCCCCCCCEEEEEHHHHCCCEEEEECCCCCCCCCCCCCCCCH
EEVDDPTDATVAPDVGHKFCKRRKGFFPRLVEGLIERRRELKRRLRKLDTESHPHEAKIL
HHCCCCCCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHH
DVRQQAYKVLANSYYGYMGWANARWFCRECAESVTAWGRYYISEVRRIAEEKYGLKVVYG
HHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEEEC
DTDSLFVKLPDADLEETIERVKEFLKEVNGRLPVELELEDAYKRILFVTKKKYAGYTEDG
CCCEEEEECCCCCHHHHHHHHHHHHHHHCCCCCEEEEHHHHHHHHHHEEHHHHCCCCCCC
KIVTKGLEVVRRDWAPIARETQRRVLKRILADNDPEAALKEIHEVLERLKSGDVDIDELA
CHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCHHHHH
VTSQLTKKPSEYVQKGPHVRAALRLARHLGVEPEPGTIVRYVIVRGPGSVSDKAYPVELV
HHHHHHCCCHHHHHCCCCHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCCCCCCHHHHH
REEGKEPDVDYYIEHQILPAVERIMRAIGYSRGQIVGETASQKTLDQFFG
HHCCCCCCCCEEEHHHHHHHHHHHHHHHCCCCCCEECCCHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8441634; 1762925; 1579479 [H]