Definition Bacillus anthracis str. Sterne chromosome, complete genome.
Accession NC_005945
Length 5,228,663

Click here to switch to the map view.

The map label for this gene is yqjH [H]

Identifier: 49187049

GI number: 49187049

Start: 3984440

End: 3985678

Strand: Reverse

Name: yqjH [H]

Synonym: BAS4051

Alternate gene names: 49187049

Gene position: 3985678-3984440 (Counterclockwise)

Preceding gene: 49187050

Following gene: 49187047

Centisome position: 76.23

GC content: 37.29

Gene sequence:

>1239_bases
ATGCGAGAAATGTATCCAAAAAAAGGTCGTGTTATTTTACATGTAGATATGAATTGTTTTTTCGCATCTGTTGAAATTGC
TCATGACTCATCATTACAAGGGAAGCCGTTAGCGGTTGCTGGAAATGAAAAAGAAAGAAAAGGAATTATCATAACATGTA
GTTATGAGGCGAGAGAATATGGAATACGTACAACGATGCCACTTTGGGAAGCGAAAAGGTTATGTCCGCAATTAATTGTA
AGGCGCCCTAATTTTACATTATATCGCGAAGCTTCATTTCAAATGTTTCAAATTCTTTCCCGTTTTACGGAAAAAATACA
ACCAGTCTCTATAGATGAGGGGTATTTAGATATTACAGACTGTTACGCACTTGGTTCACCTCTTGAAATAGCAAAGATGA
TTCAGCAGGCGTTATTAACAGAATTGCAGCTTCCGTGTAGCATCGGGATTGCTCCGAACCTTTTCCTAGCAAAAACAGCT
TCTGATATGAAAAAGCCTCTCGGTATTACCGTGCTCCGAAAACGAGATATTCCAGAAATGATTTGGCCACTTCCGGTTGG
AGCGATGCATGGAATTGGAGAAAAAACAGCTGAGAAATTGAATGATATTCATATACAAACCATTGAGCAGTTAGCAAAAG
GAAACGAACATATCATTCGCGCTAAAATTGGAAAGCATGGTGTTGATTTACAAAGGCGGGCAAAAGGTATGGATGATAGG
GAAGTTGATCCGAGTCAAATGGGTCAACATAAAAGTGTTGGTAACTCGATGACTTTTTCAAAGGATATGGATGAAGAGAA
AGAATTACTTGATATGTTACAAAGGCTATCAAAATCAGTGAGTAAAAGATTACAAAAGCGAACTCTTGTCAGCTATAATA
TTCAAATTATGATTAAATATCATGATAGGCGGACAGTAACGAGGAGTAAGCAATTGAAAAATGCCATTTGGGAAGAACGA
GATATTTTTCAAGCAGCCTCTCGTTTATGGAAGCAACATTGGGACGGTGATTCCGTTCGTTTATTAGGTGTTACAGCTAC
TGAAATAGAGTGGAAGACTGAATCGGTGAAACAATTAGATTTGTTTTCATTTGAAGAAGATGCGAAAGAAGAGCCGCTAC
TTGCTGTCATTGATCAAATTAATGATAAGTATGGAATGCCGCTTTTACAACGAGGTAGTCAATTATTACGTAAGCAAGAA
AAGTCTTTTCAGCAAAAATTAGAAAGTAAGTTTATGTAG

Upstream 100 bases:

>100_bases
TAGCAATAAATATGATGGAAAAGAACAAGTAATAATTATAATTATTGTGCAAAATTTGCTATAATAAAAGAACGAACGTT
CTGATTAGGGGGTGAGTACT

Downstream 100 bases:

>100_bases
AAGGTGTCATGAAGCTGTACATATCATGACACTTTTTATTTTTATAAAAAACAAATTGTCGTTTTTGCACTACAAGCAAA
ACAATTATGTAATAACTCAA

Product: DNA polymerase IV

Products: NA

Alternate protein names: Pol IV [H]

Number of amino acids: Translated: 412; Mature: 412

Protein sequence:

>412_residues
MREMYPKKGRVILHVDMNCFFASVEIAHDSSLQGKPLAVAGNEKERKGIIITCSYEAREYGIRTTMPLWEAKRLCPQLIV
RRPNFTLYREASFQMFQILSRFTEKIQPVSIDEGYLDITDCYALGSPLEIAKMIQQALLTELQLPCSIGIAPNLFLAKTA
SDMKKPLGITVLRKRDIPEMIWPLPVGAMHGIGEKTAEKLNDIHIQTIEQLAKGNEHIIRAKIGKHGVDLQRRAKGMDDR
EVDPSQMGQHKSVGNSMTFSKDMDEEKELLDMLQRLSKSVSKRLQKRTLVSYNIQIMIKYHDRRTVTRSKQLKNAIWEER
DIFQAASRLWKQHWDGDSVRLLGVTATEIEWKTESVKQLDLFSFEEDAKEEPLLAVIDQINDKYGMPLLQRGSQLLRKQE
KSFQQKLESKFM

Sequences:

>Translated_412_residues
MREMYPKKGRVILHVDMNCFFASVEIAHDSSLQGKPLAVAGNEKERKGIIITCSYEAREYGIRTTMPLWEAKRLCPQLIV
RRPNFTLYREASFQMFQILSRFTEKIQPVSIDEGYLDITDCYALGSPLEIAKMIQQALLTELQLPCSIGIAPNLFLAKTA
SDMKKPLGITVLRKRDIPEMIWPLPVGAMHGIGEKTAEKLNDIHIQTIEQLAKGNEHIIRAKIGKHGVDLQRRAKGMDDR
EVDPSQMGQHKSVGNSMTFSKDMDEEKELLDMLQRLSKSVSKRLQKRTLVSYNIQIMIKYHDRRTVTRSKQLKNAIWEER
DIFQAASRLWKQHWDGDSVRLLGVTATEIEWKTESVKQLDLFSFEEDAKEEPLLAVIDQINDKYGMPLLQRGSQLLRKQE
KSFQQKLESKFM
>Mature_412_residues
MREMYPKKGRVILHVDMNCFFASVEIAHDSSLQGKPLAVAGNEKERKGIIITCSYEAREYGIRTTMPLWEAKRLCPQLIV
RRPNFTLYREASFQMFQILSRFTEKIQPVSIDEGYLDITDCYALGSPLEIAKMIQQALLTELQLPCSIGIAPNLFLAKTA
SDMKKPLGITVLRKRDIPEMIWPLPVGAMHGIGEKTAEKLNDIHIQTIEQLAKGNEHIIRAKIGKHGVDLQRRAKGMDDR
EVDPSQMGQHKSVGNSMTFSKDMDEEKELLDMLQRLSKSVSKRLQKRTLVSYNIQIMIKYHDRRTVTRSKQLKNAIWEER
DIFQAASRLWKQHWDGDSVRLLGVTATEIEWKTESVKQLDLFSFEEDAKEEPLLAVIDQINDKYGMPLLQRGSQLLRKQE
KSFQQKLESKFM

Specific function: Poorly processive, error-prone DNA polymerase involved in untargeted mutagenesis. Copies undamaged DNA at stalled replication forks, which arise in vivo from mismatched or misaligned primer ends. These misaligned primers can be extended by polIV. Exhibits

COG id: COG0389

COG function: function code L; Nucleotidyltransferase/DNA polymerase involved in DNA repair

Gene ontology:

Cell location: Cytoplasm (Probable) [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 umuC domain [H]

Homologues:

Organism=Homo sapiens, GI84043967, Length=333, Percent_Identity=28.2282282282282, Blast_Score=126, Evalue=4e-29,
Organism=Homo sapiens, GI7706681, Length=334, Percent_Identity=28.1437125748503, Blast_Score=125, Evalue=5e-29,
Organism=Homo sapiens, GI154350220, Length=330, Percent_Identity=27.2727272727273, Blast_Score=113, Evalue=4e-25,
Organism=Homo sapiens, GI7705344, Length=111, Percent_Identity=40.5405405405405, Blast_Score=97, Evalue=2e-20,
Organism=Homo sapiens, GI5729982, Length=124, Percent_Identity=41.1290322580645, Blast_Score=93, Evalue=5e-19,
Organism=Escherichia coli, GI1786425, Length=351, Percent_Identity=35.3276353276353, Blast_Score=218, Evalue=7e-58,
Organism=Escherichia coli, GI1787432, Length=417, Percent_Identity=24.7002398081535, Blast_Score=88, Evalue=8e-19,
Organism=Caenorhabditis elegans, GI17537959, Length=300, Percent_Identity=24.6666666666667, Blast_Score=101, Evalue=9e-22,
Organism=Caenorhabditis elegans, GI115534089, Length=122, Percent_Identity=35.2459016393443, Blast_Score=80, Evalue=1e-15,
Organism=Caenorhabditis elegans, GI193205700, Length=115, Percent_Identity=31.304347826087, Blast_Score=75, Evalue=5e-14,
Organism=Drosophila melanogaster, GI19923006, Length=352, Percent_Identity=28.9772727272727, Blast_Score=131, Evalue=8e-31,
Organism=Drosophila melanogaster, GI21355641, Length=348, Percent_Identity=27.2988505747126, Blast_Score=107, Evalue=2e-23,
Organism=Drosophila melanogaster, GI24644984, Length=348, Percent_Identity=27.2988505747126, Blast_Score=107, Evalue=2e-23,
Organism=Drosophila melanogaster, GI24668444, Length=124, Percent_Identity=39.5161290322581, Blast_Score=87, Evalue=3e-17,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR017962
- InterPro:   IPR017961
- InterPro:   IPR001126
- InterPro:   IPR017963
- InterPro:   IPR022880 [H]

Pfam domain/function: PF00817 IMS [H]

EC number: =2.7.7.7 [H]

Molecular weight: Translated: 47451; Mature: 47451

Theoretical pI: Translated: 9.37; Mature: 9.37

Prosite motif: PS50173 UMUC

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.2 %Cys     (Translated Protein)
4.1 %Met     (Translated Protein)
5.3 %Cys+Met (Translated Protein)
1.2 %Cys     (Mature Protein)
4.1 %Met     (Mature Protein)
5.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MREMYPKKGRVILHVDMNCFFASVEIAHDSSLQGKPLAVAGNEKERKGIIITCSYEAREY
CCCCCCCCCCEEEEEECCEEEEEEEEECCCCCCCCCEEEECCCCCCCCEEEEEECHHHHC
GIRTTMPLWEAKRLCPQLIVRRPNFTLYREASFQMFQILSRFTEKIQPVSIDEGYLDITD
CCCCCCCHHHHHHHHHHHHHCCCCCEEEECCHHHHHHHHHHHHHHCCCCCCCCCCEEHHH
CYALGSPLEIAKMIQQALLTELQLPCSIGIAPNLFLAKTASDMKKPLGITVLRKRDIPEM
HHHCCCHHHHHHHHHHHHHHHHCCCEECCCCCCHHHHHCHHHHHCCCCEEEEECCCCHHH
IWPLPVGAMHGIGEKTAEKLNDIHIQTIEQLAKGNEHIIRAKIGKHGVDLQRRAKGMDDR
HCCCCCHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHCCCCCC
EVDPSQMGQHKSVGNSMTFSKDMDEEKELLDMLQRLSKSVSKRLQKRTLVSYNIQIMIKY
CCCHHHCCCHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHEEEEEEEEEEE
HDRRTVTRSKQLKNAIWEERDIFQAASRLWKQHWDGDSVRLLGVTATEIEWKTESVKQLD
CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEEEEECHHHHHHHH
LFSFEEDAKEEPLLAVIDQINDKYGMPLLQRGSQLLRKQEKSFQQKLESKFM
HHCCCCCCCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCC
>Mature Secondary Structure
MREMYPKKGRVILHVDMNCFFASVEIAHDSSLQGKPLAVAGNEKERKGIIITCSYEAREY
CCCCCCCCCCEEEEEECCEEEEEEEEECCCCCCCCCEEEECCCCCCCCEEEEEECHHHHC
GIRTTMPLWEAKRLCPQLIVRRPNFTLYREASFQMFQILSRFTEKIQPVSIDEGYLDITD
CCCCCCCHHHHHHHHHHHHHCCCCCEEEECCHHHHHHHHHHHHHHCCCCCCCCCCEEHHH
CYALGSPLEIAKMIQQALLTELQLPCSIGIAPNLFLAKTASDMKKPLGITVLRKRDIPEM
HHHCCCHHHHHHHHHHHHHHHHCCCEECCCCCCHHHHHCHHHHHCCCCEEEEECCCCHHH
IWPLPVGAMHGIGEKTAEKLNDIHIQTIEQLAKGNEHIIRAKIGKHGVDLQRRAKGMDDR
HCCCCCHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHCCCCCC
EVDPSQMGQHKSVGNSMTFSKDMDEEKELLDMLQRLSKSVSKRLQKRTLVSYNIQIMIKY
CCCHHHCCCHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHEEEEEEEEEEE
HDRRTVTRSKQLKNAIWEERDIFQAASRLWKQHWDGDSVRLLGVTATEIEWKTESVKQLD
CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEEEEECHHHHHHHH
LFSFEEDAKEEPLLAVIDQINDKYGMPLLQRGSQLLRKQEKSFQQKLESKFM
HHCCCCCCCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 12721630 [H]