Definition Salmonella enterica subsp. enterica serovar Typhi str. Ty2 chromosome, complete genome.
Accession NC_004631
Length 4,791,961

Click here to switch to the map view.

The map label for this gene is dinP [H]

Identifier: 29142919

GI number: 29142919

Start: 2612961

End: 2614016

Strand: Reverse

Name: dinP [H]

Synonym: t2537

Alternate gene names: 29142919

Gene position: 2614016-2612961 (Counterclockwise)

Preceding gene: 29142921

Following gene: 29142918

Centisome position: 54.55

GC content: 51.42

Gene sequence:

>1056_bases
ATGCGCAAAATCATACATGTTGATATGGACTGTTTTTTTGCCGCGGTAGAGATGCGTGATAATCCGGCGTTACGCGATAT
CCCCATTGCCATTGGCGGGAGTCGTGAGCGAAGAGGCGTTATCAGTACCGCCAACTATCCTGCGCGCCAGTTTGGCGTAC
GCAGCGCGATGCCGACTGCCATGGCGCTAAAGCTGTGCCCCCATCTCACCTTACTGCCTGGCCGCTTTGACGCCTATAAA
GAGGCTTCGCGGCATGTACGTGACATCTTTTCCCGCTACACTTCACTGATTGAACCGCTCTCGCTGGATGAAGCCTGGCT
GGATGTGACTGACAGCCCGCACTGTTATGGTTCGGCAACTCTGATCGCCAGGGAGATTCGTCAGACTATTTTCAATGAAC
TGCAACTCACCGCCTCCGCCGGCGTTGCGCCAGTGAAATTTCTGGCGAAGATCGCCTCCGATCTCAATAAACCGAACGGA
CAATATGTTATTACGCCAGCCGATGTACCCGACTTTCTTAAAACGTTGCCGTTAGCCAAAATTCCAGGCGTGGGGAAAGT
TTCCGCCGCCAAACTGGAAAATATGGGGTTAAGAACCTGCGGGGATATTCAACAGTGCGATCTGGCGATGCTTCTCAAAC
GCTTTGGTAAATTTGGCCGGGTATTATGGGAGCGTAGTCAGGGTATTGACGAGCGTGATGTTAACAGCGAAAGGCTGCGT
AAATCGGTAGGCGTTGAGCGTACTCTGGCGGAGGATATTCATGAATGGTCGGACTGCGAGGCCATTATTGAACATTTATA
TCCGGAACTGGAGCGTCGTCTGGCTATAGTAAAGCCCGATTTACTGATTGCCCGACAGGGCGTTAAATTAAAATTTAATG
ATTTTCAGCAAACCACTCAGGAACATGTCTGGCCGCAATTAAATAAAGAAGACTTAATTACCACCGCCCGTAAAACCTGG
GACGAGCGTCGCGGCGAGCGCGGCGTACGGTTGGTGGGGCTACACGTAACGCTGCTCGACCCTCAGTTGGAACGACAGTT
GGTGTTAGGGTTATAA

Upstream 100 bases:

>100_bases
TCGTCGTCATACCTTCGAATATGTCGCATTTTTAACGCGAATGTCTCTTTGCGCGCAATAAAACATACTGTACACTTAAA
CAGTATTGTGGGGGCGAGCG

Downstream 100 bases:

>100_bases
GTCAGGCTATTAAGATTTTATCACTATTACGATTGCGCCATCTAAATTTCAGTATTACTATACCCGCGTTTTCTCATTCT
CAATAACTGCAAAGGAGGTT

Product: DNA polymerase IV

Products: NA

Alternate protein names: Pol IV [H]

Number of amino acids: Translated: 351; Mature: 351

Protein sequence:

>351_residues
MRKIIHVDMDCFFAAVEMRDNPALRDIPIAIGGSRERRGVISTANYPARQFGVRSAMPTAMALKLCPHLTLLPGRFDAYK
EASRHVRDIFSRYTSLIEPLSLDEAWLDVTDSPHCYGSATLIAREIRQTIFNELQLTASAGVAPVKFLAKIASDLNKPNG
QYVITPADVPDFLKTLPLAKIPGVGKVSAAKLENMGLRTCGDIQQCDLAMLLKRFGKFGRVLWERSQGIDERDVNSERLR
KSVGVERTLAEDIHEWSDCEAIIEHLYPELERRLAIVKPDLLIARQGVKLKFNDFQQTTQEHVWPQLNKEDLITTARKTW
DERRGERGVRLVGLHVTLLDPQLERQLVLGL

Sequences:

>Translated_351_residues
MRKIIHVDMDCFFAAVEMRDNPALRDIPIAIGGSRERRGVISTANYPARQFGVRSAMPTAMALKLCPHLTLLPGRFDAYK
EASRHVRDIFSRYTSLIEPLSLDEAWLDVTDSPHCYGSATLIAREIRQTIFNELQLTASAGVAPVKFLAKIASDLNKPNG
QYVITPADVPDFLKTLPLAKIPGVGKVSAAKLENMGLRTCGDIQQCDLAMLLKRFGKFGRVLWERSQGIDERDVNSERLR
KSVGVERTLAEDIHEWSDCEAIIEHLYPELERRLAIVKPDLLIARQGVKLKFNDFQQTTQEHVWPQLNKEDLITTARKTW
DERRGERGVRLVGLHVTLLDPQLERQLVLGL
>Mature_351_residues
MRKIIHVDMDCFFAAVEMRDNPALRDIPIAIGGSRERRGVISTANYPARQFGVRSAMPTAMALKLCPHLTLLPGRFDAYK
EASRHVRDIFSRYTSLIEPLSLDEAWLDVTDSPHCYGSATLIAREIRQTIFNELQLTASAGVAPVKFLAKIASDLNKPNG
QYVITPADVPDFLKTLPLAKIPGVGKVSAAKLENMGLRTCGDIQQCDLAMLLKRFGKFGRVLWERSQGIDERDVNSERLR
KSVGVERTLAEDIHEWSDCEAIIEHLYPELERRLAIVKPDLLIARQGVKLKFNDFQQTTQEHVWPQLNKEDLITTARKTW
DERRGERGVRLVGLHVTLLDPQLERQLVLGL

Specific function: Poorly processive, error-prone DNA polymerase involved in untargeted mutagenesis. Copies undamaged DNA at stalled replication forks, which arise in vivo from mismatched or misaligned primer ends. These misaligned primers can be extended by polIV. Exhibits

COG id: COG0389

COG function: function code L; Nucleotidyltransferase/DNA polymerase involved in DNA repair

Gene ontology:

Cell location: Cytoplasm [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 umuC domain [H]

Homologues:

Organism=Homo sapiens, GI84043967, Length=330, Percent_Identity=30.6060606060606, Blast_Score=166, Evalue=4e-41,
Organism=Homo sapiens, GI7705344, Length=355, Percent_Identity=32.9577464788732, Blast_Score=162, Evalue=5e-40,
Organism=Homo sapiens, GI7706681, Length=331, Percent_Identity=31.1178247734139, Blast_Score=156, Evalue=3e-38,
Organism=Homo sapiens, GI154350220, Length=251, Percent_Identity=32.2709163346614, Blast_Score=135, Evalue=7e-32,
Organism=Homo sapiens, GI5729982, Length=353, Percent_Identity=29.7450424929178, Blast_Score=115, Evalue=8e-26,
Organism=Escherichia coli, GI1786425, Length=351, Percent_Identity=91.1680911680912, Blast_Score=667, Evalue=0.0,
Organism=Escherichia coli, GI1787432, Length=217, Percent_Identity=25.3456221198157, Blast_Score=74, Evalue=1e-14,
Organism=Caenorhabditis elegans, GI193205700, Length=403, Percent_Identity=33.2506203473945, Blast_Score=176, Evalue=2e-44,
Organism=Caenorhabditis elegans, GI17537959, Length=286, Percent_Identity=28.3216783216783, Blast_Score=117, Evalue=1e-26,
Organism=Caenorhabditis elegans, GI193205702, Length=348, Percent_Identity=29.5977011494253, Blast_Score=109, Evalue=2e-24,
Organism=Caenorhabditis elegans, GI115534089, Length=163, Percent_Identity=35.5828220858896, Blast_Score=74, Evalue=1e-13,
Organism=Saccharomyces cerevisiae, GI6324921, Length=208, Percent_Identity=30.2884615384615, Blast_Score=84, Evalue=4e-17,
Organism=Drosophila melanogaster, GI19923006, Length=327, Percent_Identity=30.8868501529052, Blast_Score=165, Evalue=4e-41,
Organism=Drosophila melanogaster, GI21355641, Length=378, Percent_Identity=29.3650793650794, Blast_Score=117, Evalue=9e-27,
Organism=Drosophila melanogaster, GI24644984, Length=378, Percent_Identity=29.3650793650794, Blast_Score=117, Evalue=9e-27,
Organism=Drosophila melanogaster, GI24668444, Length=119, Percent_Identity=36.1344537815126, Blast_Score=77, Evalue=2e-14,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR017962
- InterPro:   IPR017961
- InterPro:   IPR001126
- InterPro:   IPR017963
- InterPro:   IPR022880 [H]

Pfam domain/function: PF00817 IMS [H]

EC number: =2.7.7.7 [H]

Molecular weight: Translated: 39658; Mature: 39658

Theoretical pI: Translated: 8.37; Mature: 8.37

Prosite motif: PS50173 UMUC

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.7 %Cys     (Translated Protein)
2.0 %Met     (Translated Protein)
3.7 %Cys+Met (Translated Protein)
1.7 %Cys     (Mature Protein)
2.0 %Met     (Mature Protein)
3.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MRKIIHVDMDCFFAAVEMRDNPALRDIPIAIGGSRERRGVISTANYPARQFGVRSAMPTA
CCCEEEECHHHHHHHHHHCCCCCCCEEEEEECCCCCCCCCEECCCCCHHHHHHHHCCHHH
MALKLCPHLTLLPGRFDAYKEASRHVRDIFSRYTSLIEPLSLDEAWLDVTDSPHCYGSAT
HHHHHCCCCEECCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCCCCCCHHH
LIAREIRQTIFNELQLTASAGVAPVKFLAKIASDLNKPNGQYVITPADVPDFLKTLPLAK
HHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCEEEECCCCHHHHHHCCCCC
IPGVGKVSAAKLENMGLRTCGDIQQCDLAMLLKRFGKFGRVLWERSQGIDERDVNSERLR
CCCCCCHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHH
KSVGVERTLAEDIHEWSDCEAIIEHLYPELERRLAIVKPDLLIARQGVKLKFNDFQQTTQ
HHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHEECCCCEEEEHHHHHHHH
EHVWPQLNKEDLITTARKTWDERRGERGVRLVGLHVTLLDPQLERQLVLGL
HHCCCCCCHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEECHHHHHHHHCCC
>Mature Secondary Structure
MRKIIHVDMDCFFAAVEMRDNPALRDIPIAIGGSRERRGVISTANYPARQFGVRSAMPTA
CCCEEEECHHHHHHHHHHCCCCCCCEEEEEECCCCCCCCCEECCCCCHHHHHHHHCCHHH
MALKLCPHLTLLPGRFDAYKEASRHVRDIFSRYTSLIEPLSLDEAWLDVTDSPHCYGSAT
HHHHHCCCCEECCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCCCCCCHHH
LIAREIRQTIFNELQLTASAGVAPVKFLAKIASDLNKPNGQYVITPADVPDFLKTLPLAK
HHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCEEEECCCCHHHHHHCCCCC
IPGVGKVSAAKLENMGLRTCGDIQQCDLAMLLKRFGKFGRVLWERSQGIDERDVNSERLR
CCCCCCHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHH
KSVGVERTLAEDIHEWSDCEAIIEHLYPELERRLAIVKPDLLIARQGVKLKFNDFQQTTQ
HHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHEECCCCEEEEHHHHHHHH
EHVWPQLNKEDLITTARKTWDERRGERGVRLVGLHVTLLDPQLERQLVLGL
HHCCCCCCHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEECHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: NA