Definition | Streptococcus pyogenes M1 GAS chromosome, complete genome. |
---|---|
Accession | NC_002737 |
Length | 1,852,441 |
Click here to switch to the map view.
The map label for this gene is dinP [H]
Identifier: 15675671
GI number: 15675671
Start: 1531489
End: 1532583
Strand: Reverse
Name: dinP [H]
Synonym: SPy_1846
Alternate gene names: 15675671
Gene position: 1532583-1531489 (Counterclockwise)
Preceding gene: 15675673
Following gene: 15675666
Centisome position: 82.73
GC content: 41.55
Gene sequence:
>1095_bases ATGCTTATTTTTCCACTGATTAATGACACGTCACGAAAAATCATCCATATTGACATGGATGCCTTTTTTGCTGCAGTTGA GGAAAGGGATAACCCTGCTTTAAAAGGAAAGCCTGTTGTGATTGGGAAAGATCCAAGAGAAACAGGTGGTCGCGGAGTTG TTTCCACTTGTAATTACGAAGCGAGAAAATATGGCATTCATTCGGCCATGAGCTCTAAGGAAGCTTATGAGCGTTGTCCC AAAGCCATTTTTATTTCAGGAAATTATGAAAAGTATCGAACAGTTGGAGACCAGATCCGCCGTATTTTTAAGCGTTATAC TGATGTGGTAGAGCCTATGTCCATTGACGAGGCTTACCTTGATGTGACTGATAATAAGTTGGGGATTAAGTCAGCCGTCA AAATAGCCAAGCTGATTCAGCATGATATCTGGAAAGAAGTAGGATTGACCTGTTCAGCAGGTGTGTCTTATAACAAATTT TTGGCTAAATTGGCTAGTGATTTTGAAAAACCTCATGGCCTCACTCTAGTCTTGAAAGAAGATGCCCTGTGCTTTTTAGC CAAACTCCCCATTGAAAAGTTTCATGGTGTTGGTAAAAAATCAGTTAAAAAACTGCATGACATGGGGATTTATACAGGAC AGGATTTGTTGGCAGTTCCTGAAATGACCTTGATTGATCATTTTGGTCGGTTTGGTTTTGACCTTTACCGTAAAGCGAGA GGCATCAGCAATTCTCCTGTCAAGTATGATCGGATACGCAAGTCAATTGGCAGTGAGAGAACCTACGCTAAACTGCTTTA TCAAGAAACAGACATCAAGGCAGAGATCAGTAAAAATGTTAAGCGCGTGGCCGCTCTCTTACAAGACCATAAAAAGTTAG GCAAGACCATTGTGCTCAAAGTGCGTTATGCTGATTTTACCACCTTGACAAAACGTGTCACCTTGCCAGAATTAACCAGA AATGCCGCACAAATTGAGCAAGTAGCTGGGGATATTTTTGACAGCTTAAGCGAAAATCCTGCTGGTATCCGCCTGTTAGG GGTAACCATGACCAATTTGGAGGATAAGGTAGCTGATATTTCCTTGGACCTATAG
Upstream 100 bases:
>100_bases CTTTTATAATACAGATTGTTGAAACAGTCACAAACTGTATTAGAATGTATTTTTTTGGCTTTAGATAGCCTAAAAGGTAT AATAGAGAAAGGAGGGACTT
Downstream 100 bases:
>100_bases GAGAAAGAGTGCTCTATCCGTCCGCCTCATTTTGGAGAAAAAAGAAGCTGATTAGGGATTTCCCTAATCAGCTTCTTTGG TCCATTATTTTATTACAGCA
Product: DNA polymerase IV
Products: NA
Alternate protein names: Pol IV [H]
Number of amino acids: Translated: 364; Mature: 364
Protein sequence:
>364_residues MLIFPLINDTSRKIIHIDMDAFFAAVEERDNPALKGKPVVIGKDPRETGGRGVVSTCNYEARKYGIHSAMSSKEAYERCP KAIFISGNYEKYRTVGDQIRRIFKRYTDVVEPMSIDEAYLDVTDNKLGIKSAVKIAKLIQHDIWKEVGLTCSAGVSYNKF LAKLASDFEKPHGLTLVLKEDALCFLAKLPIEKFHGVGKKSVKKLHDMGIYTGQDLLAVPEMTLIDHFGRFGFDLYRKAR GISNSPVKYDRIRKSIGSERTYAKLLYQETDIKAEISKNVKRVAALLQDHKKLGKTIVLKVRYADFTTLTKRVTLPELTR NAAQIEQVAGDIFDSLSENPAGIRLLGVTMTNLEDKVADISLDL
Sequences:
>Translated_364_residues MLIFPLINDTSRKIIHIDMDAFFAAVEERDNPALKGKPVVIGKDPRETGGRGVVSTCNYEARKYGIHSAMSSKEAYERCP KAIFISGNYEKYRTVGDQIRRIFKRYTDVVEPMSIDEAYLDVTDNKLGIKSAVKIAKLIQHDIWKEVGLTCSAGVSYNKF LAKLASDFEKPHGLTLVLKEDALCFLAKLPIEKFHGVGKKSVKKLHDMGIYTGQDLLAVPEMTLIDHFGRFGFDLYRKAR GISNSPVKYDRIRKSIGSERTYAKLLYQETDIKAEISKNVKRVAALLQDHKKLGKTIVLKVRYADFTTLTKRVTLPELTR NAAQIEQVAGDIFDSLSENPAGIRLLGVTMTNLEDKVADISLDL >Mature_364_residues MLIFPLINDTSRKIIHIDMDAFFAAVEERDNPALKGKPVVIGKDPRETGGRGVVSTCNYEARKYGIHSAMSSKEAYERCP KAIFISGNYEKYRTVGDQIRRIFKRYTDVVEPMSIDEAYLDVTDNKLGIKSAVKIAKLIQHDIWKEVGLTCSAGVSYNKF LAKLASDFEKPHGLTLVLKEDALCFLAKLPIEKFHGVGKKSVKKLHDMGIYTGQDLLAVPEMTLIDHFGRFGFDLYRKAR GISNSPVKYDRIRKSIGSERTYAKLLYQETDIKAEISKNVKRVAALLQDHKKLGKTIVLKVRYADFTTLTKRVTLPELTR NAAQIEQVAGDIFDSLSENPAGIRLLGVTMTNLEDKVADISLDL
Specific function: Poorly processive, error-prone DNA polymerase involved in untargeted mutagenesis. Copies undamaged DNA at stalled replication forks, which arise in vivo from mismatched or misaligned primer ends. These misaligned primers can be extended by polIV. Exhibits
COG id: COG0389
COG function: function code L; Nucleotidyltransferase/DNA polymerase involved in DNA repair
Gene ontology:
Cell location: Cytoplasm (Probable) [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 umuC domain [H]
Homologues:
Organism=Homo sapiens, GI154350220, Length=343, Percent_Identity=28.5714285714286, Blast_Score=126, Evalue=4e-29, Organism=Homo sapiens, GI84043967, Length=405, Percent_Identity=25.679012345679, Blast_Score=122, Evalue=7e-28, Organism=Homo sapiens, GI7706681, Length=406, Percent_Identity=25.615763546798, Blast_Score=121, Evalue=8e-28, Organism=Homo sapiens, GI5729982, Length=321, Percent_Identity=27.1028037383178, Blast_Score=107, Evalue=2e-23, Organism=Homo sapiens, GI7705344, Length=113, Percent_Identity=44.2477876106195, Blast_Score=106, Evalue=3e-23, Organism=Escherichia coli, GI1786425, Length=349, Percent_Identity=40.6876790830946, Blast_Score=244, Evalue=8e-66, Organism=Escherichia coli, GI1787432, Length=303, Percent_Identity=25.0825082508251, Blast_Score=84, Evalue=1e-17, Organism=Caenorhabditis elegans, GI193205700, Length=419, Percent_Identity=29.1169451073986, Blast_Score=143, Evalue=1e-34, Organism=Caenorhabditis elegans, GI17537959, Length=390, Percent_Identity=25.6410256410256, Blast_Score=100, Evalue=2e-21, Organism=Caenorhabditis elegans, GI193205702, Length=349, Percent_Identity=26.647564469914, Blast_Score=91, Evalue=1e-18, Organism=Caenorhabditis elegans, GI115534089, Length=123, Percent_Identity=36.5853658536585, Blast_Score=82, Evalue=4e-16, Organism=Saccharomyces cerevisiae, GI6324921, Length=208, Percent_Identity=26.9230769230769, Blast_Score=69, Evalue=1e-12, Organism=Drosophila melanogaster, GI19923006, Length=415, Percent_Identity=24.578313253012, Blast_Score=124, Evalue=8e-29, Organism=Drosophila melanogaster, GI21355641, Length=337, Percent_Identity=26.7062314540059, Blast_Score=107, Evalue=1e-23, Organism=Drosophila melanogaster, GI24644984, Length=337, Percent_Identity=26.7062314540059, Blast_Score=107, Evalue=1e-23, Organism=Drosophila melanogaster, GI24668444, Length=127, Percent_Identity=32.2834645669291, Blast_Score=80, Evalue=2e-15,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR017962 - InterPro: IPR017961 - InterPro: IPR001126 - InterPro: IPR017963 - InterPro: IPR022880 [H]
Pfam domain/function: PF00817 IMS [H]
EC number: =2.7.7.7 [H]
Molecular weight: Translated: 40865; Mature: 40865
Theoretical pI: Translated: 9.67; Mature: 9.67
Prosite motif: PS50173 UMUC
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.1 %Cys (Translated Protein) 1.9 %Met (Translated Protein) 3.0 %Cys+Met (Translated Protein) 1.1 %Cys (Mature Protein) 1.9 %Met (Mature Protein) 3.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MLIFPLINDTSRKIIHIDMDAFFAAVEERDNPALKGKPVVIGKDPRETGGRGVVSTCNYE CEEEEEECCCCCEEEEEEHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCCCCEEECCCH ARKYGIHSAMSSKEAYERCPKAIFISGNYEKYRTVGDQIRRIFKRYTDVVEPMSIDEAYL HHHHHHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHE DVTDNKLGIKSAVKIAKLIQHDIWKEVGLTCSAGVSYNKFLAKLASDFEKPHGLTLVLKE EECCCCCCHHHHHHHHHHHHHHHHHHHCCEEECCCCHHHHHHHHHHHHCCCCCEEEEECC DALCFLAKLPIEKFHGVGKKSVKKLHDMGIYTGQDLLAVPEMTLIDHFGRFGFDLYRKAR CCHHHHHHCCHHHHHCCCHHHHHHHHHCCCCCCCCHHHCCCHHHHHHHHHHHHHHHHHHC GISNSPVKYDRIRKSIGSERTYAKLLYQETDIKAEISKNVKRVAALLQDHKKLGKTIVLK CCCCCCCHHHHHHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEE VRYADFTTLTKRVTLPELTRNAAQIEQVAGDIFDSLSENPAGIRLLGVTMTNLEDKVADI EEECCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEECCHHHHHHHC SLDL EECC >Mature Secondary Structure MLIFPLINDTSRKIIHIDMDAFFAAVEERDNPALKGKPVVIGKDPRETGGRGVVSTCNYE CEEEEEECCCCCEEEEEEHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCCCCEEECCCH ARKYGIHSAMSSKEAYERCPKAIFISGNYEKYRTVGDQIRRIFKRYTDVVEPMSIDEAYL HHHHHHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHE DVTDNKLGIKSAVKIAKLIQHDIWKEVGLTCSAGVSYNKFLAKLASDFEKPHGLTLVLKE EECCCCCCHHHHHHHHHHHHHHHHHHHCCEEECCCCHHHHHHHHHHHHCCCCCEEEEECC DALCFLAKLPIEKFHGVGKKSVKKLHDMGIYTGQDLLAVPEMTLIDHFGRFGFDLYRKAR CCHHHHHHCCHHHHHCCCHHHHHHHHHCCCCCCCCHHHCCCHHHHHHHHHHHHHHHHHHC GISNSPVKYDRIRKSIGSERTYAKLLYQETDIKAEISKNVKRVAALLQDHKKLGKTIVLK CCCCCCCHHHHHHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEE VRYADFTTLTKRVTLPELTRNAAQIEQVAGDIFDSLSENPAGIRLLGVTMTNLEDKVADI EEECCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEECCHHHHHHHC SLDL EECC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 12122206; 12799345 [H]