Definition | Hyphomonas neptunium ATCC 15444 chromosome, complete genome. |
---|---|
Accession | NC_008358 |
Length | 3,705,021 |
Click here to switch to the map view.
The map label for this gene is dinB [H]
Identifier: 114798874
GI number: 114798874
Start: 2387408
End: 2388682
Strand: Direct
Name: dinB [H]
Synonym: HNE_2286
Alternate gene names: 114798874
Gene position: 2387408-2388682 (Clockwise)
Preceding gene: 114798587
Following gene: 114798769
Centisome position: 64.44
GC content: 63.69
Gene sequence:
>1275_bases ATGTCGAGTCTTTGCCGCGATTGCCAGTACAGGTCCGAGGCGGAACTCACCGCCTGCCCCGCCTGTGGCAGCCGCAGGAT CGTCTCTCACGCCGCCCTTTTCAGCCTGACAATCGCGCATGTGGACTGTGACGCCTTCTACGCTTCTGTAGAAAAGCGAG ATAATCCAACGCTTAAGGACAAGCCCCTGATCGTGGGCGGAGGCCAGCGGGGCGTGGTGACGACCTGCTGCTACATTGCC CGGCTTTATGGCGTGCGCTCGGCCATGCCCATGTTCAAGGCACTGAAAGCCTGCCCGGACGCGGTTGTGGTGCCGCCAGA CTTCCGGAAATACCGGGAAGCGGCCAAAGCGATTCGCGCCGAAATGGACCATCTCACGCCCCTCGTGCAGCAGGTTTCCA TCGATGAGGCCTATCTGGACCTGTCCGGAACGGACCGCCTCCACGGCGCCCCGCCGGCTGCCAGCCTCGCCCGGCTGGCC AAAAACGTGGAACGGGAAGTGGGCGTGACGGTCTCCGTCGGCCTCTCCTCCAACAAATTCCTCGCCAAGACGGCCAGCGA ACTCGACAAGCCGCGCGGCTTTGCCATCATCGCGCCGGAGGAGGCCGAAGCCTTCCTGGCACCCCATCCGGTCGGCTTCC TGCACGGGGTTGGCCCGAAATTCGCCGAGTCGCTTAACAAGGACGGCTTCTACACCATTGAGGACATCCAGAAGGCGGAC CTTAAAAGCCTGATCCGGCGCTATGGCGAAACCGGCGACTGGCTCAAACGCTGCGCCCATGGCCGAGACAACCGGAAAGT GAACCCGCATGAAGACCGCAAATCGGTCTCCAGCGAGACGACCTTCTTCGAGGACACTGCCGACATCGGCATTCTGGAGG ATCACCTCTGGCGCCTCAGCGTGAAAACCGCTGACCGGGCCAAGGCCGAAGGCGTCTCAGGCCGCGTGGTGACGCTGAAA CTGAAAACCTCAGACTTCCACCCGCTGACCCGCCGCCTGTCCCTCACCGAGCCGACACAGCTGGCCCAGGTGGTGTTCCG GGCGTCCCGGCCCCTGCTGCTGAAGGAAGCCACCGGCCGCGTGAAATACCGGCTGATCGGCGTTGGCCTGTCAGACCTGT CGGACTTCCGCGCTGATGGCACCGATCTCATCGACCCCAAAGTCGCCAAACGCGCCGCCGCCGAACGCGCCTCCGACCAG GCCCGCGCCAAGTTCGGCACCGGCGCTGTCGTGACCGGACGAGCTGCAAAATACATCCGGAAACCTGAGGGCTGA
Upstream 100 bases:
>100_bases GCGCCAGCTTCAGCTATACTGCAAATTATTAACGCGCCTTTACGCCTGCCGCAGGGCGTGTTTGCGGCGACGGCGGGAAC ATGGCAGGAACACGTCTGTC
Downstream 100 bases:
>100_bases CTCTGCCTTTGCGGCAGGTTAGGACGGGCGCGAAGACTTTACAGGAGCTGCCCCCATGTCCACGCCCGAAGCCCGCCTTG AAGCCCTTGGCATCGTTCTG
Product: DNA polymerase IV
Products: NA
Alternate protein names: Pol IV [H]
Number of amino acids: Translated: 424; Mature: 423
Protein sequence:
>424_residues MSSLCRDCQYRSEAELTACPACGSRRIVSHAALFSLTIAHVDCDAFYASVEKRDNPTLKDKPLIVGGGQRGVVTTCCYIA RLYGVRSAMPMFKALKACPDAVVVPPDFRKYREAAKAIRAEMDHLTPLVQQVSIDEAYLDLSGTDRLHGAPPAASLARLA KNVEREVGVTVSVGLSSNKFLAKTASELDKPRGFAIIAPEEAEAFLAPHPVGFLHGVGPKFAESLNKDGFYTIEDIQKAD LKSLIRRYGETGDWLKRCAHGRDNRKVNPHEDRKSVSSETTFFEDTADIGILEDHLWRLSVKTADRAKAEGVSGRVVTLK LKTSDFHPLTRRLSLTEPTQLAQVVFRASRPLLLKEATGRVKYRLIGVGLSDLSDFRADGTDLIDPKVAKRAAAERASDQ ARAKFGTGAVVTGRAAKYIRKPEG
Sequences:
>Translated_424_residues MSSLCRDCQYRSEAELTACPACGSRRIVSHAALFSLTIAHVDCDAFYASVEKRDNPTLKDKPLIVGGGQRGVVTTCCYIA RLYGVRSAMPMFKALKACPDAVVVPPDFRKYREAAKAIRAEMDHLTPLVQQVSIDEAYLDLSGTDRLHGAPPAASLARLA KNVEREVGVTVSVGLSSNKFLAKTASELDKPRGFAIIAPEEAEAFLAPHPVGFLHGVGPKFAESLNKDGFYTIEDIQKAD LKSLIRRYGETGDWLKRCAHGRDNRKVNPHEDRKSVSSETTFFEDTADIGILEDHLWRLSVKTADRAKAEGVSGRVVTLK LKTSDFHPLTRRLSLTEPTQLAQVVFRASRPLLLKEATGRVKYRLIGVGLSDLSDFRADGTDLIDPKVAKRAAAERASDQ ARAKFGTGAVVTGRAAKYIRKPEG >Mature_423_residues SSLCRDCQYRSEAELTACPACGSRRIVSHAALFSLTIAHVDCDAFYASVEKRDNPTLKDKPLIVGGGQRGVVTTCCYIAR LYGVRSAMPMFKALKACPDAVVVPPDFRKYREAAKAIRAEMDHLTPLVQQVSIDEAYLDLSGTDRLHGAPPAASLARLAK NVEREVGVTVSVGLSSNKFLAKTASELDKPRGFAIIAPEEAEAFLAPHPVGFLHGVGPKFAESLNKDGFYTIEDIQKADL KSLIRRYGETGDWLKRCAHGRDNRKVNPHEDRKSVSSETTFFEDTADIGILEDHLWRLSVKTADRAKAEGVSGRVVTLKL KTSDFHPLTRRLSLTEPTQLAQVVFRASRPLLLKEATGRVKYRLIGVGLSDLSDFRADGTDLIDPKVAKRAAAERASDQA RAKFGTGAVVTGRAAKYIRKPEG
Specific function: Poorly processive, error-prone DNA polymerase involved in untargeted mutagenesis. Copies undamaged DNA at stalled replication forks, which arise in vivo from mismatched or misaligned primer ends. These misaligned primers can be extended by polIV. Exhibits
COG id: COG0389
COG function: function code L; Nucleotidyltransferase/DNA polymerase involved in DNA repair
Gene ontology:
Cell location: Cytoplasm (Probable) [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 umuC domain [H]
Homologues:
Organism=Homo sapiens, GI84043967, Length=338, Percent_Identity=27.810650887574, Blast_Score=114, Evalue=1e-25, Organism=Homo sapiens, GI7705344, Length=109, Percent_Identity=47.7064220183486, Blast_Score=111, Evalue=1e-24, Organism=Homo sapiens, GI7706681, Length=339, Percent_Identity=27.4336283185841, Blast_Score=110, Evalue=2e-24, Organism=Homo sapiens, GI154350220, Length=287, Percent_Identity=29.616724738676, Blast_Score=109, Evalue=6e-24, Organism=Homo sapiens, GI5729982, Length=138, Percent_Identity=34.0579710144928, Blast_Score=69, Evalue=6e-12, Organism=Escherichia coli, GI1786425, Length=251, Percent_Identity=42.6294820717131, Blast_Score=189, Evalue=3e-49, Organism=Escherichia coli, GI1787432, Length=393, Percent_Identity=23.6641221374046, Blast_Score=79, Evalue=5e-16, Organism=Caenorhabditis elegans, GI193205700, Length=393, Percent_Identity=30.2798982188295, Blast_Score=127, Evalue=9e-30, Organism=Caenorhabditis elegans, GI193205702, Length=345, Percent_Identity=28.4057971014493, Blast_Score=85, Evalue=7e-17, Organism=Caenorhabditis elegans, GI17537959, Length=196, Percent_Identity=27.5510204081633, Blast_Score=80, Evalue=1e-15, Organism=Caenorhabditis elegans, GI115534089, Length=121, Percent_Identity=34.7107438016529, Blast_Score=69, Evalue=4e-12, Organism=Saccharomyces cerevisiae, GI6324921, Length=221, Percent_Identity=27.1493212669683, Blast_Score=75, Evalue=2e-14, Organism=Drosophila melanogaster, GI21355641, Length=282, Percent_Identity=30.8510638297872, Blast_Score=116, Evalue=2e-26, Organism=Drosophila melanogaster, GI24644984, Length=282, Percent_Identity=30.8510638297872, Blast_Score=116, Evalue=2e-26, Organism=Drosophila melanogaster, GI19923006, Length=347, Percent_Identity=25.6484149855908, Blast_Score=105, Evalue=7e-23,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR017962 - InterPro: IPR017961 - InterPro: IPR001126 - InterPro: IPR017963 - InterPro: IPR022880 [H]
Pfam domain/function: PF00817 IMS [H]
EC number: =2.7.7.7 [H]
Molecular weight: Translated: 46381; Mature: 46249
Theoretical pI: Translated: 9.44; Mature: 9.44
Prosite motif: PS50173 UMUC
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.1 %Cys (Translated Protein) 0.9 %Met (Translated Protein) 3.1 %Cys+Met (Translated Protein) 2.1 %Cys (Mature Protein) 0.7 %Met (Mature Protein) 2.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSSLCRDCQYRSEAELTACPACGSRRIVSHAALFSLTIAHVDCDAFYASVEKRDNPTLKD CCHHHHHHCCCCCCCEEECCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCCC KPLIVGGGQRGVVTTCCYIARLYGVRSAMPMFKALKACPDAVVVPPDFRKYREAAKAIRA CCEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCHHHHHHHHHHHHH EMDHLTPLVQQVSIDEAYLDLSGTDRLHGAPPAASLARLAKNVEREVGVTVSVGLSSNKF HHHHHHHHHHHHHHHHHEEECCCCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCH LAKTASELDKPRGFAIIAPEEAEAFLAPHPVGFLHGVGPKFAESLNKDGFYTIEDIQKAD HHHHHHHHCCCCCEEEECCCCCCCEECCCCCHHHHCCCHHHHHHCCCCCCEEHHHHHHHH LKSLIRRYGETGDWLKRCAHGRDNRKVNPHEDRKSVSSETTFFEDTADIGILEDHLWRLS HHHHHHHHCCCHHHHHHHHCCCCCCCCCCCHHHHHHCCCCCEECCCCCCCHHHHHHEEEE VKTADRAKAEGVSGRVVTLKLKTSDFHPLTRRLSLTEPTQLAQVVFRASRPLLLKEATGR ECCCHHHHHCCCCCEEEEEEEECCCCCHHHHHCCCCCHHHHHHHHHHCCCCEEEECCCCC VKYRLIGVGLSDLSDFRADGTDLIDPKVAKRAAAERASDQARAKFGTGAVVTGRAAKYIR EEEEEEECCHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEECHHHHHHC KPEG CCCC >Mature Secondary Structure SSLCRDCQYRSEAELTACPACGSRRIVSHAALFSLTIAHVDCDAFYASVEKRDNPTLKD CHHHHHHCCCCCCCEEECCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCCC KPLIVGGGQRGVVTTCCYIARLYGVRSAMPMFKALKACPDAVVVPPDFRKYREAAKAIRA CCEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCHHHHHHHHHHHHH EMDHLTPLVQQVSIDEAYLDLSGTDRLHGAPPAASLARLAKNVEREVGVTVSVGLSSNKF HHHHHHHHHHHHHHHHHEEECCCCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCH LAKTASELDKPRGFAIIAPEEAEAFLAPHPVGFLHGVGPKFAESLNKDGFYTIEDIQKAD HHHHHHHHCCCCCEEEECCCCCCCEECCCCCHHHHCCCHHHHHHCCCCCCEEHHHHHHHH LKSLIRRYGETGDWLKRCAHGRDNRKVNPHEDRKSVSSETTFFEDTADIGILEDHLWRLS HHHHHHHHCCCHHHHHHHHCCCCCCCCCCCHHHHHHCCCCCEECCCCCCCHHHHHHEEEE VKTADRAKAEGVSGRVVTLKLKTSDFHPLTRRLSLTEPTQLAQVVFRASRPLLLKEATGR ECCCHHHHHCCCCCEEEEEEEECCCCCHHHHHCCCCCHHHHHHHHHHCCCCEEEECCCCC VKYRLIGVGLSDLSDFRADGTDLIDPKVAKRAAAERASDQARAKFGTGAVVTGRAAKYIR EEEEEEECCHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEECHHHHHHC KPEG CCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 11259647 [H]