Definition | Herpetosiphon aurantiacus ATCC 23779 chromosome, complete genome. |
---|---|
Accession | NC_009972 |
Length | 6,346,587 |
Click here to switch to the map view.
The map label for this gene is dinG [H]
Identifier: 159899742
GI number: 159899742
Start: 4077728
End: 4080511
Strand: Direct
Name: dinG [H]
Synonym: Haur_3224
Alternate gene names: 159899742
Gene position: 4077728-4080511 (Clockwise)
Preceding gene: 159899737
Following gene: 159899744
Centisome position: 64.25
GC content: 51.04
Gene sequence:
>2784_bases GTGGAACCGATATATATCGCCTTGGATTTGGAAACAACTGGCTTAGAACCAGGGCGCGATGAGATTATTGAAGTTGGGGC AGTTAAATTTCGAGGCAATGAGGTTCTCGAAACCTACCAAACCTTGGTCAAACCCAAACAGGTGCTGCCGATTAAAATCG CCCGTTTAACAGGCATCGATGCCCATGAATTGACCACGGCTCCTACATTTAATAGTATTGGTGGCCAATTAGCCAAATTT CTCAAAAGCTACCCCATTATTGGCCATTCGGTCGATAACGATTTGCGCTTTTTGCAACAACAAGGGCTAAAAGTTACCCA GCCGCATTACGATACCTTTGATCTGGCGACGCTGTTGATTCCCCAATTGCCCAATTACTCGCTTTCAACGATTGCCGAAC ATTTGCAAATTCAACACCCTGATGCCCACCGCGCCTTGGCCGATGCCGAGGCCAGTCGCTTGGTGTTTAGCGCATTGCTC GATAAATTAGCCGAATTATCGGCTGCCGAATTGCATAGCATCGCTCAAACCACCCAAAAATTGCAGTGGCCGCTGGCCAA ACTATTTGGCGAAATTGCCAAACGCCGCGTGCAAACGCTCTGGCAAGCGCCGATCGAATTTCAACCGAAACCACTGGTGC GCCCGGTGGCCCTCGAACCAACCGGCAATCAGCAAGAACTCGATGCCCAAGCAATTGGCGCGATGTTTGGCGCAGATGGT GGGTTTAGCCGCATGTTCCCAGGCTATGAGCCACGTCAGCCCCAAATTGAGATGACCGAGGCAATCGCCGAAGCACTCAA TCAAGGCGATACCTTGATGATCGAAGCGCCAACTGGCACTGGCAAAAGTTTGGCTTACCTCGTGCCCGCTGCTCAATGGG CACGCCAGCGCGGCGAACGAGTGGTCATCTCAACCAACACGATCAATCTCCAAGATCAGCTTTGCTCCAAAGATATTCCT ACCGTGCAAGCGTTGTTGGCCGAGCAACCCGACCAATGGCCCGCTTTGCGAGCGGTGCAACTCAAAGGCCGCAGCAATTA CCTATGTTTGAAGCGCTATGAATCCTTTCGTGCCCACCCCGACCACAACGAAGATCAGACCCGTGGGTTGTTGAAGCTCC AACTTTGGCTGCCCTCGACCAACAGCGGCGACCGCGCTGAATTGATGTTGATTCAAGGCGAGCAGCAAGTTTGGAACAAT GTTAATGTTGATCCCGACCAATGTTTGCGCCAACGCTGCTCGCTCTACAACGAATGTTTCTTCTTCAAAGCCCGCGCTGA AGCTGAAAATGCCCATATCGTGGTGGCGAACCATGCCTTGTTGATGTCGGATGTCAAATCGCCTGGGATTTTGCCACGCT ACGATCATTTGATCATCGACGAAGCGCATAATCTCGAAGATGTGGCGACTGATCAGTTGGGCTTTACGATTTCACAACAT AGCCTAACTGGTTTGCTCAATGATATGCATAGCGCTGGCGGTGTGCGTTTGGCGGGTGGCGTGCTCAACGAATGGAGCCA AATCTTCCGTTTGAGCACCGTTGATCATAAAGAGCAGCGCAAACTCGAAGATCTCAGCGCCGATTTGCGACCAAATGTTG ATAAAGCCCGCGAAGCGGCCCAGCAATTATTCAGCATTTTCAACGATATTATGGCCAAAGATCGCAGTGTGACCCAATAC GATCCCCAATTGCGGATCACCAGCAAAGTGCGTCGCCACACCGAATGGACTCAAGTTGAGCAAACATGGGAAAACTTGAG CATCAATTTGCGCAAGCTGGGCGATGGCTTTGGCAAGCTCCAAGCAATTTTGGATAATCTCGAAGGCCGCGATATCAATG GCTACGATGATTTGGTGATGCGGGTCAAGGGTATGGTCAATGCTTGCACTGAATTACAACGCCAATTTGATGTGGTAATT TATGGCAATGAAGAAACCGTCGCATGGCTGACTGCCGATCAACGTCGCCGCGAATTGTTGGTGCAGGCTGCGCCAATTCA TGTTGGGCCATTGCTCACTGAAGATTTATGGCTGAAAAAACGCGCCAGCATCTTGGTTTCGGCGACGCTTTCGGTCAGCA ACAGCTTCGATTACCCCAAACAGCGCTTGGGCTTGGACGAAGCCACGACGATGCAACTCGATTCGCCCTTCGATTACAGC AAATCAACCTTAATCTATTTGCCAACCGATATGCCCGAACCCAACGAGCGCAATTATCAACGGGCCATGGAAGATGCCCT GATCAATTTATGCAAAGCGACTGGCGGGCGCACTTTGGCACTGTTTACCGCCAATGCCTCGCTGAAACAAACCTATCATG GCATTAGCGAAAGCCTTGAGCAAGCCGATATTTCGACCTTGGCCCAAGGCATGGATGGCTCACGCCGCTCGTTGATCCAG CGCTTCAAATCTGACCCACGCACGGTTTTGTTGGGCACAGCCTCGTTTTGGGAAGGTGTTGATGTGGTTGGCGATGCTTT GAGCGTACTGGTGATTACCAAATTGCCCTTTAGCGTGCCAAATGATCCGGTGTTTTCGGCGCGATCTGAGGGCTTTGATG ATGCTTTTGCTGAATATTCAGTGCCGCAGGCAATTTTGCGTTTCAAGCAAGGCTTTGGCCGCTTGATTCGCTCCAAAGAT GATCGCGGGATTGTGGTGGTGCTTGATCGGCGCTTGCTTAGCAAAAATTATGGGCGGCAATTCCTCGAATCATTGCCCGA TTGCACGATTCAACGCAAGCCGCTCGCCGAATTGGCAACAACGGCTGCTCGTTGGTTGGTTTAA
Upstream 100 bases:
>100_bases CCAAGGTTTGATAAACAGCAAATCGACTTGTTTTGCCGCTTAGAACAGGTATAATGGCAAAACATTTGAGCGACATCGAG CTTGCTAGCGAGGAGCATCT
Downstream 100 bases:
>100_bases TCATATCTCTTAATATTTTATATTCCCTCACCCCCTAGCCCCCTCTCCCGCCCAGCGAGGAGAGGGGGAACCAGCTCATT ATGATGGAGGGAACGCCCCT
Product: DNA polymerase III subunit epsilon
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 927; Mature: 927
Protein sequence:
>927_residues MEPIYIALDLETTGLEPGRDEIIEVGAVKFRGNEVLETYQTLVKPKQVLPIKIARLTGIDAHELTTAPTFNSIGGQLAKF LKSYPIIGHSVDNDLRFLQQQGLKVTQPHYDTFDLATLLIPQLPNYSLSTIAEHLQIQHPDAHRALADAEASRLVFSALL DKLAELSAAELHSIAQTTQKLQWPLAKLFGEIAKRRVQTLWQAPIEFQPKPLVRPVALEPTGNQQELDAQAIGAMFGADG GFSRMFPGYEPRQPQIEMTEAIAEALNQGDTLMIEAPTGTGKSLAYLVPAAQWARQRGERVVISTNTINLQDQLCSKDIP TVQALLAEQPDQWPALRAVQLKGRSNYLCLKRYESFRAHPDHNEDQTRGLLKLQLWLPSTNSGDRAELMLIQGEQQVWNN VNVDPDQCLRQRCSLYNECFFFKARAEAENAHIVVANHALLMSDVKSPGILPRYDHLIIDEAHNLEDVATDQLGFTISQH SLTGLLNDMHSAGGVRLAGGVLNEWSQIFRLSTVDHKEQRKLEDLSADLRPNVDKAREAAQQLFSIFNDIMAKDRSVTQY DPQLRITSKVRRHTEWTQVEQTWENLSINLRKLGDGFGKLQAILDNLEGRDINGYDDLVMRVKGMVNACTELQRQFDVVI YGNEETVAWLTADQRRRELLVQAAPIHVGPLLTEDLWLKKRASILVSATLSVSNSFDYPKQRLGLDEATTMQLDSPFDYS KSTLIYLPTDMPEPNERNYQRAMEDALINLCKATGGRTLALFTANASLKQTYHGISESLEQADISTLAQGMDGSRRSLIQ RFKSDPRTVLLGTASFWEGVDVVGDALSVLVITKLPFSVPNDPVFSARSEGFDDAFAEYSVPQAILRFKQGFGRLIRSKD DRGIVVVLDRRLLSKNYGRQFLESLPDCTIQRKPLAELATTAARWLV
Sequences:
>Translated_927_residues MEPIYIALDLETTGLEPGRDEIIEVGAVKFRGNEVLETYQTLVKPKQVLPIKIARLTGIDAHELTTAPTFNSIGGQLAKF LKSYPIIGHSVDNDLRFLQQQGLKVTQPHYDTFDLATLLIPQLPNYSLSTIAEHLQIQHPDAHRALADAEASRLVFSALL DKLAELSAAELHSIAQTTQKLQWPLAKLFGEIAKRRVQTLWQAPIEFQPKPLVRPVALEPTGNQQELDAQAIGAMFGADG GFSRMFPGYEPRQPQIEMTEAIAEALNQGDTLMIEAPTGTGKSLAYLVPAAQWARQRGERVVISTNTINLQDQLCSKDIP TVQALLAEQPDQWPALRAVQLKGRSNYLCLKRYESFRAHPDHNEDQTRGLLKLQLWLPSTNSGDRAELMLIQGEQQVWNN VNVDPDQCLRQRCSLYNECFFFKARAEAENAHIVVANHALLMSDVKSPGILPRYDHLIIDEAHNLEDVATDQLGFTISQH SLTGLLNDMHSAGGVRLAGGVLNEWSQIFRLSTVDHKEQRKLEDLSADLRPNVDKAREAAQQLFSIFNDIMAKDRSVTQY DPQLRITSKVRRHTEWTQVEQTWENLSINLRKLGDGFGKLQAILDNLEGRDINGYDDLVMRVKGMVNACTELQRQFDVVI YGNEETVAWLTADQRRRELLVQAAPIHVGPLLTEDLWLKKRASILVSATLSVSNSFDYPKQRLGLDEATTMQLDSPFDYS KSTLIYLPTDMPEPNERNYQRAMEDALINLCKATGGRTLALFTANASLKQTYHGISESLEQADISTLAQGMDGSRRSLIQ RFKSDPRTVLLGTASFWEGVDVVGDALSVLVITKLPFSVPNDPVFSARSEGFDDAFAEYSVPQAILRFKQGFGRLIRSKD DRGIVVVLDRRLLSKNYGRQFLESLPDCTIQRKPLAELATTAARWLV >Mature_927_residues MEPIYIALDLETTGLEPGRDEIIEVGAVKFRGNEVLETYQTLVKPKQVLPIKIARLTGIDAHELTTAPTFNSIGGQLAKF LKSYPIIGHSVDNDLRFLQQQGLKVTQPHYDTFDLATLLIPQLPNYSLSTIAEHLQIQHPDAHRALADAEASRLVFSALL DKLAELSAAELHSIAQTTQKLQWPLAKLFGEIAKRRVQTLWQAPIEFQPKPLVRPVALEPTGNQQELDAQAIGAMFGADG GFSRMFPGYEPRQPQIEMTEAIAEALNQGDTLMIEAPTGTGKSLAYLVPAAQWARQRGERVVISTNTINLQDQLCSKDIP TVQALLAEQPDQWPALRAVQLKGRSNYLCLKRYESFRAHPDHNEDQTRGLLKLQLWLPSTNSGDRAELMLIQGEQQVWNN VNVDPDQCLRQRCSLYNECFFFKARAEAENAHIVVANHALLMSDVKSPGILPRYDHLIIDEAHNLEDVATDQLGFTISQH SLTGLLNDMHSAGGVRLAGGVLNEWSQIFRLSTVDHKEQRKLEDLSADLRPNVDKAREAAQQLFSIFNDIMAKDRSVTQY DPQLRITSKVRRHTEWTQVEQTWENLSINLRKLGDGFGKLQAILDNLEGRDINGYDDLVMRVKGMVNACTELQRQFDVVI YGNEETVAWLTADQRRRELLVQAAPIHVGPLLTEDLWLKKRASILVSATLSVSNSFDYPKQRLGLDEATTMQLDSPFDYS KSTLIYLPTDMPEPNERNYQRAMEDALINLCKATGGRTLALFTANASLKQTYHGISESLEQADISTLAQGMDGSRRSLIQ RFKSDPRTVLLGTASFWEGVDVVGDALSVLVITKLPFSVPNDPVFSARSEGFDDAFAEYSVPQAILRFKQGFGRLIRSKD DRGIVVVLDRRLLSKNYGRQFLESLPDCTIQRKPLAELATTAARWLV
Specific function: Probable helicase involved in DNA repair and perhaps also replication [H]
COG id: COG1199
COG function: function code KL; Rad3-related DNA helicases
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 helicase C-terminal domain [H]
Homologues:
Organism=Escherichia coli, GI1787018, Length=717, Percent_Identity=27.8940027894003, Blast_Score=219, Evalue=5e-58, Organism=Escherichia coli, GI1788110, Length=315, Percent_Identity=35.2380952380952, Blast_Score=171, Evalue=2e-43,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR014001 - InterPro: IPR006054 - InterPro: IPR006310 - InterPro: IPR006055 - InterPro: IPR013520 - InterPro: IPR014013 - InterPro: IPR006555 - InterPro: IPR001650 - InterPro: IPR006935 - InterPro: IPR012337 [H]
Pfam domain/function: PF00929 Exonuc_X-T; PF04851 ResIII [H]
EC number: =3.6.4.12 [H]
Molecular weight: Translated: 103847; Mature: 103847
Theoretical pI: Translated: 5.77; Mature: 5.77
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.9 %Cys (Translated Protein) 1.6 %Met (Translated Protein) 2.5 %Cys+Met (Translated Protein) 0.9 %Cys (Mature Protein) 1.6 %Met (Mature Protein) 2.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MEPIYIALDLETTGLEPGRDEIIEVGAVKFRGNEVLETYQTLVKPKQVLPIKIARLTGID CCCEEEEEEEECCCCCCCHHHHEEECEEEECCHHHHHHHHHHHCCCCCCCEEEEECCCCC AHELTTAPTFNSIGGQLAKFLKSYPIIGHSVDNDLRFLQQQGLKVTQPHYDTFDLATLLI HHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHCCCEEECCCCCHHHHHHHHH PQLPNYSLSTIAEHLQIQHPDAHRALADAEASRLVFSALLDKLAELSAAELHSIAQTTQK CCCCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH LQWPLAKLFGEIAKRRVQTLWQAPIEFQPKPLVRPVALEPTGNQQELDAQAIGAMFGADG HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEECCCCCHHHHHHHHHHHHHCCCC GFSRMFPGYEPRQPQIEMTEAIAEALNQGDTLMIEAPTGTGKSLAYLVPAAQWARQRGER CCHHCCCCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCEEEEHHHHHHHHHCCCE VVISTNTINLQDQLCSKDIPTVQALLAEQPDQWPALRAVQLKGRSNYLCLKRYESFRAHP EEEEECEECHHHHHHCCCCHHHHHHHHCCCCCCCCCEEEEECCCCCEEEEEHHHHHCCCC DHNEDQTRGLLKLQLWLPSTNSGDRAELMLIQGEQQVWNNVNVDPDQCLRQRCSLYNECF CCCCHHHCCEEEEEEECCCCCCCCCEEEEEEECCHHHHCCCCCCHHHHHHHHHHHHHHHE FFKARAEAENAHIVVANHALLMSDVKSPGILPRYDHLIIDEAHNLEDVATDQLGFTISQH EEEEECCCCCCEEEEECCHHHHHHCCCCCCCCCCCCEEEECCCCHHHHHHHHHCCEEHHH SLTGLLNDMHSAGGVRLAGGVLNEWSQIFRLSTVDHKEQRKLEDLSADLRPNVDKAREAA HHHHHHHHHHCCCCEEEECHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCCCHHHHHHHH QQLFSIFNDIMAKDRSVTQYDPQLRITSKVRRHTEWTQVEQTWENLSINLRKLGDGFGKL HHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCEEEHHHCCHHHHH QAILDNLEGRDINGYDDLVMRVKGMVNACTELQRQFDVVIYGNEETVAWLTADQRRRELL HHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCEEEECCCCEEEEEECHHHHHHHH VQAAPIHVGPLLTEDLWLKKRASILVSATLSVSNSFDYPKQRLGLDEATTMQLDSPFDYS HHCCCEEECCHHHHHHHHHHHHHEEEEEEEECCCCCCCCHHHCCCCCCCEEECCCCCCCC KSTLIYLPTDMPEPNERNYQRAMEDALINLCKATGGRTLALFTANASLKQTYHGISESLE CCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHH QADISTLAQGMDGSRRSLIQRFKSDPRTVLLGTASFWEGVDVVGDALSVLVITKLPFSVP HHHHHHHHHCCCCHHHHHHHHHHCCCCEEEEECHHHHCCHHHHHHHHHHHHHHHCCCCCC NDPVFSARSEGFDDAFAEYSVPQAILRFKQGFGRLIRSKDDRGIVVVLDRRLLSKNYGRQ CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCHHHHHHHHHH FLESLPDCTIQRKPLAELATTAARWLV HHHHCCCCCCCCCCHHHHHHHHHHHCC >Mature Secondary Structure MEPIYIALDLETTGLEPGRDEIIEVGAVKFRGNEVLETYQTLVKPKQVLPIKIARLTGID CCCEEEEEEEECCCCCCCHHHHEEECEEEECCHHHHHHHHHHHCCCCCCCEEEEECCCCC AHELTTAPTFNSIGGQLAKFLKSYPIIGHSVDNDLRFLQQQGLKVTQPHYDTFDLATLLI HHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHCCCEEECCCCCHHHHHHHHH PQLPNYSLSTIAEHLQIQHPDAHRALADAEASRLVFSALLDKLAELSAAELHSIAQTTQK CCCCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH LQWPLAKLFGEIAKRRVQTLWQAPIEFQPKPLVRPVALEPTGNQQELDAQAIGAMFGADG HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEECCCCCHHHHHHHHHHHHHCCCC GFSRMFPGYEPRQPQIEMTEAIAEALNQGDTLMIEAPTGTGKSLAYLVPAAQWARQRGER CCHHCCCCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCEEEEHHHHHHHHHCCCE VVISTNTINLQDQLCSKDIPTVQALLAEQPDQWPALRAVQLKGRSNYLCLKRYESFRAHP EEEEECEECHHHHHHCCCCHHHHHHHHCCCCCCCCCEEEEECCCCCEEEEEHHHHHCCCC DHNEDQTRGLLKLQLWLPSTNSGDRAELMLIQGEQQVWNNVNVDPDQCLRQRCSLYNECF CCCCHHHCCEEEEEEECCCCCCCCCEEEEEEECCHHHHCCCCCCHHHHHHHHHHHHHHHE FFKARAEAENAHIVVANHALLMSDVKSPGILPRYDHLIIDEAHNLEDVATDQLGFTISQH EEEEECCCCCCEEEEECCHHHHHHCCCCCCCCCCCCEEEECCCCHHHHHHHHHCCEEHHH SLTGLLNDMHSAGGVRLAGGVLNEWSQIFRLSTVDHKEQRKLEDLSADLRPNVDKAREAA HHHHHHHHHHCCCCEEEECHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCCCHHHHHHHH QQLFSIFNDIMAKDRSVTQYDPQLRITSKVRRHTEWTQVEQTWENLSINLRKLGDGFGKL HHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCEEEHHHCCHHHHH QAILDNLEGRDINGYDDLVMRVKGMVNACTELQRQFDVVIYGNEETVAWLTADQRRRELL HHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCEEEECCCCEEEEEECHHHHHHHH VQAAPIHVGPLLTEDLWLKKRASILVSATLSVSNSFDYPKQRLGLDEATTMQLDSPFDYS HHCCCEEECCHHHHHHHHHHHHHEEEEEEEECCCCCCCCHHHCCCCCCCEEECCCCCCCC KSTLIYLPTDMPEPNERNYQRAMEDALINLCKATGGRTLALFTANASLKQTYHGISESLE CCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHH QADISTLAQGMDGSRRSLIQRFKSDPRTVLLGTASFWEGVDVVGDALSVLVITKLPFSVP HHHHHHHHHCCCCHHHHHHHHHHCCCCEEEEECHHHHCCHHHHHHHHHHHHHHHCCCCCC NDPVFSARSEGFDDAFAEYSVPQAILRFKQGFGRLIRSKDDRGIVVVLDRRLLSKNYGRQ CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCHHHHHHHHHH FLESLPDCTIQRKPLAELATTAARWLV HHHHCCCCCCCCCCHHHHHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 8760912; 9384377 [H]