| Definition | Hyphomonas neptunium ATCC 15444 chromosome, complete genome. |
|---|---|
| Accession | NC_008358 |
| Length | 3,705,021 |
Click here to switch to the map view.
The map label for this gene is parE [H]
Identifier: 114798323
GI number: 114798323
Start: 988841
End: 990838
Strand: Reverse
Name: parE [H]
Synonym: HNE_0976
Alternate gene names: 114798323
Gene position: 990838-988841 (Counterclockwise)
Preceding gene: 114799557
Following gene: 114799895
Centisome position: 26.74
GC content: 63.46
Gene sequence:
>1998_bases ATGTCCTCAGCGCGGCAGATGAAGCAAGACGACCTGCTGGCGGGCACAGCGGGCTATTCCGCCAAGGACATCGAGGTCCT CGAAGGCCTTGAGCCCGTCCGCAAGCGCCCCGGCATGTATATCGGCGGCACCGATGAGCGCGCCTGGCACCACCTCTTCG CTGAAGTCCTCGACAACGCCATGGACGAAGCCGTTGCCGGCTATGCCAACCGCATCGACATCAAGCTCGACGCTGACGGC TTCCTCACCGTGGTCGACAATGGCCGCGGCATCCCGATCGACCCGCACCCGAAATTCCCCAAGAAATCAGCGCTCGAAGT CATCATGACGACGCTCCATTCGGGCGGCAAATTCTCCGACAAAGCCTACTCCACCGCCGGAGGCCTCCACGGCGTCGGCA TCTCGGTCGTCAACGCCCTGTCTGAACTCGTCGAAGTCGAAGTCGCCCGCGACCGCACGCTCTACCGCCAGGATTTCTCG CGCGGCCTGCCGCTCGGCAAGCTCCAGAAGGTCGGCGCTGCGCCCAACCGGCGCGGCACCTCCGTGCGCTTCAAGCCCGA CTTCCAGATCTTCGGCGACAAGCTCCGCTGGCGCCCCCAGCGCCTCTTCCAGATGGCGCGCTCGAAGGCTTACCTCTTCC GCGGCGTCGAAGTGCGCTGGTCCTGCGATCCCGAGCTTCTTCCCGAGGACTCCAAAGTCCCCGCCGAAGCCGTCCTCTCC TATCCCAATGGCCTGGCCGACCAGCTCACCGAAGTCTTCGGCGACAAGTCCACCATCACCGAGACCCCCTTCACCGGCCT CGTTGACATGGGCGCCGAGGGTAAGGTCGAATGGGCCATCGCCTGGACCCAGGCCGGCTTTGGCGAGGCAGACGGCTTCG CCCGCTCCTATTGTAACACCATCCCGACGCCCGAAGGCGGCACCCACGAAGCCGGCTTCCGCTCGGCCATCACCAAGGGC ATCCGCAATTTCGGAGACCTGACCGGCAACAAGAAAGCCGCCGAAGTCACCGCCGAGGATGTCATGGGCCATTCCGGCCT CCTCCTCTCGGTCTTCATCCGTGGCCCTGAATTCGTCGGCCAGACCAAGGACAAGCTCTCCTCCACCCACGCCTTCCGCT TGGTGGAAAACGCGGTACGGGACCACTTCGACCATTGGCTCGCCGGCTCCCCCAAGGAAGCAAACAAGCTGCTCGGCTGG GCCATCGACCGCGCCGATGAACGCGCCAAGCGCCGCAAGGCCAAGGAAATCAGCCGAAAGTCTGCCACCAAGAAACTCCG CCTCCCCGGCAAACTCGCCGATTGCTCGGCTAAAGGCCCGGAAGGCACCGAACTCTTCCTCGTCGAAGGCGACTCGGCCG GCGGCTCCGCCAAACAGGCCCGCAACCGCGAGACGCAGGCGATCCTCCCCCTGCGCGGCAAGATCCTCAACGTCGAAAGC GCTTCAGACGACAAGCTGATGGGCAACCAGGAACTGGCCGACCTCTCCCTCGCCCTCGGCACAGAACTCGGTCGCAAGTT CAACATCGACGATCTGCGCTATGAGCGCATCATCATCATGACCGATGCGGACGTCGACGGGGCCCACATCGCCGCCCTGC TCATCACCTTCTTCTACCGGCTCACCCCCGGCCTGATCGAAAGCGGGCGGCTCTATCTCGCCCTGCCGCCCCTGTTCAAA CTCTCGAACAAGGGCAACATCCATTACGCCATGGATGACGCCGACCGGGCCCGGATCATGAAGGAACACTTCAAGGGCAA CCAGAAGGTCGAGATGACGCGCTTCAAGGGTCTGGGCGAAATGAACCCCGCCCAGCTCAAAGAAACCACCATGAACCCGT CCAGCCGCACGCTGGCGCGCGTCACGCTCCCGGCGGCAATGGATGATCTCGAAATCAATCCCGCCGACCTCATCAACACC CTGATGGGCAAAAAAGCCGAACTCCGCTTCCGCTTCATCCAGGAAAACGCCGCCTTCGTCGAAGAGCTCGACATCTAG
Upstream 100 bases:
>100_bases AGAAAGCCGAAGGTTAATGCGCCGTTAAGGCTTTAACCTCCTATCTGCCACGACAGTTCCGCCGGGTCATCGGCGGCTTT CAGTGACGGAGAAGCAGTCC
Downstream 100 bases:
>100_bases GCTCAGGCTCACCGCGCCCTCTCCCCACAGCGTTTGCGCGTGGCTGCGGCAAGCACCAACCCAGTCAGCGAACGCGCCGT CAGGTCTGCCGGATGAATTT
Product: DNA topoisomerase IV subunit B
Products: NA
Alternate protein names: Topoisomerase IV subunit B [H]
Number of amino acids: Translated: 665; Mature: 664
Protein sequence:
>665_residues MSSARQMKQDDLLAGTAGYSAKDIEVLEGLEPVRKRPGMYIGGTDERAWHHLFAEVLDNAMDEAVAGYANRIDIKLDADG FLTVVDNGRGIPIDPHPKFPKKSALEVIMTTLHSGGKFSDKAYSTAGGLHGVGISVVNALSELVEVEVARDRTLYRQDFS RGLPLGKLQKVGAAPNRRGTSVRFKPDFQIFGDKLRWRPQRLFQMARSKAYLFRGVEVRWSCDPELLPEDSKVPAEAVLS YPNGLADQLTEVFGDKSTITETPFTGLVDMGAEGKVEWAIAWTQAGFGEADGFARSYCNTIPTPEGGTHEAGFRSAITKG IRNFGDLTGNKKAAEVTAEDVMGHSGLLLSVFIRGPEFVGQTKDKLSSTHAFRLVENAVRDHFDHWLAGSPKEANKLLGW AIDRADERAKRRKAKEISRKSATKKLRLPGKLADCSAKGPEGTELFLVEGDSAGGSAKQARNRETQAILPLRGKILNVES ASDDKLMGNQELADLSLALGTELGRKFNIDDLRYERIIIMTDADVDGAHIAALLITFFYRLTPGLIESGRLYLALPPLFK LSNKGNIHYAMDDADRARIMKEHFKGNQKVEMTRFKGLGEMNPAQLKETTMNPSSRTLARVTLPAAMDDLEINPADLINT LMGKKAELRFRFIQENAAFVEELDI
Sequences:
>Translated_665_residues MSSARQMKQDDLLAGTAGYSAKDIEVLEGLEPVRKRPGMYIGGTDERAWHHLFAEVLDNAMDEAVAGYANRIDIKLDADG FLTVVDNGRGIPIDPHPKFPKKSALEVIMTTLHSGGKFSDKAYSTAGGLHGVGISVVNALSELVEVEVARDRTLYRQDFS RGLPLGKLQKVGAAPNRRGTSVRFKPDFQIFGDKLRWRPQRLFQMARSKAYLFRGVEVRWSCDPELLPEDSKVPAEAVLS YPNGLADQLTEVFGDKSTITETPFTGLVDMGAEGKVEWAIAWTQAGFGEADGFARSYCNTIPTPEGGTHEAGFRSAITKG IRNFGDLTGNKKAAEVTAEDVMGHSGLLLSVFIRGPEFVGQTKDKLSSTHAFRLVENAVRDHFDHWLAGSPKEANKLLGW AIDRADERAKRRKAKEISRKSATKKLRLPGKLADCSAKGPEGTELFLVEGDSAGGSAKQARNRETQAILPLRGKILNVES ASDDKLMGNQELADLSLALGTELGRKFNIDDLRYERIIIMTDADVDGAHIAALLITFFYRLTPGLIESGRLYLALPPLFK LSNKGNIHYAMDDADRARIMKEHFKGNQKVEMTRFKGLGEMNPAQLKETTMNPSSRTLARVTLPAAMDDLEINPADLINT LMGKKAELRFRFIQENAAFVEELDI >Mature_664_residues SSARQMKQDDLLAGTAGYSAKDIEVLEGLEPVRKRPGMYIGGTDERAWHHLFAEVLDNAMDEAVAGYANRIDIKLDADGF LTVVDNGRGIPIDPHPKFPKKSALEVIMTTLHSGGKFSDKAYSTAGGLHGVGISVVNALSELVEVEVARDRTLYRQDFSR GLPLGKLQKVGAAPNRRGTSVRFKPDFQIFGDKLRWRPQRLFQMARSKAYLFRGVEVRWSCDPELLPEDSKVPAEAVLSY PNGLADQLTEVFGDKSTITETPFTGLVDMGAEGKVEWAIAWTQAGFGEADGFARSYCNTIPTPEGGTHEAGFRSAITKGI RNFGDLTGNKKAAEVTAEDVMGHSGLLLSVFIRGPEFVGQTKDKLSSTHAFRLVENAVRDHFDHWLAGSPKEANKLLGWA IDRADERAKRRKAKEISRKSATKKLRLPGKLADCSAKGPEGTELFLVEGDSAGGSAKQARNRETQAILPLRGKILNVESA SDDKLMGNQELADLSLALGTELGRKFNIDDLRYERIIIMTDADVDGAHIAALLITFFYRLTPGLIESGRLYLALPPLFKL SNKGNIHYAMDDADRARIMKEHFKGNQKVEMTRFKGLGEMNPAQLKETTMNPSSRTLARVTLPAAMDDLEINPADLINTL MGKKAELRFRFIQENAAFVEELDI
Specific function: Topoisomerase IV is essential for chromosome segregation. It relaxes supercoiled DNA. Performs the decatenation events required during the replication of a circular DNA molecule [H]
COG id: COG0187
COG function: function code L; Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), B subunit
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 Toprim domain [H]
Homologues:
Organism=Homo sapiens, GI19913406, Length=625, Percent_Identity=25.28, Blast_Score=140, Evalue=4e-33, Organism=Homo sapiens, GI19913408, Length=627, Percent_Identity=24.5614035087719, Blast_Score=134, Evalue=3e-31, Organism=Escherichia coli, GI1789408, Length=650, Percent_Identity=42.9230769230769, Blast_Score=470, Evalue=1e-133, Organism=Escherichia coli, GI48994957, Length=563, Percent_Identity=43.5168738898757, Blast_Score=410, Evalue=1e-115, Organism=Caenorhabditis elegans, GI17535065, Length=582, Percent_Identity=22.8522336769759, Blast_Score=123, Evalue=4e-28, Organism=Caenorhabditis elegans, GI212645845, Length=552, Percent_Identity=24.0942028985507, Blast_Score=116, Evalue=4e-26, Organism=Caenorhabditis elegans, GI212645657, Length=228, Percent_Identity=26.3157894736842, Blast_Score=71, Evalue=2e-12, Organism=Saccharomyces cerevisiae, GI6324241, Length=650, Percent_Identity=26, Blast_Score=120, Evalue=9e-28, Organism=Drosophila melanogaster, GI17136538, Length=629, Percent_Identity=25.5961844197138, Blast_Score=121, Evalue=2e-27,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR003594 - InterPro: IPR020568 - InterPro: IPR014721 - InterPro: IPR001241 - InterPro: IPR013759 - InterPro: IPR002288 - InterPro: IPR013506 - InterPro: IPR013760 - InterPro: IPR018522 - InterPro: IPR005737 - InterPro: IPR006171 [H]
Pfam domain/function: PF00204 DNA_gyraseB; PF00986 DNA_gyraseB_C; PF02518 HATPase_c; PF01751 Toprim [H]
EC number: 5.99.1.-
Molecular weight: Translated: 73224; Mature: 73093
Theoretical pI: Translated: 7.45; Mature: 7.45
Prosite motif: PS00177 TOPOISOMERASE_II
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.5 %Cys (Translated Protein) 2.6 %Met (Translated Protein) 3.0 %Cys+Met (Translated Protein) 0.5 %Cys (Mature Protein) 2.4 %Met (Mature Protein) 2.9 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSSARQMKQDDLLAGTAGYSAKDIEVLEGLEPVRKRPGMYIGGTDERAWHHLFAEVLDNA CCCHHHHCHHCCEECCCCCCCCHHHHHHCCHHHHHCCCEEECCCCHHHHHHHHHHHHHHH MDEAVAGYANRIDIKLDADGFLTVVDNGRGIPIDPHPKFPKKSALEVIMTTLHSGGKFSD HHHHHHCCCCEEEEEECCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCC KAYSTAGGLHGVGISVVNALSELVEVEVARDRTLYRQDFSRGLPLGKLQKVGAAPNRRGT HHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCC SVRFKPDFQIFGDKLRWRPQRLFQMARSKAYLFRGVEVRWSCDPELLPEDSKVPAEAVLS EEEECCCHHHHCCHHCCCHHHHHHHHHHHHHHHCCEEEEECCCCCCCCCCCCCCHHHHHH YPNGLADQLTEVFGDKSTITETPFTGLVDMGAEGKVEWAIAWTQAGFGEADGFARSYCNT CCCCHHHHHHHHHCCCCCCCCCCCCCCEECCCCCCEEEEEEEECCCCCCCCHHHHHHHCC IPTPEGGTHEAGFRSAITKGIRNFGDLTGNKKAAEVTAEDVMGHSGLLLSVFIRGPEFVG CCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCEEEEEECCCHHHC QTKDKLSSTHAFRLVENAVRDHFDHWLAGSPKEANKLLGWAIDRADERAKRRKAKEISRK CCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH SATKKLRLPGKLADCSAKGPEGTELFLVEGDSAGGSAKQARNRETQAILPLRGKILNVES HHHHEECCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHHHCCCCCEEEECCCCEEEEECC ASDDKLMGNQELADLSLALGTELGRKFNIDDLRYERIIIMTDADVDGAHIAALLITFFYR CCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCEEEEEEEECCCCCHHHHHHHHHHHHHH LTPGLIESGRLYLALPPLFKLSNKGNIHYAMDDADRARIMKEHFKGNQKVEMTRFKGLGE HCCCHHHCCCEEEEECCHHEECCCCCEEEEECCHHHHHHHHHHCCCCCEEEEHHHCCCCC MNPAQLKETTMNPSSRTLARVTLPAAMDDLEINPADLINTLMGKKAELRFRFIQENAAFV CCHHHHHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHCCCHHHEEEEHHHCHHHH EELDI HHCCC >Mature Secondary Structure SSARQMKQDDLLAGTAGYSAKDIEVLEGLEPVRKRPGMYIGGTDERAWHHLFAEVLDNA CCHHHHCHHCCEECCCCCCCCHHHHHHCCHHHHHCCCEEECCCCHHHHHHHHHHHHHHH MDEAVAGYANRIDIKLDADGFLTVVDNGRGIPIDPHPKFPKKSALEVIMTTLHSGGKFSD HHHHHHCCCCEEEEEECCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCC KAYSTAGGLHGVGISVVNALSELVEVEVARDRTLYRQDFSRGLPLGKLQKVGAAPNRRGT HHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCC SVRFKPDFQIFGDKLRWRPQRLFQMARSKAYLFRGVEVRWSCDPELLPEDSKVPAEAVLS EEEECCCHHHHCCHHCCCHHHHHHHHHHHHHHHCCEEEEECCCCCCCCCCCCCCHHHHHH YPNGLADQLTEVFGDKSTITETPFTGLVDMGAEGKVEWAIAWTQAGFGEADGFARSYCNT CCCCHHHHHHHHHCCCCCCCCCCCCCCEECCCCCCEEEEEEEECCCCCCCCHHHHHHHCC IPTPEGGTHEAGFRSAITKGIRNFGDLTGNKKAAEVTAEDVMGHSGLLLSVFIRGPEFVG CCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCEEEEEECCCHHHC QTKDKLSSTHAFRLVENAVRDHFDHWLAGSPKEANKLLGWAIDRADERAKRRKAKEISRK CCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH SATKKLRLPGKLADCSAKGPEGTELFLVEGDSAGGSAKQARNRETQAILPLRGKILNVES HHHHEECCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHHHCCCCCEEEECCCCEEEEECC ASDDKLMGNQELADLSLALGTELGRKFNIDDLRYERIIIMTDADVDGAHIAALLITFFYR CCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCEEEEEEEECCCCCHHHHHHHHHHHHHH LTPGLIESGRLYLALPPLFKLSNKGNIHYAMDDADRARIMKEHFKGNQKVEMTRFKGLGE HCCCHHHCCCEEEEECCHHEECCCCCEEEEECCHHHHHHHHHHCCCCCEEEEHHHCCCCC MNPAQLKETTMNPSSRTLARVTLPAAMDDLEINPADLINTLMGKKAELRFRFIQENAAFV CCHHHHHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHCCCHHHEEEEHHHCHHHH EELDI HHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 11259647; 9426128 [H]