Definition | Rickettsia akari str. Hartford, complete genome. |
---|---|
Accession | NC_009881 |
Length | 1,231,060 |
Click here to switch to the map view.
The map label for this gene is topA [H]
Identifier: 157825575
GI number: 157825575
Start: 443524
End: 445854
Strand: Reverse
Name: topA [H]
Synonym: A1C_02440
Alternate gene names: 157825575
Gene position: 445854-443524 (Counterclockwise)
Preceding gene: 157825576
Following gene: 157825574
Centisome position: 36.22
GC content: 34.62
Gene sequence:
>2331_bases ATGAAATTAGTAATAGTAGAATCGCCAGCAAAGGCAAAAACGATAAATAAATATTTAGGTGATGAGTTTAAGGTCATTGC ATCATTCGGTCACATCAGAGATTTACCTTCAAAAAAAGGCTCGGTATTGCCTGATGAAAATTTTGCAATGAAATACGATA TTTCCGATAAAGCCAGTAAATATGTAGATGCCATAGTTAAAGATGCTAAAAAAGCTGATTCAGTATATCTTGCAACCGAT CCTGATCGTGAAGGTGAATCTATCTCATGGCATGTTGCAGAGGTAATAAAAGAAAAGAATAAAGTTAAATCCGATGATTT TTTCAAAAGAGTAGCCTTTAATGAGATTACTAAAAAAGCAATTATTCATGCCGTTGAGAACCCTAGGAAACTTGACACTA ATTTAGTAAATGCCCAACAAGCAAGAAGAGCTTTAGACTATTTAGTTGGCTTTACCCTTTCGCCTCTTTTATGGCGTAAG TTACCGGGATGTAAATCGGCGGGACGTGTACAATCCGTGGCTCTGCGATTAATATGTGAGCGAGAAGATAAAATAGAGCG TTTTAAGTCAGAAGAATATTGGGATATTAGCATCAAAATACAAAGTAGTAATAATGAACTATTTACTGCTCAATTGACTC ATGTAAACGATCAGAAATTAGAAAAATTTTCAATTATTAACGCAAAAGACGCCAAAGATTTAACTGAAAAATTAAAATCT CAAAAATTTCACGTTGATAAGATAGAAAAGAAACAACAAAAACGTGCACCGCAACCTCCTTTTATTACTTCATCACTACA ACAAGAAGCAGCAAGAAAATTAGGTTTTAGTGCTAAAAAGACTATGCAAATAGCACAAAAACTTTATGAGGGCGTTGACA TAGGTAAGGAAACTATAGGACTTATTACCTATATGAGAACCGACGGCGTTACATTATCAAATGACGCAATAGCAGATATA CGTAAGTTAATTGATACAAGCTATGGCAATAAATATTTACCTAGTAGCCCTAGAATTTACAAATCCAAAGTAAAAAACGC TCAAGAAGCTCATGAGGCGATAAGACCGACAAATATCACTTATACTCCCGATAACTTAAAAGAACTGCTAGAAAAGGATT ATTATAAGCTTTATGAGTTGATTTGGAAAAGAACTGTAGCCTGCCAAATGGAAAATGTGATAATGGATTTGGTGGTTGCA AGTTTAGCTTCGGAAAATAAAGAATATTTAGCAAAAGCAACCGGATCGACTATCACCTTTGATGGATTTTATAAAATTTA TCGTGAAAGTGTAGACGATGAAGATGAAGAAGAAAATAAAATGCTACCGCCTTTAAAAGAACAAGAATCGCTTAAAACTA AAGAAATTATTCCGAATCAGCATTTTACTGAACCACCTCCAAGATATTCGGAGGCAAGTTTAGTGAAAAAACTTGAAGAG CTTGGGATCGGTCGCCCTTCGACATATGCCAGTATTTTATCGGTTTTACAAGATCGCAAATATGTAACGCTCGAAAAAAA GCGTTTTATACCTGAAGAGCTTGGACGTCTAGTAACGGTATTTTTGGTTGGGTTTTTCAGAAAATATGTAGAATATGATT TTACTGCAGGTCTTGAAAACGAGTTAGATGAAATAGCAGCGGGCAAGCTTGAGTGGAAAGCCGCATTAAATAATTTTTGG AGTGGTTTTAACAATAATATTAAATCGGTAAACGAACAAAAAATAACCGAGATTATTAGCTATGTACAGAAAGCACTTGA TTATCATTTATTCGGTGAAAATAAAGAATCTAAAGTTTGTCCTTCATGTAAAACAGGTGAGCTTAGCTTAAAGCTCGGTA AGTTCGGGGCATTTTTAGCATGTAGTAATTACCCTGAATGTACTTTTAGAAAATCTATTGTTAGCGGTAACGATAATAAC GAAAATGAAGCCGAACTTTCCGCTATTCCTAATGGAAATAAAATTTTAGGCACTGATAAAGACGGGATAGAAATATATCT GAAAAAAGGACCTTACGGACCTTATATTCAACACGGCGAACAAGAAGGAAAAGTAAAGCCAAAACGTAGCCCCATACCTG CTATCTTGAATCAAAACGACATCACACTTGAGATTGCATTAAAGCTTCTAAGCCTACCGCTTAAAATCGGTATTCATAAA GATAGCGGCGAGGAAATTATGATAGGATACGGTAAATTCGGTCCTTACATAAAATATATGAGTAAGTTTATCTCTATACC TAAAAAATATGATTTTCTAAATTTAAGCTTAGATGATGCGATGAAGCTAATTGAAGATAATAATGCGAAGTTAGAAACGA CACAGGGGTAA
Upstream 100 bases:
>100_bases TACCGCTTCCTATTATATATACGTAATATTAGAATTAGAACTCGCCGGCAAAACAATACTTCACACCGGCAATAAAATAT CATTAGTTTATAATAAATAG
Downstream 100 bases:
>100_bases TCGTTATAATACAGTGGCTTGACAAGCGTTGTTGCATAGCTCGGTTTTCCAGCATTACGAGAAGCAACTGCAAGTTTCGA CGAAGCAATCCAGTAAAAAA
Product: DNA topoisomerase I
Products: NA
Alternate protein names: DNA topoisomerase I; Omega-protein; Relaxing enzyme; Swivelase; Untwisting enzyme [H]
Number of amino acids: Translated: 776; Mature: 776
Protein sequence:
>776_residues MKLVIVESPAKAKTINKYLGDEFKVIASFGHIRDLPSKKGSVLPDENFAMKYDISDKASKYVDAIVKDAKKADSVYLATD PDREGESISWHVAEVIKEKNKVKSDDFFKRVAFNEITKKAIIHAVENPRKLDTNLVNAQQARRALDYLVGFTLSPLLWRK LPGCKSAGRVQSVALRLICEREDKIERFKSEEYWDISIKIQSSNNELFTAQLTHVNDQKLEKFSIINAKDAKDLTEKLKS QKFHVDKIEKKQQKRAPQPPFITSSLQQEAARKLGFSAKKTMQIAQKLYEGVDIGKETIGLITYMRTDGVTLSNDAIADI RKLIDTSYGNKYLPSSPRIYKSKVKNAQEAHEAIRPTNITYTPDNLKELLEKDYYKLYELIWKRTVACQMENVIMDLVVA SLASENKEYLAKATGSTITFDGFYKIYRESVDDEDEEENKMLPPLKEQESLKTKEIIPNQHFTEPPPRYSEASLVKKLEE LGIGRPSTYASILSVLQDRKYVTLEKKRFIPEELGRLVTVFLVGFFRKYVEYDFTAGLENELDEIAAGKLEWKAALNNFW SGFNNNIKSVNEQKITEIISYVQKALDYHLFGENKESKVCPSCKTGELSLKLGKFGAFLACSNYPECTFRKSIVSGNDNN ENEAELSAIPNGNKILGTDKDGIEIYLKKGPYGPYIQHGEQEGKVKPKRSPIPAILNQNDITLEIALKLLSLPLKIGIHK DSGEEIMIGYGKFGPYIKYMSKFISIPKKYDFLNLSLDDAMKLIEDNNAKLETTQG
Sequences:
>Translated_776_residues MKLVIVESPAKAKTINKYLGDEFKVIASFGHIRDLPSKKGSVLPDENFAMKYDISDKASKYVDAIVKDAKKADSVYLATD PDREGESISWHVAEVIKEKNKVKSDDFFKRVAFNEITKKAIIHAVENPRKLDTNLVNAQQARRALDYLVGFTLSPLLWRK LPGCKSAGRVQSVALRLICEREDKIERFKSEEYWDISIKIQSSNNELFTAQLTHVNDQKLEKFSIINAKDAKDLTEKLKS QKFHVDKIEKKQQKRAPQPPFITSSLQQEAARKLGFSAKKTMQIAQKLYEGVDIGKETIGLITYMRTDGVTLSNDAIADI RKLIDTSYGNKYLPSSPRIYKSKVKNAQEAHEAIRPTNITYTPDNLKELLEKDYYKLYELIWKRTVACQMENVIMDLVVA SLASENKEYLAKATGSTITFDGFYKIYRESVDDEDEEENKMLPPLKEQESLKTKEIIPNQHFTEPPPRYSEASLVKKLEE LGIGRPSTYASILSVLQDRKYVTLEKKRFIPEELGRLVTVFLVGFFRKYVEYDFTAGLENELDEIAAGKLEWKAALNNFW SGFNNNIKSVNEQKITEIISYVQKALDYHLFGENKESKVCPSCKTGELSLKLGKFGAFLACSNYPECTFRKSIVSGNDNN ENEAELSAIPNGNKILGTDKDGIEIYLKKGPYGPYIQHGEQEGKVKPKRSPIPAILNQNDITLEIALKLLSLPLKIGIHK DSGEEIMIGYGKFGPYIKYMSKFISIPKKYDFLNLSLDDAMKLIEDNNAKLETTQG >Mature_776_residues MKLVIVESPAKAKTINKYLGDEFKVIASFGHIRDLPSKKGSVLPDENFAMKYDISDKASKYVDAIVKDAKKADSVYLATD PDREGESISWHVAEVIKEKNKVKSDDFFKRVAFNEITKKAIIHAVENPRKLDTNLVNAQQARRALDYLVGFTLSPLLWRK LPGCKSAGRVQSVALRLICEREDKIERFKSEEYWDISIKIQSSNNELFTAQLTHVNDQKLEKFSIINAKDAKDLTEKLKS QKFHVDKIEKKQQKRAPQPPFITSSLQQEAARKLGFSAKKTMQIAQKLYEGVDIGKETIGLITYMRTDGVTLSNDAIADI RKLIDTSYGNKYLPSSPRIYKSKVKNAQEAHEAIRPTNITYTPDNLKELLEKDYYKLYELIWKRTVACQMENVIMDLVVA SLASENKEYLAKATGSTITFDGFYKIYRESVDDEDEEENKMLPPLKEQESLKTKEIIPNQHFTEPPPRYSEASLVKKLEE LGIGRPSTYASILSVLQDRKYVTLEKKRFIPEELGRLVTVFLVGFFRKYVEYDFTAGLENELDEIAAGKLEWKAALNNFW SGFNNNIKSVNEQKITEIISYVQKALDYHLFGENKESKVCPSCKTGELSLKLGKFGAFLACSNYPECTFRKSIVSGNDNN ENEAELSAIPNGNKILGTDKDGIEIYLKKGPYGPYIQHGEQEGKVKPKRSPIPAILNQNDITLEIALKLLSLPLKIGIHK DSGEEIMIGYGKFGPYIKYMSKFISIPKKYDFLNLSLDDAMKLIEDNNAKLETTQG
Specific function: The reaction catalyzed by topoisomerases leads to the conversion of one topological isomer of DNA to another [H]
COG id: COG0550
COG function: function code L; Topoisomerase IA
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 Toprim domain [H]
Homologues:
Organism=Homo sapiens, GI4507635, Length=582, Percent_Identity=23.1958762886598, Blast_Score=108, Evalue=2e-23, Organism=Escherichia coli, GI1787529, Length=651, Percent_Identity=42.0890937019969, Blast_Score=484, Evalue=1e-138, Organism=Escherichia coli, GI1788061, Length=573, Percent_Identity=24.0837696335079, Blast_Score=106, Evalue=7e-24, Organism=Caenorhabditis elegans, GI32563869, Length=559, Percent_Identity=24.865831842576, Blast_Score=134, Evalue=2e-31, Organism=Caenorhabditis elegans, GI17555378, Length=544, Percent_Identity=27.0220588235294, Blast_Score=130, Evalue=2e-30, Organism=Saccharomyces cerevisiae, GI6323263, Length=509, Percent_Identity=24.1650294695481, Blast_Score=108, Evalue=4e-24, Organism=Drosophila melanogaster, GI24585251, Length=610, Percent_Identity=23.1147540983607, Blast_Score=116, Evalue=7e-26, Organism=Drosophila melanogaster, GI24640096, Length=469, Percent_Identity=23.454157782516, Blast_Score=94, Evalue=3e-19,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR003601 - InterPro: IPR013497 - InterPro: IPR013824 - InterPro: IPR013825 - InterPro: IPR000380 - InterPro: IPR003602 - InterPro: IPR013498 - InterPro: IPR005733 - InterPro: IPR006171 [H]
Pfam domain/function: PF01131 Topoisom_bac; PF01751 Toprim; PF01396 zf-C4_Topoisom [H]
EC number: =5.99.1.2 [H]
Molecular weight: Translated: 88086; Mature: 88086
Theoretical pI: Translated: 8.83; Mature: 8.83
Prosite motif: PS00396 TOPOISOMERASE_I_PROK
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.9 %Cys (Translated Protein) 1.3 %Met (Translated Protein) 2.2 %Cys+Met (Translated Protein) 0.9 %Cys (Mature Protein) 1.3 %Met (Mature Protein) 2.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKLVIVESPAKAKTINKYLGDEFKVIASFGHIRDLPSKKGSVLPDENFAMKYDISDKASK CEEEEECCCCHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEECCCHHHHH YVDAIVKDAKKADSVYLATDPDREGESISWHVAEVIKEKNKVKSDDFFKRVAFNEITKKA HHHHHHHHHHHCCCEEEEECCCCCCCCEEHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHH IIHAVENPRKLDTNLVNAQQARRALDYLVGFTLSPLLWRKLPGCKSAGRVQSVALRLICE HHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHC REDKIERFKSEEYWDISIKIQSSNNELFTAQLTHVNDQKLEKFSIINAKDAKDLTEKLKS CHHHHHHHCCCCEEEEEEEEECCCCEEEEEEEECCCHHHHHHHEECCCCCHHHHHHHHHH QKFHVDKIEKKQQKRAPQPPFITSSLQQEAARKLGFSAKKTMQIAQKLYEGVDIGKETIG CCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCHHHHH LITYMRTDGVTLSNDAIADIRKLIDTSYGNKYLPSSPRIYKSKVKNAQEAHEAIRPTNIT HHEEEECCCEEECCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHCCHHHHHHHHCCCCCE YTPDNLKELLEKDYYKLYELIWKRTVACQMENVIMDLVVASLASENKEYLAKATGSTITF ECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHCCCCEEEE DGFYKIYRESVDDEDEEENKMLPPLKEQESLKTKEIIPNQHFTEPPPRYSEASLVKKLEE HHHHHHHHHHCCCCCCHHCCCCCCCCHHHCCHHHHCCCCCCCCCCCCCCCHHHHHHHHHH LGIGRPSTYASILSVLQDRKYVTLEKKRFIPEELGRLVTVFLVGFFRKYVEYDFTAGLEN HCCCCCHHHHHHHHHHHCCCEEEEHHHHCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHH ELDEIAAGKLEWKAALNNFWSGFNNNIKSVNEQKITEIISYVQKALDYHLFGENKESKVC HHHHHHCCCHHHHHHHHHHHHHHCCCHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCC PSCKTGELSLKLGKFGAFLACSNYPECTFRKSIVSGNDNNENEAELSAIPNGNKILGTDK CCCCCCCEEEEECCCCEEEEECCCCCCHHHHHHHCCCCCCCCCCCEEECCCCCEEECCCC DGIEIYLKKGPYGPYIQHGEQEGKVKPKRSPIPAILNQNDITLEIALKLLSLPLKIGIHK CCEEEEEECCCCCCHHCCCCCCCCCCCCCCCCCHHCCCCCCHHHHHHHHHHCCHHEEEEC DSGEEIMIGYGKFGPYIKYMSKFISIPKKYDFLNLSLDDAMKLIEDNNAKLETTQG CCCCEEEEECCCCCHHHHHHHHHHCCCCCCCEEECCHHHHHHHHHCCCCEEEECCC >Mature Secondary Structure MKLVIVESPAKAKTINKYLGDEFKVIASFGHIRDLPSKKGSVLPDENFAMKYDISDKASK CEEEEECCCCHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEECCCHHHHH YVDAIVKDAKKADSVYLATDPDREGESISWHVAEVIKEKNKVKSDDFFKRVAFNEITKKA HHHHHHHHHHHCCCEEEEECCCCCCCCEEHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHH IIHAVENPRKLDTNLVNAQQARRALDYLVGFTLSPLLWRKLPGCKSAGRVQSVALRLICE HHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHC REDKIERFKSEEYWDISIKIQSSNNELFTAQLTHVNDQKLEKFSIINAKDAKDLTEKLKS CHHHHHHHCCCCEEEEEEEEECCCCEEEEEEEECCCHHHHHHHEECCCCCHHHHHHHHHH QKFHVDKIEKKQQKRAPQPPFITSSLQQEAARKLGFSAKKTMQIAQKLYEGVDIGKETIG CCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCHHHHH LITYMRTDGVTLSNDAIADIRKLIDTSYGNKYLPSSPRIYKSKVKNAQEAHEAIRPTNIT HHEEEECCCEEECCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHCCHHHHHHHHCCCCCE YTPDNLKELLEKDYYKLYELIWKRTVACQMENVIMDLVVASLASENKEYLAKATGSTITF ECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHCCCCEEEE DGFYKIYRESVDDEDEEENKMLPPLKEQESLKTKEIIPNQHFTEPPPRYSEASLVKKLEE HHHHHHHHHHCCCCCCHHCCCCCCCCHHHCCHHHHCCCCCCCCCCCCCCCHHHHHHHHHH LGIGRPSTYASILSVLQDRKYVTLEKKRFIPEELGRLVTVFLVGFFRKYVEYDFTAGLEN HCCCCCHHHHHHHHHHHCCCEEEEHHHHCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHH ELDEIAAGKLEWKAALNNFWSGFNNNIKSVNEQKITEIISYVQKALDYHLFGENKESKVC HHHHHHCCCHHHHHHHHHHHHHHCCCHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCC PSCKTGELSLKLGKFGAFLACSNYPECTFRKSIVSGNDNNENEAELSAIPNGNKILGTDK CCCCCCCEEEEECCCCEEEEECCCCCCHHHHHHHCCCCCCCCCCCEEECCCCCEEECCCC DGIEIYLKKGPYGPYIQHGEQEGKVKPKRSPIPAILNQNDITLEIALKLLSLPLKIGIHK CCEEEEEECCCCCCHHCCCCCCCCCCCCCCCCCHHCCCCCCHHHHHHHHHHCCHHEEEEC DSGEEIMIGYGKFGPYIKYMSKFISIPKKYDFLNLSLDDAMKLIEDNNAKLETTQG CCCCEEEEECCCCCHHHHHHHHHHCCCCCCCEEECCHHHHHHHHHCCCCEEEECCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 11557893 [H]