Definition | Agrobacterium tumefaciens str. C58 chromosome circular, complete sequence. |
---|---|
Accession | NC_003062 |
Length | 2,841,580 |
Click here to switch to the map view.
The map label for this gene is topA [H]
Identifier: 15888629
GI number: 15888629
Start: 1294043
End: 1296721
Strand: Reverse
Name: topA [H]
Synonym: Atu1304
Alternate gene names: 15888629
Gene position: 1296721-1294043 (Counterclockwise)
Preceding gene: 15888630
Following gene: 15888628
Centisome position: 45.63
GC content: 60.13
Gene sequence:
>2679_bases ATGAATGTCGTTGTCGTAGAATCTCCTGCCAAAGCCAAGACGATCAACAAGTATCTTGGTTCAGGCTATAAGGTTCTTGC ATCCTTTGGTCACGTCAGAGACCTTCCCGCCAAGGACGGATCGGTGCTGCCCGACCAGGATTTCGAAATGTCCTGGGAAG TCGACAGCGCATCTGCCAAACGCATGAAGGATATTGCCGATGCGGTAAAATCCTCTGATGGCCTTTTTCTCGCAACCGAC CCTGATCGCGAAGGTGAGGCCATTTCCTGGCACGTTCTCGATCTTTTGAAGAAGAAGCGCGTTCTCGGCGACAAGCCGGT GAAGCGCGTGGTCTTCAACGCCATCACCAAGAAGGCGGTTCTGGACGCCATGGCGAACCCGCGCGACATCGACGTGCCGC TGGTCGACGCCTATCTCGCCCGCCGTGCACTCGATTATCTCGTCGGTTTCAACCTCTCTCCCGTTCTGTGGCGCAAGCTG CCGGGCGCGCGTTCCGCCGGCCGTGTGCAGTCGGTTGCGCTGCGTCTTGTCTGCGACCGTGAATCCGAGATCGAGCGTTT CGTCTCCGAAGAATATTGGAACATCAGCGCGCTTTTGAAGACACCGCGCGGTGACGAGTTCGAGGCGAAGCTGGTTTCGG CCGACGGCAAGCGGCTGCAAAGCCGCGGGATCAAGACCGGCGAAGATGCAAACCGGCTGAAGGCGCTGCTGGAAGGCGCG ACCTATGTTGTCGACACGGTCGAGGCGAAGCCCGTCAAGCGTAATCCCGGCCCGCCCTTCACCACCTCGACGCTTCAGCA GGCTGCGTCTTCGCGCATGGGCTTTGGTGCCTCGCGGACGATGCAGGTGGCGCAGAAGCTTTATGAAGGCATCGATATCG GCGGAGAAACCGTCGGCCTTATCACCTATATGCGTACCGACGGCGTACAGATAGCACCGGAGGCGATCGATGCCGCCCGC CTCGCCATCGGCGAGCAGTTCGGTGAGCGTTACGTGCCGGAAAAGGCGCGTTTCTACTCCACCAAGGCCAAGAACGCCCA GGAAGCGCACGAAGCGATCCGCCCGACTGATTTCACGCGCACACCGGACCAGGTGAAGCGTTATCTCGATGCCGACCAGC TGCGCCTTTACGACCTGATCTGGAAGCGCGGTATTGCCAGCCAGATGGCGTCCGCAGAGATCGAGCGAACAACGGTCGAA ATCCTCGCGGACAAGAACGGTGAAAAGGCCGGCCTGCGCGCCGTCGGTTCCGTCATCCGGTTCGACGGCTTCATTGCCGC CTATACCGACCAGAAGGAAGATGGCGAGCAGAGCGACGATGGTGACGATGAAGGCCGTCTGCCGCAGATCAATGCGCGCG AAAATCTCGCCAAGCAGAAGATCAATGCCAGCCAGCATTTCACCGAACCGCCGCCGCGTTATTCGGAAGCCTCGCTGATC AAGAAGATGGAAGAGCTCGGCATCGGCCGCCCTTCCACCTATGCCGCGACGCTGAAGACGCTGAGCGACCGCGAATATAT CGTCATAGACAAGCGCAAGCTTATCCCGCATTCACGCGGACGACTGGTGACGGCTTTCCTGGAGAGCTTTTTCACCAAAT ATGTCGAATACGATTTCACCGCAGCACTCGAAGAAAAGCTCGACCGCATTTCCGCCGGCGAGCTGGACTGGAAGCAGGTG CTGCGCGATTTCTGGAAGGATTTCTTCGCGCAGATCGAAGATACCAAGGAACTGCGCGTCACCAACGTGCTCGACGCGTT GAACGAGGTTCTGGCACCGCTGGTCTTCCCCAAGCGCGAGGATGGATCGGATCCGCGCATCTGTCAGGTTTGCGGCACCG GCAATCTTTCGCTGAAGCTCGGCAAATACGGCGCTTTCGTCGGTTGCTCCAACTATCCGGAATGCAATTACACCCGCCAG CTCACCTCCGATGGTGCGGAAGCGGATGCTGCGGCCTCTAACGAGCCGAAAGCACTCGGTGCCGATCCGATGACCGGCGA GGAGCTCACGCTTCGCTCCGGCCGTTTCGGACCCTATATCCAGCGCGGCGACGGCAAGGAAGCCAAGCGTTCATCGCTGC CCAAGGGTTGGAAGCCTGAGGATATCGACCACGAAAAGGCGCTGGCGCTCATCAACCTGCCGCGCGATATCGGCAAGCAT CCGGAAACTGGCAAAATGATCTCTGCCGGTCTCGGCCGTTACGGACCCTTCCTTCTGCATGACGGTTCCTATGCGAACCT CGAAAGCATCGAAGACGTGTTCTCGATCGGCCTCAACCGCGCCGTGACCGTCATTGCCGAAAAGCAAGCCAAGGGACCGG GCCGCAGCGGCACGCCGGCGGCGCTGAAAGAACTGGGCGACCATCCCGATGGCGGTGCCATCACCGTCCGCGACGGGCGT TATGGCGCTTACGTCAACTGGGGCAAGGTCAATGCCACCATTCCGAAGGGGCAGGACCCGGCCTCAGTGACTCTGGACGA GTCGCTGGTGCTGATCGCCGAGCGCATCGCCAAGACCGGCACCGGCGGCAAGCCCGCCAAGGCCAAGAAGACCACCGCTA AGAAGGCCGATGGTGAGGCAGCGGCAAAGCCGAAAGCCACCAAGGCCAAGGCGGCAACCAAGAGCAAGGCGGCCGCCAAG CCGAAGGCAGCGGCCAAAACCAAGAAGGCAGCAGAGTGA
Upstream 100 bases:
>100_bases GCGGGCATTACCTGTCCATGTCGAATTATAAAGAAGGCCAATTGCGGCACCGCGCCGCGACCGGGGCCACGTTAACGGAC CAGTTCCAGAGAAGAACAAT
Downstream 100 bases:
>100_bases GCAAGGCGCCACGTCAGAGACAGGGTTCCGCCAGTGACGGTTTCGGACGCACCGGGCGCGGCAAGCGGGTGAGCGAAGGT GAAGGCATCATCCATGGCGA
Product: DNA topoisomerase I
Products: NA
Alternate protein names: DNA topoisomerase I; Omega-protein; Relaxing enzyme; Swivelase; Untwisting enzyme [H]
Number of amino acids: Translated: 892; Mature: 892
Protein sequence:
>892_residues MNVVVVESPAKAKTINKYLGSGYKVLASFGHVRDLPAKDGSVLPDQDFEMSWEVDSASAKRMKDIADAVKSSDGLFLATD PDREGEAISWHVLDLLKKKRVLGDKPVKRVVFNAITKKAVLDAMANPRDIDVPLVDAYLARRALDYLVGFNLSPVLWRKL PGARSAGRVQSVALRLVCDRESEIERFVSEEYWNISALLKTPRGDEFEAKLVSADGKRLQSRGIKTGEDANRLKALLEGA TYVVDTVEAKPVKRNPGPPFTTSTLQQAASSRMGFGASRTMQVAQKLYEGIDIGGETVGLITYMRTDGVQIAPEAIDAAR LAIGEQFGERYVPEKARFYSTKAKNAQEAHEAIRPTDFTRTPDQVKRYLDADQLRLYDLIWKRGIASQMASAEIERTTVE ILADKNGEKAGLRAVGSVIRFDGFIAAYTDQKEDGEQSDDGDDEGRLPQINARENLAKQKINASQHFTEPPPRYSEASLI KKMEELGIGRPSTYAATLKTLSDREYIVIDKRKLIPHSRGRLVTAFLESFFTKYVEYDFTAALEEKLDRISAGELDWKQV LRDFWKDFFAQIEDTKELRVTNVLDALNEVLAPLVFPKREDGSDPRICQVCGTGNLSLKLGKYGAFVGCSNYPECNYTRQ LTSDGAEADAAASNEPKALGADPMTGEELTLRSGRFGPYIQRGDGKEAKRSSLPKGWKPEDIDHEKALALINLPRDIGKH PETGKMISAGLGRYGPFLLHDGSYANLESIEDVFSIGLNRAVTVIAEKQAKGPGRSGTPAALKELGDHPDGGAITVRDGR YGAYVNWGKVNATIPKGQDPASVTLDESLVLIAERIAKTGTGGKPAKAKKTTAKKADGEAAAKPKATKAKAATKSKAAAK PKAAAKTKKAAE
Sequences:
>Translated_892_residues MNVVVVESPAKAKTINKYLGSGYKVLASFGHVRDLPAKDGSVLPDQDFEMSWEVDSASAKRMKDIADAVKSSDGLFLATD PDREGEAISWHVLDLLKKKRVLGDKPVKRVVFNAITKKAVLDAMANPRDIDVPLVDAYLARRALDYLVGFNLSPVLWRKL PGARSAGRVQSVALRLVCDRESEIERFVSEEYWNISALLKTPRGDEFEAKLVSADGKRLQSRGIKTGEDANRLKALLEGA TYVVDTVEAKPVKRNPGPPFTTSTLQQAASSRMGFGASRTMQVAQKLYEGIDIGGETVGLITYMRTDGVQIAPEAIDAAR LAIGEQFGERYVPEKARFYSTKAKNAQEAHEAIRPTDFTRTPDQVKRYLDADQLRLYDLIWKRGIASQMASAEIERTTVE ILADKNGEKAGLRAVGSVIRFDGFIAAYTDQKEDGEQSDDGDDEGRLPQINARENLAKQKINASQHFTEPPPRYSEASLI KKMEELGIGRPSTYAATLKTLSDREYIVIDKRKLIPHSRGRLVTAFLESFFTKYVEYDFTAALEEKLDRISAGELDWKQV LRDFWKDFFAQIEDTKELRVTNVLDALNEVLAPLVFPKREDGSDPRICQVCGTGNLSLKLGKYGAFVGCSNYPECNYTRQ LTSDGAEADAAASNEPKALGADPMTGEELTLRSGRFGPYIQRGDGKEAKRSSLPKGWKPEDIDHEKALALINLPRDIGKH PETGKMISAGLGRYGPFLLHDGSYANLESIEDVFSIGLNRAVTVIAEKQAKGPGRSGTPAALKELGDHPDGGAITVRDGR YGAYVNWGKVNATIPKGQDPASVTLDESLVLIAERIAKTGTGGKPAKAKKTTAKKADGEAAAKPKATKAKAATKSKAAAK PKAAAKTKKAAE >Mature_892_residues MNVVVVESPAKAKTINKYLGSGYKVLASFGHVRDLPAKDGSVLPDQDFEMSWEVDSASAKRMKDIADAVKSSDGLFLATD PDREGEAISWHVLDLLKKKRVLGDKPVKRVVFNAITKKAVLDAMANPRDIDVPLVDAYLARRALDYLVGFNLSPVLWRKL PGARSAGRVQSVALRLVCDRESEIERFVSEEYWNISALLKTPRGDEFEAKLVSADGKRLQSRGIKTGEDANRLKALLEGA TYVVDTVEAKPVKRNPGPPFTTSTLQQAASSRMGFGASRTMQVAQKLYEGIDIGGETVGLITYMRTDGVQIAPEAIDAAR LAIGEQFGERYVPEKARFYSTKAKNAQEAHEAIRPTDFTRTPDQVKRYLDADQLRLYDLIWKRGIASQMASAEIERTTVE ILADKNGEKAGLRAVGSVIRFDGFIAAYTDQKEDGEQSDDGDDEGRLPQINARENLAKQKINASQHFTEPPPRYSEASLI KKMEELGIGRPSTYAATLKTLSDREYIVIDKRKLIPHSRGRLVTAFLESFFTKYVEYDFTAALEEKLDRISAGELDWKQV LRDFWKDFFAQIEDTKELRVTNVLDALNEVLAPLVFPKREDGSDPRICQVCGTGNLSLKLGKYGAFVGCSNYPECNYTRQ LTSDGAEADAAASNEPKALGADPMTGEELTLRSGRFGPYIQRGDGKEAKRSSLPKGWKPEDIDHEKALALINLPRDIGKH PETGKMISAGLGRYGPFLLHDGSYANLESIEDVFSIGLNRAVTVIAEKQAKGPGRSGTPAALKELGDHPDGGAITVRDGR YGAYVNWGKVNATIPKGQDPASVTLDESLVLIAERIAKTGTGGKPAKAKKTTAKKADGEAAAKPKATKAKAATKSKAAAK PKAAAKTKKAAE
Specific function: The reaction catalyzed by topoisomerases leads to the conversion of one topological isomer of DNA to another [H]
COG id: COG0550
COG function: function code L; Topoisomerase IA
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 Toprim domain [H]
Homologues:
Organism=Homo sapiens, GI10835218, Length=648, Percent_Identity=26.2345679012346, Blast_Score=117, Evalue=4e-26, Organism=Homo sapiens, GI4507635, Length=505, Percent_Identity=25.5445544554455, Blast_Score=107, Evalue=6e-23, Organism=Escherichia coli, GI1787529, Length=661, Percent_Identity=38.124054462935, Blast_Score=438, Evalue=1e-124, Organism=Escherichia coli, GI1788061, Length=650, Percent_Identity=23.5384615384615, Blast_Score=113, Evalue=6e-26, Organism=Caenorhabditis elegans, GI32563869, Length=581, Percent_Identity=25.8175559380379, Blast_Score=114, Evalue=2e-25, Organism=Caenorhabditis elegans, GI17555378, Length=522, Percent_Identity=26.2452107279693, Blast_Score=104, Evalue=3e-22, Organism=Saccharomyces cerevisiae, GI6323263, Length=576, Percent_Identity=23.4375, Blast_Score=89, Evalue=5e-18, Organism=Drosophila melanogaster, GI24585251, Length=611, Percent_Identity=26.6775777414075, Blast_Score=135, Evalue=1e-31, Organism=Drosophila melanogaster, GI24640096, Length=597, Percent_Identity=24.4556113902848, Blast_Score=109, Evalue=7e-24,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR003601 - InterPro: IPR013497 - InterPro: IPR013824 - InterPro: IPR013825 - InterPro: IPR000380 - InterPro: IPR003602 - InterPro: IPR013498 - InterPro: IPR005733 - InterPro: IPR006171 [H]
Pfam domain/function: PF01131 Topoisom_bac; PF01751 Toprim; PF01396 zf-C4_Topoisom [H]
EC number: =5.99.1.2 [H]
Molecular weight: Translated: 97502; Mature: 97502
Theoretical pI: Translated: 9.22; Mature: 9.22
Prosite motif: PS00396 TOPOISOMERASE_I_PROK
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.6 %Cys (Translated Protein) 1.2 %Met (Translated Protein) 1.8 %Cys+Met (Translated Protein) 0.6 %Cys (Mature Protein) 1.2 %Met (Mature Protein) 1.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MNVVVVESPAKAKTINKYLGSGYKVLASFGHVRDLPAKDGSVLPDQDFEMSWEVDSASAK CCEEEECCCHHHHHHHHHHCCCHHHHHHHCCHHCCCCCCCCCCCCCCCCEEEECCCHHHH RMKDIADAVKSSDGLFLATDPDREGEAISWHVLDLLKKKRVLGDKPVKRVVFNAITKKAV HHHHHHHHHHCCCCEEEEECCCCCCCEEHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH LDAMANPRDIDVPLVDAYLARRALDYLVGFNLSPVLWRKLPGARSAGRVQSVALRLVCDR HHHHCCCCCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHCCCCCCCCHHHHHHHHHHCCC ESEIERFVSEEYWNISALLKTPRGDEFEAKLVSADGKRLQSRGIKTGEDANRLKALLEGA HHHHHHHHHHHHCCEEHEEECCCCCCCHHEEECCCHHHHHHCCCCCCCHHHHHHHHHCCC TYVVDTVEAKPVKRNPGPPFTTSTLQQAASSRMGFGASRTMQVAQKLYEGIDIGGETVGL CEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCEEEE ITYMRTDGVQIAPEAIDAARLAIGEQFGERYVPEKARFYSTKAKNAQEAHEAIRPTDFTR EEEEECCCCEECHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHCCCCCCCC TPDQVKRYLDADQLRLYDLIWKRGIASQMASAEIERTTVEILADKNGEKAGLRAVGSVIR CHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHEEEEEECCCCCHHHHHHHHHHHH FDGFIAAYTDQKEDGEQSDDGDDEGRLPQINARENLAKQKINASQHFTEPPPRYSEASLI HCCEEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHCCCCCCCCCCHHHHH KKMEELGIGRPSTYAATLKTLSDREYIVIDKRKLIPHSRGRLVTAFLESFFTKYVEYDFT HHHHHHCCCCCCHHHHHHHHCCCCCEEEEECHHCCCCCCCCHHHHHHHHHHHHHHHHHHH AALEEKLDRISAGELDWKQVLRDFWKDFFAQIEDTKELRVTNVLDALNEVLAPLVFPKRE HHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCC DGSDPRICQVCGTGNLSLKLGKYGAFVGCSNYPECNYTRQLTSDGAEADAAASNEPKALG CCCCCCEEEEECCCCEEEEECCCEEEEECCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCC ADPMTGEELTLRSGRFGPYIQRGDGKEAKRSSLPKGWKPEDIDHEKALALINLPRDIGKH CCCCCCCCEEECCCCCCCCEECCCCCHHHHHCCCCCCCCCCCCHHHHHEEEECCHHHCCC PETGKMISAGLGRYGPFLLHDGSYANLESIEDVFSIGLNRAVTVIAEKQAKGPGRSGTPA CCCCHHHHHCCCCCCCEEEECCCCCCHHHHHHHHHHCCCCEEEEEEHHHCCCCCCCCCHH ALKELGDHPDGGAITVRDGRYGAYVNWGKVNATIPKGQDPASVTLDESLVLIAERIAKTG HHHHCCCCCCCCEEEEECCCEEEEEEECEEEEECCCCCCCCEEEECHHHHHHHHHHHHCC TGGKPAKAKKTTAKKADGEAAAKPKATKAKAATKSKAAAKPKAAAKTKKAAE CCCCCCCCHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHCCC >Mature Secondary Structure MNVVVVESPAKAKTINKYLGSGYKVLASFGHVRDLPAKDGSVLPDQDFEMSWEVDSASAK CCEEEECCCHHHHHHHHHHCCCHHHHHHHCCHHCCCCCCCCCCCCCCCCEEEECCCHHHH RMKDIADAVKSSDGLFLATDPDREGEAISWHVLDLLKKKRVLGDKPVKRVVFNAITKKAV HHHHHHHHHHCCCCEEEEECCCCCCCEEHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH LDAMANPRDIDVPLVDAYLARRALDYLVGFNLSPVLWRKLPGARSAGRVQSVALRLVCDR HHHHCCCCCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHCCCCCCCCHHHHHHHHHHCCC ESEIERFVSEEYWNISALLKTPRGDEFEAKLVSADGKRLQSRGIKTGEDANRLKALLEGA HHHHHHHHHHHHCCEEHEEECCCCCCCHHEEECCCHHHHHHCCCCCCCHHHHHHHHHCCC TYVVDTVEAKPVKRNPGPPFTTSTLQQAASSRMGFGASRTMQVAQKLYEGIDIGGETVGL CEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCEEEE ITYMRTDGVQIAPEAIDAARLAIGEQFGERYVPEKARFYSTKAKNAQEAHEAIRPTDFTR EEEEECCCCEECHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHCCCCCCCC TPDQVKRYLDADQLRLYDLIWKRGIASQMASAEIERTTVEILADKNGEKAGLRAVGSVIR CHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHEEEEEECCCCCHHHHHHHHHHHH FDGFIAAYTDQKEDGEQSDDGDDEGRLPQINARENLAKQKINASQHFTEPPPRYSEASLI HCCEEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHCCCCCCCCCCHHHHH KKMEELGIGRPSTYAATLKTLSDREYIVIDKRKLIPHSRGRLVTAFLESFFTKYVEYDFT HHHHHHCCCCCCHHHHHHHHCCCCCEEEEECHHCCCCCCCCHHHHHHHHHHHHHHHHHHH AALEEKLDRISAGELDWKQVLRDFWKDFFAQIEDTKELRVTNVLDALNEVLAPLVFPKRE HHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCC DGSDPRICQVCGTGNLSLKLGKYGAFVGCSNYPECNYTRQLTSDGAEADAAASNEPKALG CCCCCCEEEEECCCCEEEEECCCEEEEECCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCC ADPMTGEELTLRSGRFGPYIQRGDGKEAKRSSLPKGWKPEDIDHEKALALINLPRDIGKH CCCCCCCCEEECCCCCCCCEECCCCCHHHHHCCCCCCCCCCCCHHHHHEEEECCHHHCCC PETGKMISAGLGRYGPFLLHDGSYANLESIEDVFSIGLNRAVTVIAEKQAKGPGRSGTPA CCCCHHHHHCCCCCCCEEEECCCCCCHHHHHHHHHHCCCCEEEEEEHHHCCCCCCCCCHH ALKELGDHPDGGAITVRDGRYGAYVNWGKVNATIPKGQDPASVTLDESLVLIAERIAKTG HHHHCCCCCCCCEEEEECCCEEEEEEECEEEEECCCCCCCCEEEECHHHHHHHHHHHHCC TGGKPAKAKKTTAKKADGEAAAKPKATKAKAATKSKAAAKPKAAAKTKKAAE CCCCCCCCHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA