Definition | Bacillus anthracis str. Sterne chromosome, complete genome. |
---|---|
Accession | NC_005945 |
Length | 5,228,663 |
Click here to switch to the map view.
The map label for this gene is topB
Identifier: 49183389
GI number: 49183389
Start: 391511
End: 393700
Strand: Reverse
Name: topB
Synonym: BAS0361
Alternate gene names: 49183389
Gene position: 393700-391511 (Counterclockwise)
Preceding gene: 49183392
Following gene: 49183383
Centisome position: 7.53
GC content: 36.58
Gene sequence:
>2190_bases ATGGCAAAAAGCGTTGTAATCGCTGAAAAACCTTCCGTTGCACGAGATATTGCGCGTGTACTGAAGTGCGACAAAAAAGG AAACGGCTACCTTGAAGGTAGTAAATATATTGTAACTTGGGCTTTAGGTCATCTTGTTACATTAGCTGATCCAGAAAGCT ATGATGTGAAATATAAAAAGTGGAATTTAGAAGATTTACCGATGCTACCTGAGCGTTTGAAATTAACTGTTATTAAACAA ACAGGGAAACAATTTAATGCTGTGAAGAGTCAGCTTCTTAGAAAAGATGTAAATGAAATCATTGTAGCGACAGACGCTGG ACGTGAAGGAGAATTGGTTGCTCGTTGGATTATTGATAAGGTTCGAATTAACAAACCAATTAAACGTCTATGGATTTCAT CTGTTACTGATAAAGCAATTAAAGATGGTTTCGCAAATTTAAAACCAGGTAAAGCATATGACAATTTATACGCTTCAGCT GTCGCACGTTCAGAAGCTGACTGGTATATCGGTCTTAACGCGACTCGAGCTTTAACGACTCGCTTTAATGCCCAGCTTAA CTGTGGCCGTGTGCAAACACCTACTGTTGCTATGATCGCTAATCGTGAAGATGAAATAAAGAACTTCAAAGCTCAAACTT ACTACGGCATTGAAGCTCAAACAACAAATCAATTAAAACTAACGTGGCAAGATGCAAATGGCAATAGCCGCAGTTTTAAT AAAGAAAAAATTGATGGTATTGTAAAAGGTTTAGATAAACATAATGCTACTGTTTTAGAAATTGATAAAAAACAGAAGAA GTCATTCTCTCCTGGTCTTTACGATTTAACTGAATTGCAACGTGATGCCAATAAAAAGTTCGGTTACTCTGCGAAAGAAA CATTGAATATTATGCAGAAATTGTATGAACAACATAAAGTGCTAACATACCCTCGTACAGATTCGCGTTACATTTCATCT GATATCGTTGGAACACTTCCAGAACGTCTAAAGGCGTGCGGCGTTGGGGAATATCGTCCTTTAGCACATAAAGTATTACA AAAGCCTATCAAGGCTAATAAATCATTTGTTGATGATAGTAAAGTAAGCGATCACCACGCAATTATTCCGACAGAAGGAT ACGTTAACTTCTCAGCCTTCACAGATAAAGAACGTAAAATTTATGATTTAGTTGTCAAACGTTTCTTAGCTGTTTTATTC CCAGCATTCGAATACGAACAACTAACGTTACGCACAAAGGTCGGCAATGAAACATTCATTGCACGCGGAAAGACAATTTT ACATGCCGGTTGGAAAGAAGTATATGAAAATCGCTTTGAAGATGATGATGTAACTGATGACGTAAAAGAGCAACTTTTAC CTCGCATTGAAAAAGGCGATACATTAACTGTAAAGTTAATTATGCAAACATCAGGTCAAACGAAAGCACCTGCACGTTTT AACGAAGCGACTTTACTTTCAGCAATGGAAAATCCTACAAAATATATGGATACGCAAAATAAACAACTTGCTGATACGTT AAAATCAACTGGTGGATTAGGTACTGTGGCAACACGAGCTGATATTATCGACAAACTATTCAATTCATTCTTAATTGAAA AACGCGGGAAAGATATTCACATTACTTCAAAAGGCCGTCAGTTACTTGATTTAGTACCAGAAGAGTTAAAATCACCTACA CTAACTGGTGAGTGGGAACAAAAACTAGAGGCAATTGCAAAAGGTAAACTGAAAAAAGAAGTATTCATTTCCGAAATGAA GAACTATACGAAAGAAATTGTTTCTGAAATTAAATCGAGTGATAAAAAATATAAACATGACAACATTTCAACAAAGTCTT GTCCAGATTGCGGGAAACCAATGCTAGAGGTAAACGGCAAAAAAGGAAAAATGCTCGTTTGCCAAGACCGTGAATGTGGT CATCGTAAAAATGTATCTCGTACAACAAACGCTCGTTGCCCTCAGTGTAAGAAGAAGTTAGAATTACGTGGTGAAGGTGC AGGACAAATCTTTGCATGTAAATGTGGCTATCGCGAGAAATTATCCACATTCCAAGAAAGACGTAAAAAGGAATCTGGAA ACAAAGCTGATAAGCGTGACGTTCAAAAATATATGAAACAGCAGAAAAAAGAGGAAGAACCATTAAATAACCCATTCGCA GAAGCATTAAAGAAATTAAAATTTGATTAA
Upstream 100 bases:
>100_bases GTTCATTCCACTAAAAACCGTTCAAATATGGAGCGAAACTGAATCTCAAGAAGATGTATGCTATAGTTAACGTATGGAAT TTTGAAAGGATGTTTTTAAT
Downstream 100 bases:
>100_bases ACAAAAAGGTTCCTACCTCTATCGGTAGGAACCTTTTATGTTAGTGAATGATTTTACTTCCTTTTGCATTCTCAACGAAT TTTTCACTTGATTTGTTAAT
Product: DNA topoisomerase III
Products: NA
Alternate protein names: DNA topoisomerase III
Number of amino acids: Translated: 729; Mature: 728
Protein sequence:
>729_residues MAKSVVIAEKPSVARDIARVLKCDKKGNGYLEGSKYIVTWALGHLVTLADPESYDVKYKKWNLEDLPMLPERLKLTVIKQ TGKQFNAVKSQLLRKDVNEIIVATDAGREGELVARWIIDKVRINKPIKRLWISSVTDKAIKDGFANLKPGKAYDNLYASA VARSEADWYIGLNATRALTTRFNAQLNCGRVQTPTVAMIANREDEIKNFKAQTYYGIEAQTTNQLKLTWQDANGNSRSFN KEKIDGIVKGLDKHNATVLEIDKKQKKSFSPGLYDLTELQRDANKKFGYSAKETLNIMQKLYEQHKVLTYPRTDSRYISS DIVGTLPERLKACGVGEYRPLAHKVLQKPIKANKSFVDDSKVSDHHAIIPTEGYVNFSAFTDKERKIYDLVVKRFLAVLF PAFEYEQLTLRTKVGNETFIARGKTILHAGWKEVYENRFEDDDVTDDVKEQLLPRIEKGDTLTVKLIMQTSGQTKAPARF NEATLLSAMENPTKYMDTQNKQLADTLKSTGGLGTVATRADIIDKLFNSFLIEKRGKDIHITSKGRQLLDLVPEELKSPT LTGEWEQKLEAIAKGKLKKEVFISEMKNYTKEIVSEIKSSDKKYKHDNISTKSCPDCGKPMLEVNGKKGKMLVCQDRECG HRKNVSRTTNARCPQCKKKLELRGEGAGQIFACKCGYREKLSTFQERRKKESGNKADKRDVQKYMKQQKKEEEPLNNPFA EALKKLKFD
Sequences:
>Translated_729_residues MAKSVVIAEKPSVARDIARVLKCDKKGNGYLEGSKYIVTWALGHLVTLADPESYDVKYKKWNLEDLPMLPERLKLTVIKQ TGKQFNAVKSQLLRKDVNEIIVATDAGREGELVARWIIDKVRINKPIKRLWISSVTDKAIKDGFANLKPGKAYDNLYASA VARSEADWYIGLNATRALTTRFNAQLNCGRVQTPTVAMIANREDEIKNFKAQTYYGIEAQTTNQLKLTWQDANGNSRSFN KEKIDGIVKGLDKHNATVLEIDKKQKKSFSPGLYDLTELQRDANKKFGYSAKETLNIMQKLYEQHKVLTYPRTDSRYISS DIVGTLPERLKACGVGEYRPLAHKVLQKPIKANKSFVDDSKVSDHHAIIPTEGYVNFSAFTDKERKIYDLVVKRFLAVLF PAFEYEQLTLRTKVGNETFIARGKTILHAGWKEVYENRFEDDDVTDDVKEQLLPRIEKGDTLTVKLIMQTSGQTKAPARF NEATLLSAMENPTKYMDTQNKQLADTLKSTGGLGTVATRADIIDKLFNSFLIEKRGKDIHITSKGRQLLDLVPEELKSPT LTGEWEQKLEAIAKGKLKKEVFISEMKNYTKEIVSEIKSSDKKYKHDNISTKSCPDCGKPMLEVNGKKGKMLVCQDRECG HRKNVSRTTNARCPQCKKKLELRGEGAGQIFACKCGYREKLSTFQERRKKESGNKADKRDVQKYMKQQKKEEEPLNNPFA EALKKLKFD >Mature_728_residues AKSVVIAEKPSVARDIARVLKCDKKGNGYLEGSKYIVTWALGHLVTLADPESYDVKYKKWNLEDLPMLPERLKLTVIKQT GKQFNAVKSQLLRKDVNEIIVATDAGREGELVARWIIDKVRINKPIKRLWISSVTDKAIKDGFANLKPGKAYDNLYASAV ARSEADWYIGLNATRALTTRFNAQLNCGRVQTPTVAMIANREDEIKNFKAQTYYGIEAQTTNQLKLTWQDANGNSRSFNK EKIDGIVKGLDKHNATVLEIDKKQKKSFSPGLYDLTELQRDANKKFGYSAKETLNIMQKLYEQHKVLTYPRTDSRYISSD IVGTLPERLKACGVGEYRPLAHKVLQKPIKANKSFVDDSKVSDHHAIIPTEGYVNFSAFTDKERKIYDLVVKRFLAVLFP AFEYEQLTLRTKVGNETFIARGKTILHAGWKEVYENRFEDDDVTDDVKEQLLPRIEKGDTLTVKLIMQTSGQTKAPARFN EATLLSAMENPTKYMDTQNKQLADTLKSTGGLGTVATRADIIDKLFNSFLIEKRGKDIHITSKGRQLLDLVPEELKSPTL TGEWEQKLEAIAKGKLKKEVFISEMKNYTKEIVSEIKSSDKKYKHDNISTKSCPDCGKPMLEVNGKKGKMLVCQDRECGH RKNVSRTTNARCPQCKKKLELRGEGAGQIFACKCGYREKLSTFQERRKKESGNKADKRDVQKYMKQQKKEEEPLNNPFAE ALKKLKFD
Specific function: The reaction catalyzed by topoisomerases leads to the conversion of one topological isomer of DNA to another. TOP3 is a potent decatenase
COG id: COG0550
COG function: function code L; Topoisomerase IA
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 Toprim domain
Homologues:
Organism=Homo sapiens, GI4507635, Length=622, Percent_Identity=25.8842443729904, Blast_Score=139, Evalue=7e-33, Organism=Homo sapiens, GI10835218, Length=600, Percent_Identity=24.3333333333333, Blast_Score=135, Evalue=1e-31, Organism=Escherichia coli, GI1788061, Length=620, Percent_Identity=32.4193548387097, Blast_Score=336, Evalue=3e-93, Organism=Escherichia coli, GI1787529, Length=669, Percent_Identity=26.0089686098655, Blast_Score=162, Evalue=8e-41, Organism=Caenorhabditis elegans, GI32563869, Length=747, Percent_Identity=25.3012048192771, Blast_Score=145, Evalue=1e-34, Organism=Caenorhabditis elegans, GI17555378, Length=642, Percent_Identity=25.3894080996885, Blast_Score=115, Evalue=8e-26, Organism=Saccharomyces cerevisiae, GI6323263, Length=698, Percent_Identity=21.6332378223496, Blast_Score=80, Evalue=2e-15, Organism=Drosophila melanogaster, GI24585251, Length=599, Percent_Identity=26.0434056761269, Blast_Score=134, Evalue=2e-31, Organism=Drosophila melanogaster, GI24640096, Length=618, Percent_Identity=24.4336569579288, Blast_Score=119, Evalue=6e-27,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): TOP3_BACAH (A0R979)
Other databases:
- EMBL: CP000485 - RefSeq: YP_893279.1 - ProteinModelPortal: A0R979 - SMR: A0R979 - STRING: A0R979 - EnsemblBacteria: EBBACT00000070117 - GeneID: 4546423 - GenomeReviews: CP000485_GR - KEGG: btl:BALH_0373 - eggNOG: COG0550 - GeneTree: EBGT00070000031993 - HOGENOM: HBG585507 - OMA: CPECHKK - ProtClustDB: PRK07726 - BioCyc: BTHU412694:BALH_0373-MONOMER - GO: GO:0005694 - InterPro: IPR003601 - InterPro: IPR013497 - InterPro: IPR013824 - InterPro: IPR013825 - InterPro: IPR000380 - InterPro: IPR003602 - InterPro: IPR005738 - InterPro: IPR006171 - Gene3D: G3DSA:1.10.460.10 - Gene3D: G3DSA:2.70.20.10 - PANTHER: PTHR11390 - PRINTS: PR00417 - SMART: SM00437 - SMART: SM00436 - SMART: SM00493 - TIGRFAMs: TIGR01056
Pfam domain/function: PF01131 Topoisom_bac; PF01751 Toprim; SSF56712 Topo_IA_core
EC number: =5.99.1.2
Molecular weight: Translated: 82813; Mature: 82682
Theoretical pI: Translated: 9.99; Mature: 9.99
Prosite motif: PS00396 TOPOISOMERASE_I_PROK
Important sites: ACT_SITE 310-310
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.5 %Cys (Translated Protein) 1.5 %Met (Translated Protein) 3.0 %Cys+Met (Translated Protein) 1.5 %Cys (Mature Protein) 1.4 %Met (Mature Protein) 2.9 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MAKSVVIAEKPSVARDIARVLKCDKKGNGYLEGSKYIVTWALGHLVTLADPESYDVKYKK CCCCEEEECCCHHHHHHHHHHHCCCCCCCEECCCCEEEEEEHHHEEEECCCCCCCCEEEE WNLEDLPMLPERLKLTVIKQTGKQFNAVKSQLLRKDVNEIIVATDAGREGELVARWIIDK CCCCCCCCCCHHHEEHHHHHCCHHHHHHHHHHHHHHHHHEEEEECCCCCCCHHHHHHHHH VRINKPIKRLWISSVTDKAIKDGFANLKPGKAYDNLYASAVARSEADWYIGLNATRALTT HHHCCHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCEEEECCCHHHHHH RFNAQLNCGRVQTPTVAMIANREDEIKNFKAQTYYGIEAQTTNQLKLTWQDANGNSRSFN HHCCEECCCCCCCCEEEEEECCHHHHHCCCCCEEECCEECCCCEEEEEEECCCCCCCCCC KEKIDGIVKGLDKHNATVLEIDKKQKKSFSPGLYDLTELQRDANKKFGYSAKETLNIMQK HHHHHHHHHHHHCCCCEEEEECCHHHCCCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHH LYEQHKVLTYPRTDSRYISSDIVGTLPERLKACGVGEYRPLAHKVLQKPIKANKSFVDDS HHHHCCEEECCCCCCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCCCCCC KVSDHHAIIPTEGYVNFSAFTDKERKIYDLVVKRFLAVLFPAFEYEQLTLRTKVGNETFI CCCCCCEEECCCCCEEEHCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEECCCCEEE ARGKTILHAGWKEVYENRFEDDDVTDDVKEQLLPRIEKGDTLTVKLIMQTSGQTKAPARF ECCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHCCCCCCCEEEEEEEEECCCCCCCCCCC NEATLLSAMENPTKYMDTQNKQLADTLKSTGGLGTVATRADIIDKLFNSFLIEKRGKDIH CHHHHHHHHCCCHHHHCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEE ITSKGRQLLDLVPEELKSPTLTGEWEQKLEAIAKGKLKKEVFISEMKNYTKEIVSEIKSS EECCCHHHHHHCHHHHCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHC DKKYKHDNISTKSCPDCGKPMLEVNGKKGKMLVCQDRECGHRKNVSRTTNARCPQCKKKL CCHHCCCCCCCCCCCCCCCCCEEECCCCCCEEEECCCCCCCCCCCCCCCCCCCHHHHHHH ELRGEGAGQIFACKCGYREKLSTFQERRKKESGNKADKRDVQKYMKQQKKEEEPLNNPFA HCCCCCCCEEEEECCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCHHH EALKKLKFD HHHHHHCCC >Mature Secondary Structure AKSVVIAEKPSVARDIARVLKCDKKGNGYLEGSKYIVTWALGHLVTLADPESYDVKYKK CCCEEEECCCHHHHHHHHHHHCCCCCCCEECCCCEEEEEEHHHEEEECCCCCCCCEEEE WNLEDLPMLPERLKLTVIKQTGKQFNAVKSQLLRKDVNEIIVATDAGREGELVARWIIDK CCCCCCCCCCHHHEEHHHHHCCHHHHHHHHHHHHHHHHHEEEEECCCCCCCHHHHHHHHH VRINKPIKRLWISSVTDKAIKDGFANLKPGKAYDNLYASAVARSEADWYIGLNATRALTT HHHCCHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCEEEECCCHHHHHH RFNAQLNCGRVQTPTVAMIANREDEIKNFKAQTYYGIEAQTTNQLKLTWQDANGNSRSFN HHCCEECCCCCCCCEEEEEECCHHHHHCCCCCEEECCEECCCCEEEEEEECCCCCCCCCC KEKIDGIVKGLDKHNATVLEIDKKQKKSFSPGLYDLTELQRDANKKFGYSAKETLNIMQK HHHHHHHHHHHHCCCCEEEEECCHHHCCCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHH LYEQHKVLTYPRTDSRYISSDIVGTLPERLKACGVGEYRPLAHKVLQKPIKANKSFVDDS HHHHCCEEECCCCCCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCCCCCC KVSDHHAIIPTEGYVNFSAFTDKERKIYDLVVKRFLAVLFPAFEYEQLTLRTKVGNETFI CCCCCCEEECCCCCEEEHCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEECCCCEEE ARGKTILHAGWKEVYENRFEDDDVTDDVKEQLLPRIEKGDTLTVKLIMQTSGQTKAPARF ECCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHCCCCCCCEEEEEEEEECCCCCCCCCCC NEATLLSAMENPTKYMDTQNKQLADTLKSTGGLGTVATRADIIDKLFNSFLIEKRGKDIH CHHHHHHHHCCCHHHHCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEE ITSKGRQLLDLVPEELKSPTLTGEWEQKLEAIAKGKLKKEVFISEMKNYTKEIVSEIKSS EECCCHHHHHHCHHHHCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHC DKKYKHDNISTKSCPDCGKPMLEVNGKKGKMLVCQDRECGHRKNVSRTTNARCPQCKKKL CCHHCCCCCCCCCCCCCCCCCEEECCCCCCEEEECCCCCCCCCCCCCCCCCCCHHHHHHH ELRGEGAGQIFACKCGYREKLSTFQERRKKESGNKADKRDVQKYMKQQKKEEEPLNNPFA HCCCCCCCEEEEECCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCHHH EALKKLKFD HHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA