Definition Trichodesmium erythraeum IMS101 chromosome, complete genome.
Accession NC_008312
Length 7,750,108

Click here to switch to the map view.

The map label for this gene is clpB2 [H]

Identifier: 113474405

GI number: 113474405

Start: 855972

End: 858584

Strand: Reverse

Name: clpB2 [H]

Synonym: Tery_0543

Alternate gene names: 113474405

Gene position: 858584-855972 (Counterclockwise)

Preceding gene: 113474406

Following gene: 113474404

Centisome position: 11.08

GC content: 39.49

Gene sequence:

>2613_bases
ATGCAGCCAAATAATCCTAACCAATTTACAGAAAAAGCTTGGGAGGCAATATCCCGTACCCCTGACATTGCTAAAACTTC
TCAAAATCAACAAATTGAAGCTGAACACCTAATGAAAGCTTTGTTAGAACAAAATGGACTGGTAGCTAGTCTATTTAGCA
AAGTAGGTGTTTCAACTACCAAAATACAAGAATACACAGATTCATTTATTAAGCGTCAACCCAAAGTCAAAAATATACCC
AATAATATTTACTTAGGTCGTAGTCTAGATGCCTTATTAGATAATGCCGAAAAATATCGCCAGGAATACAAAGATGAATA
TATATCTATTGAACATCTGATACTCGCTTACCTGAAAGACGATCACTTTGGCAAAAATCTCTATAAAGAATTCAAATTGG
ATGAAGTAAAGCTCAAAAAAACAATTTCCCAAGTTAGAGGTAAGCAAAAAGTAACAGACAAAAATCCTGAAGGCAAATAT
GAAGCTCTGGAAAAATACGGTCGCGACCTGACAGAATTTGCTCGTGAAGGTAAACTTGACCCAGTTATTGGGCGTGACGA
CGAAATTCGACGTACTATTCAAATACTCAGTCGTCGTACTAAAAATAACCCAGTTTTAATAGGTGAACCTGGAGTCGGGA
AAACTGCGATAGCTGAAGGTCTAGCACAAAGGATCATCACTCTTGATGTTCCTCAGTCTCTAAAAGACCGTAAACTTATC
GCTCTAGATATGGGTGCCTTAATTGCGGGTGCTAAATTTAGGGGTGAATTTGAAGAACGTCTGAAAGCTGTTCTCAAAGA
AGTTACTGACTCAGAGGGCAAAATTATCTTATTTATAGATGAAATTCATACTGTTGTTGGAGCAGGTGCAACCCAAGGGG
CAATGGATGCAGGTAACTTGCTGAAACCTATGTTAGCTAGGGGGGAACTAAGGTGTATTGGGGCGACGACTCTTGATGAG
TATCGTAAATACATTGAAAAAGATGCTGCCTTAGAACGTCGTTTCCAACAAGTTTATGTCGATCAACCTAGTGTGGAAGA
TGCAATATCAATTTTACGGGGTCTCAAAGAACGCTATGAGGTACATCATGGTGTAAAAATTTCTGATAGTTCTTTAGTTG
CTGCTGCTACTTTATCTACTAGATATATTAGCGATCGCTTCCTCCCAGATAAGGCCATCGACTTGGTGGATGAAGCTGCT
GCTAAACTTAAAATGGAAATCACTTCTAAACCTGAGGAACTTGATGAAATTGACCGCAAAATTCTCCAACTAGAAATGGA
GAAGTTATCTTTGCAAAAGGAAAGTGATACTGCTTCTAAAGAAAGGTTAGGAAGGCTAGAAAAAGATTTAGCAAATTTAA
AAGAGGGGCAACGGGCTCTTAATGCTCAGTGGGAATCAGAAAAAGGTATCATTAGTACAATTCAGACTGTCAAGGAAGAA
ATTGATAAAGTTAATATTGAGATTCAGCAAGCAGAACGAAATTATGACCTTAACCGTGCTGCTGAGTTGAAATATGGTAG
ACTGATTAATTTACAGAAACAGGTGGAGGAAGCAGAGGCTAAACTGGCAACAACTCAAACTAGTGGTCAGACTTTGTTAC
GAGAAGAGGTGACAGAAGCTGATATTGCTGAGATTATTTCTAAGTGGACTGGTATTCCTATCAGTAAATTAGTAGAGTCA
GAAAAAGAAAAACTGCTACATCTTGAAGGGGAGTTGCATAAGCGAGTTATTGGGCAGAATGAAGCTGTGAGTGCTGTTTC
TGATGCTATTCAGCGATCGCGTGCAGGTTTGGCTGACCCGAACCGTCCAGTTGCTAGTTTTATTTTCCTGGGTCCGACTG
GTGTGGGTAAAACAGAGTTAGGAAAGGCATTGGCAGCATATTTATTTGACACTGAAGATGCGATGGTGCGTATAGATATG
TCTGAGTATATGGAGAAACATTCTGTGTCTCGTTTAATTGGGGCACCTCCTGGATATGTGGGTTATGATGAAGGTGGTCA
GTTAACTGAAGCTATTCGCCGTCGCCCATATACGGTGATTTTGTTTGATGAAATTGAAAAAGCACATCCGGATGTGTTTA
ATATTATGCTGCAAATTTTGGATGATGGGCGGGTTACTGATAGTCAAGGTCACAAGGTTGATTTTAAGAATAGTGTGATT
ATTATGACAAGTAATATTGGTTCTCAGTATATTTTAGATGTTACTGATGATTATGAGCAAATGCAGGGTCGGGTAATGGA
GGCGTTGCGTGCTGCTTTCCGCCCAGAATTTCTCAACCGTATTGATGAGACGATTATTTTTCATGGGTTGCAGAAAGAGC
AGTTACGAGAAATTGTGCAGTTGCAGGTTGTGCGTTTAGAGAAAAGGTTGGCTGAACGTAAGATGTCATTGAAGCTTACT
GATGCTGCTATTAATTTTTTGGCAGATGTTGGGTATGATCCGGTGTATGGGGCTCGACCTTTAAAGCGGGCAATTCAGCG
AGAGTTGGAAACGCAAATTGCTAAAAGTATTTTGCGTAGTGAATTTAATGATGGTGATACTATTTTTGTAGATATTGAAA
ATGAGCGACTTTCGTTGAAACGGTTGCCTATGGAGTTGGTGACAGCTAAATGA

Upstream 100 bases:

>100_bases
ACCCAATAAGAAAAAACGGTTTTTTCTACTAAAATTTATTCATAACTACAAGTTAAAATCAAATAAAAATATAACGTATA
GGCAAAAAATAAAAAAAATT

Downstream 100 bases:

>100_bases
AGTCAACAGGAGGGGCGAATGGCTATTCGCCTCTAAACGAGTTAAAAGTAAGATTTTATACCCTTAAAACTTTGAGGTGC
CTAGTGATTTCCGGAAATTA

Product: ATPase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 870; Mature: 870

Protein sequence:

>870_residues
MQPNNPNQFTEKAWEAISRTPDIAKTSQNQQIEAEHLMKALLEQNGLVASLFSKVGVSTTKIQEYTDSFIKRQPKVKNIP
NNIYLGRSLDALLDNAEKYRQEYKDEYISIEHLILAYLKDDHFGKNLYKEFKLDEVKLKKTISQVRGKQKVTDKNPEGKY
EALEKYGRDLTEFAREGKLDPVIGRDDEIRRTIQILSRRTKNNPVLIGEPGVGKTAIAEGLAQRIITLDVPQSLKDRKLI
ALDMGALIAGAKFRGEFEERLKAVLKEVTDSEGKIILFIDEIHTVVGAGATQGAMDAGNLLKPMLARGELRCIGATTLDE
YRKYIEKDAALERRFQQVYVDQPSVEDAISILRGLKERYEVHHGVKISDSSLVAAATLSTRYISDRFLPDKAIDLVDEAA
AKLKMEITSKPEELDEIDRKILQLEMEKLSLQKESDTASKERLGRLEKDLANLKEGQRALNAQWESEKGIISTIQTVKEE
IDKVNIEIQQAERNYDLNRAAELKYGRLINLQKQVEEAEAKLATTQTSGQTLLREEVTEADIAEIISKWTGIPISKLVES
EKEKLLHLEGELHKRVIGQNEAVSAVSDAIQRSRAGLADPNRPVASFIFLGPTGVGKTELGKALAAYLFDTEDAMVRIDM
SEYMEKHSVSRLIGAPPGYVGYDEGGQLTEAIRRRPYTVILFDEIEKAHPDVFNIMLQILDDGRVTDSQGHKVDFKNSVI
IMTSNIGSQYILDVTDDYEQMQGRVMEALRAAFRPEFLNRIDETIIFHGLQKEQLREIVQLQVVRLEKRLAERKMSLKLT
DAAINFLADVGYDPVYGARPLKRAIQRELETQIAKSILRSEFNDGDTIFVDIENERLSLKRLPMELVTAK

Sequences:

>Translated_870_residues
MQPNNPNQFTEKAWEAISRTPDIAKTSQNQQIEAEHLMKALLEQNGLVASLFSKVGVSTTKIQEYTDSFIKRQPKVKNIP
NNIYLGRSLDALLDNAEKYRQEYKDEYISIEHLILAYLKDDHFGKNLYKEFKLDEVKLKKTISQVRGKQKVTDKNPEGKY
EALEKYGRDLTEFAREGKLDPVIGRDDEIRRTIQILSRRTKNNPVLIGEPGVGKTAIAEGLAQRIITLDVPQSLKDRKLI
ALDMGALIAGAKFRGEFEERLKAVLKEVTDSEGKIILFIDEIHTVVGAGATQGAMDAGNLLKPMLARGELRCIGATTLDE
YRKYIEKDAALERRFQQVYVDQPSVEDAISILRGLKERYEVHHGVKISDSSLVAAATLSTRYISDRFLPDKAIDLVDEAA
AKLKMEITSKPEELDEIDRKILQLEMEKLSLQKESDTASKERLGRLEKDLANLKEGQRALNAQWESEKGIISTIQTVKEE
IDKVNIEIQQAERNYDLNRAAELKYGRLINLQKQVEEAEAKLATTQTSGQTLLREEVTEADIAEIISKWTGIPISKLVES
EKEKLLHLEGELHKRVIGQNEAVSAVSDAIQRSRAGLADPNRPVASFIFLGPTGVGKTELGKALAAYLFDTEDAMVRIDM
SEYMEKHSVSRLIGAPPGYVGYDEGGQLTEAIRRRPYTVILFDEIEKAHPDVFNIMLQILDDGRVTDSQGHKVDFKNSVI
IMTSNIGSQYILDVTDDYEQMQGRVMEALRAAFRPEFLNRIDETIIFHGLQKEQLREIVQLQVVRLEKRLAERKMSLKLT
DAAINFLADVGYDPVYGARPLKRAIQRELETQIAKSILRSEFNDGDTIFVDIENERLSLKRLPMELVTAK
>Mature_870_residues
MQPNNPNQFTEKAWEAISRTPDIAKTSQNQQIEAEHLMKALLEQNGLVASLFSKVGVSTTKIQEYTDSFIKRQPKVKNIP
NNIYLGRSLDALLDNAEKYRQEYKDEYISIEHLILAYLKDDHFGKNLYKEFKLDEVKLKKTISQVRGKQKVTDKNPEGKY
EALEKYGRDLTEFAREGKLDPVIGRDDEIRRTIQILSRRTKNNPVLIGEPGVGKTAIAEGLAQRIITLDVPQSLKDRKLI
ALDMGALIAGAKFRGEFEERLKAVLKEVTDSEGKIILFIDEIHTVVGAGATQGAMDAGNLLKPMLARGELRCIGATTLDE
YRKYIEKDAALERRFQQVYVDQPSVEDAISILRGLKERYEVHHGVKISDSSLVAAATLSTRYISDRFLPDKAIDLVDEAA
AKLKMEITSKPEELDEIDRKILQLEMEKLSLQKESDTASKERLGRLEKDLANLKEGQRALNAQWESEKGIISTIQTVKEE
IDKVNIEIQQAERNYDLNRAAELKYGRLINLQKQVEEAEAKLATTQTSGQTLLREEVTEADIAEIISKWTGIPISKLVES
EKEKLLHLEGELHKRVIGQNEAVSAVSDAIQRSRAGLADPNRPVASFIFLGPTGVGKTELGKALAAYLFDTEDAMVRIDM
SEYMEKHSVSRLIGAPPGYVGYDEGGQLTEAIRRRPYTVILFDEIEKAHPDVFNIMLQILDDGRVTDSQGHKVDFKNSVI
IMTSNIGSQYILDVTDDYEQMQGRVMEALRAAFRPEFLNRIDETIIFHGLQKEQLREIVQLQVVRLEKRLAERKMSLKLT
DAAINFLADVGYDPVYGARPLKRAIQRELETQIAKSILRSEFNDGDTIFVDIENERLSLKRLPMELVTAK

Specific function: Part of a stress-induced multi-chaperone system, it is involved in the recovery of the cell from heat-induced damage, in cooperation with DnaK, DnaJ and GrpE. Acts before DnaK, in the processing of protein aggregates. Protein binding stimulates the ATPase

COG id: COG0542

COG function: function code O; ATPases with chaperone activity, ATP-binding subunit

Gene ontology:

Cell location: Cytoplasm (Probable) [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the clpA/clpB family [H]

Homologues:

Organism=Homo sapiens, GI13540606, Length=330, Percent_Identity=36.6666666666667, Blast_Score=206, Evalue=6e-53,
Organism=Escherichia coli, GI1788943, Length=856, Percent_Identity=56.5420560747664, Blast_Score=978, Evalue=0.0,
Organism=Escherichia coli, GI1787109, Length=254, Percent_Identity=50.7874015748031, Blast_Score=275, Evalue=8e-75,
Organism=Saccharomyces cerevisiae, GI6320464, Length=726, Percent_Identity=52.8925619834711, Blast_Score=785, Evalue=0.0,
Organism=Saccharomyces cerevisiae, GI6323002, Length=862, Percent_Identity=43.1554524361949, Blast_Score=691, Evalue=0.0,

Paralogues:

None

Copy number: 560 Molecules/Cell In: Glucose minimal media [C]

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003593
- InterPro:   IPR013093
- InterPro:   IPR003959
- InterPro:   IPR018368
- InterPro:   IPR017730
- InterPro:   IPR001270
- InterPro:   IPR019489
- InterPro:   IPR004176
- InterPro:   IPR023150 [H]

Pfam domain/function: PF00004 AAA; PF07724 AAA_2; PF02861 Clp_N; PF10431 ClpB_D2-small [H]

EC number: NA

Molecular weight: Translated: 98339; Mature: 98339

Theoretical pI: Translated: 5.55; Mature: 5.55

Prosite motif: PS00870 CLPAB_1 ; PS00871 CLPAB_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.1 %Cys     (Translated Protein)
1.8 %Met     (Translated Protein)
2.0 %Cys+Met (Translated Protein)
0.1 %Cys     (Mature Protein)
1.8 %Met     (Mature Protein)
2.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MQPNNPNQFTEKAWEAISRTPDIAKTSQNQQIEAEHLMKALLEQNGLVASLFSKVGVSTT
CCCCCCCHHHHHHHHHHHCCCCCHHCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHCCCHH
KIQEYTDSFIKRQPKVKNIPNNIYLGRSLDALLDNAEKYRQEYKDEYISIEHLILAYLKD
HHHHHHHHHHHCCCCCCCCCCCEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC
DHFGKNLYKEFKLDEVKLKKTISQVRGKQKVTDKNPEGKYEALEKYGRDLTEFAREGKLD
CCCCHHHHHHHCHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHCCHHHHHHHCCCCC
PVIGRDDEIRRTIQILSRRTKNNPVLIGEPGVGKTAIAEGLAQRIITLDVPQSLKDRKLI
CCCCCCHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHHHEEECCCCCCCCCEEE
ALDMGALIAGAKFRGEFEERLKAVLKEVTDSEGKIILFIDEIHTVVGAGATQGAMDAGNL
EEEHHHHHHCHHHHCHHHHHHHHHHHHHCCCCCCEEEEEHHHHHHHCCCCCCCCCCHHHH
LKPMLARGELRCIGATTLDEYRKYIEKDAALERRFQQVYVDQPSVEDAISILRGLKERYE
HHHHHHCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHH
VHHGVKISDSSLVAAATLSTRYISDRFLPDKAIDLVDEAAAKLKMEITSKPEELDEIDRK
HHCCCEECCCCEEHHHHHHHHHHHHHCCCHHHHHHHHHHHHHEEHHHCCCCHHHHHHHHH
ILQLEMEKLSLQKESDTASKERLGRLEKDLANLKEGQRALNAQWESEKGIISTIQTVKEE
HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHH
IDKVNIEIQQAERNYDLNRAAELKYGRLINLQKQVEEAEAKLATTQTSGQTLLREEVTEA
HHHHCEEEEECCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH
DIAEIISKWTGIPISKLVESEKEKLLHLEGELHKRVIGQNEAVSAVSDAIQRSRAGLADP
HHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCC
NRPVASFIFLGPTGVGKTELGKALAAYLFDTEDAMVRIDMSEYMEKHSVSRLIGAPPGYV
CCCCEEEEEECCCCCCHHHHHHHHHHHHHCCCCCEEEECHHHHHHHHHHHHHHCCCCCCC
GYDEGGQLTEAIRRRPYTVILFDEIEKAHPDVFNIMLQILDDGRVTDSQGHKVDFKNSVI
CCCCCCHHHHHHHHCCCEEEEEHHHHHHCCHHHHHHHHHHCCCCCCCCCCCEEECCCCEE
IMTSNIGSQYILDVTDDYEQMQGRVMEALRAAFRPEFLNRIDETIIFHGLQKEQLREIVQ
EEEECCCCEEEEECCCCHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHCCCCHHHHHHHHH
LQVVRLEKRLAERKMSLKLTDAAINFLADVGYDPVYGARPLKRAIQRELETQIAKSILRS
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH
EFNDGDTIFVDIENERLSLKRLPMELVTAK
CCCCCCEEEEEECCCCHHHHHCCHHHHCCC
>Mature Secondary Structure
MQPNNPNQFTEKAWEAISRTPDIAKTSQNQQIEAEHLMKALLEQNGLVASLFSKVGVSTT
CCCCCCCHHHHHHHHHHHCCCCCHHCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHCCCHH
KIQEYTDSFIKRQPKVKNIPNNIYLGRSLDALLDNAEKYRQEYKDEYISIEHLILAYLKD
HHHHHHHHHHHCCCCCCCCCCCEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC
DHFGKNLYKEFKLDEVKLKKTISQVRGKQKVTDKNPEGKYEALEKYGRDLTEFAREGKLD
CCCCHHHHHHHCHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHCCHHHHHHHCCCCC
PVIGRDDEIRRTIQILSRRTKNNPVLIGEPGVGKTAIAEGLAQRIITLDVPQSLKDRKLI
CCCCCCHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHHHEEECCCCCCCCCEEE
ALDMGALIAGAKFRGEFEERLKAVLKEVTDSEGKIILFIDEIHTVVGAGATQGAMDAGNL
EEEHHHHHHCHHHHCHHHHHHHHHHHHHCCCCCCEEEEEHHHHHHHCCCCCCCCCCHHHH
LKPMLARGELRCIGATTLDEYRKYIEKDAALERRFQQVYVDQPSVEDAISILRGLKERYE
HHHHHHCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHH
VHHGVKISDSSLVAAATLSTRYISDRFLPDKAIDLVDEAAAKLKMEITSKPEELDEIDRK
HHCCCEECCCCEEHHHHHHHHHHHHHCCCHHHHHHHHHHHHHEEHHHCCCCHHHHHHHHH
ILQLEMEKLSLQKESDTASKERLGRLEKDLANLKEGQRALNAQWESEKGIISTIQTVKEE
HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHH
IDKVNIEIQQAERNYDLNRAAELKYGRLINLQKQVEEAEAKLATTQTSGQTLLREEVTEA
HHHHCEEEEECCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH
DIAEIISKWTGIPISKLVESEKEKLLHLEGELHKRVIGQNEAVSAVSDAIQRSRAGLADP
HHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCC
NRPVASFIFLGPTGVGKTELGKALAAYLFDTEDAMVRIDMSEYMEKHSVSRLIGAPPGYV
CCCCEEEEEECCCCCCHHHHHHHHHHHHHCCCCCEEEECHHHHHHHHHHHHHHCCCCCCC
GYDEGGQLTEAIRRRPYTVILFDEIEKAHPDVFNIMLQILDDGRVTDSQGHKVDFKNSVI
CCCCCCHHHHHHHHCCCEEEEEHHHHHHCCHHHHHHHHHHCCCCCCCCCCCEEECCCCEE
IMTSNIGSQYILDVTDDYEQMQGRVMEALRAAFRPEFLNRIDETIIFHGLQKEQLREIVQ
EEEECCCCEEEEECCCCHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHCCCCHHHHHHHHH
LQVVRLEKRLAERKMSLKLTDAAINFLADVGYDPVYGARPLKRAIQRELETQIAKSILRS
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH
EFNDGDTIFVDIENERLSLKRLPMELVTAK
CCCCCCEEEEEECCCCHHHHHCCHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: Hydrolase; Acting on peptide bonds (Peptidases); Serine endopeptidases [C]

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11759840 [H]