| Definition | Trichodesmium erythraeum IMS101 chromosome, complete genome. |
|---|---|
| Accession | NC_008312 |
| Length | 7,750,108 |
Click here to switch to the map view.
The map label for this gene is clpX
Identifier: 113474440
GI number: 113474440
Start: 936923
End: 938272
Strand: Reverse
Name: clpX
Synonym: Tery_0584
Alternate gene names: 113474440
Gene position: 938272-936923 (Counterclockwise)
Preceding gene: 113474441
Following gene: 113474437
Centisome position: 12.11
GC content: 41.78
Gene sequence:
>1350_bases ATGTCTAAATACGACTCCCATCTAAAATGTTCATTCTGTGGCAAGTCTCAAGAGCAGGTTAGGAAATTGATAGCTGGACC TGGAGTTTATATATGTGATGAATGTGTAGAGTTGTGCAATGAGATTTTGGATGAGGAGCTTTTTGACTCCAATGCTACAG GAGCACAACCACCAATACCACGTCCAGCACCAGCACCCCAAAAACGAGGGACTGGTACTAAGAGATTATCTATTAGTCAA ATACCTAAGCCTAGGGAAATAAAGAATTATCTGGATGCTCATGTTATTGGTCAGGAGGAAGGTAAGAAGGTTTTATCAGT GGCAGTTTATAACCACTATAAACGTCTGAGTTTTCTAGAGGCCAAAAAAAGTGGTAAGTCCTCTCAAGATGAGGTGGAAT TACAAAAGTCTAATATTTTGTTGATTGGGCCCACAGGTTGTGGAAAAACGTTGTTGGCTCAAACTTTGGCGGATTTATTG GATGTGCCTTTTGCCGTGGCAGATGCGACGACTTTAACTGAAGCTGGATATGTTGGGGAGGACGTGGAGAATATTTTGCT ACGACTTTTACAAGTAGCAGATTTAGAAGTGGATGAAGCACAACGGGGAATTATATATATTGATGAGATTGATAAAATAG CTCGTAAGAGTGAGAACCCTTCTATAACAAGAGATGTTTCTGGGGAGGGTGTGCAGCAAGCCTTATTAAAGATGTTGGAG GGAACTGTTGCTAATGTTCCTCCACAAGGTGGTCGGAAACATCCCTATCAAGATTGTATTCAGATCGATACGAGTAATAT TTTATTTATCTGTGGTGGTGCTTTTGTTGGTTTAGAAAAGATAGTAGATCAAAGAATTGGTAAAAAGTCAATGGGCTTTA TTCACCAGAGTGGGGACAGTTATCAGGTTAAGGAGAAAAAAGTTGTAGATTTAATGAAGCAAATGGAACCAAATGATTTG GTGAAGTTTGGTTTGATCCCAGAATTGATTGGGCGAATACCTATGGTGGCTGTCGTTGAACCTCTCGATGAGGAGACTCT GATGGCAATTTTGACGAAACCTCAGAATGCTCTGGTGAAGCAGTATCAAAAGCTGTTACGGATGGATAATGTGAAGTTGG AGTTTGAGGAGGATGCTGTACGGGCGATCGCGAAGGAAGCATTTAGGAGAAAGACTGGGGCGCGAGCTTTGCGGGGTATT GTTGAGGAGTTGATGTTGGATGTGATGTATGAGCTACCATCACGGAAGGATGTGAGTCGTTGCACTATTACTAAGGAAAT GGTGGAAAAGCGATCAACTGCAGAGTTGTTATTGCATCCTTCGTCTTTGCCTAAACCGGAGTCAGCTTAA
Upstream 100 bases:
>100_bases AGAAGAGGCTAAAAACTATGGCCTGATTGATCAAGTGATTACTAGACAGAATCTTCCTGTACCAGGAGAGTCTGTCCCTG CAATGCAATAAGAGGCAGTT
Downstream 100 bases:
>100_bases TTTATGTATTGCTTCTCGTAAGCACTTAAATAAAAAACTTCATTTGGCTACTAGGCCGACACAGCTAAAATAGTGGATTA GCTTTTTTCTGATAATTCTT
Product: ATP-dependent protease ATP-binding subunit ClpX
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 449; Mature: 448
Protein sequence:
>449_residues MSKYDSHLKCSFCGKSQEQVRKLIAGPGVYICDECVELCNEILDEELFDSNATGAQPPIPRPAPAPQKRGTGTKRLSISQ IPKPREIKNYLDAHVIGQEEGKKVLSVAVYNHYKRLSFLEAKKSGKSSQDEVELQKSNILLIGPTGCGKTLLAQTLADLL DVPFAVADATTLTEAGYVGEDVENILLRLLQVADLEVDEAQRGIIYIDEIDKIARKSENPSITRDVSGEGVQQALLKMLE GTVANVPPQGGRKHPYQDCIQIDTSNILFICGGAFVGLEKIVDQRIGKKSMGFIHQSGDSYQVKEKKVVDLMKQMEPNDL VKFGLIPELIGRIPMVAVVEPLDEETLMAILTKPQNALVKQYQKLLRMDNVKLEFEEDAVRAIAKEAFRRKTGARALRGI VEELMLDVMYELPSRKDVSRCTITKEMVEKRSTAELLLHPSSLPKPESA
Sequences:
>Translated_449_residues MSKYDSHLKCSFCGKSQEQVRKLIAGPGVYICDECVELCNEILDEELFDSNATGAQPPIPRPAPAPQKRGTGTKRLSISQ IPKPREIKNYLDAHVIGQEEGKKVLSVAVYNHYKRLSFLEAKKSGKSSQDEVELQKSNILLIGPTGCGKTLLAQTLADLL DVPFAVADATTLTEAGYVGEDVENILLRLLQVADLEVDEAQRGIIYIDEIDKIARKSENPSITRDVSGEGVQQALLKMLE GTVANVPPQGGRKHPYQDCIQIDTSNILFICGGAFVGLEKIVDQRIGKKSMGFIHQSGDSYQVKEKKVVDLMKQMEPNDL VKFGLIPELIGRIPMVAVVEPLDEETLMAILTKPQNALVKQYQKLLRMDNVKLEFEEDAVRAIAKEAFRRKTGARALRGI VEELMLDVMYELPSRKDVSRCTITKEMVEKRSTAELLLHPSSLPKPESA >Mature_448_residues SKYDSHLKCSFCGKSQEQVRKLIAGPGVYICDECVELCNEILDEELFDSNATGAQPPIPRPAPAPQKRGTGTKRLSISQI PKPREIKNYLDAHVIGQEEGKKVLSVAVYNHYKRLSFLEAKKSGKSSQDEVELQKSNILLIGPTGCGKTLLAQTLADLLD VPFAVADATTLTEAGYVGEDVENILLRLLQVADLEVDEAQRGIIYIDEIDKIARKSENPSITRDVSGEGVQQALLKMLEG TVANVPPQGGRKHPYQDCIQIDTSNILFICGGAFVGLEKIVDQRIGKKSMGFIHQSGDSYQVKEKKVVDLMKQMEPNDLV KFGLIPELIGRIPMVAVVEPLDEETLMAILTKPQNALVKQYQKLLRMDNVKLEFEEDAVRAIAKEAFRRKTGARALRGIV EELMLDVMYELPSRKDVSRCTITKEMVEKRSTAELLLHPSSLPKPESA
Specific function: ATP-dependent specificity component of the Clp protease. It directs the protease to specific substrates. Can perform chaperone functions in the absence of ClpP
COG id: COG1219
COG function: function code O; ATP-dependent protease Clp, ATPase subunit
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the ClpX chaperone family
Homologues:
Organism=Homo sapiens, GI7242140, Length=336, Percent_Identity=47.6190476190476, Blast_Score=304, Evalue=1e-82, Organism=Escherichia coli, GI1786642, Length=428, Percent_Identity=59.8130841121495, Blast_Score=494, Evalue=1e-141, Organism=Escherichia coli, GI1790366, Length=110, Percent_Identity=40.9090909090909, Blast_Score=91, Evalue=2e-19, Organism=Caenorhabditis elegans, GI71982908, Length=308, Percent_Identity=45.1298701298701, Blast_Score=273, Evalue=1e-73, Organism=Caenorhabditis elegans, GI71982905, Length=308, Percent_Identity=45.1298701298701, Blast_Score=273, Evalue=2e-73, Organism=Caenorhabditis elegans, GI71988663, Length=446, Percent_Identity=38.5650224215247, Blast_Score=265, Evalue=3e-71, Organism=Caenorhabditis elegans, GI71988660, Length=288, Percent_Identity=39.5833333333333, Blast_Score=176, Evalue=2e-44, Organism=Saccharomyces cerevisiae, GI6319704, Length=414, Percent_Identity=39.1304347826087, Blast_Score=275, Evalue=1e-74, Organism=Drosophila melanogaster, GI24648291, Length=319, Percent_Identity=46.3949843260188, Blast_Score=283, Evalue=2e-76, Organism=Drosophila melanogaster, GI24648289, Length=319, Percent_Identity=46.3949843260188, Blast_Score=283, Evalue=2e-76,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): CLPX_TRIEI (Q118P6)
Other databases:
- EMBL: CP000393 - RefSeq: YP_720501.1 - ProteinModelPortal: Q118P6 - STRING: Q118P6 - GeneID: 4244610 - GenomeReviews: CP000393_GR - KEGG: ter:Tery_0584 - NMPDR: fig|203124.1.peg.3579 - eggNOG: COG1219 - HOGENOM: HBG745965 - OMA: CGKSQEQ - PhylomeDB: Q118P6 - ProtClustDB: PRK05342 - BioCyc: TERY203124:TERY_0584-MONOMER - HAMAP: MF_00175 - InterPro: IPR003593 - InterPro: IPR013093 - InterPro: IPR019489 - InterPro: IPR004487 - InterPro: IPR010603 - PANTHER: PTHR11262:SF4 - SMART: SM00382 - TIGRFAMs: TIGR00382
Pfam domain/function: PF07724 AAA_2; PF10431 ClpB_D2-small; PF06689 zf-C4_ClpX
EC number: NA
Molecular weight: Translated: 49770; Mature: 49639
Theoretical pI: Translated: 6.05; Mature: 6.05
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.0 %Cys (Translated Protein) 2.4 %Met (Translated Protein) 4.5 %Cys+Met (Translated Protein) 2.0 %Cys (Mature Protein) 2.2 %Met (Mature Protein) 4.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSKYDSHLKCSFCGKSQEQVRKLIAGPGVYICDECVELCNEILDEELFDSNATGAQPPIP CCCCCCCCEEEECCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC RPAPAPQKRGTGTKRLSISQIPKPREIKNYLDAHVIGQEEGKKVLSVAVYNHYKRLSFLE CCCCCCCCCCCCCCEEEHHCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHH AKKSGKSSQDEVELQKSNILLIGPTGCGKTLLAQTLADLLDVPFAVADATTLTEAGYVGE HHHCCCCCCHHHHEECCCEEEEECCCCCHHHHHHHHHHHHCCCHHHHCHHHHHHCCCCCH DVENILLRLLQVADLEVDEAQRGIIYIDEIDKIARKSENPSITRDVSGEGVQQALLKMLE HHHHHHHHHHHHHCCCHHHHCCCEEEEHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHC GTVANVPPQGGRKHPYQDCIQIDTSNILFICGGAFVGLEKIVDQRIGKKSMGFIHQSGDS CHHCCCCCCCCCCCCHHHHHEECCCCEEEEECCHHHHHHHHHHHHHCHHHHCHHCCCCCC YQVKEKKVVDLMKQMEPNDLVKFGLIPELIGRIPMVAVVEPLDEETLMAILTKPQNALVK CCHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCEEHCCCCCHHHHHHHHCCCHHHHHH QYQKLLRMDNVKLEFEEDAVRAIAKEAFRRKTGARALRGIVEELMLDVMYELPSRKDVSR HHHHHHHHCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHH CTITKEMVEKRSTAELLLHPSSLPKPESA HHHHHHHHHCCCCHHHEECCCCCCCCCCC >Mature Secondary Structure SKYDSHLKCSFCGKSQEQVRKLIAGPGVYICDECVELCNEILDEELFDSNATGAQPPIP CCCCCCCEEEECCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC RPAPAPQKRGTGTKRLSISQIPKPREIKNYLDAHVIGQEEGKKVLSVAVYNHYKRLSFLE CCCCCCCCCCCCCCEEEHHCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHH AKKSGKSSQDEVELQKSNILLIGPTGCGKTLLAQTLADLLDVPFAVADATTLTEAGYVGE HHHCCCCCCHHHHEECCCEEEEECCCCCHHHHHHHHHHHHCCCHHHHCHHHHHHCCCCCH DVENILLRLLQVADLEVDEAQRGIIYIDEIDKIARKSENPSITRDVSGEGVQQALLKMLE HHHHHHHHHHHHHCCCHHHHCCCEEEEHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHC GTVANVPPQGGRKHPYQDCIQIDTSNILFICGGAFVGLEKIVDQRIGKKSMGFIHQSGDS CHHCCCCCCCCCCCCHHHHHEECCCCEEEEECCHHHHHHHHHHHHHCHHHHCHHCCCCCC YQVKEKKVVDLMKQMEPNDLVKFGLIPELIGRIPMVAVVEPLDEETLMAILTKPQNALVK CCHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCEEHCCCCCHHHHHHHHCCCHHHHHH QYQKLLRMDNVKLEFEEDAVRAIAKEAFRRKTGARALRGIVEELMLDVMYELPSRKDVSR HHHHHHHHCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHH CTITKEMVEKRSTAELLLHPSSLPKPESA HHHHHHHHHCCCCHHHEECCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA