Definition | Sulfolobus solfataricus P2 chromosome, complete genome. |
---|---|
Accession | NC_002754 |
Length | 2,992,245 |
Click here to switch to the map view.
The map label for this gene is cpsA-1
Identifier: 15898196
GI number: 15898196
Start: 1193091
End: 1194272
Strand: Direct
Name: cpsA-1
Synonym: SSO1355
Alternate gene names: 15898196
Gene position: 1193091-1194272 (Clockwise)
Preceding gene: 15898193
Following gene: 15898197
Centisome position: 39.87
GC content: 40.1
Gene sequence:
>1182_bases ATGGATTTAGTTGAGAAGTTAAAAAATGACGTAAGAGAAATAGAGGACTGGATAATTCAAATTAGAAGGAAAATCCATGA GTATCCGGAACTTTCCTACAAGGAGTATAACACCTCTAAACTAGTAGCGGAAACGTTAAGGAAATTGGGAGTAGAAGTGG AAGAAGGCGTTGGATTACCCACAGCAGTGGTTGGTAAGATTAGGGGAAGTAAACCAGGAAAGACTGTTGCTTTGAGAGCT GATATGGATGCCCTTCCGGTAGAGGAGAACACTGATCTAGAATTTAAATCCAAAGTTAAGGGAGTAATGCACGCATGTGG TCATGATACTCACGTAGCAATGCTCTTAGGTGGAGCTTATCTGTTAGTTAAGAATAAAGATTTAATCAGTGGTGAAATTA GGTTAATATTCCAACCGGCAGAGGAGGATGGAGGATTAGGAGGAGCAAAACCAATGATTGAGGCTGGAGTTATGAACGGT GTAGATTATGTATTTGGAATACATATATCGAGTAGTTATCCTTCTGGAGTTTTCGCAACTAGAAAAGGCCCTATAATGGC TACGCCGGACGCATTCAAGATAATCGTTCACGGGAAGGGCGGTCATGGTTCTGCTCCTCATGAGACTATTGACCCAATTT TTATATCCTTACAAATAGCTAACGCAATCTACGGCATAACAGCAAGGCAAATTGATCCAGTTCAACCCTTTATCATATCC ATTACTACAATACATTCAGGTACAAAGGATAACATAATACCAGATGATGCCGAAATGCAGGGAACAATTAGAAGTTTAGA CGAGAACGTTAGAAGTAAGGCTAAGGACTATATGAGAAGAATAGTTTCGTCAATATGTGGAATCTATGGTGCAACTTGTG AGGTTAAATTCATGGAAGACGTCTATCCAACTACCGTAAATAACCCTGAGGTAACTGATGAGGTAATGAAAATTCTATCT TCAATATCAACAGTTGTTGAGACAGAGCCAGTGCTAGGAGCGGAGGACTTCTCCAGATTCTTACAGAAGGCTCCAGGAAC GTATTTCTTTCTGGGAACCAGAAACGAAAAGAAAGGATGCATATATCCCAATCACAGCTCTAAGTTCTGTGTAGATGAGG ACGTGCTAAAATTAGGTGCCTTAGCTCACGCATTATTGGCAGTAAAGTTCAGTAATAAATAA
Upstream 100 bases:
>100_bases TTATTCATTATCCTACTTTATAATCCTTTAGGTACTAAAGATTTTCCGGCTAACTTTTATTAATGGATTAGCTTTTTATT CTTAGCTCGCTAAATCTCTT
Downstream 100 bases:
>100_bases AGGGCTTAATGCTAAATGAGTATTGGAAATGGAGAAGGAAAAGGTTCTCAATATTTTGAGGAATTCCTCAAATTTACCTT TAAGTTTAATTAAAGAATTC
Product: thermostable carboxypeptidase (cpsA-1)
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 393; Mature: 393
Protein sequence:
>393_residues MDLVEKLKNDVREIEDWIIQIRRKIHEYPELSYKEYNTSKLVAETLRKLGVEVEEGVGLPTAVVGKIRGSKPGKTVALRA DMDALPVEENTDLEFKSKVKGVMHACGHDTHVAMLLGGAYLLVKNKDLISGEIRLIFQPAEEDGGLGGAKPMIEAGVMNG VDYVFGIHISSSYPSGVFATRKGPIMATPDAFKIIVHGKGGHGSAPHETIDPIFISLQIANAIYGITARQIDPVQPFIIS ITTIHSGTKDNIIPDDAEMQGTIRSLDENVRSKAKDYMRRIVSSICGIYGATCEVKFMEDVYPTTVNNPEVTDEVMKILS SISTVVETEPVLGAEDFSRFLQKAPGTYFFLGTRNEKKGCIYPNHSSKFCVDEDVLKLGALAHALLAVKFSNK
Sequences:
>Translated_393_residues MDLVEKLKNDVREIEDWIIQIRRKIHEYPELSYKEYNTSKLVAETLRKLGVEVEEGVGLPTAVVGKIRGSKPGKTVALRA DMDALPVEENTDLEFKSKVKGVMHACGHDTHVAMLLGGAYLLVKNKDLISGEIRLIFQPAEEDGGLGGAKPMIEAGVMNG VDYVFGIHISSSYPSGVFATRKGPIMATPDAFKIIVHGKGGHGSAPHETIDPIFISLQIANAIYGITARQIDPVQPFIIS ITTIHSGTKDNIIPDDAEMQGTIRSLDENVRSKAKDYMRRIVSSICGIYGATCEVKFMEDVYPTTVNNPEVTDEVMKILS SISTVVETEPVLGAEDFSRFLQKAPGTYFFLGTRNEKKGCIYPNHSSKFCVDEDVLKLGALAHALLAVKFSNK >Mature_393_residues MDLVEKLKNDVREIEDWIIQIRRKIHEYPELSYKEYNTSKLVAETLRKLGVEVEEGVGLPTAVVGKIRGSKPGKTVALRA DMDALPVEENTDLEFKSKVKGVMHACGHDTHVAMLLGGAYLLVKNKDLISGEIRLIFQPAEEDGGLGGAKPMIEAGVMNG VDYVFGIHISSSYPSGVFATRKGPIMATPDAFKIIVHGKGGHGSAPHETIDPIFISLQIANAIYGITARQIDPVQPFIIS ITTIHSGTKDNIIPDDAEMQGTIRSLDENVRSKAKDYMRRIVSSICGIYGATCEVKFMEDVYPTTVNNPEVTDEVMKILS SISTVVETEPVLGAEDFSRFLQKAPGTYFFLGTRNEKKGCIYPNHSSKFCVDEDVLKLGALAHALLAVKFSNK
Specific function: Can release basic, acidic, aromatic, and, to a lesser extent, aliphatic amino acids
COG id: COG1473
COG function: function code R; Metal-dependent amidase/aminoacylase/carboxypeptidase
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the peptidase M20 family
Homologues:
Organism=Escherichia coli, GI87081880, Length=444, Percent_Identity=25.6756756756757, Blast_Score=120, Evalue=2e-28,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): CBPX1_SULSO (P80092)
Other databases:
- EMBL: Z48497 - EMBL: AE006641 - PIR: H90291 - PIR: S23180 - RefSeq: NP_342801.1 - ProteinModelPortal: P80092 - SMR: P80092 - GeneID: 1454370 - GenomeReviews: AE006641_GR - KEGG: sso:SSO1355 - NMPDR: fig|273057.1.peg.1224 - HOGENOM: HBG708500 - OMA: IVEINHS - PhylomeDB: P80092 - ProtClustDB: CLSK785363 - BioCyc: SSOL273057:SSO1355-MONOMER - InterPro: IPR017439 - InterPro: IPR010168 - InterPro: IPR002933 - InterPro: IPR011650 - PIRSF: PIRSF005962 - TIGRFAMs: TIGR01891
Pfam domain/function: PF07687 M20_dimer; PF01546 Peptidase_M20; SSF55031 Peptidase_M20_dimer
EC number: NA
Molecular weight: Translated: 43069; Mature: 43069
Theoretical pI: Translated: 6.30; Mature: 6.30
Prosite motif: NA
Important sites: ACT_SITE 302-302 ACT_SITE 373-373
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.3 %Cys (Translated Protein) 2.8 %Met (Translated Protein) 4.1 %Cys+Met (Translated Protein) 1.3 %Cys (Mature Protein) 2.8 %Met (Mature Protein) 4.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MDLVEKLKNDVREIEDWIIQIRRKIHEYPELSYKEYNTSKLVAETLRKLGVEVEEGVGLP CCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHCCCHHHCCCCC TAVVGKIRGSKPGKTVALRADMDALPVEENTDLEFKSKVKGVMHACGHDTHVAMLLGGAY HHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEEECCEE LLVKNKDLISGEIRLIFQPAEEDGGLGGAKPMIEAGVMNGVDYVFGIHISSSYPSGVFAT EEEECCCEECCEEEEEEEECCCCCCCCCCCHHHHHHHHCCCCEEEEEEEECCCCCCEEEE RKGPIMATPDAFKIIVHGKGGHGSAPHETIDPIFISLQIANAIYGITARQIDPVQPFIIS CCCCEEECCCCEEEEEEECCCCCCCCCCCCCCEEEEEEEHHHHHHCHHCCCCCCCCEEEE ITTIHSGTKDNIIPDDAEMQGTIRSLDENVRSKAKDYMRRIVSSICGIYGATCEVKFMED EEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEHHC VYPTTVNNPEVTDEVMKILSSISTVVETEPVLGAEDFSRFLQKAPGTYFFLGTRNEKKGC CCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCEEEEECCCCCCCC IYPNHSSKFCVDEDVLKLGALAHALLAVKFSNK CCCCCCCCCCCCHHHHHHHHHHHHHHHEEECCC >Mature Secondary Structure MDLVEKLKNDVREIEDWIIQIRRKIHEYPELSYKEYNTSKLVAETLRKLGVEVEEGVGLP CCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHCCCHHHCCCCC TAVVGKIRGSKPGKTVALRADMDALPVEENTDLEFKSKVKGVMHACGHDTHVAMLLGGAY HHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEEECCEE LLVKNKDLISGEIRLIFQPAEEDGGLGGAKPMIEAGVMNGVDYVFGIHISSSYPSGVFAT EEEECCCEECCEEEEEEEECCCCCCCCCCCHHHHHHHHCCCCEEEEEEEECCCCCCEEEE RKGPIMATPDAFKIIVHGKGGHGSAPHETIDPIFISLQIANAIYGITARQIDPVQPFIIS CCCCEEECCCCEEEEEEECCCCCCCCCCCCCCEEEEEEEHHHHHHCHHCCCCCCCCEEEE ITTIHSGTKDNIIPDDAEMQGTIRSLDENVRSKAKDYMRRIVSSICGIYGATCEVKFMED EEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEHHC VYPTTVNNPEVTDEVMKILSSISTVVETEPVLGAEDFSRFLQKAPGTYFFLGTRNEKKGC CCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCEEEEECCCCCCCC IYPNHSSKFCVDEDVLKLGALAHALLAVKFSNK CCCCCCCCCCCCHHHHHHHHHHHHHHHEEECCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 7559343; 11427726; 1597179