Definition | Sulfolobus tokodaii str. 7 chromosome, complete genome. |
---|---|
Accession | NC_003106 |
Length | 2,694,756 |
Click here to switch to the map view.
The map label for this gene is triC
Identifier: 15921425
GI number: 15921425
Start: 1166207
End: 1168771
Strand: Direct
Name: triC
Synonym: ST1168
Alternate gene names: 15921425
Gene position: 1166207-1168771 (Clockwise)
Preceding gene: 15921424
Following gene: 15921426
Centisome position: 43.28
GC content: 36.57
Gene sequence:
>2565_bases GTGTTTTATAACTTTTTACTTTATTTAGTTATTCACGTTAATTACGCGTTGGACGCTCTCACGTTCAATAAATTCCTCAG TCTGGACGGAATAGTAAGCTGGCCTATGATAATTAAGGACAGAGTATATTTCCTTTCAGATCATGAAGGAATCTCAAACC TTTACTCCGTTAATCTGGAGGGCAAGGATTTAACAAAACATACAAACTTTACTGAATATTATTGCAGAAATGCAAGCAGT GATGGAAGAAGGATCGTGTTTCAAAACAGTGGGGATATCTATTTATATGATCCAGAAAAACAAGAATTAAAACTCCTTGA TATTGACCTCCCTACAGACAGAAAGAAAAAGCAAGGAAAGTTCGTTGAAGTCCTAGATTACACTACTGAAGCTATCGCAA ACGATAAGTACCTTAGTTTAATAAGCAGAGGGAAAGTGTTTTTAATGAGACACTGGGATGGTCCTGCAGTACAACTTGGA GAAAAACAAGGAGTGAGATATAAACAAATACAGTTACTCCCTAATGGTGATACAGTAGTTTTAGATACAAATGATGACAA ACTCACTTTTCTCAGCAAAGATGGTTCAATTAAGAAATTAAATGTAGACCTTGGAAGAATTGAAAGAATCAAAGTATCCC CAGACGGAAAGAAGATACTTATTTCAAACAATAGACTAGAACTCTGGCTTTACGAGGTTGATACTACAAACCTAAGACTG ATTGACAAGAGCGAATACGATGTAATATCCCAGATGGACTGGCATCCTGATAATGAATGGTTCGCCTATACTTTTCCAGA AAGTTATAGCACTCAGTCTATTAAATTAGCTCATATCTCTGGAAAAGTAATAAGAATAACGAGCCCTTACGGTTATGATT TTTCTCCCTCTTTCGATCCCGACGGAAGATACTTGTATTTCTTATCAGCAAGACATCTAGACCCAACTAACGATAAGGTA ATATTTAATATGAGTTTCCAAAGAGTCATAAAGCCTTACCTAGTAGTCCTCTCTAATACTTATTCACCGTTTAATCAATC TTTAGAAGAGACTACAAGTGATAAAAAAGTCGAGATCGAGGGTATTGAAGACAGAGTAATTCCCTTCCCTGTTGATGAGG ACTATTATATTAGGATTGAAGGAGCAAAGAACAATAAAGTATTTCTCTTCTCACTTCCTATTAAAGGATATAGGTATCCT GGCGAAACATTAGGCAAACTAGAAGTTTTTGACCTTGATAGCAAAACTAAAGAACTATATGCAGATAATGTAAAGAGTTT TTCATTAACCATAGATAAAGGGAAAATTCTGATACTATTCAAAGACTCCATAAGACTCTTTGACGTAAACACTAAACCAG ATCTAAACGCCACGGGCAAGAAAGGAGGAATAGTTGATCTCTCAAGAATTAAAGTATATGTTGACCCAGAAAGAGAGTGG AAACAAATGTTCAGAGAAGCGTGGAAGCTTATGCAACAAAACTATTGGAAGCCAGATGGACTTAAGGACTGGGAATCAGT ACTTTTGAAGTATGAGAAGTTAATTGATAGGATAAGCACTCGTTACGAACTCTCAGACTTAATACAGGAAATGCAAGGTG AGACAAAAACCTCCCATTCTTATGAGATGCCATACGACTATGATACTGCAGAACCTTTACCAATAGGCGGATTAGGAGCC GATTATGAATATGACAAAGAAAATAAATGCTATAAGATTGCTAGGATTTACGTAGGTGACCCAACAAATGAAAATGAGAG AAGTCCTCTGAGAGATCCAGGAGTTCAGCTTAACATTGGAGATTGCATAAAGGCTGTTGACGGTGAAGAAGTGAAATATA ATATTCTCTCCTACCTAGTTAATAAGGATCAAGTGGTACTAGATGTTATTACGAAAGGAAAGACTAAACGCGTTACTGTG AAATTATTAAAAGATGAGAAGTTCTTAATTTATAGATATTGGGTTGAGAAGAATAGACAATATGTTCATGAGAAAAGTAA GGGAAAGTTAGGATATGTTCATATCCCCGATATGATGTACCAAGGTTTCGCAGAGTTCTATAGACTCTTCCTCTCCGAAT TCCATAGAGAAGGATTAATAGTAGACGTTAGGTTTAACAGAGGCGGGTTCATCTCCGGTTTAATTTTGGAGAAGCTCCTT CTCAAAAGAATGGGCTATGTAGTGAGGAGAAACGGAAAAGAACTACCGCATCCTTTCTTCTCTTCTCCCGGAGTTATCGT AGCAATAACTAATCAATATGCAGGCTCTGACGGCGATATATTCTCCTATTTATTCAAAAAGTACAAGTTGGGAATATTAA TAGGAAGAAGGACTTGGGGTGGAGTTATAGGAATTAACGTAAGAGATCGATTGGCTGATAACTCAGCAGTATCTCAACCA GAGTTTGCAGTACATTTTCACGACATAGGATTAAAAATAGAGAACTATGGTGTAGATCCGGATATTGAAGTTGATATTAA ACCAGAAGATTATGCTAATGGAAGGGATCCGCAACTTGATACTGCAATTGAGCTAGCATTAAAACAACTTGAAGAAAAAA GCTAG
Upstream 100 bases:
>100_bases GGGTTATCGAAAATTGCGAGATTATTTGCAATTGCATTTAATAAACTGATTTTATATTATCTCACTCGTAGTAATTAAGT GAATTTAATTCTAAGCAAAC
Downstream 100 bases:
>100_bases AAAATTTATAGTATAAAAACTAATTTTATCATGAAACTAATTTCTGGATACGATATCCCAATGATTTTCACTATTATATA GCCGCTAACCTATACAACTA
Product: tricorn protease
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 854; Mature: 854
Protein sequence:
>854_residues MFYNFLLYLVIHVNYALDALTFNKFLSLDGIVSWPMIIKDRVYFLSDHEGISNLYSVNLEGKDLTKHTNFTEYYCRNASS DGRRIVFQNSGDIYLYDPEKQELKLLDIDLPTDRKKKQGKFVEVLDYTTEAIANDKYLSLISRGKVFLMRHWDGPAVQLG EKQGVRYKQIQLLPNGDTVVLDTNDDKLTFLSKDGSIKKLNVDLGRIERIKVSPDGKKILISNNRLELWLYEVDTTNLRL IDKSEYDVISQMDWHPDNEWFAYTFPESYSTQSIKLAHISGKVIRITSPYGYDFSPSFDPDGRYLYFLSARHLDPTNDKV IFNMSFQRVIKPYLVVLSNTYSPFNQSLEETTSDKKVEIEGIEDRVIPFPVDEDYYIRIEGAKNNKVFLFSLPIKGYRYP GETLGKLEVFDLDSKTKELYADNVKSFSLTIDKGKILILFKDSIRLFDVNTKPDLNATGKKGGIVDLSRIKVYVDPEREW KQMFREAWKLMQQNYWKPDGLKDWESVLLKYEKLIDRISTRYELSDLIQEMQGETKTSHSYEMPYDYDTAEPLPIGGLGA DYEYDKENKCYKIARIYVGDPTNENERSPLRDPGVQLNIGDCIKAVDGEEVKYNILSYLVNKDQVVLDVITKGKTKRVTV KLLKDEKFLIYRYWVEKNRQYVHEKSKGKLGYVHIPDMMYQGFAEFYRLFLSEFHREGLIVDVRFNRGGFISGLILEKLL LKRMGYVVRRNGKELPHPFFSSPGVIVAITNQYAGSDGDIFSYLFKKYKLGILIGRRTWGGVIGINVRDRLADNSAVSQP EFAVHFHDIGLKIENYGVDPDIEVDIKPEDYANGRDPQLDTAIELALKQLEEKS
Sequences:
>Translated_854_residues MFYNFLLYLVIHVNYALDALTFNKFLSLDGIVSWPMIIKDRVYFLSDHEGISNLYSVNLEGKDLTKHTNFTEYYCRNASS DGRRIVFQNSGDIYLYDPEKQELKLLDIDLPTDRKKKQGKFVEVLDYTTEAIANDKYLSLISRGKVFLMRHWDGPAVQLG EKQGVRYKQIQLLPNGDTVVLDTNDDKLTFLSKDGSIKKLNVDLGRIERIKVSPDGKKILISNNRLELWLYEVDTTNLRL IDKSEYDVISQMDWHPDNEWFAYTFPESYSTQSIKLAHISGKVIRITSPYGYDFSPSFDPDGRYLYFLSARHLDPTNDKV IFNMSFQRVIKPYLVVLSNTYSPFNQSLEETTSDKKVEIEGIEDRVIPFPVDEDYYIRIEGAKNNKVFLFSLPIKGYRYP GETLGKLEVFDLDSKTKELYADNVKSFSLTIDKGKILILFKDSIRLFDVNTKPDLNATGKKGGIVDLSRIKVYVDPEREW KQMFREAWKLMQQNYWKPDGLKDWESVLLKYEKLIDRISTRYELSDLIQEMQGETKTSHSYEMPYDYDTAEPLPIGGLGA DYEYDKENKCYKIARIYVGDPTNENERSPLRDPGVQLNIGDCIKAVDGEEVKYNILSYLVNKDQVVLDVITKGKTKRVTV KLLKDEKFLIYRYWVEKNRQYVHEKSKGKLGYVHIPDMMYQGFAEFYRLFLSEFHREGLIVDVRFNRGGFISGLILEKLL LKRMGYVVRRNGKELPHPFFSSPGVIVAITNQYAGSDGDIFSYLFKKYKLGILIGRRTWGGVIGINVRDRLADNSAVSQP EFAVHFHDIGLKIENYGVDPDIEVDIKPEDYANGRDPQLDTAIELALKQLEEKS >Mature_854_residues MFYNFLLYLVIHVNYALDALTFNKFLSLDGIVSWPMIIKDRVYFLSDHEGISNLYSVNLEGKDLTKHTNFTEYYCRNASS DGRRIVFQNSGDIYLYDPEKQELKLLDIDLPTDRKKKQGKFVEVLDYTTEAIANDKYLSLISRGKVFLMRHWDGPAVQLG EKQGVRYKQIQLLPNGDTVVLDTNDDKLTFLSKDGSIKKLNVDLGRIERIKVSPDGKKILISNNRLELWLYEVDTTNLRL IDKSEYDVISQMDWHPDNEWFAYTFPESYSTQSIKLAHISGKVIRITSPYGYDFSPSFDPDGRYLYFLSARHLDPTNDKV IFNMSFQRVIKPYLVVLSNTYSPFNQSLEETTSDKKVEIEGIEDRVIPFPVDEDYYIRIEGAKNNKVFLFSLPIKGYRYP GETLGKLEVFDLDSKTKELYADNVKSFSLTIDKGKILILFKDSIRLFDVNTKPDLNATGKKGGIVDLSRIKVYVDPEREW KQMFREAWKLMQQNYWKPDGLKDWESVLLKYEKLIDRISTRYELSDLIQEMQGETKTSHSYEMPYDYDTAEPLPIGGLGA DYEYDKENKCYKIARIYVGDPTNENERSPLRDPGVQLNIGDCIKAVDGEEVKYNILSYLVNKDQVVLDVITKGKTKRVTV KLLKDEKFLIYRYWVEKNRQYVHEKSKGKLGYVHIPDMMYQGFAEFYRLFLSEFHREGLIVDVRFNRGGFISGLILEKLL LKRMGYVVRRNGKELPHPFFSSPGVIVAITNQYAGSDGDIFSYLFKKYKLGILIGRRTWGGVIGINVRDRLADNSAVSQP EFAVHFHDIGLKIENYGVDPDIEVDIKPEDYANGRDPQLDTAIELALKQLEEKS
Specific function: Degrades oligopeptides in a sequential manner
COG id: COG4946
COG function: function code S; Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system
Gene ontology:
Cell location: Cytoplasm
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: Belongs to the peptidase S41B family
Homologues:
None
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): TRIC_SULTO (Q972G5)
Other databases:
- EMBL: BA000023 - RefSeq: NP_377094.1 - ProteinModelPortal: Q972G5 - MEROPS: S41.005 - GeneID: 1459157 - GenomeReviews: BA000023_GR - KEGG: sto:ST1168 - NMPDR: fig|273063.1.peg.1262 - HOGENOM: HBG562891 - OMA: GYIHIPD - ProtClustDB: CLSK803210 - BioCyc: STOK273063:ST1168-MONOMER - GO: GO:0005737 - GO: GO:0006508 - InterPro: IPR011659 - InterPro: IPR001478 - InterPro: IPR005151 - InterPro: IPR015943 - Gene3D: G3DSA:2.130.10.10 - SMART: SM00245
Pfam domain/function: PF07676 PD40; PF03572 Peptidase_S41; SSF50156 PDZ
EC number: NA
Molecular weight: Translated: 98911; Mature: 98911
Theoretical pI: Translated: 6.32; Mature: 6.32
Prosite motif: NA
Important sites: ACT_SITE 539-539 ACT_SITE 756-756 ACT_SITE 757-757
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.4 %Cys (Translated Protein) 1.4 %Met (Translated Protein) 1.8 %Cys+Met (Translated Protein) 0.4 %Cys (Mature Protein) 1.4 %Met (Mature Protein) 1.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MFYNFLLYLVIHVNYALDALTFNKFLSLDGIVSWPMIIKDRVYFLSDHEGISNLYSVNLE CHHHHEEEHHHHHHHHHHHHHHHHHHCCCCCEECCEEEECEEEEEECCCCCCCEEEEECC GKDLTKHTNFTEYYCRNASSDGRRIVFQNSGDIYLYDPEKQELKLLDIDLPTDRKKKQGK CCCCCCCCCCHHHHCCCCCCCCCEEEEECCCCEEEECCCCCCEEEEEEECCCCCCHHCCC FVEVLDYTTEAIANDKYLSLISRGKVFLMRHWDGPAVQLGEKQGVRYKQIQLLPNGDTVV EEEEEHHHHHHHCCCHHHHHHHCCCEEEEEECCCCCEEECCCCCCEEEEEEEECCCCEEE LDTNDDKLTFLSKDGSIKKLNVDLGRIERIKVSPDGKKILISNNRLELWLYEVDTTNLRL EECCCCEEEEEECCCCEEEEEECCCCEEEEEECCCCCEEEEECCEEEEEEEEECCCEEEE IDKSEYDVISQMDWHPDNEWFAYTFPESYSTQSIKLAHISGKVIRITSPYGYDFSPSFDP EECCCCHHHHCCCCCCCCCEEEEECCCCCCCCEEEEEEECCEEEEEECCCCCCCCCCCCC DGRYLYFLSARHLDPTNDKVIFNMSFQRVIKPYLVVLSNTYSPFNQSLEETTSDKKVEIE CCCEEEEEEECCCCCCCCEEEEECCHHHHHHHHEEEEECCCCHHHHHHHHCCCCCEEEEE GIEDRVIPFPVDEDYYIRIEGAKNNKVFLFSLPIKGYRYPGETLGKLEVFDLDSKTKELY CCCCCEEECCCCCCEEEEEECCCCCEEEEEEECCCCCCCCCCCCCEEEEEECCCHHHHHH ADNVKSFSLTIDKGKILILFKDSIRLFDVNTKPDLNATGKKGGIVDLSRIKVYVDPEREW HCCCCEEEEEEECCEEEEEEECCEEEEECCCCCCCCCCCCCCCEEEEEEEEEEECCCHHH KQMFREAWKLMQQNYWKPDGLKDWESVLLKYEKLIDRISTRYELSDLIQEMQGETKTSHS HHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC YEMPYDYDTAEPLPIGGLGADYEYDKENKCYKIARIYVGDPTNENERSPLRDPGVQLNIG CCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCEEEHH DCIKAVDGEEVKYNILSYLVNKDQVVLDVITKGKTKRVTVKLLKDEKFLIYRYWVEKNRQ HHHHCCCCCCHHHHHHHHHHCCCCEEEEEECCCCCEEEEEEEECCCEEEEEEEEECCCHH YVHEKSKGKLGYVHIPDMMYQGFAEFYRLFLSEFHREGLIVDVRFNRGGFISGLILEKLL HHHHCCCCCEEEEECCHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCHHHHHHHHHH LKRMGYVVRRNGKELPHPFFSSPGVIVAITNQYAGSDGDIFSYLFKKYKLGILIGRRTWG HHHCCHHEECCCCCCCCCCCCCCCEEEEEECCCCCCCCHHHHHHHHHHEEEEEEECCCCC GVIGINVRDRLADNSAVSQPEFAVHFHDIGLKIENYGVDPDIEVDIKPEDYANGRDPQLD CEEEECHHHHCCCCCCCCCCCEEEEEEEECEEEEECCCCCCEEEEECCCCCCCCCCCCHH TAIELALKQLEEKS HHHHHHHHHHHHCC >Mature Secondary Structure MFYNFLLYLVIHVNYALDALTFNKFLSLDGIVSWPMIIKDRVYFLSDHEGISNLYSVNLE CHHHHEEEHHHHHHHHHHHHHHHHHHCCCCCEECCEEEECEEEEEECCCCCCCEEEEECC GKDLTKHTNFTEYYCRNASSDGRRIVFQNSGDIYLYDPEKQELKLLDIDLPTDRKKKQGK CCCCCCCCCCHHHHCCCCCCCCCEEEEECCCCEEEECCCCCCEEEEEEECCCCCCHHCCC FVEVLDYTTEAIANDKYLSLISRGKVFLMRHWDGPAVQLGEKQGVRYKQIQLLPNGDTVV EEEEEHHHHHHHCCCHHHHHHHCCCEEEEEECCCCCEEECCCCCCEEEEEEEECCCCEEE LDTNDDKLTFLSKDGSIKKLNVDLGRIERIKVSPDGKKILISNNRLELWLYEVDTTNLRL EECCCCEEEEEECCCCEEEEEECCCCEEEEEECCCCCEEEEECCEEEEEEEEECCCEEEE IDKSEYDVISQMDWHPDNEWFAYTFPESYSTQSIKLAHISGKVIRITSPYGYDFSPSFDP EECCCCHHHHCCCCCCCCCEEEEECCCCCCCCEEEEEEECCEEEEEECCCCCCCCCCCCC DGRYLYFLSARHLDPTNDKVIFNMSFQRVIKPYLVVLSNTYSPFNQSLEETTSDKKVEIE CCCEEEEEEECCCCCCCCEEEEECCHHHHHHHHEEEEECCCCHHHHHHHHCCCCCEEEEE GIEDRVIPFPVDEDYYIRIEGAKNNKVFLFSLPIKGYRYPGETLGKLEVFDLDSKTKELY CCCCCEEECCCCCCEEEEEECCCCCEEEEEEECCCCCCCCCCCCCEEEEEECCCHHHHHH ADNVKSFSLTIDKGKILILFKDSIRLFDVNTKPDLNATGKKGGIVDLSRIKVYVDPEREW HCCCCEEEEEEECCEEEEEEECCEEEEECCCCCCCCCCCCCCCEEEEEEEEEEECCCHHH KQMFREAWKLMQQNYWKPDGLKDWESVLLKYEKLIDRISTRYELSDLIQEMQGETKTSHS HHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC YEMPYDYDTAEPLPIGGLGADYEYDKENKCYKIARIYVGDPTNENERSPLRDPGVQLNIG CCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCEEEHH DCIKAVDGEEVKYNILSYLVNKDQVVLDVITKGKTKRVTVKLLKDEKFLIYRYWVEKNRQ HHHHCCCCCCHHHHHHHHHHCCCCEEEEEECCCCCEEEEEEEECCCEEEEEEEEECCCHH YVHEKSKGKLGYVHIPDMMYQGFAEFYRLFLSEFHREGLIVDVRFNRGGFISGLILEKLL HHHHCCCCCEEEEECCHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCHHHHHHHHHH LKRMGYVVRRNGKELPHPFFSSPGVIVAITNQYAGSDGDIFSYLFKKYKLGILIGRRTWG HHHCCHHEECCCCCCCCCCCCCCCEEEEEECCCCCCCCHHHHHHHHHHEEEEEEECCCCC GVIGINVRDRLADNSAVSQPEFAVHFHDIGLKIENYGVDPDIEVDIKPEDYANGRDPQLD CEEEECHHHHCCCCCCCCCCCEEEEEEEECEEEEECCCCCCEEEEECCCCCCCCCCCCHH TAIELALKQLEEKS HHHHHHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 11572479