Definition | Sulfolobus tokodaii str. 7 chromosome, complete genome. |
---|---|
Accession | NC_003106 |
Length | 2,694,756 |
Click here to switch to the map view.
The map label for this gene is thsA [H]
Identifier: 15921515
GI number: 15921515
Start: 1252656
End: 1254362
Strand: Direct
Name: thsA [H]
Synonym: ST1253
Alternate gene names: 15921515
Gene position: 1252656-1254362 (Clockwise)
Preceding gene: 15921513
Following gene: 15921518
Centisome position: 46.48
GC content: 35.97
Gene sequence:
>1707_bases ATGCAAAACCAGATACGGTGTTTGAATATGGCAAACGCCCCAGTCTTATTACTAAAAGAAGGCACTCAAAGATCTTCAGG AAGGGATGCTTTAAAGAATAACATTTTAGCAGCAGTTACTTTAGCTGAAATGTTAAAAAGTAGTTTAGGACCAAGAGGAT TAGATAAAATGCTTATTGATAGCTTCGGAGATGTCACTATAACCAATGACGGTGCTACTATAGTAAAGGAAATGGAAATT CAGCATCCAGCAGCAAAGCTTTTAGTCGAGGCAGCAAAAGCTCAAGACGCCGAAGTAGGTGATGGTACAACTTCAGCAGT TGTTCTTGCAGGGCTGTTATTAGATAAGGCGGATGATTTATTAGACCAAAACATTCATCCAACAATAATTATTGAAGGGT ATAAGAAAGCTCTAAATAAATCCTTAGAAATTATTGATCAACTAGCTACTAAAATTGATGTATCTAACTTAAATTCACTT GCTACGAGAGATCAGTTAAAGAAGATAGTATATACAACAATGTCTAGTAAATTCATAGCTGGCGGAGAAGAGATGGATAA AATTATGAATATGGTAATTGATGCTGTCTCAATAGTTGCTGAACCCTTACCAGAAGGAGGATATAATGTACCATTGGATC TAATAAAGATTGATAAGAAAAAAGGAGGAAGCATCGAAGATAGTATGTTAGTTCATGGCCTAGTTTTAGATAAAGAAGTA GTTCATCCTGGAATGCCAAGAAGAGTTGAAAAAGCAAAAATTGCTGTATTAGATGCTGCATTAGAAGTAGAAAAACCAGA AATTTCAGCTAAAATCAGTATAACTAGCCCAGAACAGATTAAAGCTTTCCTTGATGAGGAAGCAAAGTATCTAAAAGATA TGGTTGATAAATTAGCTTCAATTGGCGCTAATGTTGTAATCTGCCAGAAAGGTATTGATGATGTTGCACAACACTTCTTA GCAAAGAAAGGAATCTTAGCAGTTAGAAGAGTAAAGAGAAGTGATATTGAGAAGTTAGAGAAAGCACTTGGTGCTAGAAT CATAAGTAGTATCAAGGATGCTACCCCAGAAGATTTAGGTTATGCTGAACTAGTAGAAGAAAGAAGAGTTGGTAATGACA AAATGGTATTCATTGAAGGTGCAAAGAATCCAAAGGCTGTAAACATATTATTAAGGGGTTCAAATGACATGGCATTAGAT GAGGCTGAAAGAAGTATTAATGATGCATTACACTCATTAAGGAATGTATTAATGAAGCCAATGATTGTTGCTGGTGGTGG TGCTGTAGAGACTGAGTTAGCATTAAGATTGAGAGAATACGCAAGATCTGTGGGTGGCAAAGAACAATTAGCAATTGAAA AGTTTGCTGAGGCATTAGAAGAAATACCAATGATATTAGCTGAAACTGCTGGTATGGAGCCAATTCAGACATTAATGGAT CTAAGAGCAAAGCATGCTAAAGGATTAATTAATGCTGGAGTTGATGTTATGAACGGAAAAATTGCTGATGATATGTTAGC TCTTAATGTATTAGAGCCAGTAAGAGTTAAAGCTCAAGTATTAAAGAGTGCTGTAGAAGCTGCTACCGCAATATTGAAGA TTGATGATCTAATAGCAGCTGCTCCATTAAAGAGTGGAGAGAAGAAAGGAGAGAAGAAAGAAGGAGGAGAAGAAGAGAAA TCATCAACTCCTTCTTCACTAGAATAA
Upstream 100 bases:
>100_bases GTTGACAAATTTCCATTTATAATTCTTAAGTTATTATTCTCCTCCTCTTAAATTGATCACATGAAAATATAACAGAAAAA TTTATATATAAGCGATTCTA
Downstream 100 bases:
>100_bases ACTAATTTTTTATTTATACATCATTAGTTGATTTGTTTCTTCTGAATTATATCCTTCAAGCTCATCAAGAATTTTTTTAA CTGCATCTTTGATTTTTTGG
Product: thermosome, alpha subunit
Products: NA
Alternate protein names: Chaperonin subunit alpha; Thermosome subunit 1 [H]
Number of amino acids: Translated: 568; Mature: 568
Protein sequence:
>568_residues MQNQIRCLNMANAPVLLLKEGTQRSSGRDALKNNILAAVTLAEMLKSSLGPRGLDKMLIDSFGDVTITNDGATIVKEMEI QHPAAKLLVEAAKAQDAEVGDGTTSAVVLAGLLLDKADDLLDQNIHPTIIIEGYKKALNKSLEIIDQLATKIDVSNLNSL ATRDQLKKIVYTTMSSKFIAGGEEMDKIMNMVIDAVSIVAEPLPEGGYNVPLDLIKIDKKKGGSIEDSMLVHGLVLDKEV VHPGMPRRVEKAKIAVLDAALEVEKPEISAKISITSPEQIKAFLDEEAKYLKDMVDKLASIGANVVICQKGIDDVAQHFL AKKGILAVRRVKRSDIEKLEKALGARIISSIKDATPEDLGYAELVEERRVGNDKMVFIEGAKNPKAVNILLRGSNDMALD EAERSINDALHSLRNVLMKPMIVAGGGAVETELALRLREYARSVGGKEQLAIEKFAEALEEIPMILAETAGMEPIQTLMD LRAKHAKGLINAGVDVMNGKIADDMLALNVLEPVRVKAQVLKSAVEAATAILKIDDLIAAAPLKSGEKKGEKKEGGEEEK SSTPSSLE
Sequences:
>Translated_568_residues MQNQIRCLNMANAPVLLLKEGTQRSSGRDALKNNILAAVTLAEMLKSSLGPRGLDKMLIDSFGDVTITNDGATIVKEMEI QHPAAKLLVEAAKAQDAEVGDGTTSAVVLAGLLLDKADDLLDQNIHPTIIIEGYKKALNKSLEIIDQLATKIDVSNLNSL ATRDQLKKIVYTTMSSKFIAGGEEMDKIMNMVIDAVSIVAEPLPEGGYNVPLDLIKIDKKKGGSIEDSMLVHGLVLDKEV VHPGMPRRVEKAKIAVLDAALEVEKPEISAKISITSPEQIKAFLDEEAKYLKDMVDKLASIGANVVICQKGIDDVAQHFL AKKGILAVRRVKRSDIEKLEKALGARIISSIKDATPEDLGYAELVEERRVGNDKMVFIEGAKNPKAVNILLRGSNDMALD EAERSINDALHSLRNVLMKPMIVAGGGAVETELALRLREYARSVGGKEQLAIEKFAEALEEIPMILAETAGMEPIQTLMD LRAKHAKGLINAGVDVMNGKIADDMLALNVLEPVRVKAQVLKSAVEAATAILKIDDLIAAAPLKSGEKKGEKKEGGEEEK SSTPSSLE >Mature_568_residues MQNQIRCLNMANAPVLLLKEGTQRSSGRDALKNNILAAVTLAEMLKSSLGPRGLDKMLIDSFGDVTITNDGATIVKEMEI QHPAAKLLVEAAKAQDAEVGDGTTSAVVLAGLLLDKADDLLDQNIHPTIIIEGYKKALNKSLEIIDQLATKIDVSNLNSL ATRDQLKKIVYTTMSSKFIAGGEEMDKIMNMVIDAVSIVAEPLPEGGYNVPLDLIKIDKKKGGSIEDSMLVHGLVLDKEV VHPGMPRRVEKAKIAVLDAALEVEKPEISAKISITSPEQIKAFLDEEAKYLKDMVDKLASIGANVVICQKGIDDVAQHFL AKKGILAVRRVKRSDIEKLEKALGARIISSIKDATPEDLGYAELVEERRVGNDKMVFIEGAKNPKAVNILLRGSNDMALD EAERSINDALHSLRNVLMKPMIVAGGGAVETELALRLREYARSVGGKEQLAIEKFAEALEEIPMILAETAGMEPIQTLMD LRAKHAKGLINAGVDVMNGKIADDMLALNVLEPVRVKAQVLKSAVEAATAILKIDDLIAAAPLKSGEKKGEKKEGGEEEK SSTPSSLE
Specific function: Molecular chaperone; binds unfolded polypeptides in vitro, and has a weak ATPase activity [H]
COG id: COG0459
COG function: function code O; Chaperonin GroEL (HSP60 family)
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the TCP-1 chaperonin family [H]
Homologues:
Organism=Homo sapiens, GI63162572, Length=555, Percent_Identity=37.6576576576577, Blast_Score=379, Evalue=1e-105, Organism=Homo sapiens, GI57863257, Length=545, Percent_Identity=41.2844036697248, Blast_Score=376, Evalue=1e-104, Organism=Homo sapiens, GI24307939, Length=529, Percent_Identity=41.3988657844991, Blast_Score=376, Evalue=1e-104, Organism=Homo sapiens, GI38455427, Length=513, Percent_Identity=42.495126705653, Blast_Score=375, Evalue=1e-104, Organism=Homo sapiens, GI5453607, Length=532, Percent_Identity=37.406015037594, Blast_Score=345, Evalue=7e-95, Organism=Homo sapiens, GI58761484, Length=555, Percent_Identity=34.7747747747748, Blast_Score=322, Evalue=4e-88, Organism=Homo sapiens, GI261399877, Length=488, Percent_Identity=36.2704918032787, Blast_Score=304, Evalue=2e-82, Organism=Homo sapiens, GI58331173, Length=518, Percent_Identity=36.1003861003861, Blast_Score=299, Evalue=5e-81, Organism=Homo sapiens, GI4502643, Length=524, Percent_Identity=35.3053435114504, Blast_Score=292, Evalue=7e-79, Organism=Homo sapiens, GI5453603, Length=535, Percent_Identity=34.9532710280374, Blast_Score=284, Evalue=1e-76, Organism=Homo sapiens, GI302058290, Length=511, Percent_Identity=33.6594911937378, Blast_Score=254, Evalue=1e-67, Organism=Homo sapiens, GI261399875, Length=443, Percent_Identity=34.0857787810384, Blast_Score=250, Evalue=2e-66, Organism=Homo sapiens, GI48762932, Length=557, Percent_Identity=33.0341113105925, Blast_Score=246, Evalue=5e-65, Organism=Homo sapiens, GI57863259, Length=395, Percent_Identity=37.9746835443038, Blast_Score=239, Evalue=7e-63, Organism=Homo sapiens, GI302058292, Length=518, Percent_Identity=31.8532818532818, Blast_Score=237, Evalue=3e-62, Organism=Homo sapiens, GI58331171, Length=525, Percent_Identity=31.047619047619, Blast_Score=230, Evalue=3e-60, Organism=Homo sapiens, GI58331185, Length=315, Percent_Identity=35.2380952380952, Blast_Score=197, Evalue=2e-50, Organism=Homo sapiens, GI7657253, Length=514, Percent_Identity=24.3190661478599, Blast_Score=145, Evalue=1e-34, Organism=Homo sapiens, GI25914754, Length=471, Percent_Identity=23.1422505307856, Blast_Score=89, Evalue=1e-17, Organism=Homo sapiens, GI9055272, Length=471, Percent_Identity=23.1422505307856, Blast_Score=89, Evalue=1e-17, Organism=Escherichia coli, GI1790586, Length=571, Percent_Identity=23.292469352014, Blast_Score=65, Evalue=2e-11, Organism=Caenorhabditis elegans, GI25144674, Length=536, Percent_Identity=41.044776119403, Blast_Score=372, Evalue=1e-103, Organism=Caenorhabditis elegans, GI17532603, Length=539, Percent_Identity=40.8163265306122, Blast_Score=371, Evalue=1e-103, Organism=Caenorhabditis elegans, GI17532601, Length=545, Percent_Identity=40.9174311926606, Blast_Score=368, Evalue=1e-102, Organism=Caenorhabditis elegans, GI25148561, Length=551, Percent_Identity=35.2087114337568, Blast_Score=342, Evalue=4e-94, Organism=Caenorhabditis elegans, GI17564182, Length=540, Percent_Identity=34.4444444444444, Blast_Score=324, Evalue=7e-89, Organism=Caenorhabditis elegans, GI25147750, Length=536, Percent_Identity=36.3805970149254, Blast_Score=308, Evalue=5e-84, Organism=Caenorhabditis elegans, GI32566944, Length=454, Percent_Identity=34.3612334801762, Blast_Score=276, Evalue=2e-74, Organism=Caenorhabditis elegans, GI25144678, Length=527, Percent_Identity=33.965844402277, Blast_Score=271, Evalue=9e-73, Organism=Caenorhabditis elegans, GI71998178, Length=569, Percent_Identity=30.0527240773286, Blast_Score=239, Evalue=3e-63, Organism=Caenorhabditis elegans, GI25144680, Length=402, Percent_Identity=35.8208955223881, Blast_Score=238, Evalue=7e-63, Organism=Caenorhabditis elegans, GI71981457, Length=328, Percent_Identity=40.8536585365854, Blast_Score=223, Evalue=2e-58, Organism=Caenorhabditis elegans, GI17555558, Length=574, Percent_Identity=21.602787456446, Blast_Score=68, Evalue=1e-11, Organism=Saccharomyces cerevisiae, GI6322446, Length=537, Percent_Identity=36.3128491620112, Blast_Score=365, Evalue=1e-101, Organism=Saccharomyces cerevisiae, GI6320418, Length=556, Percent_Identity=38.3093525179856, Blast_Score=352, Evalue=9e-98, Organism=Saccharomyces cerevisiae, GI6322524, Length=549, Percent_Identity=38.615664845173, Blast_Score=349, Evalue=5e-97, Organism=Saccharomyces cerevisiae, GI6320058, Length=515, Percent_Identity=38.0582524271845, Blast_Score=327, Evalue=2e-90, Organism=Saccharomyces cerevisiae, GI6322350, Length=527, Percent_Identity=36.8121442125237, Blast_Score=326, Evalue=5e-90, Organism=Saccharomyces cerevisiae, GI6322049, Length=511, Percent_Identity=34.2465753424658, Blast_Score=271, Evalue=2e-73, Organism=Saccharomyces cerevisiae, GI6320393, Length=534, Percent_Identity=32.3970037453184, Blast_Score=250, Evalue=4e-67, Organism=Saccharomyces cerevisiae, GI6322452, Length=545, Percent_Identity=30.6422018348624, Blast_Score=228, Evalue=2e-60, Organism=Drosophila melanogaster, GI17647245, Length=530, Percent_Identity=40.9433962264151, Blast_Score=360, Evalue=2e-99, Organism=Drosophila melanogaster, GI24649027, Length=549, Percent_Identity=39.1621129326047, Blast_Score=355, Evalue=4e-98, Organism=Drosophila melanogaster, GI24649029, Length=549, Percent_Identity=39.1621129326047, Blast_Score=355, Evalue=4e-98, Organism=Drosophila melanogaster, GI24647512, Length=563, Percent_Identity=36.9449378330373, Blast_Score=353, Evalue=2e-97, Organism=Drosophila melanogaster, GI24647510, Length=563, Percent_Identity=36.9449378330373, Blast_Score=353, Evalue=2e-97, Organism=Drosophila melanogaster, GI24583944, Length=521, Percent_Identity=39.5393474088292, Blast_Score=351, Evalue=9e-97, Organism=Drosophila melanogaster, GI24652903, Length=509, Percent_Identity=41.0609037328094, Blast_Score=347, Evalue=1e-95, Organism=Drosophila melanogaster, GI24645179, Length=536, Percent_Identity=38.8059701492537, Blast_Score=344, Evalue=1e-94, Organism=Drosophila melanogaster, GI18858175, Length=549, Percent_Identity=34.608378870674, Blast_Score=289, Evalue=3e-78, Organism=Drosophila melanogaster, GI28571140, Length=547, Percent_Identity=34.7349177330896, Blast_Score=289, Evalue=3e-78, Organism=Drosophila melanogaster, GI18859933, Length=537, Percent_Identity=32.9608938547486, Blast_Score=262, Evalue=5e-70, Organism=Drosophila melanogaster, GI19921848, Length=561, Percent_Identity=33.5115864527629, Blast_Score=250, Evalue=2e-66, Organism=Drosophila melanogaster, GI20130093, Length=359, Percent_Identity=26.1838440111421, Blast_Score=86, Evalue=5e-17, Organism=Drosophila melanogaster, GI45552711, Length=359, Percent_Identity=26.1838440111421, Blast_Score=86, Evalue=5e-17,
Paralogues:
None
Copy number: 2180 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). 360 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). 480 Molecules/Cell In: Stationary Phase, Rich Media (Based on E. coli). 15012 Molecules/Cell In: Growth Phase,
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR017998 - InterPro: IPR002194 - InterPro: IPR002423 - InterPro: IPR012714 [H]
Pfam domain/function: PF00118 Cpn60_TCP1 [H]
EC number: NA
Molecular weight: Translated: 61271; Mature: 61271
Theoretical pI: Translated: 5.14; Mature: 5.14
Prosite motif: PS00750 TCP1_1 ; PS00751 TCP1_2 ; PS00995 TCP1_3
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.4 %Cys (Translated Protein) 3.7 %Met (Translated Protein) 4.0 %Cys+Met (Translated Protein) 0.4 %Cys (Mature Protein) 3.7 %Met (Mature Protein) 4.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MQNQIRCLNMANAPVLLLKEGTQRSSGRDALKNNILAAVTLAEMLKSSLGPRGLDKMLID CCCCEEEEECCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHH SFGDVTITNDGATIVKEMEIQHPAAKLLVEAAKAQDAEVGDGTTSAVVLAGLLLDKADDL CCCCEEEECCCHHHHHHHHHCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHH LDQNIHPTIIIEGYKKALNKSLEIIDQLATKIDVSNLNSLATRDQLKKIVYTTMSSKFIA HCCCCCCEEEECCHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHCCHHHC GGEEMDKIMNMVIDAVSIVAEPLPEGGYNVPLDLIKIDKKKGGSIEDSMLVHGLVLDKEV CHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHH VHPGMPRRVEKAKIAVLDAALEVEKPEISAKISITSPEQIKAFLDEEAKYLKDMVDKLAS HCCCCCCHHHHHHHHHHHHHHHCCCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHH IGANVVICQKGIDDVAQHFLAKKGILAVRRVKRSDIEKLEKALGARIISSIKDATPEDLG CCCCEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC YAELVEERRVGNDKMVFIEGAKNPKAVNILLRGSNDMALDEAERSINDALHSLRNVLMKP HHHHHHHHHCCCCCEEEEECCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCC MIVAGGGAVETELALRLREYARSVGGKEQLAIEKFAEALEEIPMILAETAGMEPIQTLMD EEEECCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHH LRAKHAKGLINAGVDVMNGKIADDMLALNVLEPVRVKAQVLKSAVEAATAILKIDDLIAA HHHHHHHHHHHCCCHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH APLKSGEKKGEKKEGGEEEKSSTPSSLE CCCCCCCCCCCCCCCCCCHHCCCCCCCC >Mature Secondary Structure MQNQIRCLNMANAPVLLLKEGTQRSSGRDALKNNILAAVTLAEMLKSSLGPRGLDKMLID CCCCEEEEECCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHH SFGDVTITNDGATIVKEMEIQHPAAKLLVEAAKAQDAEVGDGTTSAVVLAGLLLDKADDL CCCCEEEECCCHHHHHHHHHCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHH LDQNIHPTIIIEGYKKALNKSLEIIDQLATKIDVSNLNSLATRDQLKKIVYTTMSSKFIA HCCCCCCEEEECCHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHCCHHHC GGEEMDKIMNMVIDAVSIVAEPLPEGGYNVPLDLIKIDKKKGGSIEDSMLVHGLVLDKEV CHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHH VHPGMPRRVEKAKIAVLDAALEVEKPEISAKISITSPEQIKAFLDEEAKYLKDMVDKLAS HCCCCCCHHHHHHHHHHHHHHHCCCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHH IGANVVICQKGIDDVAQHFLAKKGILAVRRVKRSDIEKLEKALGARIISSIKDATPEDLG CCCCEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC YAELVEERRVGNDKMVFIEGAKNPKAVNILLRGSNDMALDEAERSINDALHSLRNVLMKP HHHHHHHHHCCCCCEEEEECCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCC MIVAGGGAVETELALRLREYARSVGGKEQLAIEKFAEALEEIPMILAETAGMEPIQTLMD EEEECCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHH LRAKHAKGLINAGVDVMNGKIADDMLALNVLEPVRVKAQVLKSAVEAATAILKIDDLIAA HHHHHHHHHHHCCCHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH APLKSGEKKGEKKEGGEEEKSSTPSSLE CCCCCCCCCCCCCCCCCCHHCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9245723; 11572479 [H]