The gene/protein map for NC_003106 is currently unavailable.
Definition Sulfolobus tokodaii str. 7 chromosome, complete genome.
Accession NC_003106
Length 2,694,756

Click here to switch to the map view.

The map label for this gene is thsA [H]

Identifier: 15921515

GI number: 15921515

Start: 1252656

End: 1254362

Strand: Direct

Name: thsA [H]

Synonym: ST1253

Alternate gene names: 15921515

Gene position: 1252656-1254362 (Clockwise)

Preceding gene: 15921513

Following gene: 15921518

Centisome position: 46.48

GC content: 35.97

Gene sequence:

>1707_bases
ATGCAAAACCAGATACGGTGTTTGAATATGGCAAACGCCCCAGTCTTATTACTAAAAGAAGGCACTCAAAGATCTTCAGG
AAGGGATGCTTTAAAGAATAACATTTTAGCAGCAGTTACTTTAGCTGAAATGTTAAAAAGTAGTTTAGGACCAAGAGGAT
TAGATAAAATGCTTATTGATAGCTTCGGAGATGTCACTATAACCAATGACGGTGCTACTATAGTAAAGGAAATGGAAATT
CAGCATCCAGCAGCAAAGCTTTTAGTCGAGGCAGCAAAAGCTCAAGACGCCGAAGTAGGTGATGGTACAACTTCAGCAGT
TGTTCTTGCAGGGCTGTTATTAGATAAGGCGGATGATTTATTAGACCAAAACATTCATCCAACAATAATTATTGAAGGGT
ATAAGAAAGCTCTAAATAAATCCTTAGAAATTATTGATCAACTAGCTACTAAAATTGATGTATCTAACTTAAATTCACTT
GCTACGAGAGATCAGTTAAAGAAGATAGTATATACAACAATGTCTAGTAAATTCATAGCTGGCGGAGAAGAGATGGATAA
AATTATGAATATGGTAATTGATGCTGTCTCAATAGTTGCTGAACCCTTACCAGAAGGAGGATATAATGTACCATTGGATC
TAATAAAGATTGATAAGAAAAAAGGAGGAAGCATCGAAGATAGTATGTTAGTTCATGGCCTAGTTTTAGATAAAGAAGTA
GTTCATCCTGGAATGCCAAGAAGAGTTGAAAAAGCAAAAATTGCTGTATTAGATGCTGCATTAGAAGTAGAAAAACCAGA
AATTTCAGCTAAAATCAGTATAACTAGCCCAGAACAGATTAAAGCTTTCCTTGATGAGGAAGCAAAGTATCTAAAAGATA
TGGTTGATAAATTAGCTTCAATTGGCGCTAATGTTGTAATCTGCCAGAAAGGTATTGATGATGTTGCACAACACTTCTTA
GCAAAGAAAGGAATCTTAGCAGTTAGAAGAGTAAAGAGAAGTGATATTGAGAAGTTAGAGAAAGCACTTGGTGCTAGAAT
CATAAGTAGTATCAAGGATGCTACCCCAGAAGATTTAGGTTATGCTGAACTAGTAGAAGAAAGAAGAGTTGGTAATGACA
AAATGGTATTCATTGAAGGTGCAAAGAATCCAAAGGCTGTAAACATATTATTAAGGGGTTCAAATGACATGGCATTAGAT
GAGGCTGAAAGAAGTATTAATGATGCATTACACTCATTAAGGAATGTATTAATGAAGCCAATGATTGTTGCTGGTGGTGG
TGCTGTAGAGACTGAGTTAGCATTAAGATTGAGAGAATACGCAAGATCTGTGGGTGGCAAAGAACAATTAGCAATTGAAA
AGTTTGCTGAGGCATTAGAAGAAATACCAATGATATTAGCTGAAACTGCTGGTATGGAGCCAATTCAGACATTAATGGAT
CTAAGAGCAAAGCATGCTAAAGGATTAATTAATGCTGGAGTTGATGTTATGAACGGAAAAATTGCTGATGATATGTTAGC
TCTTAATGTATTAGAGCCAGTAAGAGTTAAAGCTCAAGTATTAAAGAGTGCTGTAGAAGCTGCTACCGCAATATTGAAGA
TTGATGATCTAATAGCAGCTGCTCCATTAAAGAGTGGAGAGAAGAAAGGAGAGAAGAAAGAAGGAGGAGAAGAAGAGAAA
TCATCAACTCCTTCTTCACTAGAATAA

Upstream 100 bases:

>100_bases
GTTGACAAATTTCCATTTATAATTCTTAAGTTATTATTCTCCTCCTCTTAAATTGATCACATGAAAATATAACAGAAAAA
TTTATATATAAGCGATTCTA

Downstream 100 bases:

>100_bases
ACTAATTTTTTATTTATACATCATTAGTTGATTTGTTTCTTCTGAATTATATCCTTCAAGCTCATCAAGAATTTTTTTAA
CTGCATCTTTGATTTTTTGG

Product: thermosome, alpha subunit

Products: NA

Alternate protein names: Chaperonin subunit alpha; Thermosome subunit 1 [H]

Number of amino acids: Translated: 568; Mature: 568

Protein sequence:

>568_residues
MQNQIRCLNMANAPVLLLKEGTQRSSGRDALKNNILAAVTLAEMLKSSLGPRGLDKMLIDSFGDVTITNDGATIVKEMEI
QHPAAKLLVEAAKAQDAEVGDGTTSAVVLAGLLLDKADDLLDQNIHPTIIIEGYKKALNKSLEIIDQLATKIDVSNLNSL
ATRDQLKKIVYTTMSSKFIAGGEEMDKIMNMVIDAVSIVAEPLPEGGYNVPLDLIKIDKKKGGSIEDSMLVHGLVLDKEV
VHPGMPRRVEKAKIAVLDAALEVEKPEISAKISITSPEQIKAFLDEEAKYLKDMVDKLASIGANVVICQKGIDDVAQHFL
AKKGILAVRRVKRSDIEKLEKALGARIISSIKDATPEDLGYAELVEERRVGNDKMVFIEGAKNPKAVNILLRGSNDMALD
EAERSINDALHSLRNVLMKPMIVAGGGAVETELALRLREYARSVGGKEQLAIEKFAEALEEIPMILAETAGMEPIQTLMD
LRAKHAKGLINAGVDVMNGKIADDMLALNVLEPVRVKAQVLKSAVEAATAILKIDDLIAAAPLKSGEKKGEKKEGGEEEK
SSTPSSLE

Sequences:

>Translated_568_residues
MQNQIRCLNMANAPVLLLKEGTQRSSGRDALKNNILAAVTLAEMLKSSLGPRGLDKMLIDSFGDVTITNDGATIVKEMEI
QHPAAKLLVEAAKAQDAEVGDGTTSAVVLAGLLLDKADDLLDQNIHPTIIIEGYKKALNKSLEIIDQLATKIDVSNLNSL
ATRDQLKKIVYTTMSSKFIAGGEEMDKIMNMVIDAVSIVAEPLPEGGYNVPLDLIKIDKKKGGSIEDSMLVHGLVLDKEV
VHPGMPRRVEKAKIAVLDAALEVEKPEISAKISITSPEQIKAFLDEEAKYLKDMVDKLASIGANVVICQKGIDDVAQHFL
AKKGILAVRRVKRSDIEKLEKALGARIISSIKDATPEDLGYAELVEERRVGNDKMVFIEGAKNPKAVNILLRGSNDMALD
EAERSINDALHSLRNVLMKPMIVAGGGAVETELALRLREYARSVGGKEQLAIEKFAEALEEIPMILAETAGMEPIQTLMD
LRAKHAKGLINAGVDVMNGKIADDMLALNVLEPVRVKAQVLKSAVEAATAILKIDDLIAAAPLKSGEKKGEKKEGGEEEK
SSTPSSLE
>Mature_568_residues
MQNQIRCLNMANAPVLLLKEGTQRSSGRDALKNNILAAVTLAEMLKSSLGPRGLDKMLIDSFGDVTITNDGATIVKEMEI
QHPAAKLLVEAAKAQDAEVGDGTTSAVVLAGLLLDKADDLLDQNIHPTIIIEGYKKALNKSLEIIDQLATKIDVSNLNSL
ATRDQLKKIVYTTMSSKFIAGGEEMDKIMNMVIDAVSIVAEPLPEGGYNVPLDLIKIDKKKGGSIEDSMLVHGLVLDKEV
VHPGMPRRVEKAKIAVLDAALEVEKPEISAKISITSPEQIKAFLDEEAKYLKDMVDKLASIGANVVICQKGIDDVAQHFL
AKKGILAVRRVKRSDIEKLEKALGARIISSIKDATPEDLGYAELVEERRVGNDKMVFIEGAKNPKAVNILLRGSNDMALD
EAERSINDALHSLRNVLMKPMIVAGGGAVETELALRLREYARSVGGKEQLAIEKFAEALEEIPMILAETAGMEPIQTLMD
LRAKHAKGLINAGVDVMNGKIADDMLALNVLEPVRVKAQVLKSAVEAATAILKIDDLIAAAPLKSGEKKGEKKEGGEEEK
SSTPSSLE

Specific function: Molecular chaperone; binds unfolded polypeptides in vitro, and has a weak ATPase activity [H]

COG id: COG0459

COG function: function code O; Chaperonin GroEL (HSP60 family)

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the TCP-1 chaperonin family [H]

Homologues:

Organism=Homo sapiens, GI63162572, Length=555, Percent_Identity=37.6576576576577, Blast_Score=379, Evalue=1e-105,
Organism=Homo sapiens, GI57863257, Length=545, Percent_Identity=41.2844036697248, Blast_Score=376, Evalue=1e-104,
Organism=Homo sapiens, GI24307939, Length=529, Percent_Identity=41.3988657844991, Blast_Score=376, Evalue=1e-104,
Organism=Homo sapiens, GI38455427, Length=513, Percent_Identity=42.495126705653, Blast_Score=375, Evalue=1e-104,
Organism=Homo sapiens, GI5453607, Length=532, Percent_Identity=37.406015037594, Blast_Score=345, Evalue=7e-95,
Organism=Homo sapiens, GI58761484, Length=555, Percent_Identity=34.7747747747748, Blast_Score=322, Evalue=4e-88,
Organism=Homo sapiens, GI261399877, Length=488, Percent_Identity=36.2704918032787, Blast_Score=304, Evalue=2e-82,
Organism=Homo sapiens, GI58331173, Length=518, Percent_Identity=36.1003861003861, Blast_Score=299, Evalue=5e-81,
Organism=Homo sapiens, GI4502643, Length=524, Percent_Identity=35.3053435114504, Blast_Score=292, Evalue=7e-79,
Organism=Homo sapiens, GI5453603, Length=535, Percent_Identity=34.9532710280374, Blast_Score=284, Evalue=1e-76,
Organism=Homo sapiens, GI302058290, Length=511, Percent_Identity=33.6594911937378, Blast_Score=254, Evalue=1e-67,
Organism=Homo sapiens, GI261399875, Length=443, Percent_Identity=34.0857787810384, Blast_Score=250, Evalue=2e-66,
Organism=Homo sapiens, GI48762932, Length=557, Percent_Identity=33.0341113105925, Blast_Score=246, Evalue=5e-65,
Organism=Homo sapiens, GI57863259, Length=395, Percent_Identity=37.9746835443038, Blast_Score=239, Evalue=7e-63,
Organism=Homo sapiens, GI302058292, Length=518, Percent_Identity=31.8532818532818, Blast_Score=237, Evalue=3e-62,
Organism=Homo sapiens, GI58331171, Length=525, Percent_Identity=31.047619047619, Blast_Score=230, Evalue=3e-60,
Organism=Homo sapiens, GI58331185, Length=315, Percent_Identity=35.2380952380952, Blast_Score=197, Evalue=2e-50,
Organism=Homo sapiens, GI7657253, Length=514, Percent_Identity=24.3190661478599, Blast_Score=145, Evalue=1e-34,
Organism=Homo sapiens, GI25914754, Length=471, Percent_Identity=23.1422505307856, Blast_Score=89, Evalue=1e-17,
Organism=Homo sapiens, GI9055272, Length=471, Percent_Identity=23.1422505307856, Blast_Score=89, Evalue=1e-17,
Organism=Escherichia coli, GI1790586, Length=571, Percent_Identity=23.292469352014, Blast_Score=65, Evalue=2e-11,
Organism=Caenorhabditis elegans, GI25144674, Length=536, Percent_Identity=41.044776119403, Blast_Score=372, Evalue=1e-103,
Organism=Caenorhabditis elegans, GI17532603, Length=539, Percent_Identity=40.8163265306122, Blast_Score=371, Evalue=1e-103,
Organism=Caenorhabditis elegans, GI17532601, Length=545, Percent_Identity=40.9174311926606, Blast_Score=368, Evalue=1e-102,
Organism=Caenorhabditis elegans, GI25148561, Length=551, Percent_Identity=35.2087114337568, Blast_Score=342, Evalue=4e-94,
Organism=Caenorhabditis elegans, GI17564182, Length=540, Percent_Identity=34.4444444444444, Blast_Score=324, Evalue=7e-89,
Organism=Caenorhabditis elegans, GI25147750, Length=536, Percent_Identity=36.3805970149254, Blast_Score=308, Evalue=5e-84,
Organism=Caenorhabditis elegans, GI32566944, Length=454, Percent_Identity=34.3612334801762, Blast_Score=276, Evalue=2e-74,
Organism=Caenorhabditis elegans, GI25144678, Length=527, Percent_Identity=33.965844402277, Blast_Score=271, Evalue=9e-73,
Organism=Caenorhabditis elegans, GI71998178, Length=569, Percent_Identity=30.0527240773286, Blast_Score=239, Evalue=3e-63,
Organism=Caenorhabditis elegans, GI25144680, Length=402, Percent_Identity=35.8208955223881, Blast_Score=238, Evalue=7e-63,
Organism=Caenorhabditis elegans, GI71981457, Length=328, Percent_Identity=40.8536585365854, Blast_Score=223, Evalue=2e-58,
Organism=Caenorhabditis elegans, GI17555558, Length=574, Percent_Identity=21.602787456446, Blast_Score=68, Evalue=1e-11,
Organism=Saccharomyces cerevisiae, GI6322446, Length=537, Percent_Identity=36.3128491620112, Blast_Score=365, Evalue=1e-101,
Organism=Saccharomyces cerevisiae, GI6320418, Length=556, Percent_Identity=38.3093525179856, Blast_Score=352, Evalue=9e-98,
Organism=Saccharomyces cerevisiae, GI6322524, Length=549, Percent_Identity=38.615664845173, Blast_Score=349, Evalue=5e-97,
Organism=Saccharomyces cerevisiae, GI6320058, Length=515, Percent_Identity=38.0582524271845, Blast_Score=327, Evalue=2e-90,
Organism=Saccharomyces cerevisiae, GI6322350, Length=527, Percent_Identity=36.8121442125237, Blast_Score=326, Evalue=5e-90,
Organism=Saccharomyces cerevisiae, GI6322049, Length=511, Percent_Identity=34.2465753424658, Blast_Score=271, Evalue=2e-73,
Organism=Saccharomyces cerevisiae, GI6320393, Length=534, Percent_Identity=32.3970037453184, Blast_Score=250, Evalue=4e-67,
Organism=Saccharomyces cerevisiae, GI6322452, Length=545, Percent_Identity=30.6422018348624, Blast_Score=228, Evalue=2e-60,
Organism=Drosophila melanogaster, GI17647245, Length=530, Percent_Identity=40.9433962264151, Blast_Score=360, Evalue=2e-99,
Organism=Drosophila melanogaster, GI24649027, Length=549, Percent_Identity=39.1621129326047, Blast_Score=355, Evalue=4e-98,
Organism=Drosophila melanogaster, GI24649029, Length=549, Percent_Identity=39.1621129326047, Blast_Score=355, Evalue=4e-98,
Organism=Drosophila melanogaster, GI24647512, Length=563, Percent_Identity=36.9449378330373, Blast_Score=353, Evalue=2e-97,
Organism=Drosophila melanogaster, GI24647510, Length=563, Percent_Identity=36.9449378330373, Blast_Score=353, Evalue=2e-97,
Organism=Drosophila melanogaster, GI24583944, Length=521, Percent_Identity=39.5393474088292, Blast_Score=351, Evalue=9e-97,
Organism=Drosophila melanogaster, GI24652903, Length=509, Percent_Identity=41.0609037328094, Blast_Score=347, Evalue=1e-95,
Organism=Drosophila melanogaster, GI24645179, Length=536, Percent_Identity=38.8059701492537, Blast_Score=344, Evalue=1e-94,
Organism=Drosophila melanogaster, GI18858175, Length=549, Percent_Identity=34.608378870674, Blast_Score=289, Evalue=3e-78,
Organism=Drosophila melanogaster, GI28571140, Length=547, Percent_Identity=34.7349177330896, Blast_Score=289, Evalue=3e-78,
Organism=Drosophila melanogaster, GI18859933, Length=537, Percent_Identity=32.9608938547486, Blast_Score=262, Evalue=5e-70,
Organism=Drosophila melanogaster, GI19921848, Length=561, Percent_Identity=33.5115864527629, Blast_Score=250, Evalue=2e-66,
Organism=Drosophila melanogaster, GI20130093, Length=359, Percent_Identity=26.1838440111421, Blast_Score=86, Evalue=5e-17,
Organism=Drosophila melanogaster, GI45552711, Length=359, Percent_Identity=26.1838440111421, Blast_Score=86, Evalue=5e-17,

Paralogues:

None

Copy number: 2180 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). 360 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). 480 Molecules/Cell In: Stationary Phase, Rich Media (Based on E. coli). 15012 Molecules/Cell In: Growth Phase,

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR017998
- InterPro:   IPR002194
- InterPro:   IPR002423
- InterPro:   IPR012714 [H]

Pfam domain/function: PF00118 Cpn60_TCP1 [H]

EC number: NA

Molecular weight: Translated: 61271; Mature: 61271

Theoretical pI: Translated: 5.14; Mature: 5.14

Prosite motif: PS00750 TCP1_1 ; PS00751 TCP1_2 ; PS00995 TCP1_3

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.4 %Cys     (Translated Protein)
3.7 %Met     (Translated Protein)
4.0 %Cys+Met (Translated Protein)
0.4 %Cys     (Mature Protein)
3.7 %Met     (Mature Protein)
4.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MQNQIRCLNMANAPVLLLKEGTQRSSGRDALKNNILAAVTLAEMLKSSLGPRGLDKMLID
CCCCEEEEECCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHH
SFGDVTITNDGATIVKEMEIQHPAAKLLVEAAKAQDAEVGDGTTSAVVLAGLLLDKADDL
CCCCEEEECCCHHHHHHHHHCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHH
LDQNIHPTIIIEGYKKALNKSLEIIDQLATKIDVSNLNSLATRDQLKKIVYTTMSSKFIA
HCCCCCCEEEECCHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHCCHHHC
GGEEMDKIMNMVIDAVSIVAEPLPEGGYNVPLDLIKIDKKKGGSIEDSMLVHGLVLDKEV
CHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHH
VHPGMPRRVEKAKIAVLDAALEVEKPEISAKISITSPEQIKAFLDEEAKYLKDMVDKLAS
HCCCCCCHHHHHHHHHHHHHHHCCCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHH
IGANVVICQKGIDDVAQHFLAKKGILAVRRVKRSDIEKLEKALGARIISSIKDATPEDLG
CCCCEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC
YAELVEERRVGNDKMVFIEGAKNPKAVNILLRGSNDMALDEAERSINDALHSLRNVLMKP
HHHHHHHHHCCCCCEEEEECCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCC
MIVAGGGAVETELALRLREYARSVGGKEQLAIEKFAEALEEIPMILAETAGMEPIQTLMD
EEEECCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHH
LRAKHAKGLINAGVDVMNGKIADDMLALNVLEPVRVKAQVLKSAVEAATAILKIDDLIAA
HHHHHHHHHHHCCCHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
APLKSGEKKGEKKEGGEEEKSSTPSSLE
CCCCCCCCCCCCCCCCCCHHCCCCCCCC
>Mature Secondary Structure
MQNQIRCLNMANAPVLLLKEGTQRSSGRDALKNNILAAVTLAEMLKSSLGPRGLDKMLID
CCCCEEEEECCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHH
SFGDVTITNDGATIVKEMEIQHPAAKLLVEAAKAQDAEVGDGTTSAVVLAGLLLDKADDL
CCCCEEEECCCHHHHHHHHHCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHH
LDQNIHPTIIIEGYKKALNKSLEIIDQLATKIDVSNLNSLATRDQLKKIVYTTMSSKFIA
HCCCCCCEEEECCHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHCCHHHC
GGEEMDKIMNMVIDAVSIVAEPLPEGGYNVPLDLIKIDKKKGGSIEDSMLVHGLVLDKEV
CHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHH
VHPGMPRRVEKAKIAVLDAALEVEKPEISAKISITSPEQIKAFLDEEAKYLKDMVDKLAS
HCCCCCCHHHHHHHHHHHHHHHCCCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHH
IGANVVICQKGIDDVAQHFLAKKGILAVRRVKRSDIEKLEKALGARIISSIKDATPEDLG
CCCCEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC
YAELVEERRVGNDKMVFIEGAKNPKAVNILLRGSNDMALDEAERSINDALHSLRNVLMKP
HHHHHHHHHCCCCCEEEEECCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCC
MIVAGGGAVETELALRLREYARSVGGKEQLAIEKFAEALEEIPMILAETAGMEPIQTLMD
EEEECCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHH
LRAKHAKGLINAGVDVMNGKIADDMLALNVLEPVRVKAQVLKSAVEAATAILKIDDLIAA
HHHHHHHHHHHCCCHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
APLKSGEKKGEKKEGGEEEKSSTPSSLE
CCCCCCCCCCCCCCCCCCHHCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9245723; 11572479 [H]