Definition | Clostridium perfringens str. 13, complete genome. |
---|---|
Accession | NC_003366 |
Length | 3,031,430 |
Click here to switch to the map view.
The map label for this gene is groEL [H]
Identifier: 18311271
GI number: 18311271
Start: 2633990
End: 2635609
Strand: Reverse
Name: groEL [H]
Synonym: CPE2289
Alternate gene names: 18311271
Gene position: 2635609-2633990 (Counterclockwise)
Preceding gene: 18311272
Following gene: 18311270
Centisome position: 86.94
GC content: 33.7
Gene sequence:
>1620_bases ATGGCTAAAACATTATTATTCGGTGAAGAAGCAAGAAGATCTATGCAAGCGGGTGTAGATAAATTAGCTAACACTGTTAA GGTTACATTAGGACCAAAAGGAAGAAATGTTATTTTAGATAAAAAATTTGGATCACCATTAATAACAAATGATGGGGTTA CAATAGCAAGAGAAATTGAACTTGAAGATGCTTATGAAAATATGGGAGCTCAACTTGTAAAAGAAGTAGCTACAAAGACT AATGATGTGGCAGGAGATGGAACTACTACAGCTACCTTATTAGCTCAAGCAATTATAAGAGAAGGATTAAAAAATGTAAC AGCAGGGGCAAATCCTATATTAATAAGAAATGGAATTAAAACTGCAGTTGAAAAAGCTGTAGAGGAAATACAAAAAATTT CTAAGCCTGTAAATGGAAAAGAAGACATAGCTAGAGTTGCTGCAATTTCAGCGGCTGATGAAAAAATTGGTAAGCTAATT GCAGATGCTATGGAAAAGGTAGGAAATGAAGGCGTTATAACTGTAGAAGAATCTAAATCAATGGGAACTGAGTTAGATGT TGTTGAAGGTATGCAATTTGATAGAGGATATGTATCAGCTTATATGGTTACTGATACTGAAAAAATGGAAGCTGTTTTAG ATAATCCATTAGTATTAATAACAGATAAGAAAATAAGCAATATACAAGATTTATTACCATTACTTGAGCAAATAGTTCAA GCAGGTAAAAAACTTTTAATAATAGCTGATGATATAGAAGGCGAAGCTATGACAACATTAGTTGTTAATAAATTAAGAGG AACATTTACTTGTGTTGGAGTTAAAGCACCTGGATTTGGTGATAGAAGAAAAGAAATGTTACAAGATATAGCTACTTTAA CAGGTGGCGTTGTTATATCTGATGAAGTAGGCGGAGATTTAAAAGAAGCTACATTAGATATGCTTGGAGAAGCTGAAAGT GTTAAGGTAACTAAAGAAAGTACTACAATAGTTAATGGAAGAGGAAACTCAGAAGAGATTAAAAATAGAGTTAACCAAAT AAAATTACAATTAGAAGCTACTACTTCTGAATTTGACAAAGAAAAATTACAAGAAAGATTAGCTAAATTAGCAGGTGGGG TTGCAGTAGTTAAGGTTGGAGCTGCCACTGAAACAGAGCTTAAGGAAAGTAAGCTAAGAATAGAGGATGCTTTAGCAGCT ACAAAGGCAGCTGTTGAAGAAGGAATAGTTCCAGGTGGTGGAACAGCTTACGTAAATGTAATAAATGAAGTTGCAAAATT AACCTCTGATATTCAAGATGAACAAGTTGGTATAAATATAATTGTAAGATCTTTAGAAGAACCTATGAGACAAATAGCTC ATAATGCAGGACTAGAAGGTTCAGTTATAATAGAAAAAGTTAAAAATAGTGATGCAGGTGTAGGATTTGATGCTTTAAGA GGAGAATATAAAGATATGATTAAAGCTGGAATAGTTGATCCAACTAAGGTTACAAGATCAGCTCTTCAAAATGCAGCATC AGTAGCATCAACATTCTTAACAACAGAGGCTGCTGTAGCAGATATTCCAGAAAAAGAAATGCCTCAAGGCGCAGGTATGG GAATGGACGGAATGTACTAA
Upstream 100 bases:
>100_bases ATATACTATTTTAAGACAAGACGATATACTAGCAATAGTTGAATAGTTTTAAAATATAAGTGATTTAGATATTCATAATA TATTTGGGAGGTAAATTAAT
Downstream 100 bases:
>100_bases TAAAAGAATAAAAAGGATGACAGTTAAGTCATCCTTTTTTTATTTAATTCTTATTGAAAATAATTACATAATATGATTAA AATGGTTTAAATGTCGAAAA
Product: chaperonin GroEL
Products: NA
Alternate protein names: GroEL protein; Protein Cpn60 [H]
Number of amino acids: Translated: 539; Mature: 538
Protein sequence:
>539_residues MAKTLLFGEEARRSMQAGVDKLANTVKVTLGPKGRNVILDKKFGSPLITNDGVTIAREIELEDAYENMGAQLVKEVATKT NDVAGDGTTTATLLAQAIIREGLKNVTAGANPILIRNGIKTAVEKAVEEIQKISKPVNGKEDIARVAAISAADEKIGKLI ADAMEKVGNEGVITVEESKSMGTELDVVEGMQFDRGYVSAYMVTDTEKMEAVLDNPLVLITDKKISNIQDLLPLLEQIVQ AGKKLLIIADDIEGEAMTTLVVNKLRGTFTCVGVKAPGFGDRRKEMLQDIATLTGGVVISDEVGGDLKEATLDMLGEAES VKVTKESTTIVNGRGNSEEIKNRVNQIKLQLEATTSEFDKEKLQERLAKLAGGVAVVKVGAATETELKESKLRIEDALAA TKAAVEEGIVPGGGTAYVNVINEVAKLTSDIQDEQVGINIIVRSLEEPMRQIAHNAGLEGSVIIEKVKNSDAGVGFDALR GEYKDMIKAGIVDPTKVTRSALQNAASVASTFLTTEAAVADIPEKEMPQGAGMGMDGMY
Sequences:
>Translated_539_residues MAKTLLFGEEARRSMQAGVDKLANTVKVTLGPKGRNVILDKKFGSPLITNDGVTIAREIELEDAYENMGAQLVKEVATKT NDVAGDGTTTATLLAQAIIREGLKNVTAGANPILIRNGIKTAVEKAVEEIQKISKPVNGKEDIARVAAISAADEKIGKLI ADAMEKVGNEGVITVEESKSMGTELDVVEGMQFDRGYVSAYMVTDTEKMEAVLDNPLVLITDKKISNIQDLLPLLEQIVQ AGKKLLIIADDIEGEAMTTLVVNKLRGTFTCVGVKAPGFGDRRKEMLQDIATLTGGVVISDEVGGDLKEATLDMLGEAES VKVTKESTTIVNGRGNSEEIKNRVNQIKLQLEATTSEFDKEKLQERLAKLAGGVAVVKVGAATETELKESKLRIEDALAA TKAAVEEGIVPGGGTAYVNVINEVAKLTSDIQDEQVGINIIVRSLEEPMRQIAHNAGLEGSVIIEKVKNSDAGVGFDALR GEYKDMIKAGIVDPTKVTRSALQNAASVASTFLTTEAAVADIPEKEMPQGAGMGMDGMY >Mature_538_residues AKTLLFGEEARRSMQAGVDKLANTVKVTLGPKGRNVILDKKFGSPLITNDGVTIAREIELEDAYENMGAQLVKEVATKTN DVAGDGTTTATLLAQAIIREGLKNVTAGANPILIRNGIKTAVEKAVEEIQKISKPVNGKEDIARVAAISAADEKIGKLIA DAMEKVGNEGVITVEESKSMGTELDVVEGMQFDRGYVSAYMVTDTEKMEAVLDNPLVLITDKKISNIQDLLPLLEQIVQA GKKLLIIADDIEGEAMTTLVVNKLRGTFTCVGVKAPGFGDRRKEMLQDIATLTGGVVISDEVGGDLKEATLDMLGEAESV KVTKESTTIVNGRGNSEEIKNRVNQIKLQLEATTSEFDKEKLQERLAKLAGGVAVVKVGAATETELKESKLRIEDALAAT KAAVEEGIVPGGGTAYVNVINEVAKLTSDIQDEQVGINIIVRSLEEPMRQIAHNAGLEGSVIIEKVKNSDAGVGFDALRG EYKDMIKAGIVDPTKVTRSALQNAASVASTFLTTEAAVADIPEKEMPQGAGMGMDGMY
Specific function: Prevents misfolding and promotes the refolding and proper assembly of unfolded polypeptides generated under stress conditions [H]
COG id: COG0459
COG function: function code O; Chaperonin GroEL (HSP60 family)
Gene ontology:
Cell location: Cytoplasm [H]
Metaboloic importance: Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the chaperonin (HSP60) family [H]
Homologues:
Organism=Homo sapiens, GI41399285, Length=527, Percent_Identity=49.146110056926, Blast_Score=497, Evalue=1e-140, Organism=Homo sapiens, GI31542947, Length=527, Percent_Identity=49.146110056926, Blast_Score=497, Evalue=1e-140, Organism=Homo sapiens, GI24307939, Length=135, Percent_Identity=34.0740740740741, Blast_Score=72, Evalue=2e-12, Organism=Homo sapiens, GI38455427, Length=552, Percent_Identity=22.1014492753623, Blast_Score=71, Evalue=2e-12, Organism=Escherichia coli, GI1790586, Length=526, Percent_Identity=59.3155893536122, Blast_Score=608, Evalue=1e-175, Organism=Caenorhabditis elegans, GI17555558, Length=529, Percent_Identity=48.5822306238185, Blast_Score=502, Evalue=1e-142, Organism=Caenorhabditis elegans, GI193210679, Length=210, Percent_Identity=48.5714285714286, Blast_Score=203, Evalue=2e-52, Organism=Caenorhabditis elegans, GI25144674, Length=187, Percent_Identity=31.0160427807487, Blast_Score=74, Evalue=2e-13, Organism=Saccharomyces cerevisiae, GI6323288, Length=524, Percent_Identity=51.1450381679389, Blast_Score=512, Evalue=1e-146, Organism=Saccharomyces cerevisiae, GI6322524, Length=515, Percent_Identity=24.6601941747573, Blast_Score=79, Evalue=1e-15, Organism=Saccharomyces cerevisiae, GI6322446, Length=543, Percent_Identity=23.2044198895028, Blast_Score=76, Evalue=1e-14, Organism=Saccharomyces cerevisiae, GI6320418, Length=590, Percent_Identity=23.2203389830508, Blast_Score=73, Evalue=1e-13, Organism=Drosophila melanogaster, GI24641193, Length=527, Percent_Identity=48.7666034155598, Blast_Score=507, Evalue=1e-144, Organism=Drosophila melanogaster, GI24641191, Length=527, Percent_Identity=48.7666034155598, Blast_Score=507, Evalue=1e-144, Organism=Drosophila melanogaster, GI45550936, Length=527, Percent_Identity=46.8690702087287, Blast_Score=482, Evalue=1e-136, Organism=Drosophila melanogaster, GI45550132, Length=527, Percent_Identity=46.8690702087287, Blast_Score=482, Evalue=1e-136, Organism=Drosophila melanogaster, GI45550935, Length=527, Percent_Identity=46.8690702087287, Blast_Score=482, Evalue=1e-136, Organism=Drosophila melanogaster, GI17864606, Length=541, Percent_Identity=42.1441774491682, Blast_Score=426, Evalue=1e-119, Organism=Drosophila melanogaster, GI24584129, Length=531, Percent_Identity=36.346516007533, Blast_Score=302, Evalue=3e-82, Organism=Drosophila melanogaster, GI19921262, Length=531, Percent_Identity=36.346516007533, Blast_Score=302, Evalue=3e-82, Organism=Drosophila melanogaster, GI17647245, Length=535, Percent_Identity=23.1775700934579, Blast_Score=67, Evalue=4e-11, Organism=Drosophila melanogaster, GI24652903, Length=131, Percent_Identity=30.5343511450382, Blast_Score=66, Evalue=8e-11,
Paralogues:
None
Copy number: 2180 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). 360 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). 480 Molecules/Cell In: Stationary Phase, Rich Media (Based on E. coli). 15012 Molecules/Cell In: Growth Phase,
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR018370 - InterPro: IPR001844 - InterPro: IPR002423 [H]
Pfam domain/function: PF00118 Cpn60_TCP1 [H]
EC number: NA
Molecular weight: Translated: 57368; Mature: 57236
Theoretical pI: Translated: 4.57; Mature: 4.57
Prosite motif: PS00296 CHAPERONINS_CPN60
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.2 %Cys (Translated Protein) 3.2 %Met (Translated Protein) 3.3 %Cys+Met (Translated Protein) 0.2 %Cys (Mature Protein) 3.0 %Met (Mature Protein) 3.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MAKTLLFGEEARRSMQAGVDKLANTVKVTLGPKGRNVILDKKFGSPLITNDGVTIAREIE CCCCEECCHHHHHHHHHHHHHHCCEEEEEECCCCCEEEEECCCCCCEEECCCCEEEEEEE LEDAYENMGAQLVKEVATKTNDVAGDGTTTATLLAQAIIREGLKNVTAGANPILIRNGIK HHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCEEEECCHH TAVEKAVEEIQKISKPVNGKEDIARVAAISAADEKIGKLIADAMEKVGNEGVITVEESKS HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCC MGTELDVVEGMQFDRGYVSAYMVTDTEKMEAVLDNPLVLITDKKISNIQDLLPLLEQIVQ CCCCCHHHHCCCCCCCEEEEEEEECHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHHHHH AGKKLLIIADDIEGEAMTTLVVNKLRGTFTCVGVKAPGFGDRRKEMLQDIATLTGGVVIS CCCEEEEEEECCCCCHHHHHHHHHHCCCEEEEEEECCCCCHHHHHHHHHHHHHHCCEEEE DEVGGDLKEATLDMLGEAESVKVTKESTTIVNGRGNSEEIKNRVNQIKLQLEATTSEFDK CCCCCCHHHHHHHHHCCCCCEEEEECCCEEEECCCCHHHHHHHHHHEEEEEECCHHHHHH EKLQERLAKLAGGVAVVKVGAATETELKESKLRIEDALAATKAAVEEGIVPGGGTAYVNV HHHHHHHHHHHCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHH INEVAKLTSDIQDEQVGINIIVRSLEEPMRQIAHNAGLEGSVIIEKVKNSDAGVGFDALR HHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHEECCCCCCCCCHHHHH GEYKDMIKAGIVDPTKVTRSALQNAASVASTFLTTEAAVADIPEKEMPQGAGMGMDGMY HHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHCCCCCCCCCCCCC >Mature Secondary Structure AKTLLFGEEARRSMQAGVDKLANTVKVTLGPKGRNVILDKKFGSPLITNDGVTIAREIE CCCEECCHHHHHHHHHHHHHHCCEEEEEECCCCCEEEEECCCCCCEEECCCCEEEEEEE LEDAYENMGAQLVKEVATKTNDVAGDGTTTATLLAQAIIREGLKNVTAGANPILIRNGIK HHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCEEEECCHH TAVEKAVEEIQKISKPVNGKEDIARVAAISAADEKIGKLIADAMEKVGNEGVITVEESKS HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCC MGTELDVVEGMQFDRGYVSAYMVTDTEKMEAVLDNPLVLITDKKISNIQDLLPLLEQIVQ CCCCCHHHHCCCCCCCEEEEEEEECHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHHHHH AGKKLLIIADDIEGEAMTTLVVNKLRGTFTCVGVKAPGFGDRRKEMLQDIATLTGGVVIS CCCEEEEEEECCCCCHHHHHHHHHHCCCEEEEEEECCCCCHHHHHHHHHHHHHHCCEEEE DEVGGDLKEATLDMLGEAESVKVTKESTTIVNGRGNSEEIKNRVNQIKLQLEATTSEFDK CCCCCCHHHHHHHHHCCCCCEEEEECCCEEEECCCCHHHHHHHHHHEEEEEECCHHHHHH EKLQERLAKLAGGVAVVKVGAATETELKESKLRIEDALAATKAAVEEGIVPGGGTAYVNV HHHHHHHHHHHCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHH INEVAKLTSDIQDEQVGINIIVRSLEEPMRQIAHNAGLEGSVIIEKVKNSDAGVGFDALR HHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHEECCCCCCCCCHHHHH GEYKDMIKAGIVDPTKVTRSALQNAASVASTFLTTEAAVADIPEKEMPQGAGMGMDGMY HHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHCCCCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA