Definition | Akkermansia muciniphila ATCC BAA-835, complete genome. |
---|---|
Accession | NC_010655 |
Length | 2,664,102 |
Click here to switch to the map view.
The map label for this gene is groL
Identifier: 187735900
GI number: 187735900
Start: 1683318
End: 1684970
Strand: Direct
Name: groL
Synonym: Amuc_1408
Alternate gene names: 187735900
Gene position: 1683318-1684970 (Clockwise)
Preceding gene: 187735899
Following gene: 187735901
Centisome position: 63.19
GC content: 59.1
Gene sequence:
>1653_bases ATGGCTAAACAAATCCAATTTGACGAAACCGCCCGCCAGGCTCTGCTCCGCGGCGTGGAACAGATTGCCAAGGCTGTCAA GAGCACGCTGGGCCCTGCCGGCCGCAACGTAGTGATTGACAAAAAATTCGGTTCCCCCCTCATCACCAAGGACGGCGTAA CCGTGGCCAAGGAAATTGAACTGGAAGACCCGTTTGAAAACATGGGCGCCCAGCTTGTCCGGGAAGTATCTTCCAAAACC AATGACGTGGCCGGCGACGGCACCACTACCGCTACCGTGCTGGCTGAAAGCATTTACCGCGAAGGCCTGCGCAACGTTAC TGCCGGGGCCAACCCCATCTCCCTCCAGAGGGGCATCATGAAGGCTGCGGATTCCGTTGTGGAAGAACTCAAGAAGATCA GCAAGCCTGTCGACTCCAGCAAGGAAGTGGCCCAGGTCGCTACCGTCTCCGCCAACTGGGACGCTGAAATCGGCAACATC ATCGCGGAAGCCATGGACAAGGTGGGCAAGGACGGCACCATCACCGTGGAAGAAGCCAAGGGCATTGAAACTACGCTGGA CGTGGTGGAAGGCATGCAGTTTGACAAGGGATACCTGTCCCCCTACTTCGTGACGAACGCGGAAACGATGGAAGCGGTGC TGGAAAACCCCTACATCCTCATCCACGAAAAGAAAATCAACAACCTGAAGGACTTTCTTCCGCTGCTTGAAAAAGTGGCC AAGAGCGGCCGTCCCTTCCTGGTAATCGCGGAAGACATTGAAGGCGAAGCCCTCGCCACCCTGGTAGTCAACCGTCTGCG CGGCGTGCTGAACATCTGCGCGGTCAAGGCTCCCGGCTTCGGCGACCGCCGCAAGGCCATGATGGAAGACATCGCCATCC TTACCGGCGGCAAGTGCATCACGGAAGACCTGGGCATCAAGCTGGAAAACGTGGGCATCGAAGACCTCGGCCAGGCCAAG CGCGTGGTTGTTTCCAAGGATGAAACCGTTATCGTGGAAGGTTCCGCCAAATCTTCCGATATTGAAGCCCGCATTTCCCA GATTCGCCGCCAGATCAAGGACACCACGTCCGACTACGACCGCGAAAAACTCCAGGAACGCCTGGCCAAGCTGGCCGGCG GTGTGGCCGTCATCCATGTGGGTGCCGCTACGGAAACGGAAATGAAGGAAAAGAAGGCCCGTGTGGACGACGCCCTGCAC GCTACCCGCGCTGCGGTGGAAGAAGGCATCGTTCCCGGCGGCGGCGTGGCGCTGATTCGCGCCCAGAAAGCCATTGACAC CCTCAAACTGGAAGGTGATGAAGCAACCGGCGCCCAGATCGTTTATCGCGCTGTGGAAGCCCCGCTCCGCCAGCTGGCCT GCAATGCCGGCCGCGAAGGAGCCCTCATCGTCGCCAACGTGAAAGGCATGAAGAATACTGCCGAAGGTTACAACGTGGCC ACGGACAAGTATGAAGACCTGCTTTCCGCCGGCGTGGTGGATCCGACCAAGGTGACCCGTTCCGCTCTGCAGAATGCGGC CTCCATCGCCGGCCTGCTGCTTACCACGGAATGCGTCATTGCCGACAAGCCCGAGAAGAAGAGCTGCAGCTGCGGCTCCG GAGCTTCCGACATGGGCGGCATGGGAGGAATGGGCGGCATGGGCATGATGTAA
Upstream 100 bases:
>100_bases CGGGGAAGACTACCTTATCCTGTCTGAAAACGATATTCTGGCCATCATCGGCTAAATGGCACCACACACACATTCAACAT CATTTTAAATATTTAACATT
Downstream 100 bases:
>100_bases GCCCCGTCCGTTTCTTCGAGACCAAGGCAATCGGCTTCAAGGCTCCCCGGAAAAACCGGGGAGCCTTTTTTAATATGCTT CATCAGGAGCCGGTTTCGGG
Product: chaperonin GroEL
Products: NA
Alternate protein names: GroEL protein; Protein Cpn60
Number of amino acids: Translated: 550; Mature: 549
Protein sequence:
>550_residues MAKQIQFDETARQALLRGVEQIAKAVKSTLGPAGRNVVIDKKFGSPLITKDGVTVAKEIELEDPFENMGAQLVREVSSKT NDVAGDGTTTATVLAESIYREGLRNVTAGANPISLQRGIMKAADSVVEELKKISKPVDSSKEVAQVATVSANWDAEIGNI IAEAMDKVGKDGTITVEEAKGIETTLDVVEGMQFDKGYLSPYFVTNAETMEAVLENPYILIHEKKINNLKDFLPLLEKVA KSGRPFLVIAEDIEGEALATLVVNRLRGVLNICAVKAPGFGDRRKAMMEDIAILTGGKCITEDLGIKLENVGIEDLGQAK RVVVSKDETVIVEGSAKSSDIEARISQIRRQIKDTTSDYDREKLQERLAKLAGGVAVIHVGAATETEMKEKKARVDDALH ATRAAVEEGIVPGGGVALIRAQKAIDTLKLEGDEATGAQIVYRAVEAPLRQLACNAGREGALIVANVKGMKNTAEGYNVA TDKYEDLLSAGVVDPTKVTRSALQNAASIAGLLLTTECVIADKPEKKSCSCGSGASDMGGMGGMGGMGMM
Sequences:
>Translated_550_residues MAKQIQFDETARQALLRGVEQIAKAVKSTLGPAGRNVVIDKKFGSPLITKDGVTVAKEIELEDPFENMGAQLVREVSSKT NDVAGDGTTTATVLAESIYREGLRNVTAGANPISLQRGIMKAADSVVEELKKISKPVDSSKEVAQVATVSANWDAEIGNI IAEAMDKVGKDGTITVEEAKGIETTLDVVEGMQFDKGYLSPYFVTNAETMEAVLENPYILIHEKKINNLKDFLPLLEKVA KSGRPFLVIAEDIEGEALATLVVNRLRGVLNICAVKAPGFGDRRKAMMEDIAILTGGKCITEDLGIKLENVGIEDLGQAK RVVVSKDETVIVEGSAKSSDIEARISQIRRQIKDTTSDYDREKLQERLAKLAGGVAVIHVGAATETEMKEKKARVDDALH ATRAAVEEGIVPGGGVALIRAQKAIDTLKLEGDEATGAQIVYRAVEAPLRQLACNAGREGALIVANVKGMKNTAEGYNVA TDKYEDLLSAGVVDPTKVTRSALQNAASIAGLLLTTECVIADKPEKKSCSCGSGASDMGGMGGMGGMGMM >Mature_549_residues AKQIQFDETARQALLRGVEQIAKAVKSTLGPAGRNVVIDKKFGSPLITKDGVTVAKEIELEDPFENMGAQLVREVSSKTN DVAGDGTTTATVLAESIYREGLRNVTAGANPISLQRGIMKAADSVVEELKKISKPVDSSKEVAQVATVSANWDAEIGNII AEAMDKVGKDGTITVEEAKGIETTLDVVEGMQFDKGYLSPYFVTNAETMEAVLENPYILIHEKKINNLKDFLPLLEKVAK SGRPFLVIAEDIEGEALATLVVNRLRGVLNICAVKAPGFGDRRKAMMEDIAILTGGKCITEDLGIKLENVGIEDLGQAKR VVVSKDETVIVEGSAKSSDIEARISQIRRQIKDTTSDYDREKLQERLAKLAGGVAVIHVGAATETEMKEKKARVDDALHA TRAAVEEGIVPGGGVALIRAQKAIDTLKLEGDEATGAQIVYRAVEAPLRQLACNAGREGALIVANVKGMKNTAEGYNVAT DKYEDLLSAGVVDPTKVTRSALQNAASIAGLLLTTECVIADKPEKKSCSCGSGASDMGGMGGMGGMGMM
Specific function: Prevents misfolding and promotes the refolding and proper assembly of unfolded polypeptides generated under stress conditions
COG id: COG0459
COG function: function code O; Chaperonin GroEL (HSP60 family)
Gene ontology:
Cell location: Cytoplasm
Metaboloic importance: Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the chaperonin (HSP60) family
Homologues:
Organism=Homo sapiens, GI41399285, Length=529, Percent_Identity=48.7712665406427, Blast_Score=506, Evalue=1e-143, Organism=Homo sapiens, GI31542947, Length=529, Percent_Identity=48.7712665406427, Blast_Score=506, Evalue=1e-143, Organism=Homo sapiens, GI24307939, Length=352, Percent_Identity=22.7272727272727, Blast_Score=68, Evalue=3e-11, Organism=Escherichia coli, GI1790586, Length=529, Percent_Identity=61.8147448015123, Blast_Score=660, Evalue=0.0, Organism=Caenorhabditis elegans, GI17555558, Length=530, Percent_Identity=48.8679245283019, Blast_Score=516, Evalue=1e-146, Organism=Caenorhabditis elegans, GI193210679, Length=211, Percent_Identity=47.8672985781991, Blast_Score=198, Evalue=7e-51, Organism=Saccharomyces cerevisiae, GI6323288, Length=524, Percent_Identity=50.9541984732824, Blast_Score=533, Evalue=1e-152, Organism=Drosophila melanogaster, GI24641193, Length=532, Percent_Identity=49.6240601503759, Blast_Score=521, Evalue=1e-148, Organism=Drosophila melanogaster, GI24641191, Length=532, Percent_Identity=49.6240601503759, Blast_Score=521, Evalue=1e-148, Organism=Drosophila melanogaster, GI45550936, Length=525, Percent_Identity=46.6666666666667, Blast_Score=495, Evalue=1e-140, Organism=Drosophila melanogaster, GI45550132, Length=525, Percent_Identity=46.6666666666667, Blast_Score=495, Evalue=1e-140, Organism=Drosophila melanogaster, GI45550935, Length=525, Percent_Identity=46.6666666666667, Blast_Score=495, Evalue=1e-140, Organism=Drosophila melanogaster, GI17864606, Length=556, Percent_Identity=44.7841726618705, Blast_Score=456, Evalue=1e-128, Organism=Drosophila melanogaster, GI24584129, Length=549, Percent_Identity=37.1584699453552, Blast_Score=323, Evalue=1e-88, Organism=Drosophila melanogaster, GI19921262, Length=549, Percent_Identity=37.1584699453552, Blast_Score=323, Evalue=1e-88,
Paralogues:
None
Copy number: 2180 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). 360 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). 480 Molecules/Cell In: Stationary Phase, Rich Media (Based on E. coli). 15012 Molecules/Cell In: Growth Phase,
Swissprot (AC and ID): CH60_AKKM8 (B2UKV8)
Other databases:
- EMBL: CP001071 - RefSeq: YP_001878012.1 - ProteinModelPortal: B2UKV8 - SMR: B2UKV8 - GeneID: 6275718 - GenomeReviews: CP001071_GR - KEGG: amu:Amuc_1408 - HOGENOM: HBG625289 - OMA: ATDKYED - GO: GO:0005737 - HAMAP: MF_00600 - InterPro: IPR018370 - InterPro: IPR001844 - InterPro: IPR002423 - PANTHER: PTHR11353 - PRINTS: PR00298 - TIGRFAMs: TIGR02348
Pfam domain/function: PF00118 Cpn60_TCP1; SSF48592 GroEL-ATPase
EC number: NA
Molecular weight: Translated: 58445; Mature: 58314
Theoretical pI: Translated: 4.93; Mature: 4.93
Prosite motif: PS00296 CHAPERONINS_CPN60
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.1 %Cys (Translated Protein) 2.9 %Met (Translated Protein) 4.0 %Cys+Met (Translated Protein) 1.1 %Cys (Mature Protein) 2.7 %Met (Mature Protein) 3.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MAKQIQFDETARQALLRGVEQIAKAVKSTLGPAGRNVVIDKKFGSPLITKDGVTVAKEIE CCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCEECCCCCEEEECC LEDPFENMGAQLVREVSSKTNDVAGDGTTTATVLAESIYREGLRNVTAGANPISLQRGIM CCCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHH KAADSVVEELKKISKPVDSSKEVAQVATVSANWDAEIGNIIAEAMDKVGKDGTITVEEAK HHHHHHHHHHHHHHCCCCCCHHHHHHHEECCCCCHHHHHHHHHHHHHHCCCCEEEEECCC GIETTLDVVEGMQFDKGYLSPYFVTNAETMEAVLENPYILIHEKKINNLKDFLPLLEKVA CCHHHHHHHHCCCCCCCCCCCEEEECHHHHHHHHCCCEEEEECHHHHHHHHHHHHHHHHH KSGRPFLVIAEDIEGEALATLVVNRLRGVLNICAVKAPGFGDRRKAMMEDIAILTGGKCI HCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEE TEDLGIKLENVGIEDLGQAKRVVVSKDETVIVEGSAKSSDIEARISQIRRQIKDTTSDYD HHHHCCEEECCCHHHHCCHHEEEEECCCEEEEECCCCCCHHHHHHHHHHHHHHHCCCHHH REKLQERLAKLAGGVAVIHVGAATETEMKEKKARVDDALHATRAAVEEGIVPGGGVALIR HHHHHHHHHHHHCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEE AQKAIDTLKLEGDEATGAQIVYRAVEAPLRQLACNAGREGALIVANVKGMKNTAEGYNVA HHHHHHHEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCCC TDKYEDLLSAGVVDPTKVTRSALQNAASIAGLLLTTECVIADKPEKKSCSCGSGASDMGG HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCC MGGMGGMGMM CCCCCCCCCC >Mature Secondary Structure AKQIQFDETARQALLRGVEQIAKAVKSTLGPAGRNVVIDKKFGSPLITKDGVTVAKEIE CCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCEECCCCCEEEECC LEDPFENMGAQLVREVSSKTNDVAGDGTTTATVLAESIYREGLRNVTAGANPISLQRGIM CCCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHH KAADSVVEELKKISKPVDSSKEVAQVATVSANWDAEIGNIIAEAMDKVGKDGTITVEEAK HHHHHHHHHHHHHHCCCCCCHHHHHHHEECCCCCHHHHHHHHHHHHHHCCCCEEEEECCC GIETTLDVVEGMQFDKGYLSPYFVTNAETMEAVLENPYILIHEKKINNLKDFLPLLEKVA CCHHHHHHHHCCCCCCCCCCCEEEECHHHHHHHHCCCEEEEECHHHHHHHHHHHHHHHHH KSGRPFLVIAEDIEGEALATLVVNRLRGVLNICAVKAPGFGDRRKAMMEDIAILTGGKCI HCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEE TEDLGIKLENVGIEDLGQAKRVVVSKDETVIVEGSAKSSDIEARISQIRRQIKDTTSDYD HHHHCCEEECCCHHHHCCHHEEEEECCCEEEEECCCCCCHHHHHHHHHHHHHHHCCCHHH REKLQERLAKLAGGVAVIHVGAATETEMKEKKARVDDALHATRAAVEEGIVPGGGVALIR HHHHHHHHHHHHCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEE AQKAIDTLKLEGDEATGAQIVYRAVEAPLRQLACNAGREGALIVANVKGMKNTAEGYNVA HHHHHHHEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCCC TDKYEDLLSAGVVDPTKVTRSALQNAASIAGLLLTTECVIADKPEKKSCSCGSGASDMGG HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCC MGGMGGMGMM CCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA