Definition Francisella tularensis subsp. holarctica LVS chromosome, complete genome.
Accession NC_007880
Length 1,895,994

Click here to switch to the map view.

The map label for this gene is groEL [H]

Identifier: 89256983

GI number: 89256983

Start: 1645403

End: 1647037

Strand: Reverse

Name: groEL [H]

Synonym: FTL_1714

Alternate gene names: 89256983

Gene position: 1647037-1645403 (Counterclockwise)

Preceding gene: 89256984

Following gene: 89256981

Centisome position: 86.87

GC content: 38.9

Gene sequence:

>1635_bases
ATGGCTGCAAAACAAGTTTTATTTTCAGATGAAGCTCGTGCAAAAATGCTAGATGGTGTTAACACACTAGCAAATGCTGT
AAAAGTTACTTTAGGTCCAAAAGGTCGTAATGTTGTTTTAGATAAATCATTTGGCGCGCCAACTATCACTAAAGATGGTG
TATCTGTTGCTAAAGAAATTGAACTAGAAGATAAGTTTGAGAATATGGGGGCTCAGATAGTTAAAGAAGTAGCTTCAAAG
ACAGCGGATGTTGCTGGTGATGGTACTACTACAGCGACTGTACTTGCTCAGGCATTATTGACAGAGGGTCTAAAAGCTGT
CACTGCAGGTATGAATCCTATGGATCTAAAAAGAGGTATCGACAAAGCAACTGCTAGGTTAGTTGAAGAATTAAAAGCAC
TTTCTAAACCATGTTCAGATCCAAAATCAATTGAGCAAGTTGGTACTATCTCTGCTAACTCTGATGCTACTGTAGGTAAG
CTTATCGCTGACGCAATGGCAAAAGTTGGTAAAGAAGGTGTGATTACAGTTGAAGAAGGCAAAGGCTTTGAAGATGAGCT
TGATGTAGTTGAAGGTATGCAGTTTGATAGAGGTTATCTATCTCCGTATTTTGCAACAAATCAAGAGAATATGACTACTG
ATTTAGAGAATCCATATATTCTAATAGTTGATAAGAAAATCTCTAATATCCGCGATTTATTACCGATATTAGAAGGTGTT
TCTAAATCTGGTAGAGCGTTACTAATAATAGCTGAAGATGTAGAAAGTGAAGCTCTAGCTACTTTAGTTGTAAATAATAT
GCGTGGTGTAGTTAAAGTATGTGCTGTCAAAGCTCCTGGCTTTGGTGATAGAAGGAAAGCTATGCTAGAAGATATCGCTA
CTCTAACTGGAGCTACGTTTGTATCAGAAGACCTAAGCATGAAGTTAGAAGAAACTAACATGGAGCATTTAGGTACGGCT
AGTAGAGTACAAGTAACAAAAGATAATACAACAATTATTGATGGTGCTGGTGAAAAAGAAGCTATCGCTAAACGAATAAA
TGTAATCAAAGCTAATATTGCTGAAGCTAACTCTGATTATGATCGTGAGAAGCTGCAAGAAAGATTGGCTAAACTTTCTG
GTGGTGTCGCGGTGATAAAAGTTGGTGCTGTTACAGAAGCTGAGATGAAAGAGAAGAAAGATCGTGTCGATGATGCTTTA
CATGCTACTCGTGCGGCTGTAGAAGAAGGTATTGTTGCTGGTGGTGGCGTTGCTTTAATTAGAGCACAAAAAGCATTAGA
TGGCTTAACAGGTGAAAATGACGATCAAAACCATGGTATAGCGCTACTTAGAAAAGCAATAGAAGCTCCTCTAAGACAGA
TAGTATCAAATGCTGGCGGTGAGTCTTCTGTAGTTGTTAACCAAGTTAAAGCTAATCAAGGTAACTATGGTTATAATGCT
GCAAATGATACTTATGGTGATATGGTTGAGATGGGTATTTTAGATCCTACTAAAGTTACTCGTTCAGCTCTACAACATGC
TGCTTCAATTGCTGGACTTATGATCACTACAGAGGCGATGATCGGTGAGATTAAAGAAGCTGCTCCTGCTATGCCTATGG
GCGGTGGCATGGGCGGTATGCCTGGCATGATGTAA

Upstream 100 bases:

>100_bases
AAAGTTGGTGATGAAACTCTTCTAATGATGAGAGAAGAAGATATCATGGGTATTATTGCATAATCAAAGAATCTTACAAA
TTTTAAAGGAGAAATTAAAA

Downstream 100 bases:

>100_bases
TAGTCTTTATATTGTTATTAATCTAATAATACTTTTATATTATACTCAACTAGCTTATTTTCATTTAATTATTATTTCTT
ATATTACTTTAGCTCCGGAC

Product: chaperonin GroEL

Products: NA

Alternate protein names: GroEL protein; Protein Cpn60 [H]

Number of amino acids: Translated: 544; Mature: 543

Protein sequence:

>544_residues
MAAKQVLFSDEARAKMLDGVNTLANAVKVTLGPKGRNVVLDKSFGAPTITKDGVSVAKEIELEDKFENMGAQIVKEVASK
TADVAGDGTTTATVLAQALLTEGLKAVTAGMNPMDLKRGIDKATARLVEELKALSKPCSDPKSIEQVGTISANSDATVGK
LIADAMAKVGKEGVITVEEGKGFEDELDVVEGMQFDRGYLSPYFATNQENMTTDLENPYILIVDKKISNIRDLLPILEGV
SKSGRALLIIAEDVESEALATLVVNNMRGVVKVCAVKAPGFGDRRKAMLEDIATLTGATFVSEDLSMKLEETNMEHLGTA
SRVQVTKDNTTIIDGAGEKEAIAKRINVIKANIAEANSDYDREKLQERLAKLSGGVAVIKVGAVTEAEMKEKKDRVDDAL
HATRAAVEEGIVAGGGVALIRAQKALDGLTGENDDQNHGIALLRKAIEAPLRQIVSNAGGESSVVVNQVKANQGNYGYNA
ANDTYGDMVEMGILDPTKVTRSALQHAASIAGLMITTEAMIGEIKEAAPAMPMGGGMGGMPGMM

Sequences:

>Translated_544_residues
MAAKQVLFSDEARAKMLDGVNTLANAVKVTLGPKGRNVVLDKSFGAPTITKDGVSVAKEIELEDKFENMGAQIVKEVASK
TADVAGDGTTTATVLAQALLTEGLKAVTAGMNPMDLKRGIDKATARLVEELKALSKPCSDPKSIEQVGTISANSDATVGK
LIADAMAKVGKEGVITVEEGKGFEDELDVVEGMQFDRGYLSPYFATNQENMTTDLENPYILIVDKKISNIRDLLPILEGV
SKSGRALLIIAEDVESEALATLVVNNMRGVVKVCAVKAPGFGDRRKAMLEDIATLTGATFVSEDLSMKLEETNMEHLGTA
SRVQVTKDNTTIIDGAGEKEAIAKRINVIKANIAEANSDYDREKLQERLAKLSGGVAVIKVGAVTEAEMKEKKDRVDDAL
HATRAAVEEGIVAGGGVALIRAQKALDGLTGENDDQNHGIALLRKAIEAPLRQIVSNAGGESSVVVNQVKANQGNYGYNA
ANDTYGDMVEMGILDPTKVTRSALQHAASIAGLMITTEAMIGEIKEAAPAMPMGGGMGGMPGMM
>Mature_543_residues
AAKQVLFSDEARAKMLDGVNTLANAVKVTLGPKGRNVVLDKSFGAPTITKDGVSVAKEIELEDKFENMGAQIVKEVASKT
ADVAGDGTTTATVLAQALLTEGLKAVTAGMNPMDLKRGIDKATARLVEELKALSKPCSDPKSIEQVGTISANSDATVGKL
IADAMAKVGKEGVITVEEGKGFEDELDVVEGMQFDRGYLSPYFATNQENMTTDLENPYILIVDKKISNIRDLLPILEGVS
KSGRALLIIAEDVESEALATLVVNNMRGVVKVCAVKAPGFGDRRKAMLEDIATLTGATFVSEDLSMKLEETNMEHLGTAS
RVQVTKDNTTIIDGAGEKEAIAKRINVIKANIAEANSDYDREKLQERLAKLSGGVAVIKVGAVTEAEMKEKKDRVDDALH
ATRAAVEEGIVAGGGVALIRAQKALDGLTGENDDQNHGIALLRKAIEAPLRQIVSNAGGESSVVVNQVKANQGNYGYNAA
NDTYGDMVEMGILDPTKVTRSALQHAASIAGLMITTEAMIGEIKEAAPAMPMGGGMGGMPGMM

Specific function: Prevents misfolding and promotes the refolding and proper assembly of unfolded polypeptides generated under stress conditions [H]

COG id: COG0459

COG function: function code O; Chaperonin GroEL (HSP60 family)

Gene ontology:

Cell location: Cytoplasm [H]

Metaboloic importance: Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the chaperonin (HSP60) family [H]

Homologues:

Organism=Homo sapiens, GI41399285, Length=523, Percent_Identity=49.9043977055449, Blast_Score=509, Evalue=1e-144,
Organism=Homo sapiens, GI31542947, Length=523, Percent_Identity=49.9043977055449, Blast_Score=509, Evalue=1e-144,
Organism=Escherichia coli, GI1790586, Length=524, Percent_Identity=74.4274809160305, Blast_Score=773, Evalue=0.0,
Organism=Caenorhabditis elegans, GI17555558, Length=524, Percent_Identity=51.1450381679389, Blast_Score=519, Evalue=1e-147,
Organism=Caenorhabditis elegans, GI193210679, Length=206, Percent_Identity=51.4563106796116, Blast_Score=203, Evalue=2e-52,
Organism=Saccharomyces cerevisiae, GI6323288, Length=522, Percent_Identity=54.9808429118774, Blast_Score=559, Evalue=1e-160,
Organism=Saccharomyces cerevisiae, GI6322524, Length=161, Percent_Identity=29.8136645962733, Blast_Score=68, Evalue=3e-12,
Organism=Drosophila melanogaster, GI24641193, Length=523, Percent_Identity=49.3307839388145, Blast_Score=523, Evalue=1e-148,
Organism=Drosophila melanogaster, GI24641191, Length=523, Percent_Identity=49.3307839388145, Blast_Score=523, Evalue=1e-148,
Organism=Drosophila melanogaster, GI45550936, Length=523, Percent_Identity=49.3307839388145, Blast_Score=516, Evalue=1e-146,
Organism=Drosophila melanogaster, GI45550132, Length=523, Percent_Identity=49.3307839388145, Blast_Score=516, Evalue=1e-146,
Organism=Drosophila melanogaster, GI45550935, Length=523, Percent_Identity=49.3307839388145, Blast_Score=516, Evalue=1e-146,
Organism=Drosophila melanogaster, GI17864606, Length=542, Percent_Identity=42.0664206642066, Blast_Score=428, Evalue=1e-120,
Organism=Drosophila melanogaster, GI24584129, Length=533, Percent_Identity=38.0863039399625, Blast_Score=328, Evalue=8e-90,
Organism=Drosophila melanogaster, GI19921262, Length=533, Percent_Identity=38.0863039399625, Blast_Score=328, Evalue=8e-90,

Paralogues:

None

Copy number: 2180 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). 360 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). 480 Molecules/Cell In: Stationary Phase, Rich Media (Based on E. coli). 15012 Molecules/Cell In: Growth Phase,

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR018370
- InterPro:   IPR001844
- InterPro:   IPR002423 [H]

Pfam domain/function: PF00118 Cpn60_TCP1 [H]

EC number: NA

Molecular weight: Translated: 57403; Mature: 57272

Theoretical pI: Translated: 4.72; Mature: 4.72

Prosite motif: PS00296 CHAPERONINS_CPN60

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.4 %Cys     (Translated Protein)
4.2 %Met     (Translated Protein)
4.6 %Cys+Met (Translated Protein)
0.4 %Cys     (Mature Protein)
4.1 %Met     (Mature Protein)
4.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MAAKQVLFSDEARAKMLDGVNTLANAVKVTLGPKGRNVVLDKSFGAPTITKDGVSVAKEI
CCCCHHHHCCHHHHHHHHHHHHHHHHEEEEECCCCCEEEEECCCCCCCCCCCCHHHHHHH
ELEDKFENMGAQIVKEVASKTADVAGDGTTTATVLAQALLTEGLKAVTAGMNPMDLKRGI
HHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHH
DKATARLVEELKALSKPCSDPKSIEQVGTISANSDATVGKLIADAMAKVGKEGVITVEEG
HHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEECC
KGFEDELDVVEGMQFDRGYLSPYFATNQENMTTDLENPYILIVDKKISNIRDLLPILEGV
CCCHHHHHHHHCCCCCCCCCCCCEECCCCCCCCCCCCCEEEEECCHHHHHHHHHHHHHCC
SKSGRALLIIAEDVESEALATLVVNNMRGVVKVCAVKAPGFGDRRKAMLEDIATLTGATF
CCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHEECCCCCCHHHHHHHHHHHHHHHHHH
VSEDLSMKLEETNMEHLGTASRVQVTKDNTTIIDGAGEKEAIAKRINVIKANIAEANSDY
HHHHHHHHHHHCCHHHCCCCCEEEEECCCCEEEECCCCHHHHHHHHHHHHHHHHHCCCCC
DREKLQERLAKLSGGVAVIKVGAVTEAEMKEKKDRVDDALHATRAAVEEGIVAGGGVALI
CHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHH
RAQKALDGLTGENDDQNHGIALLRKAIEAPLRQIVSNAGGESSVVVNQVKANQGNYGYNA
HHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCCCCCC
ANDTYGDMVEMGILDPTKVTRSALQHAASIAGLMITTEAMIGEIKEAAPAMPMGGGMGGM
CCCCHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC
PGMM
CCCC
>Mature Secondary Structure 
AAKQVLFSDEARAKMLDGVNTLANAVKVTLGPKGRNVVLDKSFGAPTITKDGVSVAKEI
CCCHHHHCCHHHHHHHHHHHHHHHHEEEEECCCCCEEEEECCCCCCCCCCCCHHHHHHH
ELEDKFENMGAQIVKEVASKTADVAGDGTTTATVLAQALLTEGLKAVTAGMNPMDLKRGI
HHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHH
DKATARLVEELKALSKPCSDPKSIEQVGTISANSDATVGKLIADAMAKVGKEGVITVEEG
HHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEECC
KGFEDELDVVEGMQFDRGYLSPYFATNQENMTTDLENPYILIVDKKISNIRDLLPILEGV
CCCHHHHHHHHCCCCCCCCCCCCEECCCCCCCCCCCCCEEEEECCHHHHHHHHHHHHHCC
SKSGRALLIIAEDVESEALATLVVNNMRGVVKVCAVKAPGFGDRRKAMLEDIATLTGATF
CCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHEECCCCCCHHHHHHHHHHHHHHHHHH
VSEDLSMKLEETNMEHLGTASRVQVTKDNTTIIDGAGEKEAIAKRINVIKANIAEANSDY
HHHHHHHHHHHCCHHHCCCCCEEEEECCCCEEEECCCCHHHHHHHHHHHHHHHHHCCCCC
DREKLQERLAKLSGGVAVIKVGAVTEAEMKEKKDRVDDALHATRAAVEEGIVAGGGVALI
CHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHH
RAQKALDGLTGENDDQNHGIALLRKAIEAPLRQIVSNAGGESSVVVNQVKANQGNYGYNA
HHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCCCCCC
ANDTYGDMVEMGILDPTKVTRSALQHAASIAGLMITTEAMIGEIKEAAPAMPMGGGMGGM
CCCCHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC
PGMM
CCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA