Definition Listeria monocytogenes Clip81459, complete genome.
Accession NC_012488
Length 2,912,690

Click here to switch to the map view.

The map label for this gene is gcp [H]

Identifier: 226224681

GI number: 226224681

Start: 2154851

End: 2155885

Strand: Reverse

Name: gcp [H]

Synonym: Lm4b_02096

Alternate gene names: 226224681

Gene position: 2155885-2154851 (Counterclockwise)

Preceding gene: 226224682

Following gene: 226224678

Centisome position: 74.02

GC content: 42.32

Gene sequence:

>1035_bases
GTGGGTGGACTTATGAAAAAAAATACATTAATTCTTGGAATAGAATCTAGCTGCGATGAAACAGCTGCTTCTGTTGTAAA
AAATGGTAATGAAATTATATCGAGTGTGGTGGCTTCTCAAATTGAGAGCCATAAACGATTTGGCGGGGTAGTTCCTGAAA
TCGCATCAAGACATCACGTGGAGCAAATTACGCTTGTGATTGAAGAAGCTTTAAAACAAGCAAATGTGACGATGGATGAT
TTAGACGGGATAGCTGTGACAGAAGGTCCAGGTCTAGTTGGTGCGCTGCTTATCGGAGTGAATGCGGCTAAAACGCTCGC
TTTTATGCACAATTTACCTTTAGTTGGCGTGCATCATATTGCTGGCCATATTTATGCAAATCGTTTTGAAACTGAATTCA
AATTTCCGCTACTTTCATTAGTTGTCAGCGGCGGTCATACTGAACTAGTTTTAATGAAAGCAGATAATGAGTTTGAAATT
ATCGGGGAGACAAGGGACGATGCAGCTGGTGAAGCTTATGATAAAGTGGCACGTACACTTGGTCTCGCATACCCAGGCGG
CGTGCAAATTGATAAACTTGCCAAAGACGGCGAAGATACGTTTCATTTCCCAAGAGCGATGATGGATGAGGGTTCGTTTG
ATTTTAGTTTTAGTGGATTGAAGTCTTCCTTTATTAACACGCTTCATAATTTAAGGCAGCGTGGTGAGGAGCCAAATCCA
AACGATATGGCGGCGAGTTTTCAAGCAAGCGTTGTGGATGTTTTAGTAAGCAAAACGATTCGTGCTGCTAAACAATACGA
TGTGAAACAACTGCTTCTTGCGGGAGGCGTGGCAGCGAACCAAGGTTTACGAGAACGCCTTATTCAAGAAGTAAAACTAG
AGCTTCCAGAGACAGAACTGATTATTCCACCATTAGCCTTATGCGGAGACAATGCGGCAATGATTGCTGCTGCAGGGACT
GTGAGTTTCTTACAAGGAAAACGTAGTGGTTTTGATATGAATGCGAATCCGGGATTATTGCTGGAAGATATATAA

Upstream 100 bases:

>100_bases
CCAACGATGTAGCACAAGGGCTTTATAAGAAATTAGGATTTCAAGACGGCGCCATTCGGAAAAACTATTATCCGGATACG
AAAGAAGACGCGCTAGTGAT

Downstream 100 bases:

>100_bases
AAAGCGCGCGATTTTTCGTTCCGCACGCTTCGGACTGCAGTGAAAGTGAACTATTACTTTCGCTGCAGTCTATTTTTATT
AGCTCAATTTTCTGGTTATG

Product: putative DNA-binding/iron metalloprotein/AP endonuclease

Products: NA

Alternate protein names: Glycoprotease [H]

Number of amino acids: Translated: 344; Mature: 343

Protein sequence:

>344_residues
MGGLMKKNTLILGIESSCDETAASVVKNGNEIISSVVASQIESHKRFGGVVPEIASRHHVEQITLVIEEALKQANVTMDD
LDGIAVTEGPGLVGALLIGVNAAKTLAFMHNLPLVGVHHIAGHIYANRFETEFKFPLLSLVVSGGHTELVLMKADNEFEI
IGETRDDAAGEAYDKVARTLGLAYPGGVQIDKLAKDGEDTFHFPRAMMDEGSFDFSFSGLKSSFINTLHNLRQRGEEPNP
NDMAASFQASVVDVLVSKTIRAAKQYDVKQLLLAGGVAANQGLRERLIQEVKLELPETELIIPPLALCGDNAAMIAAAGT
VSFLQGKRSGFDMNANPGLLLEDI

Sequences:

>Translated_344_residues
MGGLMKKNTLILGIESSCDETAASVVKNGNEIISSVVASQIESHKRFGGVVPEIASRHHVEQITLVIEEALKQANVTMDD
LDGIAVTEGPGLVGALLIGVNAAKTLAFMHNLPLVGVHHIAGHIYANRFETEFKFPLLSLVVSGGHTELVLMKADNEFEI
IGETRDDAAGEAYDKVARTLGLAYPGGVQIDKLAKDGEDTFHFPRAMMDEGSFDFSFSGLKSSFINTLHNLRQRGEEPNP
NDMAASFQASVVDVLVSKTIRAAKQYDVKQLLLAGGVAANQGLRERLIQEVKLELPETELIIPPLALCGDNAAMIAAAGT
VSFLQGKRSGFDMNANPGLLLEDI
>Mature_343_residues
GGLMKKNTLILGIESSCDETAASVVKNGNEIISSVVASQIESHKRFGGVVPEIASRHHVEQITLVIEEALKQANVTMDDL
DGIAVTEGPGLVGALLIGVNAAKTLAFMHNLPLVGVHHIAGHIYANRFETEFKFPLLSLVVSGGHTELVLMKADNEFEII
GETRDDAAGEAYDKVARTLGLAYPGGVQIDKLAKDGEDTFHFPRAMMDEGSFDFSFSGLKSSFINTLHNLRQRGEEPNPN
DMAASFQASVVDVLVSKTIRAAKQYDVKQLLLAGGVAANQGLRERLIQEVKLELPETELIIPPLALCGDNAAMIAAAGTV
SFLQGKRSGFDMNANPGLLLEDI

Specific function: Could Be A Metalloprotease. [C]

COG id: COG0533

COG function: function code O; Metal-dependent proteases with possible chaperone activity

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the peptidase M22 family [H]

Homologues:

Organism=Homo sapiens, GI116812636, Length=340, Percent_Identity=34.1176470588235, Blast_Score=169, Evalue=3e-42,
Organism=Homo sapiens, GI8923380, Length=325, Percent_Identity=34.1538461538462, Blast_Score=145, Evalue=8e-35,
Organism=Escherichia coli, GI1789445, Length=337, Percent_Identity=42.433234421365, Blast_Score=265, Evalue=3e-72,
Organism=Caenorhabditis elegans, GI17557464, Length=324, Percent_Identity=31.7901234567901, Blast_Score=149, Evalue=2e-36,
Organism=Caenorhabditis elegans, GI71995670, Length=334, Percent_Identity=33.5329341317365, Blast_Score=139, Evalue=2e-33,
Organism=Saccharomyces cerevisiae, GI6320099, Length=365, Percent_Identity=32.6027397260274, Blast_Score=152, Evalue=6e-38,
Organism=Saccharomyces cerevisiae, GI6322891, Length=349, Percent_Identity=30.0859598853868, Blast_Score=120, Evalue=3e-28,
Organism=Drosophila melanogaster, GI20129063, Length=353, Percent_Identity=31.728045325779, Blast_Score=171, Evalue=5e-43,
Organism=Drosophila melanogaster, GI21357207, Length=334, Percent_Identity=32.6347305389222, Blast_Score=151, Evalue=5e-37,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR022450
- InterPro:   IPR000905
- InterPro:   IPR017860
- InterPro:   IPR017861 [H]

Pfam domain/function: PF00814 Peptidase_M22 [H]

EC number: =3.4.24.57 [H]

Molecular weight: Translated: 36875; Mature: 36743

Theoretical pI: Translated: 4.82; Mature: 4.82

Prosite motif: PS01016 GLYCOPROTEASE

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
2.9 %Met     (Translated Protein)
3.5 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
2.6 %Met     (Mature Protein)
3.2 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MGGLMKKNTLILGIESSCDETAASVVKNGNEIISSVVASQIESHKRFGGVVPEIASRHHV
CCCCCCCCEEEEEECCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHH
EQITLVIEEALKQANVTMDDLDGIAVTEGPGLVGALLIGVNAAKTLAFMHNLPLVGVHHI
HHHHHHHHHHHHHCCCCHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHH
AGHIYANRFETEFKFPLLSLVVSGGHTELVLMKADNEFEIIGETRDDAAGEAYDKVARTL
HHHHHHHHCCCCCCCHHHHHHHCCCCEEEEEEECCCCEEEECCCCCCCCHHHHHHHHHHH
GLAYPGGVQIDKLAKDGEDTFHFPRAMMDEGSFDFSFSGLKSSFINTLHNLRQRGEEPNP
CCCCCCCCCHHHHHCCCCCHHHCCHHHHCCCCCCEEHHHHHHHHHHHHHHHHHCCCCCCC
NDMAASFQASVVDVLVSKTIRAAKQYDVKQLLLAGGVAANQGLRERLIQEVKLELPETEL
CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCE
IIPPLALCGDNAAMIAAAGTVSFLQGKRSGFDMNANPGLLLEDI
ECCCHHHCCCCCEEEEEHHHHHHHHCCCCCCCCCCCCCEEEECC
>Mature Secondary Structure 
GGLMKKNTLILGIESSCDETAASVVKNGNEIISSVVASQIESHKRFGGVVPEIASRHHV
CCCCCCCEEEEEECCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHH
EQITLVIEEALKQANVTMDDLDGIAVTEGPGLVGALLIGVNAAKTLAFMHNLPLVGVHHI
HHHHHHHHHHHHHCCCCHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHH
AGHIYANRFETEFKFPLLSLVVSGGHTELVLMKADNEFEIIGETRDDAAGEAYDKVARTL
HHHHHHHHCCCCCCCHHHHHHHCCCCEEEEEEECCCCEEEECCCCCCCCHHHHHHHHHHH
GLAYPGGVQIDKLAKDGEDTFHFPRAMMDEGSFDFSFSGLKSSFINTLHNLRQRGEEPNP
CCCCCCCCCHHHHHCCCCCHHHCCHHHHCCCCCCEEHHHHHHHHHHHHHHHHHCCCCCCC
NDMAASFQASVVDVLVSKTIRAAKQYDVKQLLLAGGVAANQGLRERLIQEVKLELPETEL
CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCE
IIPPLALCGDNAAMIAAAGTVSFLQGKRSGFDMNANPGLLLEDI
ECCCHHHCCCCCEEEEEHHHHHHHHCCCCCCCCCCCCCEEEECC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11679669 [H]