The gene/protein map for NC_005071 is currently unavailable.
Definition Prochlorococcus marinus str. MIT 9313 chromosome, complete genome.
Accession NC_005071
Length 2,410,873

Click here to switch to the map view.

The map label for this gene is gcp [H]

Identifier: 33863583

GI number: 33863583

Start: 1401516

End: 1402586

Strand: Reverse

Name: gcp [H]

Synonym: PMT1315

Alternate gene names: 33863583

Gene position: 1402586-1401516 (Counterclockwise)

Preceding gene: 109150048

Following gene: 33863582

Centisome position: 58.18

GC content: 56.12

Gene sequence:

>1071_bases
ATGCCGACGGTGCTTGCCCTCGAAACAAGTTGTGACGAGTCGGCTGCCGCAGTTCTACGTCTAAATAACGGGTGTTTGCA
GGTTATCGCTAGCCGAATTGCTTCTCAGGTTGAGAAACATGCCCAGTGGGGAGGCGTGGTACCGGAAGTGGCCTCTCGCT
TGCATGTGGAGGCTCTGCCTCATCTTGTAGAGGAGGTTTTGCAGGAGGCGGGACAGTCGATGGCTCGCTTTGATGCTGTA
GCGGCAACGGTGACCCCTGGGCTGGCCGGAGCACTGATGGTGGGATCGGTGACAGGCCGATCTTTGGCTGCTCTCCATGC
GCTGCCTTTTTTTGGCATTCATCACCTGGAGGGGCATTTGGCTTCAGTGCGCTTGGCGGAACATCCTCCACGCCCTCCTT
ATCTGGTGCTACTTGTGAGCGGGGGACACACTGAGTTGATTCGGGTCGGGGCAGAGAGTGAGATGGTGCGTCTTGGACGT
AGCCATGATGATGCTGCTGGAGAGGCTTTTGACAAAGTTGGTCGTTTGCTCGGTTTGGCCTATCCAGGTGGCCCCGCTAT
TCAGGCGTTGGCCGCGACTGGAGACTCTGGCAGATTTTCTTTGCCCAAAGGACGGGTCTCTAAGCCTGGTGGTGGATTTC
ATCCCTACGATTTCTCTTTCAGCGGATTGAAGACCGCCATGCTGCGCCTGGTTCAGGCTCTTTCAGAGGCTGATGAAGAC
CTGCCCCGCGCAGATCTTGCTGCCAGTTTTGAGCAAGTGGTTGCAGATGTTTTGGTCGAGCGCAGCTTGCTCTGCGCTAA
TGACCAGGGCCTAAAAACTGTGGTGATGGTTGGAGGAGTCGCTGCTAACCGCCGCCTAAGAGAACTGATGAGCAAACGTG
GACAAGAACAGGGAATTGAAGTGCACACGGCACCGCTTCGATACTGCACGGATAATGCCGCCATGATTGGAGCAGCGGCC
CTGCAGCGCTTGGTGTCTGGGGTTAATGGCAGCTCTCTTGAGCTGGGAGTGGCAGCCCGATGGCCATTGGACAAAACCGA
GGTTTTGTACCATTCACCTCCCCCATTTTGA

Upstream 100 bases:

>100_bases
ACAACGGCGAAGAGACGACGCATCGGTTTCGATTCCTGTAAGACCTGCTCCGGAACGGCAGGATGTCACAAGTAAAATGT
AAATGAGGCTTCCCACAACC

Downstream 100 bases:

>100_bases
AGAAGCGAATTAGCTTGACCTTGTCTGCTGTGCAGCGATGGCTTCTGAATCTCCACTTGATTCCAACACCTCTGCGGAAC
CTGTCAGTAGTGAAGAGCTA

Product: DNA-binding/iron metalloprotein/AP endonuclease

Products: NA

Alternate protein names: Glycoprotease [H]

Number of amino acids: Translated: 356; Mature: 355

Protein sequence:

>356_residues
MPTVLALETSCDESAAAVLRLNNGCLQVIASRIASQVEKHAQWGGVVPEVASRLHVEALPHLVEEVLQEAGQSMARFDAV
AATVTPGLAGALMVGSVTGRSLAALHALPFFGIHHLEGHLASVRLAEHPPRPPYLVLLVSGGHTELIRVGAESEMVRLGR
SHDDAAGEAFDKVGRLLGLAYPGGPAIQALAATGDSGRFSLPKGRVSKPGGGFHPYDFSFSGLKTAMLRLVQALSEADED
LPRADLAASFEQVVADVLVERSLLCANDQGLKTVVMVGGVAANRRLRELMSKRGQEQGIEVHTAPLRYCTDNAAMIGAAA
LQRLVSGVNGSSLELGVAARWPLDKTEVLYHSPPPF

Sequences:

>Translated_356_residues
MPTVLALETSCDESAAAVLRLNNGCLQVIASRIASQVEKHAQWGGVVPEVASRLHVEALPHLVEEVLQEAGQSMARFDAV
AATVTPGLAGALMVGSVTGRSLAALHALPFFGIHHLEGHLASVRLAEHPPRPPYLVLLVSGGHTELIRVGAESEMVRLGR
SHDDAAGEAFDKVGRLLGLAYPGGPAIQALAATGDSGRFSLPKGRVSKPGGGFHPYDFSFSGLKTAMLRLVQALSEADED
LPRADLAASFEQVVADVLVERSLLCANDQGLKTVVMVGGVAANRRLRELMSKRGQEQGIEVHTAPLRYCTDNAAMIGAAA
LQRLVSGVNGSSLELGVAARWPLDKTEVLYHSPPPF
>Mature_355_residues
PTVLALETSCDESAAAVLRLNNGCLQVIASRIASQVEKHAQWGGVVPEVASRLHVEALPHLVEEVLQEAGQSMARFDAVA
ATVTPGLAGALMVGSVTGRSLAALHALPFFGIHHLEGHLASVRLAEHPPRPPYLVLLVSGGHTELIRVGAESEMVRLGRS
HDDAAGEAFDKVGRLLGLAYPGGPAIQALAATGDSGRFSLPKGRVSKPGGGFHPYDFSFSGLKTAMLRLVQALSEADEDL
PRADLAASFEQVVADVLVERSLLCANDQGLKTVVMVGGVAANRRLRELMSKRGQEQGIEVHTAPLRYCTDNAAMIGAAAL
QRLVSGVNGSSLELGVAARWPLDKTEVLYHSPPPF

Specific function: Could Be A Metalloprotease. [C]

COG id: COG0533

COG function: function code O; Metal-dependent proteases with possible chaperone activity

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the peptidase M22 family [H]

Homologues:

Organism=Homo sapiens, GI116812636, Length=352, Percent_Identity=34.6590909090909, Blast_Score=162, Evalue=5e-40,
Organism=Homo sapiens, GI8923380, Length=354, Percent_Identity=29.3785310734463, Blast_Score=122, Evalue=7e-28,
Organism=Escherichia coli, GI1789445, Length=340, Percent_Identity=45, Blast_Score=273, Evalue=1e-74,
Organism=Caenorhabditis elegans, GI17557464, Length=339, Percent_Identity=33.6283185840708, Blast_Score=144, Evalue=5e-35,
Organism=Caenorhabditis elegans, GI71995670, Length=314, Percent_Identity=28.343949044586, Blast_Score=102, Evalue=4e-22,
Organism=Saccharomyces cerevisiae, GI6320099, Length=369, Percent_Identity=27.6422764227642, Blast_Score=110, Evalue=3e-25,
Organism=Saccharomyces cerevisiae, GI6322891, Length=291, Percent_Identity=27.1477663230241, Blast_Score=82, Evalue=1e-16,
Organism=Drosophila melanogaster, GI20129063, Length=377, Percent_Identity=29.4429708222812, Blast_Score=144, Evalue=1e-34,
Organism=Drosophila melanogaster, GI21357207, Length=322, Percent_Identity=29.1925465838509, Blast_Score=123, Evalue=2e-28,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR022450
- InterPro:   IPR000905
- InterPro:   IPR017861 [H]

Pfam domain/function: PF00814 Peptidase_M22 [H]

EC number: =3.4.24.57 [H]

Molecular weight: Translated: 37553; Mature: 37421

Theoretical pI: Translated: 6.57; Mature: 6.57

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.1 %Cys     (Translated Protein)
2.2 %Met     (Translated Protein)
3.4 %Cys+Met (Translated Protein)
1.1 %Cys     (Mature Protein)
2.0 %Met     (Mature Protein)
3.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MPTVLALETSCDESAAAVLRLNNGCLQVIASRIASQVEKHAQWGGVVPEVASRLHVEALP
CCCEEEEECCCCCHHHHHEEECCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH
HLVEEVLQEAGQSMARFDAVAATVTPGLAGALMVGSVTGRSLAALHALPFFGIHHLEGHL
HHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHH
ASVRLAEHPPRPPYLVLLVSGGHTELIRVGAESEMVRLGRSHDDAAGEAFDKVGRLLGLA
HEEEECCCCCCCCEEEEEEECCCEEEEEECCHHHHHHHCCCCCCHHHHHHHHHHHHHCCC
YPGGPAIQALAATGDSGRFSLPKGRVSKPGGGFHPYDFSFSGLKTAMLRLVQALSEADED
CCCCHHHHHHHCCCCCCCEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHC
LPRADLAASFEQVVADVLVERSLLCANDQGLKTVVMVGGVAANRRLRELMSKRGQEQGIE
CCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCHHHHHHHHHHHHHCCHHCCCE
VHTAPLRYCTDNAAMIGAAALQRLVSGVNGSSLELGVAARWPLDKTEVLYHSPPPF
EEECCHHHCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEEECCCCCCCEEECCCCCC
>Mature Secondary Structure 
PTVLALETSCDESAAAVLRLNNGCLQVIASRIASQVEKHAQWGGVVPEVASRLHVEALP
CCEEEEECCCCCHHHHHEEECCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH
HLVEEVLQEAGQSMARFDAVAATVTPGLAGALMVGSVTGRSLAALHALPFFGIHHLEGHL
HHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHH
ASVRLAEHPPRPPYLVLLVSGGHTELIRVGAESEMVRLGRSHDDAAGEAFDKVGRLLGLA
HEEEECCCCCCCCEEEEEEECCCEEEEEECCHHHHHHHCCCCCCHHHHHHHHHHHHHCCC
YPGGPAIQALAATGDSGRFSLPKGRVSKPGGGFHPYDFSFSGLKTAMLRLVQALSEADED
CCCCHHHHHHHCCCCCCCEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHC
LPRADLAASFEQVVADVLVERSLLCANDQGLKTVVMVGGVAANRRLRELMSKRGQEQGIE
CCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCHHHHHHHHHHHHHCCHHCCCE
VHTAPLRYCTDNAAMIGAAALQRLVSGVNGSSLELGVAARWPLDKTEVLYHSPPPF
EEECCHHHCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEEECCCCCCCEEECCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: NA