The gene/protein map for NC_008533 is currently unavailable.
Definition Streptococcus pneumoniae D39, complete genome.
Accession NC_008533
Length 2,046,115

Click here to switch to the map view.

The map label for this gene is pepC [H]

Identifier: 116516913

GI number: 116516913

Start: 259030

End: 260364

Strand: Direct

Name: pepC [H]

Synonym: SPD_0261

Alternate gene names: 116516913

Gene position: 259030-260364 (Clockwise)

Preceding gene: 116516048

Following gene: 116515488

Centisome position: 12.66

GC content: 43.67

Gene sequence:

>1335_bases
ATGAACGCGATTCAAGAATCATTTACTGATAAACTATTTGCCAACTATGAAGCAAATGTAAAATACCAAGCGATTGAAAA
TGCTGCCAGCCACAACGGAATTTTTGCAGCTCTGGAACGTCGCCAAAGCCATGTAGACAACACACCTGTTTTCTCATTGG
ATTTAACCAAGGACAAGGTCACTAACCAGAAAGCGTCTGGTCGTTGCTGGATGTTTGCGGCTCTCAACACCTTCCGCCAC
AAACTCATCTCGCAATACAAATTGGAGAACTTTGAGTTGTCACAAGCCCACACTTTCTTCTGGGATAAGTATGAGAAATC
AAACTGGTTCTTGGAGCAAGTCATTGCGACTTCAGACCAAGAATTGACTAGCCGCAAGGTTAGCTTCTTACTCCAAACAC
CTCAACAAGATGGCGGTCAATGGGATATGGTCGTTTCCCTCTTTGAAAAATACGGTGTCGTGCCTAAGTCAGTTTATCCT
GAGTCTGTTTCATCTAGCAGCAGTCGTGAGCTAAATGCGATCCTTAATAAATTGCTTCGTCAAGATGCTCAAATCTTGCG
TGACTTGCTTGTTTCTGGTGCAGATCAAGCGACTGTTCAAGCTAAGAAAGAAGACCTCTTGCAAGAAATCTTTAACTTTC
TTGCTATGTCATTGGGACTTCCACCACGCAAGTTTGACTTTGCTTATCGCGATAAAGATAACAACTACAAAAGTGAAAAA
GGAATCACACCACAAGAGTTTTACAAGAAATATGTCAATCTTCCTTTAGAAGACTACGTTTCTGTTATCAATGCTCCAAC
TGCTGATAAACCTTACGGAAAATCTTACACAGTTGAGATGTTGGGGAATGTGGTTGGTAGCCGTGCAGTTCGCTACATCA
ACGTTCCAATGGAGCGCTTGAAAGAATTGGCGATTGCCCAAATGCAAGCAGGTGAGACTGTTTGGTTTGGTTCTGATGTC
GGCCAGCTCAGCAACCGTAAGGCTGGCATCCTTGCGACAGATGTTTATGACTTTGAATCAAGCATGGACATTAAACTTAC
TCAAGACAAGGCTGGACGTTTGGACTATAGTGAAAGCTTGATGACCCACGCCATGGTCTTGACAGGTGTTGACTTGGACG
AAAATGGTAAATCAACCAAGTGGAAGGTTGAAAACTCATGGGGAGACAAGGTCGGTACAGATGGTTACTTTGTTGCCTCA
GACGCTTGGATGGACGAATACACATACCAAATCGTTGTTCGTAAGGAATTGCTGACAGCAGAAGAACAAGCTGCCTATGG
AGCAGAACCAATCGTACTTGCACCATGGGATCCAATGGGAGCCTTGGCTGAATAA

Upstream 100 bases:

>100_bases
TTCGAAAAAGGTGAGGGACAATGTCCTCGCCTTTTATGTTTTTTAGTTGCTTCCTTTGTGAAAAGAGTTATAATAGACTG
TAGAATAAAAGGAGGAATCT

Downstream 100 bases:

>100_bases
AAGCATAGAAAAAAGGAATCAGATTTTAGAACCTGGTTCCTTTTTAGTTGCTTGATTACATGATGTGAAGAACATGTGCC
ACAATACCCACTGCGAAGAG

Product: aminopeptidase C

Products: NA

Alternate protein names: Bleomycin hydrolase [H]

Number of amino acids: Translated: 444; Mature: 444

Protein sequence:

>444_residues
MNAIQESFTDKLFANYEANVKYQAIENAASHNGIFAALERRQSHVDNTPVFSLDLTKDKVTNQKASGRCWMFAALNTFRH
KLISQYKLENFELSQAHTFFWDKYEKSNWFLEQVIATSDQELTSRKVSFLLQTPQQDGGQWDMVVSLFEKYGVVPKSVYP
ESVSSSSSRELNAILNKLLRQDAQILRDLLVSGADQATVQAKKEDLLQEIFNFLAMSLGLPPRKFDFAYRDKDNNYKSEK
GITPQEFYKKYVNLPLEDYVSVINAPTADKPYGKSYTVEMLGNVVGSRAVRYINVPMERLKELAIAQMQAGETVWFGSDV
GQLSNRKAGILATDVYDFESSMDIKLTQDKAGRLDYSESLMTHAMVLTGVDLDENGKSTKWKVENSWGDKVGTDGYFVAS
DAWMDEYTYQIVVRKELLTAEEQAAYGAEPIVLAPWDPMGALAE

Sequences:

>Translated_444_residues
MNAIQESFTDKLFANYEANVKYQAIENAASHNGIFAALERRQSHVDNTPVFSLDLTKDKVTNQKASGRCWMFAALNTFRH
KLISQYKLENFELSQAHTFFWDKYEKSNWFLEQVIATSDQELTSRKVSFLLQTPQQDGGQWDMVVSLFEKYGVVPKSVYP
ESVSSSSSRELNAILNKLLRQDAQILRDLLVSGADQATVQAKKEDLLQEIFNFLAMSLGLPPRKFDFAYRDKDNNYKSEK
GITPQEFYKKYVNLPLEDYVSVINAPTADKPYGKSYTVEMLGNVVGSRAVRYINVPMERLKELAIAQMQAGETVWFGSDV
GQLSNRKAGILATDVYDFESSMDIKLTQDKAGRLDYSESLMTHAMVLTGVDLDENGKSTKWKVENSWGDKVGTDGYFVAS
DAWMDEYTYQIVVRKELLTAEEQAAYGAEPIVLAPWDPMGALAE
>Mature_444_residues
MNAIQESFTDKLFANYEANVKYQAIENAASHNGIFAALERRQSHVDNTPVFSLDLTKDKVTNQKASGRCWMFAALNTFRH
KLISQYKLENFELSQAHTFFWDKYEKSNWFLEQVIATSDQELTSRKVSFLLQTPQQDGGQWDMVVSLFEKYGVVPKSVYP
ESVSSSSSRELNAILNKLLRQDAQILRDLLVSGADQATVQAKKEDLLQEIFNFLAMSLGLPPRKFDFAYRDKDNNYKSEK
GITPQEFYKKYVNLPLEDYVSVINAPTADKPYGKSYTVEMLGNVVGSRAVRYINVPMERLKELAIAQMQAGETVWFGSDV
GQLSNRKAGILATDVYDFESSMDIKLTQDKAGRLDYSESLMTHAMVLTGVDLDENGKSTKWKVENSWGDKVGTDGYFVAS
DAWMDEYTYQIVVRKELLTAEEQAAYGAEPIVLAPWDPMGALAE

Specific function: Unknown

COG id: COG3579

COG function: function code E; Aminopeptidase C

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the peptidase C1 family [H]

Homologues:

Organism=Homo sapiens, GI4557367, Length=392, Percent_Identity=44.1326530612245, Blast_Score=358, Evalue=5e-99,
Organism=Saccharomyces cerevisiae, GI6324090, Length=417, Percent_Identity=41.2470023980815, Blast_Score=315, Evalue=9e-87,
Organism=Drosophila melanogaster, GI161077632, Length=451, Percent_Identity=39.4678492239468, Blast_Score=336, Evalue=2e-92,
Organism=Drosophila melanogaster, GI24640588, Length=451, Percent_Identity=39.4678492239468, Blast_Score=335, Evalue=3e-92,
Organism=Drosophila melanogaster, GI161077630, Length=368, Percent_Identity=41.5760869565217, Blast_Score=296, Evalue=1e-80,
Organism=Drosophila melanogaster, GI20130175, Length=408, Percent_Identity=27.4509803921569, Blast_Score=160, Evalue=2e-39,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000169
- InterPro:   IPR004134 [H]

Pfam domain/function: PF03051 Peptidase_C1_2 [H]

EC number: =3.4.22.40 [H]

Molecular weight: Translated: 50256; Mature: 50256

Theoretical pI: Translated: 4.83; Mature: 4.83

Prosite motif: PS00139 THIOL_PROTEASE_CYS ; PS00639 THIOL_PROTEASE_HIS

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.2 %Cys     (Translated Protein)
2.7 %Met     (Translated Protein)
2.9 %Cys+Met (Translated Protein)
0.2 %Cys     (Mature Protein)
2.7 %Met     (Mature Protein)
2.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNAIQESFTDKLFANYEANVKYQAIENAASHNGIFAALERRQSHVDNTPVFSLDLTKDKV
CCHHHHHHHHHHHHCCCCCEEEHHHHHHHCCCCHHHHHHHHHHCCCCCCEEEEECCHHHH
TNQKASGRCWMFAALNTFRHKLISQYKLENFELSQAHTFFWDKYEKSNWFLEQVIATSDQ
CCCCCCCCEEEHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCCHHHHHHHHHCCCH
ELTSRKVSFLLQTPQQDGGQWDMVVSLFEKYGVVPKSVYPESVSSSSSRELNAILNKLLR
HHHHHHHHHHEECCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHH
QDAQILRDLLVSGADQATVQAKKEDLLQEIFNFLAMSLGLPPRKFDFAYRDKDNNYKSEK
HHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCC
GITPQEFYKKYVNLPLEDYVSVINAPTADKPYGKSYTVEMLGNVVGSRAVRYINVPMERL
CCCHHHHHHHHHCCCHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHEEEEECCHHHH
KELAIAQMQAGETVWFGSDVGQLSNRKAGILATDVYDFESSMDIKLTQDKAGRLDYSESL
HHHHHHHHHCCCEEEECCCHHHHCCCCCCEEEEEEECCCCCCCEEEECCCCCCCCHHHHH
MTHAMVLTGVDLDENGKSTKWKVENSWGDKVGTDGYFVASDAWMDEYTYQIVVRKELLTA
HHHHHHHCCCCCCCCCCCCEEEEECCCCCCCCCCCEEEECCCHHHHHHEEEEEHHHHHHH
EEQAAYGAEPIVLAPWDPMGALAE
HHHHHCCCCCEEEECCCCCCCCCC
>Mature Secondary Structure
MNAIQESFTDKLFANYEANVKYQAIENAASHNGIFAALERRQSHVDNTPVFSLDLTKDKV
CCHHHHHHHHHHHHCCCCCEEEHHHHHHHCCCCHHHHHHHHHHCCCCCCEEEEECCHHHH
TNQKASGRCWMFAALNTFRHKLISQYKLENFELSQAHTFFWDKYEKSNWFLEQVIATSDQ
CCCCCCCCEEEHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCCHHHHHHHHHCCCH
ELTSRKVSFLLQTPQQDGGQWDMVVSLFEKYGVVPKSVYPESVSSSSSRELNAILNKLLR
HHHHHHHHHHEECCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHH
QDAQILRDLLVSGADQATVQAKKEDLLQEIFNFLAMSLGLPPRKFDFAYRDKDNNYKSEK
HHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCC
GITPQEFYKKYVNLPLEDYVSVINAPTADKPYGKSYTVEMLGNVVGSRAVRYINVPMERL
CCCHHHHHHHHHCCCHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHEEEEECCHHHH
KELAIAQMQAGETVWFGSDVGQLSNRKAGILATDVYDFESSMDIKLTQDKAGRLDYSESL
HHHHHHHHHCCCEEEECCCHHHHCCCCCCEEEEEEECCCCCCCEEEECCCCCCCCHHHHH
MTHAMVLTGVDLDENGKSTKWKVENSWGDKVGTDGYFVASDAWMDEYTYQIVVRKELLTA
HHHHHHHCCCCCCCCCCCCEEEEECCCCCCCCCCCEEEECCCHHHHHHEEEEEHHHHHHH
EEQAAYGAEPIVLAPWDPMGALAE
HHHHHCCCCCEEEECCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 7925365 [H]