Definition Streptococcus pneumoniae ATCC 700669, complete genome.
Accession NC_011900
Length 2,221,315

Click here to switch to the map view.

The map label for this gene is pepO

Identifier: 221232386

GI number: 221232386

Start: 1585122

End: 1587014

Strand: Reverse

Name: pepO

Synonym: SPN23F_16490

Alternate gene names: NA

Gene position: 1587014-1585122 (Counterclockwise)

Preceding gene: 221232391

Following gene: 221232384

Centisome position: 71.44

GC content: 44.69

Gene sequence:

>1893_bases
ATGACACGTTATCAAGATGATTTTTATGATGCTATCAATGGAGAATGGCAACAGACAGCTGAAATCCCAGCAGATAAGTC
TCAAACAGGAGGTTTTGTTGATTTAGACCAGGAAATTGAAGACCTGATGCTGGCGACAACAGACAAGTGGTTAGCAGGTG
AAGAAGTGCCTGAGGATGCTATCTTGGAAAACTTTGTCAAATACCACCGCCTAGTTCGTGATTTTGACAAGAGAGAAGCT
GACGGTATCACACCTGTCTTACCACTCCTTAAAGAATTCCAAGAATTGGAAACTTTTGCGGATTTTACAGCTAAACTAGC
AGAGTTTGAGCTTGCAGGAAAACCAAACTTCCTTCCTTTTGGTGTATCGCCAGACTTTATGGATGCTAGAATCAATGTTC
TATGGGCTAGCGCTCCAAGCACAATCTTGCCAGATACGACCTACTATGCAGAAGAACATCCTCAGCGCGAAGAGCTCTTG
ACTCTTTGGAAAGAAAGCAGCGCAAATCTCCTCAAGGCTTATGATTTCTCTGATGAAGAAATTGAAGACTTGCTAGAAAA
AAGACTTGAATTGGACCGCCGAGTTGTGGCAGTGGTGCTCTCTAATGAAGAAAGTTCAGAATATGCTAAACTCTATCATC
CATATTCTTACGAAGATTTCAAGAAATTCGCGCCTGCCCTACCTTTGGATGACTTCTTCAAAGCAGTTATTGGGCAATTA
CCAGACAAGGTTATTGTAGACGAGGAACGTTTCTGGCAAGCAGCAGAGCAATTTTACAGTGAGGAAGCCTGGTCTCTCCT
TAAAGCAACCTTGATTTTGAGTGTTGTCAATCTTTCAACCAGCTATTTAACAGAGGATATCCGTGTTTTGTCTGGTGCCT
ACAGCCGTGCCCTTTCTGGAGTTCCAGAGGCAAAAGATAAGGTCAAAGCAGCTTATCATCTAGCACAGGAACCTTTCAAG
CAAGCCCTGGGGCTTTGGTACGCCCGTGAGAAGTTCTCTCCAGAAGCCAAGGCGGATGTGGAGAAAAAAGTGGCAACCAT
GATTGATGTCTATAAGGAGCGTCTGCTTAAGAATGACTGGCTCACTCCAGAAACCTGTAAACAGGCTATCGTGAAGCTCA
ATGTGATCAAACCTTATATTGGCTATCCAGAAGAATTGCCTGCACGTTACAAGGATAAGGTAGTGAATGAAACTGCCAGT
CTTTTTGAGAATGCTCTAGCCTTTGCGCGTGTGGAAATCAAGCACAGTTGGAGTAAGTGGAACCAGCCTGTAGACTATAA
GGAATGGGGCATGCCTGCTCATATGGTCAATGCCTACTACAATCCTCAGAAGAACCTGATTGTCTTTCCAGCGGCCATTT
TACAGGCGCCTTTCTATGACTTGCATCAGTCATCTTCTGCTAACTACGGTGGTATTGGGGCAGTGATTGCCCATGAAATT
TCCCACGCCTTTGATACTAACGGGGCTTCCTTTGACGAAAATGGTAGCCTCAAGGATTGGTGGACAGAGAGCGACTATGC
TGCCTTCAAGGAGAAAACACAAAAAGTCATTGACCAATTTGATGGACAGGATTCTTATGGAGCAACCATTAACGGTAAAT
TGACTGTATCAGAAAACGTGGCTGACTTGGGAGGAATCGCAGCAGCGCTTGAAGCAGCTAAGAGAGAAGCAGACTTCTCA
GCAGAAGAGTTCTTCTACAACTTCGGTCGCATCTGGCGCATGAAAGGTCGACCAGAATTTATGAAACTTTTGGCTAGCGT
CGATGTGCACGCACCAGCCAAACTCCGTGTCAATGTGCAAGTACCAAACTTCGACGATTTCTTTACAACCTATGATGTCA
AAGAAGGAGACGGAATGTGGCGTTCACCAGAGGAGCGCGTGATTATTTGGTAA

Upstream 100 bases:

>100_bases
TTTTATAATAGCTAAAAAATAATAGTTAAATGGTTATTAATTGCATTCCCTAGTGATTTTTGTTAAGATAAATGCAAATA
CAAATGAAAGCGAGAACAAG

Downstream 100 bases:

>100_bases
TACAAAAAATCTAGTCATCAGAAAAAAGGCATCCAATCACGTGTGAAAACCTTGATAGGATGCCTTTTGTAATAGAAAGA
TTTGCTGGATAGTTTACTTA

Product: endopeptidase O

Products: NA

Alternate protein names: ORF6 [H]

Number of amino acids: Translated: 630; Mature: 629

Protein sequence:

>630_residues
MTRYQDDFYDAINGEWQQTAEIPADKSQTGGFVDLDQEIEDLMLATTDKWLAGEEVPEDAILENFVKYHRLVRDFDKREA
DGITPVLPLLKEFQELETFADFTAKLAEFELAGKPNFLPFGVSPDFMDARINVLWASAPSTILPDTTYYAEEHPQREELL
TLWKESSANLLKAYDFSDEEIEDLLEKRLELDRRVVAVVLSNEESSEYAKLYHPYSYEDFKKFAPALPLDDFFKAVIGQL
PDKVIVDEERFWQAAEQFYSEEAWSLLKATLILSVVNLSTSYLTEDIRVLSGAYSRALSGVPEAKDKVKAAYHLAQEPFK
QALGLWYAREKFSPEAKADVEKKVATMIDVYKERLLKNDWLTPETCKQAIVKLNVIKPYIGYPEELPARYKDKVVNETAS
LFENALAFARVEIKHSWSKWNQPVDYKEWGMPAHMVNAYYNPQKNLIVFPAAILQAPFYDLHQSSSANYGGIGAVIAHEI
SHAFDTNGASFDENGSLKDWWTESDYAAFKEKTQKVIDQFDGQDSYGATINGKLTVSENVADLGGIAAALEAAKREADFS
AEEFFYNFGRIWRMKGRPEFMKLLASVDVHAPAKLRVNVQVPNFDDFFTTYDVKEGDGMWRSPEERVIIW

Sequences:

>Translated_630_residues
MTRYQDDFYDAINGEWQQTAEIPADKSQTGGFVDLDQEIEDLMLATTDKWLAGEEVPEDAILENFVKYHRLVRDFDKREA
DGITPVLPLLKEFQELETFADFTAKLAEFELAGKPNFLPFGVSPDFMDARINVLWASAPSTILPDTTYYAEEHPQREELL
TLWKESSANLLKAYDFSDEEIEDLLEKRLELDRRVVAVVLSNEESSEYAKLYHPYSYEDFKKFAPALPLDDFFKAVIGQL
PDKVIVDEERFWQAAEQFYSEEAWSLLKATLILSVVNLSTSYLTEDIRVLSGAYSRALSGVPEAKDKVKAAYHLAQEPFK
QALGLWYAREKFSPEAKADVEKKVATMIDVYKERLLKNDWLTPETCKQAIVKLNVIKPYIGYPEELPARYKDKVVNETAS
LFENALAFARVEIKHSWSKWNQPVDYKEWGMPAHMVNAYYNPQKNLIVFPAAILQAPFYDLHQSSSANYGGIGAVIAHEI
SHAFDTNGASFDENGSLKDWWTESDYAAFKEKTQKVIDQFDGQDSYGATINGKLTVSENVADLGGIAAALEAAKREADFS
AEEFFYNFGRIWRMKGRPEFMKLLASVDVHAPAKLRVNVQVPNFDDFFTTYDVKEGDGMWRSPEERVIIW
>Mature_629_residues
TRYQDDFYDAINGEWQQTAEIPADKSQTGGFVDLDQEIEDLMLATTDKWLAGEEVPEDAILENFVKYHRLVRDFDKREAD
GITPVLPLLKEFQELETFADFTAKLAEFELAGKPNFLPFGVSPDFMDARINVLWASAPSTILPDTTYYAEEHPQREELLT
LWKESSANLLKAYDFSDEEIEDLLEKRLELDRRVVAVVLSNEESSEYAKLYHPYSYEDFKKFAPALPLDDFFKAVIGQLP
DKVIVDEERFWQAAEQFYSEEAWSLLKATLILSVVNLSTSYLTEDIRVLSGAYSRALSGVPEAKDKVKAAYHLAQEPFKQ
ALGLWYAREKFSPEAKADVEKKVATMIDVYKERLLKNDWLTPETCKQAIVKLNVIKPYIGYPEELPARYKDKVVNETASL
FENALAFARVEIKHSWSKWNQPVDYKEWGMPAHMVNAYYNPQKNLIVFPAAILQAPFYDLHQSSSANYGGIGAVIAHEIS
HAFDTNGASFDENGSLKDWWTESDYAAFKEKTQKVIDQFDGQDSYGATINGKLTVSENVADLGGIAAALEAAKREADFSA
EEFFYNFGRIWRMKGRPEFMKLLASVDVHAPAKLRVNVQVPNFDDFFTTYDVKEGDGMWRSPEERVIIW

Specific function: Unknown

COG id: COG3590

COG function: function code O; Predicted metalloendopeptidase

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the peptidase M13 family [H]

Homologues:

Organism=Homo sapiens, GI239049391, Length=679, Percent_Identity=27.540500736377, Blast_Score=210, Evalue=4e-54,
Organism=Homo sapiens, GI164519140, Length=663, Percent_Identity=25.9426847662142, Blast_Score=208, Evalue=1e-53,
Organism=Homo sapiens, GI4503443, Length=663, Percent_Identity=25.9426847662142, Blast_Score=208, Evalue=2e-53,
Organism=Homo sapiens, GI164519136, Length=663, Percent_Identity=25.9426847662142, Blast_Score=207, Evalue=2e-53,
Organism=Homo sapiens, GI164519138, Length=663, Percent_Identity=25.9426847662142, Blast_Score=207, Evalue=2e-53,
Organism=Homo sapiens, GI157426891, Length=665, Percent_Identity=25.5639097744361, Blast_Score=206, Evalue=5e-53,
Organism=Homo sapiens, GI116256333, Length=683, Percent_Identity=26.7935578330893, Blast_Score=204, Evalue=2e-52,
Organism=Homo sapiens, GI116256331, Length=683, Percent_Identity=26.7935578330893, Blast_Score=204, Evalue=2e-52,
Organism=Homo sapiens, GI116256329, Length=683, Percent_Identity=26.7935578330893, Blast_Score=204, Evalue=2e-52,
Organism=Homo sapiens, GI116256327, Length=683, Percent_Identity=26.7935578330893, Blast_Score=204, Evalue=2e-52,
Organism=Homo sapiens, GI153945761, Length=542, Percent_Identity=26.9372693726937, Blast_Score=199, Evalue=9e-51,
Organism=Homo sapiens, GI153945836, Length=542, Percent_Identity=26.9372693726937, Blast_Score=199, Evalue=1e-50,
Organism=Homo sapiens, GI82617560, Length=542, Percent_Identity=26.9372693726937, Blast_Score=198, Evalue=1e-50,
Organism=Homo sapiens, GI153945771, Length=542, Percent_Identity=26.9372693726937, Blast_Score=198, Evalue=1e-50,
Organism=Homo sapiens, GI90403592, Length=573, Percent_Identity=23.2111692844677, Blast_Score=136, Evalue=7e-32,
Organism=Homo sapiens, GI4557691, Length=482, Percent_Identity=24.6887966804979, Blast_Score=103, Evalue=4e-22,
Organism=Caenorhabditis elegans, GI86562497, Length=680, Percent_Identity=24.1176470588235, Blast_Score=190, Evalue=2e-48,
Organism=Caenorhabditis elegans, GI25148650, Length=701, Percent_Identity=25.2496433666191, Blast_Score=175, Evalue=7e-44,
Organism=Caenorhabditis elegans, GI17533333, Length=484, Percent_Identity=24.5867768595041, Blast_Score=172, Evalue=6e-43,
Organism=Caenorhabditis elegans, GI32563993, Length=403, Percent_Identity=25.8064516129032, Blast_Score=160, Evalue=2e-39,
Organism=Caenorhabditis elegans, GI71987442, Length=541, Percent_Identity=24.9537892791128, Blast_Score=155, Evalue=5e-38,
Organism=Caenorhabditis elegans, GI17564342, Length=319, Percent_Identity=30.0940438871473, Blast_Score=146, Evalue=3e-35,
Organism=Caenorhabditis elegans, GI17534401, Length=372, Percent_Identity=26.8817204301075, Blast_Score=138, Evalue=1e-32,
Organism=Caenorhabditis elegans, GI17533319, Length=488, Percent_Identity=24.3852459016393, Blast_Score=110, Evalue=3e-24,
Organism=Caenorhabditis elegans, GI17570047, Length=382, Percent_Identity=24.0837696335079, Blast_Score=108, Evalue=1e-23,
Organism=Caenorhabditis elegans, GI71994787, Length=316, Percent_Identity=26.8987341772152, Blast_Score=102, Evalue=7e-22,
Organism=Caenorhabditis elegans, GI71988640, Length=325, Percent_Identity=26.7692307692308, Blast_Score=100, Evalue=2e-21,
Organism=Caenorhabditis elegans, GI71998977, Length=200, Percent_Identity=32.5, Blast_Score=97, Evalue=2e-20,
Organism=Caenorhabditis elegans, GI71998975, Length=200, Percent_Identity=32.5, Blast_Score=97, Evalue=2e-20,
Organism=Caenorhabditis elegans, GI17532485, Length=315, Percent_Identity=27.6190476190476, Blast_Score=87, Evalue=2e-17,
Organism=Caenorhabditis elegans, GI71986886, Length=204, Percent_Identity=28.4313725490196, Blast_Score=76, Evalue=4e-14,
Organism=Caenorhabditis elegans, GI193204436, Length=313, Percent_Identity=23.6421725239617, Blast_Score=72, Evalue=1e-12,
Organism=Caenorhabditis elegans, GI17534885, Length=338, Percent_Identity=23.3727810650888, Blast_Score=67, Evalue=2e-11,
Organism=Drosophila melanogaster, GI24643425, Length=666, Percent_Identity=26.5765765765766, Blast_Score=233, Evalue=3e-61,
Organism=Drosophila melanogaster, GI24640050, Length=670, Percent_Identity=25.9701492537313, Blast_Score=211, Evalue=9e-55,
Organism=Drosophila melanogaster, GI24640052, Length=670, Percent_Identity=25.9701492537313, Blast_Score=211, Evalue=9e-55,
Organism=Drosophila melanogaster, GI17737761, Length=472, Percent_Identity=27.9661016949153, Blast_Score=188, Evalue=9e-48,
Organism=Drosophila melanogaster, GI45551938, Length=542, Percent_Identity=25.830258302583, Blast_Score=186, Evalue=4e-47,
Organism=Drosophila melanogaster, GI45550777, Length=541, Percent_Identity=25.3234750462107, Blast_Score=185, Evalue=7e-47,
Organism=Drosophila melanogaster, GI24650487, Length=407, Percent_Identity=26.044226044226, Blast_Score=139, Evalue=9e-33,
Organism=Drosophila melanogaster, GI221474862, Length=342, Percent_Identity=28.3625730994152, Blast_Score=128, Evalue=1e-29,
Organism=Drosophila melanogaster, GI24650765, Length=333, Percent_Identity=29.4294294294294, Blast_Score=117, Evalue=2e-26,
Organism=Drosophila melanogaster, GI24650889, Length=680, Percent_Identity=21.1764705882353, Blast_Score=106, Evalue=6e-23,
Organism=Drosophila melanogaster, GI21355943, Length=329, Percent_Identity=24.9240121580547, Blast_Score=96, Evalue=6e-20,
Organism=Drosophila melanogaster, GI281362760, Length=212, Percent_Identity=28.7735849056604, Blast_Score=95, Evalue=2e-19,
Organism=Drosophila melanogaster, GI24650885, Length=545, Percent_Identity=23.8532110091743, Blast_Score=92, Evalue=1e-18,
Organism=Drosophila melanogaster, GI24641622, Length=244, Percent_Identity=27.4590163934426, Blast_Score=76, Evalue=7e-14,
Organism=Drosophila melanogaster, GI24649148, Length=348, Percent_Identity=21.551724137931, Blast_Score=66, Evalue=9e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000718
- InterPro:   IPR018497
- InterPro:   IPR008753 [H]

Pfam domain/function: PF01431 Peptidase_M13; PF05649 Peptidase_M13_N [H]

EC number: NA

Molecular weight: Translated: 71927; Mature: 71796

Theoretical pI: Translated: 4.45; Mature: 4.45

Prosite motif: PS00142 ZINC_PROTEASE

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.2 %Cys     (Translated Protein)
1.4 %Met     (Translated Protein)
1.6 %Cys+Met (Translated Protein)
0.2 %Cys     (Mature Protein)
1.3 %Met     (Mature Protein)
1.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTRYQDDFYDAINGEWQQTAEIPADKSQTGGFVDLDQEIEDLMLATTDKWLAGEEVPEDA
CCCCCHHHHHHHCCCHHHHHCCCCCCCCCCCEEECHHHHHHHHHHCCCCCCCCCCCCHHH
ILENFVKYHRLVRDFDKREADGITPVLPLLKEFQELETFADFTAKLAEFELAGKPNFLPF
HHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHEECCCCCEEEC
GVSPDFMDARINVLWASAPSTILPDTTYYAEEHPQREELLTLWKESSANLLKAYDFSDEE
CCCCCCCCCEEEEEEECCCCCCCCCCCHHHCCCCCHHHHHHHHHHCCCCEEEECCCCHHH
IEDLLEKRLELDRRVVAVVLSNEESSEYAKLYHPYSYEDFKKFAPALPLDDFFKAVIGQL
HHHHHHHHHHHHHHEEEEEECCCCCHHHHHHCCCCCHHHHHHHCCCCCHHHHHHHHHHHC
PDKVIVDEERFWQAAEQFYSEEAWSLLKATLILSVVNLSTSYLTEDIRVLSGAYSRALSG
CCCEEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC
VPEAKDKVKAAYHLAQEPFKQALGLWYAREKFSPEAKADVEKKVATMIDVYKERLLKNDW
CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCC
LTPETCKQAIVKLNVIKPYIGYPEELPARYKDKVVNETASLFENALAFARVEIKHSWSKW
CCHHHHHHHHHHHHHHCCCCCCCHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC
NQPVDYKEWGMPAHMVNAYYNPQKNLIVFPAAILQAPFYDLHQSSSANYGGIGAVIAHEI
CCCCCHHHHCCCHHHHHHEECCCCCEEEECHHHHHCCCHHHHCCCCCCCCCHHHHHHHHH
SHAFDTNGASFDENGSLKDWWTESDYAAFKEKTQKVIDQFDGQDSYGATINGKLTVSENV
HHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCEECCEEEECCCH
ADLGGIAAALEAAKREADFSAEEFFYNFGRIWRMKGRPEFMKLLASVDVHAPAKLRVNVQ
HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHEEEECCCHHHHHHHHHCCCCCCCEEEEEEE
VPNFDDFFTTYDVKEGDGMWRSPEERVIIW
CCCCCCCCEEECCCCCCCCCCCCCCCEEEC
>Mature Secondary Structure 
TRYQDDFYDAINGEWQQTAEIPADKSQTGGFVDLDQEIEDLMLATTDKWLAGEEVPEDA
CCCCHHHHHHHCCCHHHHHCCCCCCCCCCCEEECHHHHHHHHHHCCCCCCCCCCCCHHH
ILENFVKYHRLVRDFDKREADGITPVLPLLKEFQELETFADFTAKLAEFELAGKPNFLPF
HHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHEECCCCCEEEC
GVSPDFMDARINVLWASAPSTILPDTTYYAEEHPQREELLTLWKESSANLLKAYDFSDEE
CCCCCCCCCEEEEEEECCCCCCCCCCCHHHCCCCCHHHHHHHHHHCCCCEEEECCCCHHH
IEDLLEKRLELDRRVVAVVLSNEESSEYAKLYHPYSYEDFKKFAPALPLDDFFKAVIGQL
HHHHHHHHHHHHHHEEEEEECCCCCHHHHHHCCCCCHHHHHHHCCCCCHHHHHHHHHHHC
PDKVIVDEERFWQAAEQFYSEEAWSLLKATLILSVVNLSTSYLTEDIRVLSGAYSRALSG
CCCEEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC
VPEAKDKVKAAYHLAQEPFKQALGLWYAREKFSPEAKADVEKKVATMIDVYKERLLKNDW
CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCC
LTPETCKQAIVKLNVIKPYIGYPEELPARYKDKVVNETASLFENALAFARVEIKHSWSKW
CCHHHHHHHHHHHHHHCCCCCCCHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC
NQPVDYKEWGMPAHMVNAYYNPQKNLIVFPAAILQAPFYDLHQSSSANYGGIGAVIAHEI
CCCCCHHHHCCCHHHHHHEECCCCCEEEECHHHHHCCCHHHHCCCCCCCCCHHHHHHHHH
SHAFDTNGASFDENGSLKDWWTESDYAAFKEKTQKVIDQFDGQDSYGATINGKLTVSENV
HHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCEECCEEEECCCH
ADLGGIAAALEAAKREADFSAEEFFYNFGRIWRMKGRPEFMKLLASVDVHAPAKLRVNVQ
HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHEEEECCCHHHHHHHHHCCCCCCCEEEEEEE
VPNFDDFFTTYDVKEGDGMWRSPEERVIIW
CCCCCCCCEEECCCCCCCCCCCCCCCEEEC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 7927711 [H]