The gene/protein map for NC_003143 is currently unavailable.
Definition Yersinia pestis CO92 chromosome, complete genome.
Accession NC_003143
Length 4,653,728

Click here to switch to the map view.

The map label for this gene is degS [H]

Identifier: 218930580

GI number: 218930580

Start: 3980992

End: 3982080

Strand: Direct

Name: degS [H]

Synonym: YPO3568

Alternate gene names: 218930580

Gene position: 3980992-3982080 (Clockwise)

Preceding gene: 218930578

Following gene: 218930588

Centisome position: 85.54

GC content: 48.12

Gene sequence:

>1089_bases
ATGTTTCTTAAGCTATTGCGTTCTATTATTTTGGGGCTAATTGTTGCCGGTATTCTGCTGGTTGCCCTACCCATGCTCCG
CAGCCCAGGTTATTTATTCTCTGGAAAAAGCAATAACGTAAATGAAGAGGTTCCTACCAGTTATAACCAAGCAGTACGTC
GTGCCGCACCGGCGGTGGTCAATGTCTATAACCGGAGCCTGAGCGCTACACAGCAGGGGTTAGCCATCCGCACGCTGGGC
TCGGGTGTGATCATGAGCGATAAGGGCTATATCCTTACCAATAAACACGTTATCAATGATGCAGAACAGATCATTGTCGC
CATGCAAAATGGCCGTATCTCAGAAGCTTTATTGGTCGGTTCAGATAATCTGACAGATTTAGCCGTCCTAAAGATTGACG
CAACAAACCTGCCGGTGATCCCCATTAATATTAACCGCACACCACATATTGGTGACGTCGTGTTGGCAATTGGTAACCCT
TATAACCTTGGGCAGACAGTAACGCAGGGGATTATCAGTGCAACCGGGCGTATTGGTTTAAGCTCTTCCGGGCGGCAAAA
TTTCCTGCAAACAGATGCATCAATTAATCAGGGTAATTCCGGCGGTGCGCTGGTCAACACCCTTGGCGAGCTAATGGGGA
TCAACACGCTCTCATTTGATAAAAGCAATAATGGCGAAACACCGGAAGGCATCGGCTTTGCGATCCCAACAGCACTGGCA
ACGAAAGTGATGGAAAAACTGATCCGTGATGGGCGGGTGATCCGTGGTTATATCGGTATTACCGGCGAGGAGTACCCACC
GTTTAATGCTAACGATAATGGCTCAGATCGGGTACACGGTATTAAGGTCAAAAAAGTTTCACCAGACGGCCCAGCGGCCC
AGGCAGGAATACACGTTGGCGATATCATTCTTAACGTGAATAATAAACCGGCAACCTCCGTGATCGAAACTATGGATCAG
GTCGCAGAAGTCCGCCCTGGTACGACCATTCCTGTTTTACTATTACGTAATGGTCAGCAGATAGCGGTTCAAATCACCAT
CACTGAACTCGATCAGAATGAGATGCTGACCACCCAAGCAGCAGATTAA

Upstream 100 bases:

>100_bases
TGGGGGTTTTCTGGGCACAAAAATCGGACACAGCACGATGCTGTGTCCGCCTAACTCATGCTATTCTCCTTTTCACCCAT
TCATTTATGCCTTAAACCTC

Downstream 100 bases:

>100_bases
CGCGCAAGCAGCCAATACCCGTGTCACTTAACGGTATGACGGGTATGTATAAAGAATAATGGGTACCTTGACGCCGATGA
ATATAAACATAAAAAATGCC

Product: serine endoprotease

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 362; Mature: 362

Protein sequence:

>362_residues
MFLKLLRSIILGLIVAGILLVALPMLRSPGYLFSGKSNNVNEEVPTSYNQAVRRAAPAVVNVYNRSLSATQQGLAIRTLG
SGVIMSDKGYILTNKHVINDAEQIIVAMQNGRISEALLVGSDNLTDLAVLKIDATNLPVIPININRTPHIGDVVLAIGNP
YNLGQTVTQGIISATGRIGLSSSGRQNFLQTDASINQGNSGGALVNTLGELMGINTLSFDKSNNGETPEGIGFAIPTALA
TKVMEKLIRDGRVIRGYIGITGEEYPPFNANDNGSDRVHGIKVKKVSPDGPAAQAGIHVGDIILNVNNKPATSVIETMDQ
VAEVRPGTTIPVLLLRNGQQIAVQITITELDQNEMLTTQAAD

Sequences:

>Translated_362_residues
MFLKLLRSIILGLIVAGILLVALPMLRSPGYLFSGKSNNVNEEVPTSYNQAVRRAAPAVVNVYNRSLSATQQGLAIRTLG
SGVIMSDKGYILTNKHVINDAEQIIVAMQNGRISEALLVGSDNLTDLAVLKIDATNLPVIPININRTPHIGDVVLAIGNP
YNLGQTVTQGIISATGRIGLSSSGRQNFLQTDASINQGNSGGALVNTLGELMGINTLSFDKSNNGETPEGIGFAIPTALA
TKVMEKLIRDGRVIRGYIGITGEEYPPFNANDNGSDRVHGIKVKKVSPDGPAAQAGIHVGDIILNVNNKPATSVIETMDQ
VAEVRPGTTIPVLLLRNGQQIAVQITITELDQNEMLTTQAAD
>Mature_362_residues
MFLKLLRSIILGLIVAGILLVALPMLRSPGYLFSGKSNNVNEEVPTSYNQAVRRAAPAVVNVYNRSLSATQQGLAIRTLG
SGVIMSDKGYILTNKHVINDAEQIIVAMQNGRISEALLVGSDNLTDLAVLKIDATNLPVIPININRTPHIGDVVLAIGNP
YNLGQTVTQGIISATGRIGLSSSGRQNFLQTDASINQGNSGGALVNTLGELMGINTLSFDKSNNGETPEGIGFAIPTALA
TKVMEKLIRDGRVIRGYIGITGEEYPPFNANDNGSDRVHGIKVKKVSPDGPAAQAGIHVGDIILNVNNKPATSVIETMDQ
VAEVRPGTTIPVLLLRNGQQIAVQITITELDQNEMLTTQAAD

Specific function: Senses the OMP signal triggered by the accumulation of unassembled porins in the envelope and then initiates rseA degradation by cleaving it in its periplasmic domain, making it an attractive substrate for subsequent cleavage by rseP [H]

COG id: COG0265

COG function: function code O; Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain

Gene ontology:

Cell location: Periplasm (Potential) [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 PDZ (DHR) domain [H]

Homologues:

Organism=Homo sapiens, GI22129776, Length=352, Percent_Identity=33.5227272727273, Blast_Score=137, Evalue=2e-32,
Organism=Homo sapiens, GI4506141, Length=317, Percent_Identity=33.7539432176656, Blast_Score=136, Evalue=4e-32,
Organism=Homo sapiens, GI24308541, Length=296, Percent_Identity=34.1216216216216, Blast_Score=124, Evalue=1e-28,
Organism=Homo sapiens, GI7019477, Length=286, Percent_Identity=31.1188811188811, Blast_Score=105, Evalue=5e-23,
Organism=Escherichia coli, GI1789630, Length=351, Percent_Identity=71.2250712250712, Blast_Score=489, Evalue=1e-139,
Organism=Escherichia coli, GI1789629, Length=271, Percent_Identity=45.3874538745387, Blast_Score=212, Evalue=4e-56,
Organism=Escherichia coli, GI1786356, Length=281, Percent_Identity=42.7046263345196, Blast_Score=202, Evalue=3e-53,
Organism=Drosophila melanogaster, GI24646839, Length=288, Percent_Identity=34.0277777777778, Blast_Score=127, Evalue=1e-29,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001478
- InterPro:   IPR009003
- InterPro:   IPR011783
- InterPro:   IPR001254
- InterPro:   IPR001940 [H]

Pfam domain/function: PF00595 PDZ; PF00089 Trypsin [H]

EC number: 3.4.21.-

Molecular weight: Translated: 38374; Mature: 38374

Theoretical pI: Translated: 6.53; Mature: 6.53

Prosite motif: PS50106 PDZ

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.0 %Cys     (Translated Protein)
2.2 %Met     (Translated Protein)
2.2 %Cys+Met (Translated Protein)
0.0 %Cys     (Mature Protein)
2.2 %Met     (Mature Protein)
2.2 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MFLKLLRSIILGLIVAGILLVALPMLRSPGYLFSGKSNNVNEEVPTSYNQAVRRAAPAVV
CHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCHHHHHHHHHCCHHH
NVYNRSLSATQQGLAIRTLGSGVIMSDKGYILTNKHVINDAEQIIVAMQNGRISEALLVG
HHHHHHHHHHHCCEEEEEECCCEEECCCCEEEECCHHHCCHHEEEEEECCCCCEEEEEEC
SDNLTDLAVLKIDATNLPVIPININRTPHIGDVVLAIGNPYNLGQTVTQGIISATGRIGL
CCCCCEEEEEEEECCCCCEEEEECCCCCCCCEEEEEECCCCCCHHHHHHHHHHHCCCCCC
SSSGRQNFLQTDASINQGNSGGALVNTLGELMGINTLSFDKSNNGETPEGIGFAIPTALA
CCCCCHHHHHCCCCCCCCCCCCHHHHHHHHHHCCCEEECCCCCCCCCCCCCCCHHHHHHH
TKVMEKLIRDGRVIRGYIGITGEEYPPFNANDNGSDRVHGIKVKKVSPDGPAAQAGIHVG
HHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCCCCCEEEEEEEEECCCCCHHHCCEEEE
DIILNVNNKPATSVIETMDQVAEVRPGTTIPVLLLRNGQQIAVQITITELDQNEMLTTQA
EEEEECCCCCHHHHHHHHHHHHHCCCCCCEEEEEEECCCEEEEEEEEEECCCCCEEEECC
AD
CC
>Mature Secondary Structure
MFLKLLRSIILGLIVAGILLVALPMLRSPGYLFSGKSNNVNEEVPTSYNQAVRRAAPAVV
CHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCHHHHHHHHHCCHHH
NVYNRSLSATQQGLAIRTLGSGVIMSDKGYILTNKHVINDAEQIIVAMQNGRISEALLVG
HHHHHHHHHHHCCEEEEEECCCEEECCCCEEEECCHHHCCHHEEEEEECCCCCEEEEEEC
SDNLTDLAVLKIDATNLPVIPININRTPHIGDVVLAIGNPYNLGQTVTQGIISATGRIGL
CCCCCEEEEEEEECCCCCEEEEECCCCCCCCEEEEEECCCCCCHHHHHHHHHHHCCCCCC
SSSGRQNFLQTDASINQGNSGGALVNTLGELMGINTLSFDKSNNGETPEGIGFAIPTALA
CCCCCHHHHHCCCCCCCCCCCCHHHHHHHHHHCCCEEECCCCCCCCCCCCCCCHHHHHHH
TKVMEKLIRDGRVIRGYIGITGEEYPPFNANDNGSDRVHGIKVKKVSPDGPAAQAGIHVG
HHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCCCCCEEEEEEEEECCCCCHHHCCEEEE
DIILNVNNKPATSVIETMDQVAEVRPGTTIPVLLLRNGQQIAVQITITELDQNEMLTTQA
EEEEECCCCCHHHHHHHHHHHHHCCCCCCEEEEEEECCCEEEEEEEEEECCCCCEEEECC
AD
CC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: Hydrolase; Acting on peptide bonds (Peptidases); Serine endopeptidases [C]

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796 [H]