Definition | Vibrio cholerae M66-2 chromosome I, complete genome. |
---|---|
Accession | NC_012578 |
Length | 2,892,523 |
Click here to switch to the map view.
The map label for this gene is degS [H]
Identifier: 227080748
GI number: 227080748
Start: 555850
End: 556908
Strand: Reverse
Name: degS [H]
Synonym: VCM66_0523
Alternate gene names: 227080748
Gene position: 556908-555850 (Counterclockwise)
Preceding gene: 227080749
Following gene: 227080742
Centisome position: 19.25
GC content: 49.01
Gene sequence:
>1059_bases ATGCTGAAATTTTGGGTTCGCTCAATCAGCCTTGGGTTGTTGGCTGCGATTGCCATTATTATGGTGACACCCTCACTACG CGCCAAATTAATGCCCGTTGTCGAACAACCACGCAACATCGGCGCTCTACAAATCTCATTTAATGAAGCGGTACGCAAAG CCGCCCCTGCCGTCGTCAATATTTATAACCGTAAATACAGCGAAAATGATCGCCGTAAACTCTCGATTCAAGGTTTAGGA TCCGGTGTCATTGTCAGCGAAAAAGGCTACATCATCACCAACTACCACGTCGTCGCGCAGGCCGATCAAATTGTCGTTGC TCTACAAGATGGGCGAGCCGCAGCAGCACAATTGGTGGGAAAAGATCGCCGTACCGATATTGCCGTATTACGCGTAGAAG GCACGGGTTTACCAGTGATTCCACTCAATCCAGATTACCATCCTAAAGTGGGGGACGTGGTGTTGGCGATTGGTAACCCT TACAACTTAGGGCAAACCACGACTTTCGGAATTATCTCGGCTACCGGACGTTCATCCATCAGCGCTGATGGTCGCCAAGC CTTTATTCAAACTGATGCCGCAATCAATGACGGCAACTCAGGTGGTGCATTGGTCAATACCCAAGGTGAACTGGTCGGCA TCAATACCGCCTCTTTTCAACAAGCCACCGATCTCGAAACTTACGGGATTTCGTTTGCGATTCCCTACTCTTTGGCCAGT AAAATTATGACCAAAATCATTGCTGATGGCCGCGTGATCCGCGGTTATATTGGCGTCGACGGTCAAGATATTAACTCGAT GACATCACGTTTGCTGGGGAATGAGCATGTCGGTGGGATCATTATTTTAGGGGTTGACCCGAATGGACCCGCAGCCCGAG CAGGCTTTCTGGAGCAAGATATTTTGCTGAAAATCGACGGTAAAAAAATTAATGGTCGCCAGAATGTCACAGATACCGTC ACCGATCTTCGCCCCGGCACTGTGGTGGATTTCACCCTACTGCGTAAGGGTGAAGAGATTGTACTCCCAGTTACGATTGG TGAAGACACTCGTGATTAG
Upstream 100 bases:
>100_bases TATTCATAAGCTACAGATGCAACTTTGGCGAAATGCCTTTCCAGTGTTATTCTTGCCCACTCGCAGTGAGGAATTTAGTC AATATTGAGTTGGGGAAACT
Downstream 100 bases:
>100_bases ACGAGTGTGATTCTGAAAGGTTCCACTCATCGACGATGAGACCTCGACGTTAAAAAATGGAGCACAAAAAAGCGGAGCCC TTGGGCTCCGCTTTTTACTA
Product: protease DegS
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 352; Mature: 352
Protein sequence:
>352_residues MLKFWVRSISLGLLAAIAIIMVTPSLRAKLMPVVEQPRNIGALQISFNEAVRKAAPAVVNIYNRKYSENDRRKLSIQGLG SGVIVSEKGYIITNYHVVAQADQIVVALQDGRAAAAQLVGKDRRTDIAVLRVEGTGLPVIPLNPDYHPKVGDVVLAIGNP YNLGQTTTFGIISATGRSSISADGRQAFIQTDAAINDGNSGGALVNTQGELVGINTASFQQATDLETYGISFAIPYSLAS KIMTKIIADGRVIRGYIGVDGQDINSMTSRLLGNEHVGGIIILGVDPNGPAARAGFLEQDILLKIDGKKINGRQNVTDTV TDLRPGTVVDFTLLRKGEEIVLPVTIGEDTRD
Sequences:
>Translated_352_residues MLKFWVRSISLGLLAAIAIIMVTPSLRAKLMPVVEQPRNIGALQISFNEAVRKAAPAVVNIYNRKYSENDRRKLSIQGLG SGVIVSEKGYIITNYHVVAQADQIVVALQDGRAAAAQLVGKDRRTDIAVLRVEGTGLPVIPLNPDYHPKVGDVVLAIGNP YNLGQTTTFGIISATGRSSISADGRQAFIQTDAAINDGNSGGALVNTQGELVGINTASFQQATDLETYGISFAIPYSLAS KIMTKIIADGRVIRGYIGVDGQDINSMTSRLLGNEHVGGIIILGVDPNGPAARAGFLEQDILLKIDGKKINGRQNVTDTV TDLRPGTVVDFTLLRKGEEIVLPVTIGEDTRD >Mature_352_residues MLKFWVRSISLGLLAAIAIIMVTPSLRAKLMPVVEQPRNIGALQISFNEAVRKAAPAVVNIYNRKYSENDRRKLSIQGLG SGVIVSEKGYIITNYHVVAQADQIVVALQDGRAAAAQLVGKDRRTDIAVLRVEGTGLPVIPLNPDYHPKVGDVVLAIGNP YNLGQTTTFGIISATGRSSISADGRQAFIQTDAAINDGNSGGALVNTQGELVGINTASFQQATDLETYGISFAIPYSLAS KIMTKIIADGRVIRGYIGVDGQDINSMTSRLLGNEHVGGIIILGVDPNGPAARAGFLEQDILLKIDGKKINGRQNVTDTV TDLRPGTVVDFTLLRKGEEIVLPVTIGEDTRD
Specific function: Senses the OMP signal triggered by the accumulation of unassembled porins in the envelope and then initiates rseA degradation by cleaving it in its periplasmic domain, making it an attractive substrate for subsequent cleavage by rseP [H]
COG id: COG0265
COG function: function code O; Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain
Gene ontology:
Cell location: Periplasm (Potential) [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 PDZ (DHR) domain [H]
Homologues:
Organism=Homo sapiens, GI4506141, Length=326, Percent_Identity=34.6625766871166, Blast_Score=153, Evalue=2e-37, Organism=Homo sapiens, GI22129776, Length=311, Percent_Identity=34.4051446945338, Blast_Score=130, Evalue=3e-30, Organism=Homo sapiens, GI24308541, Length=325, Percent_Identity=31.0769230769231, Blast_Score=115, Evalue=6e-26, Organism=Homo sapiens, GI7019477, Length=317, Percent_Identity=32.1766561514196, Blast_Score=105, Evalue=6e-23, Organism=Escherichia coli, GI1789630, Length=351, Percent_Identity=51.2820512820513, Blast_Score=329, Evalue=1e-91, Organism=Escherichia coli, GI1789629, Length=355, Percent_Identity=35.7746478873239, Blast_Score=194, Evalue=1e-50, Organism=Escherichia coli, GI1786356, Length=276, Percent_Identity=40.5797101449275, Blast_Score=192, Evalue=3e-50, Organism=Drosophila melanogaster, GI24646839, Length=374, Percent_Identity=29.4117647058824, Blast_Score=114, Evalue=1e-25,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001478 - InterPro: IPR009003 - InterPro: IPR011783 - InterPro: IPR001254 - InterPro: IPR001940 [H]
Pfam domain/function: PF00595 PDZ; PF00089 Trypsin [H]
EC number: 3.4.21.-
Molecular weight: Translated: 37556; Mature: 37556
Theoretical pI: Translated: 8.70; Mature: 8.70
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.0 %Cys (Translated Protein) 1.4 %Met (Translated Protein) 1.4 %Cys+Met (Translated Protein) 0.0 %Cys (Mature Protein) 1.4 %Met (Mature Protein) 1.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MLKFWVRSISLGLLAAIAIIMVTPSLRAKLMPVVEQPRNIGALQISFNEAVRKAAPAVVN CCEEEEHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCCEEEEEEHHHHHHHHCCCEEE IYNRKYSENDRRKLSIQGLGSGVIVSEKGYIITNYHVVAQADQIVVALQDGRAAAAQLVG EECCCCCCCCCEEEEEEECCCEEEEECCCEEEEEEEEEEECCEEEEEEECCCHHHHHHHC KDRRTDIAVLRVEGTGLPVIPLNPDYHPKVGDVVLAIGNPYNLGQTTTFGIISATGRSSI CCCCCEEEEEEECCCCEEEEECCCCCCCCCCCEEEEECCCCCCCCEEEEEEEEECCCCCC SADGRQAFIQTDAAINDGNSGGALVNTQGELVGINTASFQQATDLETYGISFAIPYSLAS CCCCCEEEEEECCEECCCCCCCEEEECCCCEEEEECCCCCCCCCCCCCCEEEEEHHHHHH KIMTKIIADGRVIRGYIGVDGQDINSMTSRLLGNEHVGGIIILGVDPNGPAARAGFLEQD HHHHHHHCCCEEEEEEECCCCCCHHHHHHHHCCCCCCCEEEEEEECCCCCCHHCCCCCCC ILLKIDGKKINGRQNVTDTVTDLRPGTVVDFTLLRKGEEIVLPVTIGEDTRD EEEEECCEEECCCCCCCHHHHHCCCCCEEEEEEECCCCEEEEEEEECCCCCC >Mature Secondary Structure MLKFWVRSISLGLLAAIAIIMVTPSLRAKLMPVVEQPRNIGALQISFNEAVRKAAPAVVN CCEEEEHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCCEEEEEEHHHHHHHHCCCEEE IYNRKYSENDRRKLSIQGLGSGVIVSEKGYIITNYHVVAQADQIVVALQDGRAAAAQLVG EECCCCCCCCCEEEEEEECCCEEEEECCCEEEEEEEEEEECCEEEEEEECCCHHHHHHHC KDRRTDIAVLRVEGTGLPVIPLNPDYHPKVGDVVLAIGNPYNLGQTTTFGIISATGRSSI CCCCCEEEEEEECCCCEEEEECCCCCCCCCCCEEEEECCCCCCCCEEEEEEEEECCCCCC SADGRQAFIQTDAAINDGNSGGALVNTQGELVGINTASFQQATDLETYGISFAIPYSLAS CCCCCEEEEEECCEECCCCCCCEEEECCCCEEEEECCCCCCCCCCCCCCEEEEEHHHHHH KIMTKIIADGRVIRGYIGVDGQDINSMTSRLLGNEHVGGIIILGVDPNGPAARAGFLEQD HHHHHHHCCCEEEEEEECCCCCCHHHHHHHHCCCCCCCEEEEEEECCCCCCHHCCCCCCC ILLKIDGKKINGRQNVTDTVTDLRPGTVVDFTLLRKGEEIVLPVTIGEDTRD EEEEECCEEECCCCCCCHHHHHCCCCCEEEEEEECCCCEEEEEEEECCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: Hydrolase; Acting on peptide bonds (Peptidases); Serine endopeptidases [C]
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]