Definition | Escherichia coli HS, complete genome. |
---|---|
Accession | NC_009800 |
Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is degQ
Identifier: 157162712
GI number: 157162712
Start: 3424331
End: 3425698
Strand: Direct
Name: degQ
Synonym: EcHS_A3423
Alternate gene names: 157162712
Gene position: 3424331-3425698 (Clockwise)
Preceding gene: 157162711
Following gene: 157162713
Centisome position: 73.74
GC content: 52.05
Gene sequence:
>1368_bases ATGAAAAAACAAACCCAGCTGTTGAGTGCATTAGCGTTAAGTGTCGGGTTAACTCTCTCGGCGTCATTTCAGGCCGTCGC GTCGATTCCAGGCCAGGTTGCCGATCAGGCCCCTCTCCCCAGTCTGGCTCCAATGCTGGAAAAAGTGCTTCCGGCAGTGG TGAGCGTACGGGTGGAAGGAACGGCCAGTCAGGGACAGAAAATCCCGGAAGAATTCAAAAAGTTTTTTGGTGATGATTTA CCGGATCAACCTGCACAACCCTTCGAAGGTTTAGGCTCCGGTGTCATCATCAACGCCAGTAAAGGCTATGTGCTGACCAA CAACCATGTGATTAATCAGGCACAGAAAATCAGTATTCAGCTCAATGATGGGCGCGAGTTTGATGCAAAACTGATTGGTA GCGATGACCAGAGCGATATCGCCCTGTTACAAATTCAAAACCCGAGCAAATTAACGCAAATCGCTATTGCCGACTCCGAT AAATTGCGCGTCGGTGATTTTGCCGTAGCGGTCGGTAACCCATTTGGCCTTGGGCAAACCGCCACCTCTGGCATTGTTTC CGCATTAGGCCGCAGCGGGTTGAATCTTGAAGGTCTGGAAAACTTTATCCAGACAGATGCTTCCATTAACCGCGGTAACT CCGGCGGTGCACTATTAAACCTTAACGGTGAGTTAATTGGCATCAACACTGCAATCCTTGCGCCTGGCGGCGGGAGCGTC GGGATTGGATTTGCCATCCCCAGTAATATGGCGCGAACACTGGCGCAGCAGCTTATCGACTTTGGTGAAATCAAACGCGG TTTGTTAGGCATCAAAGGCACCGAGATGAGTGCCGATATCGCCAAAGCCTTCAACCTTGACGTGCAGCGTGGCGCGTTTG TCAGCGAAGTGTTGCCAGGTTCTGGCTCGGCAAAAGCGGGCGTCAAAGCGGGCGATATTATTACCAGCCTCAACGGCAAA CCGCTGAATAGCTTTGCTGAGTTGCGCTCTCGTATCGCGACCACCGAGCCGGGCACGAAAGTGAAGCTTGGCCTGCTGCG TAACGGCAAACCACTGGAAGTAGAAGTGACGCTCGATACCAGCACCTCTTCGTCGGCCAGCGCTGAAATGATCACGCCAG CGCTGGAAGGTGCAACGTTGAGCGATGGTCAGCTAAAAGATGGCGGCAAAGGTATTAAAATCGATGAAGTTGTCAAAGGA AGCCCAGCTGCTCAGGCTGGCTTGCAAAAAGACGATGTGATCATTGGCGTCAACCGCGATCGGGTGAACTCGATTGCTGA AATGCGTAAAGTGCTGGCGGCAAAACCGGCCATCATCGCCCTGCAAATTGTACGCGGCAATGAAAGCATCTATCTGCTGA TGCGTTAA
Upstream 100 bases:
>100_bases TCATTTAATCTGGTGTCTCATTGTTAGCCGTCTGAAAATTCAATAACATCAAACTGTTTTGAATCTCTTTTCTTATCATT CAGGTACGAGAGCAGGAATA
Downstream 100 bases:
>100_bases TGTCGTAAACCGGGCATCAGGCTTACGTGTGATGTCCGGTTAACTCGTGGTATGCTGCTGCCGTTCCCTTTTTTAATGAC GCCTCCATCATGTTTGTGAA
Product: serine endoprotease
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 455; Mature: 455
Protein sequence:
>455_residues MKKQTQLLSALALSVGLTLSASFQAVASIPGQVADQAPLPSLAPMLEKVLPAVVSVRVEGTASQGQKIPEEFKKFFGDDL PDQPAQPFEGLGSGVIINASKGYVLTNNHVINQAQKISIQLNDGREFDAKLIGSDDQSDIALLQIQNPSKLTQIAIADSD KLRVGDFAVAVGNPFGLGQTATSGIVSALGRSGLNLEGLENFIQTDASINRGNSGGALLNLNGELIGINTAILAPGGGSV GIGFAIPSNMARTLAQQLIDFGEIKRGLLGIKGTEMSADIAKAFNLDVQRGAFVSEVLPGSGSAKAGVKAGDIITSLNGK PLNSFAELRSRIATTEPGTKVKLGLLRNGKPLEVEVTLDTSTSSSASAEMITPALEGATLSDGQLKDGGKGIKIDEVVKG SPAAQAGLQKDDVIIGVNRDRVNSIAEMRKVLAAKPAIIALQIVRGNESIYLLMR
Sequences:
>Translated_455_residues MKKQTQLLSALALSVGLTLSASFQAVASIPGQVADQAPLPSLAPMLEKVLPAVVSVRVEGTASQGQKIPEEFKKFFGDDL PDQPAQPFEGLGSGVIINASKGYVLTNNHVINQAQKISIQLNDGREFDAKLIGSDDQSDIALLQIQNPSKLTQIAIADSD KLRVGDFAVAVGNPFGLGQTATSGIVSALGRSGLNLEGLENFIQTDASINRGNSGGALLNLNGELIGINTAILAPGGGSV GIGFAIPSNMARTLAQQLIDFGEIKRGLLGIKGTEMSADIAKAFNLDVQRGAFVSEVLPGSGSAKAGVKAGDIITSLNGK PLNSFAELRSRIATTEPGTKVKLGLLRNGKPLEVEVTLDTSTSSSASAEMITPALEGATLSDGQLKDGGKGIKIDEVVKG SPAAQAGLQKDDVIIGVNRDRVNSIAEMRKVLAAKPAIIALQIVRGNESIYLLMR >Mature_455_residues MKKQTQLLSALALSVGLTLSASFQAVASIPGQVADQAPLPSLAPMLEKVLPAVVSVRVEGTASQGQKIPEEFKKFFGDDL PDQPAQPFEGLGSGVIINASKGYVLTNNHVINQAQKISIQLNDGREFDAKLIGSDDQSDIALLQIQNPSKLTQIAIADSD KLRVGDFAVAVGNPFGLGQTATSGIVSALGRSGLNLEGLENFIQTDASINRGNSGGALLNLNGELIGINTAILAPGGGSV GIGFAIPSNMARTLAQQLIDFGEIKRGLLGIKGTEMSADIAKAFNLDVQRGAFVSEVLPGSGSAKAGVKAGDIITSLNGK PLNSFAELRSRIATTEPGTKVKLGLLRNGKPLEVEVTLDTSTSSSASAEMITPALEGATLSDGQLKDGGKGIKIDEVVKG SPAAQAGLQKDDVIIGVNRDRVNSIAEMRKVLAAKPAIIALQIVRGNESIYLLMR
Specific function: Protease with a shared specificity with degP
COG id: COG0265
COG function: function code O; Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain
Gene ontology:
Cell location: Periplasm
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 2 PDZ (DHR) domains
Homologues:
Organism=Homo sapiens, GI22129776, Length=316, Percent_Identity=36.3924050632911, Blast_Score=161, Evalue=1e-39, Organism=Homo sapiens, GI4506141, Length=294, Percent_Identity=35.7142857142857, Blast_Score=159, Evalue=7e-39, Organism=Homo sapiens, GI24308541, Length=305, Percent_Identity=30.8196721311475, Blast_Score=147, Evalue=2e-35, Organism=Homo sapiens, GI7019477, Length=296, Percent_Identity=33.7837837837838, Blast_Score=133, Evalue=4e-31, Organism=Escherichia coli, GI1789629, Length=455, Percent_Identity=100, Blast_Score=890, Evalue=0.0, Organism=Escherichia coli, GI1786356, Length=440, Percent_Identity=57.9545454545455, Blast_Score=506, Evalue=1e-144, Organism=Escherichia coli, GI1789630, Length=271, Percent_Identity=44.6494464944649, Blast_Score=209, Evalue=2e-55, Organism=Drosophila melanogaster, GI24646839, Length=289, Percent_Identity=32.5259515570934, Blast_Score=126, Evalue=4e-29,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): DEGQ_ECOLI (P39099)
Other databases:
- EMBL: U15661 - EMBL: U32495 - EMBL: U18997 - EMBL: U00096 - EMBL: AP009048 - PIR: JC6051 - RefSeq: AP_003776.1 - RefSeq: NP_417701.1 - ProteinModelPortal: P39099 - SMR: P39099 - DIP: DIP-9424N - MINT: MINT-1246722 - STRING: P39099 - MEROPS: S01.274 - SWISS-2DPAGE: P39099 - 2DBase-Ecoli: P39099 - EnsemblBacteria: EBESCT00000002944 - EnsemblBacteria: EBESCT00000014642 - GeneID: 947812 - GenomeReviews: AP009048_GR - GenomeReviews: U00096_GR - KEGG: ecj:JW3203 - KEGG: eco:b3234 - EchoBASE: EB2496 - EcoGene: EG12612 - eggNOG: COG0265 - GeneTree: EBGT00050000010522 - HOGENOM: HBG585708 - OMA: IQIADSD - ProtClustDB: PRK10139 - BioCyc: EcoCyc:G7682-MONOMER - BioCyc: MetaCyc:G7682-MONOMER - Genevestigator: P39099 - GO: GO:0006508 - InterPro: IPR001478 - InterPro: IPR009003 - InterPro: IPR011782 - InterPro: IPR001254 - InterPro: IPR001940 - PRINTS: PR00834 - SMART: SM00228 - TIGRFAMs: TIGR02037
Pfam domain/function: PF00595 PDZ; PF00089 Trypsin; SSF50156 PDZ; SSF50494 Pept_Ser_Cys
EC number: 3.4.21.-
Molecular weight: Translated: 47206; Mature: 47206
Theoretical pI: Translated: 5.60; Mature: 5.60
Prosite motif: PS50106 PDZ
Important sites: ACT_SITE 109-109 ACT_SITE 139-139 ACT_SITE 214-214
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.0 %Cys (Translated Protein) 1.5 %Met (Translated Protein) 1.5 %Cys+Met (Translated Protein) 0.0 %Cys (Mature Protein) 1.5 %Met (Mature Protein) 1.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKKQTQLLSALALSVGLTLSASFQAVASIPGQVADQAPLPSLAPMLEKVLPAVVSVRVEG CCHHHHHHHHHHHHHCCEEECCHHHHHHCCCHHCCCCCCCHHHHHHHHHHHHHEEEEEEC TASQGQKIPEEFKKFFGDDLPDQPAQPFEGLGSGVIINASKGYVLTNNHVINQAQKISIQ CHHCCCCCHHHHHHHHCCCCCCCCCCCHHHCCCCEEEECCCCEEEECCCEECCEEEEEEE LNDGREFDAKLIGSDDQSDIALLQIQNPSKLTQIAIADSDKLRVGDFAVAVGNPFGLGQT ECCCCCCCEEEECCCCCCCEEEEEECCCCCEEEEEEECCCCEEECEEEEEECCCCCCCCH ATSGIVSALGRSGLNLEGLENFIQTDASINRGNSGGALLNLNGELIGINTAILAPGGGSV HHHHHHHHHCCCCCCHHHHHHHHHCCCCCCCCCCCCEEEEECCEEEEEEEEEEECCCCCE GIGFAIPSNMARTLAQQLIDFGEIKRGLLGIKGTEMSADIAKAFNLDVQRGAFVSEVLPG EEEEECCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCHHHHHCCC SGSAKAGVKAGDIITSLNGKPLNSFAELRSRIATTEPGTKVKLGLLRNGKPLEVEVTLDT CCCCCCCCCCCCEEEECCCCCCHHHHHHHHHHCCCCCCCEEEEEEECCCCEEEEEEEEEC STSSSASAEMITPALEGATLSDGQLKDGGKGIKIDEVVKGSPAAQAGLQKDDVIIGVNRD CCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHCCCCHHHCCCCCCCEEEECCHH RVNSIAEMRKVLAAKPAIIALQIVRGNESIYLLMR HHHHHHHHHHHHHCCCCEEEEEEEECCCEEEEEEC >Mature Secondary Structure MKKQTQLLSALALSVGLTLSASFQAVASIPGQVADQAPLPSLAPMLEKVLPAVVSVRVEG CCHHHHHHHHHHHHHCCEEECCHHHHHHCCCHHCCCCCCCHHHHHHHHHHHHHEEEEEEC TASQGQKIPEEFKKFFGDDLPDQPAQPFEGLGSGVIINASKGYVLTNNHVINQAQKISIQ CHHCCCCCHHHHHHHHCCCCCCCCCCCHHHCCCCEEEECCCCEEEECCCEECCEEEEEEE LNDGREFDAKLIGSDDQSDIALLQIQNPSKLTQIAIADSDKLRVGDFAVAVGNPFGLGQT ECCCCCCCEEEECCCCCCCEEEEEECCCCCEEEEEEECCCCEEECEEEEEECCCCCCCCH ATSGIVSALGRSGLNLEGLENFIQTDASINRGNSGGALLNLNGELIGINTAILAPGGGSV HHHHHHHHHCCCCCCHHHHHHHHHCCCCCCCCCCCCEEEEECCEEEEEEEEEEECCCCCE GIGFAIPSNMARTLAQQLIDFGEIKRGLLGIKGTEMSADIAKAFNLDVQRGAFVSEVLPG EEEEECCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCHHHHHCCC SGSAKAGVKAGDIITSLNGKPLNSFAELRSRIATTEPGTKVKLGLLRNGKPLEVEVTLDT CCCCCCCCCCCCEEEECCCCCCHHHHHHHHHHCCCCCCCEEEEEEECCCCEEEEEEEEEC STSSSASAEMITPALEGATLSDGQLKDGGKGIKIDEVVKGSPAAQAGLQKDDVIIGVNRD CCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHCCCCHHHCCCCCCCEEEECCHH RVNSIAEMRKVLAAKPAIIALQIVRGNESIYLLMR HHHHHHHHHHHHHCCCCEEEEEEEECCCEEEEEEC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 8576051; 9278503