Definition | Legionella pneumophila str. Corby chromosome, complete genome. |
---|---|
Accession | NC_009494 |
Length | 3,576,470 |
Click here to switch to the map view.
The map label for this gene is sohB [H]
Identifier: 148360499
GI number: 148360499
Start: 1036505
End: 1037440
Strand: Reverse
Name: sohB [H]
Synonym: LPC_2440
Alternate gene names: 148360499
Gene position: 1037440-1036505 (Counterclockwise)
Preceding gene: 148360487
Following gene: 148360500
Centisome position: 29.01
GC content: 38.14
Gene sequence:
>936_bases ATGGAATTCCTTAGTGAATATGGCATGTTTCTCTTAAAATGCATCACTCTCGTCATCGCATTACTCATTCTTCTGGCAGG TATATTTTCTATGGGTCGCAAAACCAAACCCAAACTGGAAATTACCTCTCTTAATGAAGAGTATGAGCACCTGAATGCCT TAATGAATAAAGAGATTTTAGGAAAAAAACCTGGAAAAAAGAAAAAAGATAAAACCAAGCGCCCGGTTCTTTATGTCATT GATTTCAGCGGTGATATCAAAGCAACCCAAGTTGAACAACTAAGAGATGAAGTCACATCGGTATTAAGTATAGCAAAACC TGAGGATGAGGTTTTAGTAAGGTTGGAAAGCCCTGGAGGGGCCGTCAATGGTTATGGATTGGCTGCGTCCCAGTTACAGC GAATCCGTGATAAAAAAATTCCTTTGACTGTCAGCATAGACAAAATGGCAGCCAGCGGCGGATATTTAATGGCCTGCGTA GCAAATAAAATAATTGCTGCCCCATTCGCAATAATCGGCTCTATAGGTGTAGTTGCTCAAATTCCTAATTTTCATCGCTG GTTAAAGAAAAACAACATTGATGTTGAGCTATTAACTGCAGGAGAATACAAACGCACCCTAACACTTTTTGCGGAAAATA CTGAAAAAGGTAGAAAAAAGTTTCAGGAAGATTTGGAAAAAATTCATACTGCATTCAGAGAGTATGTTCTTAAAAATAGA AGTCAACTCGATATAGATAAGGTTGCAACTGGTGAACATTGGATAGCAAAAGATGCTTTTGATCTCAGGCTCGTTGATAA GTTAGCTACTAGCGATGAATATTTAATAGAAAAAATGGCGGAGTTTAATGCATTCAAATTAACTGTCCATGCCAAATTGC CAATTATAAGCAAGGTGCTTAAACCGGCAATGAGACTCATTCATCCGTGGATTTAA
Upstream 100 bases:
>100_bases AGAAAAATCCCCAATGCCTTCGTTAAAAGTTATTTAATTTTTGCTTGCGAAGTGAGTGGATGATCGCCATAATCTGCTTT ATTTTTTGCAAAGGTATATG
Downstream 100 bases:
>100_bases AAATTAGTACTGAAACAGCTTGGGAACGCTATTGAAATTTAATACCGTTAAATCAGGCAGCATTCCATAGGAAACGAGAT CAATTTTATCATGGGCTTAA
Product: putative periplasmic protease
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 311; Mature: 311
Protein sequence:
>311_residues MEFLSEYGMFLLKCITLVIALLILLAGIFSMGRKTKPKLEITSLNEEYEHLNALMNKEILGKKPGKKKKDKTKRPVLYVI DFSGDIKATQVEQLRDEVTSVLSIAKPEDEVLVRLESPGGAVNGYGLAASQLQRIRDKKIPLTVSIDKMAASGGYLMACV ANKIIAAPFAIIGSIGVVAQIPNFHRWLKKNNIDVELLTAGEYKRTLTLFAENTEKGRKKFQEDLEKIHTAFREYVLKNR SQLDIDKVATGEHWIAKDAFDLRLVDKLATSDEYLIEKMAEFNAFKLTVHAKLPIISKVLKPAMRLIHPWI
Sequences:
>Translated_311_residues MEFLSEYGMFLLKCITLVIALLILLAGIFSMGRKTKPKLEITSLNEEYEHLNALMNKEILGKKPGKKKKDKTKRPVLYVI DFSGDIKATQVEQLRDEVTSVLSIAKPEDEVLVRLESPGGAVNGYGLAASQLQRIRDKKIPLTVSIDKMAASGGYLMACV ANKIIAAPFAIIGSIGVVAQIPNFHRWLKKNNIDVELLTAGEYKRTLTLFAENTEKGRKKFQEDLEKIHTAFREYVLKNR SQLDIDKVATGEHWIAKDAFDLRLVDKLATSDEYLIEKMAEFNAFKLTVHAKLPIISKVLKPAMRLIHPWI >Mature_311_residues MEFLSEYGMFLLKCITLVIALLILLAGIFSMGRKTKPKLEITSLNEEYEHLNALMNKEILGKKPGKKKKDKTKRPVLYVI DFSGDIKATQVEQLRDEVTSVLSIAKPEDEVLVRLESPGGAVNGYGLAASQLQRIRDKKIPLTVSIDKMAASGGYLMACV ANKIIAAPFAIIGSIGVVAQIPNFHRWLKKNNIDVELLTAGEYKRTLTLFAENTEKGRKKFQEDLEKIHTAFREYVLKNR SQLDIDKVATGEHWIAKDAFDLRLVDKLATSDEYLIEKMAEFNAFKLTVHAKLPIISKVLKPAMRLIHPWI
Specific function: Possible protease [H]
COG id: COG0616
COG function: function code OU; Periplasmic serine proteases (ClpP class)
Gene ontology:
Cell location: Cell membrane; Single-pass membrane protein (Potential) [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the peptidase S49 family [H]
Homologues:
Organism=Escherichia coli, GI1787527, Length=305, Percent_Identity=47.8688524590164, Blast_Score=286, Evalue=8e-79, Organism=Escherichia coli, GI1788064, Length=204, Percent_Identity=27.9411764705882, Blast_Score=62, Evalue=4e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR002142 - InterPro: IPR013703 [H]
Pfam domain/function: PF01343 Peptidase_S49; PF08496 Peptidase_S49_N [H]
EC number: 3.4.21.- [C]
Molecular weight: Translated: 35024; Mature: 35024
Theoretical pI: Translated: 9.91; Mature: 9.91
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.6 %Cys (Translated Protein) 2.6 %Met (Translated Protein) 3.2 %Cys+Met (Translated Protein) 0.6 %Cys (Mature Protein) 2.6 %Met (Mature Protein) 3.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MEFLSEYGMFLLKCITLVIALLILLAGIFSMGRKTKPKLEITSLNEEYEHLNALMNKEIL CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEECCCHHHHHHHHHHHHHHH GKKPGKKKKDKTKRPVLYVIDFSGDIKATQVEQLRDEVTSVLSIAKPEDEVLVRLESPGG CCCCCCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCC AVNGYGLAASQLQRIRDKKIPLTVSIDKMAASGGYLMACVANKIIAAPFAIIGSIGVVAQ CCCCCCHHHHHHHHHHHCCCCEEEEHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH IPNFHRWLKKNNIDVELLTAGEYKRTLTLFAENTEKGRKKFQEDLEKIHTAFREYVLKNR CCHHHHHHHCCCCCEEEEECCCCEEEEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHCCC SQLDIDKVATGEHWIAKDAFDLRLVDKLATSDEYLIEKMAEFNAFKLTVHAKLPIISKVL CCCCHHHHCCCCCHHHCCHHHHHHHHHHHCCHHHHHHHHHCCCEEEEEEEECCHHHHHHH KPAMRLIHPWI HHHHHHHCCCC >Mature Secondary Structure MEFLSEYGMFLLKCITLVIALLILLAGIFSMGRKTKPKLEITSLNEEYEHLNALMNKEIL CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEECCCHHHHHHHHHHHHHHH GKKPGKKKKDKTKRPVLYVIDFSGDIKATQVEQLRDEVTSVLSIAKPEDEVLVRLESPGG CCCCCCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCC AVNGYGLAASQLQRIRDKKIPLTVSIDKMAASGGYLMACVANKIIAAPFAIIGSIGVVAQ CCCCCCHHHHHHHHHHHCCCCEEEEHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH IPNFHRWLKKNNIDVELLTAGEYKRTLTLFAENTEKGRKKFQEDLEKIHTAFREYVLKNR CCHHHHHHHCCCCCEEEEECCCCEEEEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHCCC SQLDIDKVATGEHWIAKDAFDLRLVDKLATSDEYLIEKMAEFNAFKLTVHAKLPIISKVL CCCCHHHHCCCCCHHHCCHHHHHHHHHHHCCHHHHHHHHHCCCEEEEEEEECCHHHHHHH KPAMRLIHPWI HHHHHHHCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: Hydrolase; Acting on peptide bonds (Peptidases) [C]
Inhibitor: NA
Structure determination priority: 7.0
TargetDB status: NA
Availability: NA
References: 7542800 [H]