The gene/protein map for NC_011750 is currently unavailable.
Definition Escherichia coli IAI39 chromosome, complete genome.
Accession NC_011750
Length 5,132,068

Click here to switch to the map view.

The map label for this gene is sohB

Identifier: 218699982

GI number: 218699982

Start: 1680251

End: 1681300

Strand: Direct

Name: sohB

Synonym: ECIAI39_1611

Alternate gene names: 218699982

Gene position: 1680251-1681300 (Clockwise)

Preceding gene: 218699978

Following gene: 218699984

Centisome position: 32.74

GC content: 50.29

Gene sequence:

>1050_bases
GTGGAATTGTTGTCTGAATATGGTTTGTTTTTGGCGAAAATCGTTACCGTTGTGCTAGCGATTGCGGCGATTGCCGCCAT
TATTGTCAATGTTGCTCAACGTAATAAACGCCAGCGTGGCGAGTTACGGGTCAACAATCTCAGCGAACAGTATAAGGAGA
TGAAAGAAGAACTGGCCGCGGCGCTGATGGATTCACATCAGCAAAAACAGTGGCACAAAGCGCAGAAGAAAAAGCACAAG
CAAGAAGCGAAAGCAGCAAAAGCGAAAGCCAAACTGGGCGAGGTGGCAACTGACAGTAAACCCCGCGTCTGGGTGCTGGA
TTTTAAAGGCAGCATGGACGCCCATGAAGTGAACTCGCTACGTGAAGAGATAACGGCGGTACTCGCAGCATTCAAATCGC
AGGATCAGGTTGTGCTCCGTCTGGAAAGCCCTGGTGGCATGGTGCATGGTTACGGGTTGGCGGCTTCGCAGCTGCAGCGT
CTGCGTGATAAAAACATTCCTTTAACTGTTACGGTAGACAAAGTCGCTGCCAGCGGCGGTTACATGATGGCCTGTGTGGC
AGACAAAATTGTTTCCGCACCGTTTGCTATTGTGGGTTCCATTGGCGTGGTGGCACAAATGCCCAACTTTAACCGATTCC
TGAAAAGCAAAGATATTGATATCGAACTGCACACCGCCGGGCAGTATAAGCGCACGCTGACCTTGCTGGGTGAAAACACC
GAAGAAGGGCGGGAGAAATTCCGCGAAGAGCTGAACGAAACGCATCAGTTATTTAAAGATTTTGTGAAGCGTATGCGTCC
GTCTCTGGATATTGAACAGGTGGCAACGGGTGAACACTGGTACGGACAACAGGCGGTAGAGAAAGGCCTGGTTGATGAAA
TCAACACCAGTGATGAAGTTATTCTTAGCCTGATGGAAGGCCGTGAAGTGGTCAATGTACGCTATATGCAACGAAAACGA
CTCATTGACCGATTCACCGGCAGCGCGACAGAGAGCGCCGATCGATTGTTGTTACGCTGGTGGCAGCGAGGGCAGAAGCC
ATTGATGTAA

Upstream 100 bases:

>100_bases
ACTTTGTCATACTTTCGCTGCAATAACATCTCTGCGAGACGGCTTAACATGCCTGTTGTAAACTGTGAGCCAAAGCGTTG
TTTAACCAAGGTGGGGACTC

Downstream 100 bases:

>100_bases
AAGACAAACGCGAGGGTAAGACCTCGCGTTTTGCTTTAATCAACCAGGTGATATTTTTCTGAAAGCACATGGGCCAGGTG
TTTGAACATATTAAACACCG

Product: putative periplasmic protease

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 349; Mature: 349

Protein sequence:

>349_residues
MELLSEYGLFLAKIVTVVLAIAAIAAIIVNVAQRNKRQRGELRVNNLSEQYKEMKEELAAALMDSHQQKQWHKAQKKKHK
QEAKAAKAKAKLGEVATDSKPRVWVLDFKGSMDAHEVNSLREEITAVLAAFKSQDQVVLRLESPGGMVHGYGLAASQLQR
LRDKNIPLTVTVDKVAASGGYMMACVADKIVSAPFAIVGSIGVVAQMPNFNRFLKSKDIDIELHTAGQYKRTLTLLGENT
EEGREKFREELNETHQLFKDFVKRMRPSLDIEQVATGEHWYGQQAVEKGLVDEINTSDEVILSLMEGREVVNVRYMQRKR
LIDRFTGSATESADRLLLRWWQRGQKPLM

Sequences:

>Translated_349_residues
MELLSEYGLFLAKIVTVVLAIAAIAAIIVNVAQRNKRQRGELRVNNLSEQYKEMKEELAAALMDSHQQKQWHKAQKKKHK
QEAKAAKAKAKLGEVATDSKPRVWVLDFKGSMDAHEVNSLREEITAVLAAFKSQDQVVLRLESPGGMVHGYGLAASQLQR
LRDKNIPLTVTVDKVAASGGYMMACVADKIVSAPFAIVGSIGVVAQMPNFNRFLKSKDIDIELHTAGQYKRTLTLLGENT
EEGREKFREELNETHQLFKDFVKRMRPSLDIEQVATGEHWYGQQAVEKGLVDEINTSDEVILSLMEGREVVNVRYMQRKR
LIDRFTGSATESADRLLLRWWQRGQKPLM
>Mature_349_residues
MELLSEYGLFLAKIVTVVLAIAAIAAIIVNVAQRNKRQRGELRVNNLSEQYKEMKEELAAALMDSHQQKQWHKAQKKKHK
QEAKAAKAKAKLGEVATDSKPRVWVLDFKGSMDAHEVNSLREEITAVLAAFKSQDQVVLRLESPGGMVHGYGLAASQLQR
LRDKNIPLTVTVDKVAASGGYMMACVADKIVSAPFAIVGSIGVVAQMPNFNRFLKSKDIDIELHTAGQYKRTLTLLGENT
EEGREKFREELNETHQLFKDFVKRMRPSLDIEQVATGEHWYGQQAVEKGLVDEINTSDEVILSLMEGREVVNVRYMQRKR
LIDRFTGSATESADRLLLRWWQRGQKPLM

Specific function: Multicopy suppressor of the htrA (degP) null phenotype. It is possibly a protease, not essential for bacterial viability [H]

COG id: COG0616

COG function: function code OU; Periplasmic serine proteases (ClpP class)

Gene ontology:

Cell location: Cell inner membrane; Single-pass membrane protein [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the peptidase S49 family [H]

Homologues:

Organism=Escherichia coli, GI1787527, Length=349, Percent_Identity=99.4269340974212, Blast_Score=713, Evalue=0.0,
Organism=Escherichia coli, GI1788064, Length=170, Percent_Identity=29.4117647058824, Blast_Score=66, Evalue=4e-12,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR002142
- InterPro:   IPR013703 [H]

Pfam domain/function: PF01343 Peptidase_S49; PF08496 Peptidase_S49_N [H]

EC number: 3.4.21.- [C]

Molecular weight: Translated: 39387; Mature: 39387

Theoretical pI: Translated: 9.81; Mature: 9.81

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.3 %Cys     (Translated Protein)
3.4 %Met     (Translated Protein)
3.7 %Cys+Met (Translated Protein)
0.3 %Cys     (Mature Protein)
3.4 %Met     (Mature Protein)
3.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MELLSEYGLFLAKIVTVVLAIAAIAAIIVNVAQRNKRQRGELRVNNLSEQYKEMKEELAA
CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHH
ALMDSHQQKQWHKAQKKKHKQEAKAAKAKAKLGEVATDSKPRVWVLDFKGSMDAHEVNSL
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCHHHHHHH
REEITAVLAAFKSQDQVVLRLESPGGMVHGYGLAASQLQRLRDKNIPLTVTVDKVAASGG
HHHHHHHHHHHCCCCEEEEEECCCCCEEEECCHHHHHHHHHHHCCCCEEEEEHHHHCCCC
YMMACVADKIVSAPFAIVGSIGVVAQMPNFNRFLKSKDIDIELHTAGQYKRTLTLLGENT
EEHHHHHHHHHHCCHHHHHHHHHHHCCCCHHHHHHCCCCCEEEECCCCHHHHHHEECCCC
EEGREKFREELNETHQLFKDFVKRMRPSLDIEQVATGEHWYGQQAVEKGLVDEINTSDEV
HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHCCCCCCCHHHHHCCHHHHCCCHHHH
ILSLMEGREVVNVRYMQRKRLIDRFTGSATESADRLLLRWWQRGQKPLM
HHHHHHCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCCCC
>Mature Secondary Structure
MELLSEYGLFLAKIVTVVLAIAAIAAIIVNVAQRNKRQRGELRVNNLSEQYKEMKEELAA
CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHH
ALMDSHQQKQWHKAQKKKHKQEAKAAKAKAKLGEVATDSKPRVWVLDFKGSMDAHEVNSL
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCHHHHHHH
REEITAVLAAFKSQDQVVLRLESPGGMVHGYGLAASQLQRLRDKNIPLTVTVDKVAASGG
HHHHHHHHHHHCCCCEEEEEECCCCCEEEECCHHHHHHHHHHHCCCCEEEEEHHHHCCCC
YMMACVADKIVSAPFAIVGSIGVVAQMPNFNRFLKSKDIDIELHTAGQYKRTLTLLGENT
EEHHHHHHHHHHCCHHHHHHHHHHHCCCCHHHHHHCCCCCEEEECCCCHHHHHHEECCCC
EEGREKFREELNETHQLFKDFVKRMRPSLDIEQVATGEHWYGQQAVEKGLVDEINTSDEV
HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHCCCCCCCHHHHHCCHHHHCCCHHHH
ILSLMEGREVVNVRYMQRKRLIDRFTGSATESADRLLLRWWQRGQKPLM
HHHHHHCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: Hydrolase; Acting on peptide bonds (Peptidases) [C]

Inhibitor: NA

Structure determination priority: 7.0

TargetDB status: NA

Availability: NA

References: 1885549; 9097039; 9278503; 3029379 [H]