The gene/protein map for NC_004631 is currently unavailable.
Definition Salmonella enterica subsp. enterica serovar Typhi str. Ty2 chromosome, complete genome.
Accession NC_004631
Length 4,791,961

Click here to switch to the map view.

The map label for this gene is yhjJ

Identifier: 29144176

GI number: 29144176

Start: 4038014

End: 4039501

Strand: Direct

Name: yhjJ

Synonym: t3904

Alternate gene names: 29144176

Gene position: 4038014-4039501 (Clockwise)

Preceding gene: 29144175

Following gene: 29144178

Centisome position: 84.27

GC content: 53.9

Gene sequence:

>1488_bases
ATGCAGGGCACAAAAATTCGACTCTTAGCGGGCAGTCTGTTGATGTTGGCCTCTGCCGGCTATGTGCAGGCAGATGCGCT
CCAGCCCGATCCGGCATGGCAACAGGGGACGCTGGCTAATGGGTTACAGTGGCAAGTGTTGGCTACGCCTCAGCGCCCCA
GCGATCGTATTGAAGTTCGTCTACAGGTTAATACCGGTTCGCTCACCGAAAGTACGCAACAGAGCGGGTTTAGCCATGCG
ATTCCCCGTATCGCGCTGACGCAAAGCGGTGGTCTGGATGCCGCACAGGCACGTTCTTTATGGCAGCAAGGATTTGATCC
GAAACGTCCCATGCCGCCCGTTATTGTTTCTTATGATTCCACGCTCTATAGCCTCAGTTTACCCAATAACCGTAACGATC
TGCTGAAAGAAGCGCTGACCTATCTGGCTAACGTCTCCGGTAAATTAACCATTACGCCAGAGACGGTGAATCATGCGTTA
AGCAGCGAAGATATGGTTGCGACGTGGCCAGCAGATACTAAAGAGGGCTGGTGGCGTTATCGGCTGAAAGGATCGGCGTT
ATTGGGGCACGATCCCGCGGAACCGTTAAAGCAGCCGGTAGACGCAGCCAAAATTCAGGCTTTCTATGAAAAATGGTACA
CCCCGGATGCCATGACGCTGATTGTTGTCGGCAACATTGATGCGCGCTCCGTCGCCGAGCAGATCAATAAAACGTTCGGT
ACGCTGAAAGGTAAACGCGAAACGCCCGCCCCGGTGCCGACGCTTTCGCCGCTGCGGGCGGAATCAGTGAGCATTATGAC
CGATGCGGTGCGCCAGGATCGTCTCTCCATTATGTGGGATACGCCGTGGCAACCGATTCGCGAGTCGGCGGCGCTGTTGC
GCTACTGGCAGGCGGATCTGGCGCGCGAAGCGCTGTTCTGGCATATCCAGCAAGAGCTTACTAAAAATAACGCGAAAGAT
ATTGGTCTGGGGTTTGACTGCCGGGTTCTGTTCCTGCGCGCGCAGTGCGCCATCAACATTGAATCACCTAATGATAAGCT
CAATACCAATTTGAGCCTGGTGGCGAATGAACTGGCGAAAGTACGCGATAAAGGTTTGTCGGAAGAGGAGTTTACTGCGC
TGGTGGCGCAGAAAAATCTCGAATTGCAAAAGCTATTCGCGACCTACGCGCGTACCGATACTGACATTTTGACTGGACAG
CGTATGCGCTCGCTGCAGAATCAGGTGGTGGATATCGCGCCGGAGCAGTATCAGAAGCTGCGTCAGAATTTCCTCAACAG
CCTGACCGTCGATATGCTCAATCAGAATCTACGTCAGCAGCTATCGCAGGAGATGGCATTAATTTTGCTGCAACCGCAAG
GCGAGCCGGAATTTAATATGAAGGCGTTAAAGGCGACGTGGGATGAAATCATGGTTCCGGCAACTGCCGCCGCTGTTGAA
GCAGATGAGGCGCATCCGGAAGTGACGGAAACACCGGCGGCACAGTAA

Upstream 100 bases:

>100_bases
TTAGGTTCTCGCGGTCTAACACGAAGTGTTTTTACGTCATATTCAGGCATCCCTGCCGGGGCTGTCTTTTTATTACCAGG
ATTGTTGATCAGGGGTTCAC

Downstream 100 bases:

>100_bases
GCGGCCGCCTGTCGGCCCGGTAAGCGCAGCGCCACCGGGCAAGAACGAATGACTTACTGCGGCATCGCGTCGTGCGGAAT
AATGGCCCCACGGTACTGAA

Product: zinc-protease

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 495; Mature: 495

Protein sequence:

>495_residues
MQGTKIRLLAGSLLMLASAGYVQADALQPDPAWQQGTLANGLQWQVLATPQRPSDRIEVRLQVNTGSLTESTQQSGFSHA
IPRIALTQSGGLDAAQARSLWQQGFDPKRPMPPVIVSYDSTLYSLSLPNNRNDLLKEALTYLANVSGKLTITPETVNHAL
SSEDMVATWPADTKEGWWRYRLKGSALLGHDPAEPLKQPVDAAKIQAFYEKWYTPDAMTLIVVGNIDARSVAEQINKTFG
TLKGKRETPAPVPTLSPLRAESVSIMTDAVRQDRLSIMWDTPWQPIRESAALLRYWQADLAREALFWHIQQELTKNNAKD
IGLGFDCRVLFLRAQCAINIESPNDKLNTNLSLVANELAKVRDKGLSEEEFTALVAQKNLELQKLFATYARTDTDILTGQ
RMRSLQNQVVDIAPEQYQKLRQNFLNSLTVDMLNQNLRQQLSQEMALILLQPQGEPEFNMKALKATWDEIMVPATAAAVE
ADEAHPEVTETPAAQ

Sequences:

>Translated_495_residues
MQGTKIRLLAGSLLMLASAGYVQADALQPDPAWQQGTLANGLQWQVLATPQRPSDRIEVRLQVNTGSLTESTQQSGFSHA
IPRIALTQSGGLDAAQARSLWQQGFDPKRPMPPVIVSYDSTLYSLSLPNNRNDLLKEALTYLANVSGKLTITPETVNHAL
SSEDMVATWPADTKEGWWRYRLKGSALLGHDPAEPLKQPVDAAKIQAFYEKWYTPDAMTLIVVGNIDARSVAEQINKTFG
TLKGKRETPAPVPTLSPLRAESVSIMTDAVRQDRLSIMWDTPWQPIRESAALLRYWQADLAREALFWHIQQELTKNNAKD
IGLGFDCRVLFLRAQCAINIESPNDKLNTNLSLVANELAKVRDKGLSEEEFTALVAQKNLELQKLFATYARTDTDILTGQ
RMRSLQNQVVDIAPEQYQKLRQNFLNSLTVDMLNQNLRQQLSQEMALILLQPQGEPEFNMKALKATWDEIMVPATAAAVE
ADEAHPEVTETPAAQ
>Mature_495_residues
MQGTKIRLLAGSLLMLASAGYVQADALQPDPAWQQGTLANGLQWQVLATPQRPSDRIEVRLQVNTGSLTESTQQSGFSHA
IPRIALTQSGGLDAAQARSLWQQGFDPKRPMPPVIVSYDSTLYSLSLPNNRNDLLKEALTYLANVSGKLTITPETVNHAL
SSEDMVATWPADTKEGWWRYRLKGSALLGHDPAEPLKQPVDAAKIQAFYEKWYTPDAMTLIVVGNIDARSVAEQINKTFG
TLKGKRETPAPVPTLSPLRAESVSIMTDAVRQDRLSIMWDTPWQPIRESAALLRYWQADLAREALFWHIQQELTKNNAKD
IGLGFDCRVLFLRAQCAINIESPNDKLNTNLSLVANELAKVRDKGLSEEEFTALVAQKNLELQKLFATYARTDTDILTGQ
RMRSLQNQVVDIAPEQYQKLRQNFLNSLTVDMLNQNLRQQLSQEMALILLQPQGEPEFNMKALKATWDEIMVPATAAAVE
ADEAHPEVTETPAAQ

Specific function: Unknown

COG id: COG0612

COG function: function code R; Predicted Zn-dependent peptidases

Gene ontology:

Cell location: Periplasm (Potential)

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the peptidase M16 family

Homologues:

Organism=Escherichia coli, GI1789946, Length=498, Percent_Identity=85.140562248996, Blast_Score=887, Evalue=0.0,
Organism=Escherichia coli, GI1787770, Length=443, Percent_Identity=23.0248306997743, Blast_Score=85, Evalue=1e-17,

Paralogues:

None

Copy number: 40 Molecules/Cell In: Stationary Phase, Rich Media (Based on E. coli). [C]

Swissprot (AC and ID): YHJJ_SALTI (Q8Z286)

Other databases:

- EMBL:   AL627281
- EMBL:   AE014613
- RefSeq:   NP_458306.1
- RefSeq:   NP_807518.1
- ProteinModelPortal:   Q8Z286
- GeneID:   1071200
- GeneID:   1250411
- GenomeReviews:   AE014613_GR
- GenomeReviews:   AL513382_GR
- KEGG:   stt:t3904
- KEGG:   sty:STY4190
- HOGENOM:   HBG417164
- OMA:   LWQQSID
- ProtClustDB:   CLSK880707
- BioCyc:   SENT209261:T3904-MONOMER
- BioCyc:   SENT220341:STY4190-MONOMER
- GO:   GO:0006508
- InterPro:   IPR011249
- InterPro:   IPR011237
- InterPro:   IPR011765
- InterPro:   IPR007863
- Gene3D:   G3DSA:3.30.830.10

Pfam domain/function: PF00675 Peptidase_M16; PF05193 Peptidase_M16_C; SSF63411 Metalloenz_metal-bd

EC number: NA

Molecular weight: Translated: 55124; Mature: 55124

Theoretical pI: Translated: 5.23; Mature: 5.23

Prosite motif: PS00143 INSULINASE

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.4 %Cys     (Translated Protein)
2.4 %Met     (Translated Protein)
2.8 %Cys+Met (Translated Protein)
0.4 %Cys     (Mature Protein)
2.4 %Met     (Mature Protein)
2.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MQGTKIRLLAGSLLMLASAGYVQADALQPDPAWQQGTLANGLQWQVLATPQRPSDRIEVR
CCCCEEEHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCCCCEEEEEECCCCCCCEEEEE
LQVNTGSLTESTQQSGFSHAIPRIALTQSGGLDAAQARSLWQQGFDPKRPMPPVIVSYDS
EEECCCCCCCHHHHCCHHHCCCEEEEECCCCCCHHHHHHHHHCCCCCCCCCCCEEEEECC
TLYSLSLPNNRNDLLKEALTYLANVSGKLTITPETVNHALSSEDMVATWPADTKEGWWRY
EEEEEECCCCHHHHHHHHHHHHHCCCCEEEECHHHHHHHCCCCCEEEECCCCCCCCCEEE
RLKGSALLGHDPAEPLKQPVDAAKIQAFYEKWYTPDAMTLIVVGNIDARSVAEQINKTFG
EEECCEEECCCCHHHHHCCHHHHHHHHHHHHHCCCCCEEEEEEECCCHHHHHHHHHHHHH
TLKGKRETPAPVPTLSPLRAESVSIMTDAVRQDRLSIMWDTPWQPIRESAALLRYWQADL
HCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCEEEEECCCCHHHHHHHHHHHHHHHHH
AREALFWHIQQELTKNNAKDIGLGFDCRVLFLRAQCAINIESPNDKLNTNLSLVANELAK
HHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEEEEEEEEEECCCCCCCCCCHHHHHHHHHH
VRDKGLSEEEFTALVAQKNLELQKLFATYARTDTDILTGQRMRSLQNQVVDIAPEQYQKL
HHHCCCCHHHHHHHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCHHHHHHH
RQNFLNSLTVDMLNQNLRQQLSQEMALILLQPQGEPEFNMKALKATWDEIMVPATAAAVE
HHHHHHHHHHHHHHHHHHHHHHHHHEEEEECCCCCCCCCHHHHHHHHHHHHCCCHHHHCC
ADEAHPEVTETPAAQ
CCCCCCCCCCCCCCC
>Mature Secondary Structure
MQGTKIRLLAGSLLMLASAGYVQADALQPDPAWQQGTLANGLQWQVLATPQRPSDRIEVR
CCCCEEEHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCCCCEEEEEECCCCCCCEEEEE
LQVNTGSLTESTQQSGFSHAIPRIALTQSGGLDAAQARSLWQQGFDPKRPMPPVIVSYDS
EEECCCCCCCHHHHCCHHHCCCEEEEECCCCCCHHHHHHHHHCCCCCCCCCCCEEEEECC
TLYSLSLPNNRNDLLKEALTYLANVSGKLTITPETVNHALSSEDMVATWPADTKEGWWRY
EEEEEECCCCHHHHHHHHHHHHHCCCCEEEECHHHHHHHCCCCCEEEECCCCCCCCCEEE
RLKGSALLGHDPAEPLKQPVDAAKIQAFYEKWYTPDAMTLIVVGNIDARSVAEQINKTFG
EEECCEEECCCCHHHHHCCHHHHHHHHHHHHHCCCCCEEEEEEECCCHHHHHHHHHHHHH
TLKGKRETPAPVPTLSPLRAESVSIMTDAVRQDRLSIMWDTPWQPIRESAALLRYWQADL
HCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCEEEEECCCCHHHHHHHHHHHHHHHHH
AREALFWHIQQELTKNNAKDIGLGFDCRVLFLRAQCAINIESPNDKLNTNLSLVANELAK
HHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEEEEEEEEEECCCCCCCCCCHHHHHHHHHH
VRDKGLSEEEFTALVAQKNLELQKLFATYARTDTDILTGQRMRSLQNQVVDIAPEQYQKL
HHHCCCCHHHHHHHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCHHHHHHH
RQNFLNSLTVDMLNQNLRQQLSQEMALILLQPQGEPEFNMKALKATWDEIMVPATAAAVE
HHHHHHHHHHHHHHHHHHHHHHHHHEEEEECCCCCCCCCHHHHHHHHHHHHCCCHHHHCC
ADEAHPEVTETPAAQ
CCCCCCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11677608; 12644504