Definition | Bacillus cereus E33L, complete genome. |
---|---|
Accession | NC_006274 |
Length | 5,300,915 |
Click here to switch to the map view.
The map label for this gene is ymxG [H]
Identifier: 52141684
GI number: 52141684
Start: 3697633
End: 3698874
Strand: Reverse
Name: ymxG [H]
Synonym: BCZK3564
Alternate gene names: 52141684
Gene position: 3698874-3697633 (Counterclockwise)
Preceding gene: 52141681
Following gene: 52141683
Centisome position: 69.78
GC content: 33.66
Gene sequence:
>1242_bases TTGATTAAAAAATATACTTGTAAAAATGGTGTAAGAATAGTTATGGAGAATATACCAACTGTAAGATCGGTTGCGATTGG TATTTGGATCCATGCAGGATCAAGAAATGAAAATGAAAAAAACAACGGGATTTCTCACTTTTTAGAGCATATGTTCTTTA AGGGAACGGAAACTCGTAGTGCACGCGAAATTGCAGAATCATTTGATAGCATTGGTGGACAAGTGAATGCTTTTACTTCA AAAGAATACACTTGTTACTATGCAAAAGTGCTAGATGAGCATGCTAAATATGCTTTAGATGTATTAGCAGATATGTTCTT TAATTCAACATTTGATGAAGAAGAACTGAAAAAAGAGAAGAATGTCGTATGTGAAGAAATTAAAATGTACGAAGATGCTC CAGATGACATTGTGCATGATATGTTAACGAAAGCAACATATGAAACGCATCCGCTTGGATATCCTATTTTAGGAACAGAA GAAACGCTTAATACGTTTACGGGTGATACGCTACGCCAATATATTAAAGATCATTACACACCTGAAAATGTAGTTGTATC AGTTGCAGGAAATATTGATGAAGCCTTTTTACAAACGGTAGAGCAATATTTCGGTAGTTATGAAGGAACGACAAACCGTG AACAAGTACATAGCCCAATTTTCCATTTTAATAAGGTAGCACGTAAAAAGGAAACAGAACAAGCTCATTTATGTTTAGGA TATAAAGGCTTACAAATGGGACACGAAGATATTTATAACTTAATTGTATTAAATAACGTTTTAGGCGGTAGTATGAGTAG CCGTTTATTCCAAGAAGTACGTGAGCAACGCGGGTTAGCTTACTCAGTGTTTTCTTACCATTCTTCTTATGAAGATACAG GTATGTTAACGCTGTATGGTGGAACAGGTAGCCAACAATTAGATACACTGTATGAAACAATGCAAGAAACATTAGAAACA TTGAAAAATACAGGTATTACAGAAAAAGAGCTAATTAATAGTAAAGAGCAATTAAAAGGAAACTTAATGTTAAGTTTAGA AAGTACGAATAGCCGTATGAGCCGTAATGGTAAAAATGAATTGCTACTTCGTAAGCATCGTTCACTTGATGAGATTATTG AAAGTGTAAACACTGTAACAAAAGAAAATGTAGATGAATTAATTCGTAACATGTTTACAGATGAATTCTCTGCAGCATTA ATTAGTCCAGATGGAAAACTTCCAAAAGGAATAAAACTATAA
Upstream 100 bases:
>100_bases AACGTGTGGATTAAATACATTTGCATGACATTCATCCTTAAACTTGTTATCATTAAATAATAGCATTTATTTAGAGAAAC TGGCAGTAGGAGGAAAGTTT
Downstream 100 bases:
>100_bases TTGCTATATATGATGTAAAAATAACCGTATCTCGTTTAAAAGAGATACGGTTATTTTTTTTGGGTATACAGAATAGGTAC TAATTATAGATAACTTACAT
Product: zinc protease
Products: NA
Alternate protein names: ORFP [H]
Number of amino acids: Translated: 413; Mature: 413
Protein sequence:
>413_residues MIKKYTCKNGVRIVMENIPTVRSVAIGIWIHAGSRNENEKNNGISHFLEHMFFKGTETRSAREIAESFDSIGGQVNAFTS KEYTCYYAKVLDEHAKYALDVLADMFFNSTFDEEELKKEKNVVCEEIKMYEDAPDDIVHDMLTKATYETHPLGYPILGTE ETLNTFTGDTLRQYIKDHYTPENVVVSVAGNIDEAFLQTVEQYFGSYEGTTNREQVHSPIFHFNKVARKKETEQAHLCLG YKGLQMGHEDIYNLIVLNNVLGGSMSSRLFQEVREQRGLAYSVFSYHSSYEDTGMLTLYGGTGSQQLDTLYETMQETLET LKNTGITEKELINSKEQLKGNLMLSLESTNSRMSRNGKNELLLRKHRSLDEIIESVNTVTKENVDELIRNMFTDEFSAAL ISPDGKLPKGIKL
Sequences:
>Translated_413_residues MIKKYTCKNGVRIVMENIPTVRSVAIGIWIHAGSRNENEKNNGISHFLEHMFFKGTETRSAREIAESFDSIGGQVNAFTS KEYTCYYAKVLDEHAKYALDVLADMFFNSTFDEEELKKEKNVVCEEIKMYEDAPDDIVHDMLTKATYETHPLGYPILGTE ETLNTFTGDTLRQYIKDHYTPENVVVSVAGNIDEAFLQTVEQYFGSYEGTTNREQVHSPIFHFNKVARKKETEQAHLCLG YKGLQMGHEDIYNLIVLNNVLGGSMSSRLFQEVREQRGLAYSVFSYHSSYEDTGMLTLYGGTGSQQLDTLYETMQETLET LKNTGITEKELINSKEQLKGNLMLSLESTNSRMSRNGKNELLLRKHRSLDEIIESVNTVTKENVDELIRNMFTDEFSAAL ISPDGKLPKGIKL >Mature_413_residues MIKKYTCKNGVRIVMENIPTVRSVAIGIWIHAGSRNENEKNNGISHFLEHMFFKGTETRSAREIAESFDSIGGQVNAFTS KEYTCYYAKVLDEHAKYALDVLADMFFNSTFDEEELKKEKNVVCEEIKMYEDAPDDIVHDMLTKATYETHPLGYPILGTE ETLNTFTGDTLRQYIKDHYTPENVVVSVAGNIDEAFLQTVEQYFGSYEGTTNREQVHSPIFHFNKVARKKETEQAHLCLG YKGLQMGHEDIYNLIVLNNVLGGSMSSRLFQEVREQRGLAYSVFSYHSSYEDTGMLTLYGGTGSQQLDTLYETMQETLET LKNTGITEKELINSKEQLKGNLMLSLESTNSRMSRNGKNELLLRKHRSLDEIIESVNTVTKENVDELIRNMFTDEFSAAL ISPDGKLPKGIKL
Specific function: Unknown
COG id: COG0612
COG function: function code R; Predicted Zn-dependent peptidases
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the peptidase M16 family [H]
Homologues:
Organism=Homo sapiens, GI94538354, Length=420, Percent_Identity=30, Blast_Score=197, Evalue=2e-50, Organism=Homo sapiens, GI46593007, Length=415, Percent_Identity=25.3012048192771, Blast_Score=144, Evalue=2e-34, Organism=Homo sapiens, GI24308013, Length=459, Percent_Identity=25.0544662309368, Blast_Score=122, Evalue=5e-28, Organism=Homo sapiens, GI50592988, Length=425, Percent_Identity=20.2352941176471, Blast_Score=79, Evalue=1e-14, Organism=Homo sapiens, GI155969707, Length=119, Percent_Identity=36.1344537815126, Blast_Score=74, Evalue=2e-13, Organism=Escherichia coli, GI1787770, Length=177, Percent_Identity=29.3785310734463, Blast_Score=87, Evalue=2e-18, Organism=Escherichia coli, GI2367164, Length=172, Percent_Identity=26.1627906976744, Blast_Score=62, Evalue=6e-11, Organism=Caenorhabditis elegans, GI71999683, Length=416, Percent_Identity=28.6057692307692, Blast_Score=181, Evalue=4e-46, Organism=Caenorhabditis elegans, GI17553678, Length=418, Percent_Identity=26.3157894736842, Blast_Score=168, Evalue=5e-42, Organism=Caenorhabditis elegans, GI17510601, Length=368, Percent_Identity=22.2826086956522, Blast_Score=87, Evalue=1e-17, Organism=Saccharomyces cerevisiae, GI6323192, Length=394, Percent_Identity=30.7106598984772, Blast_Score=199, Evalue=6e-52, Organism=Saccharomyces cerevisiae, GI6321813, Length=427, Percent_Identity=26.6978922716628, Blast_Score=149, Evalue=6e-37, Organism=Saccharomyces cerevisiae, GI6319426, Length=400, Percent_Identity=22.25, Blast_Score=70, Evalue=6e-13, Organism=Drosophila melanogaster, GI21357875, Length=412, Percent_Identity=29.6116504854369, Blast_Score=193, Evalue=1e-49, Organism=Drosophila melanogaster, GI24646943, Length=412, Percent_Identity=29.6116504854369, Blast_Score=193, Evalue=1e-49, Organism=Drosophila melanogaster, GI19921772, Length=437, Percent_Identity=24.7139588100687, Blast_Score=127, Evalue=1e-29, Organism=Drosophila melanogaster, GI24665395, Length=406, Percent_Identity=21.1822660098522, Blast_Score=69, Evalue=7e-12,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR011249 - InterPro: IPR011237 - InterPro: IPR011765 - InterPro: IPR001431 - InterPro: IPR007863 [H]
Pfam domain/function: PF00675 Peptidase_M16; PF05193 Peptidase_M16_C [H]
EC number: 3.4.99.- [C]
Molecular weight: Translated: 46991; Mature: 46991
Theoretical pI: Translated: 5.06; Mature: 5.06
Prosite motif: PS00143 INSULINASE
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.0 %Cys (Translated Protein) 3.1 %Met (Translated Protein) 4.1 %Cys+Met (Translated Protein) 1.0 %Cys (Mature Protein) 3.1 %Met (Mature Protein) 4.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MIKKYTCKNGVRIVMENIPTVRSVAIGIWIHAGSRNENEKNNGISHFLEHMFFKGTETRS CCCCEECCCCCEEHHHCCCHHHHEEEEEEEEECCCCCCCCCCCHHHHHHHHHHCCCCCHH AREIAESFDSIGGQVNAFTSKEYTCYYAKVLDEHAKYALDVLADMFFNSTFDEEELKKEK HHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH NVVCEEIKMYEDAPDDIVHDMLTKATYETHPLGYPILGTEETLNTFTGDTLRQYIKDHYT HHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHCCCHHHHHHHHHCCC PENVVVSVAGNIDEAFLQTVEQYFGSYEGTTNREQVHSPIFHFNKVARKKETEQAHLCLG CCCEEEEECCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCHHHHEEC YKGLQMGHEDIYNLIVLNNVLGGSMSSRLFQEVREQRGLAYSVFSYHSSYEDTGMLTLYG CHHHHCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCEEEHHHHHCCCCCCCEEEEEC GTGSQQLDTLYETMQETLETLKNTGITEKELINSKEQLKGNLMLSLESTNSRMSRNGKNE CCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHCCHHHCCCCEEEEEECCCHHHHCCCCHH LLLRKHRSLDEIIESVNTVTKENVDELIRNMFTDEFSAALISPDGKLPKGIKL HHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEECCCCCCCCCCCC >Mature Secondary Structure MIKKYTCKNGVRIVMENIPTVRSVAIGIWIHAGSRNENEKNNGISHFLEHMFFKGTETRS CCCCEECCCCCEEHHHCCCHHHHEEEEEEEEECCCCCCCCCCCHHHHHHHHHHCCCCCHH AREIAESFDSIGGQVNAFTSKEYTCYYAKVLDEHAKYALDVLADMFFNSTFDEEELKKEK HHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH NVVCEEIKMYEDAPDDIVHDMLTKATYETHPLGYPILGTEETLNTFTGDTLRQYIKDHYT HHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHCCCHHHHHHHHHCCC PENVVVSVAGNIDEAFLQTVEQYFGSYEGTTNREQVHSPIFHFNKVARKKETEQAHLCLG CCCEEEEECCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCHHHHEEC YKGLQMGHEDIYNLIVLNNVLGGSMSSRLFQEVREQRGLAYSVFSYHSSYEDTGMLTLYG CHHHHCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCEEEHHHHHCCCCCCCEEEEEC GTGSQQLDTLYETMQETLETLKNTGITEKELINSKEQLKGNLMLSLESTNSRMSRNGKNE CCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHCCHHHCCCCEEEEEECCCHHHHCCCCHH LLLRKHRSLDEIIESVNTVTKENVDELIRNMFTDEFSAALISPDGKLPKGIKL HHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEECCCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: Zn [C]
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: Hydrolase; Acting on peptide bonds (Peptidases); Endopeptidases of unknown catalytic mechanism [C]
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9384377; 8098035 [H]