| Definition | Bacillus cereus AH820, complete genome. |
|---|---|
| Accession | NC_011773 |
| Length | 5,302,683 |
Click here to switch to the map view.
The map label for this gene is ymxG [H]
Identifier: 218904932
GI number: 218904932
Start: 3651963
End: 3653204
Strand: Reverse
Name: ymxG [H]
Synonym: BCAH820_3816
Alternate gene names: 218904932
Gene position: 3653204-3651963 (Counterclockwise)
Preceding gene: 218904933
Following gene: 218904931
Centisome position: 68.89
GC content: 33.17
Gene sequence:
>1242_bases TTGATTAAAAAATATACTTGTAAAAATGGTGTAAGAATAGTTATGGAGAATATACCAACTGTAAGATCAGTTGCGATTGG TATTTGGATCCATGCAGGATCAAGAAATGAAAATGAAAAAAACAACGGAATTTCTCACTTTTTAGAGCATATGTTCTTTA AGGGAACGGAAACTCGTAGTGCACGCGAAATTGCAGAATCATTTGATAGCATTGGTGGACAAGTGAATGCTTTTACTTCA AAAGAATACACTTGTTACTATGCAAAAGTGTTAGATGAGCATGCTAAATATGCTTTAGATGTATTAGCAGATATGTTCTT TAATTCAACATTTGATGAAGAAGAATTGAAAAAAGAGAAGAATGTCGTATGTGAAGAAATTAAAATGTACGAAGATGCTC CAGATGATATTGTGCATGATATGTTAACGAAAGCAACATATGAAACGCATCCGCTTGGATATCCTATTTTAGGAACAGAA GAAACGCTTAATACGTTTACAGGTGATACGCTACGCCAATATATTAAAGATCATTACACACCTGAAAATGTAGTTGTATC AATTGCAGGAAATATTGATGAAGCCTTTTTACAAACGGTAGAGCAATATTTCGGTAGTTATGAAGGAACGACAAACCGTG AACAAGTACATAGCCCAATTTTCCACTTTAATAAGGTAGCACGTAAAAAGGAAACAGAACAAGCTCATTTATGTTTAGGA TATAAAGGCTTACAAATGGGACACGAAGATATTTATAACTTAATTGTATTAAATAACGTTTTAGGCGGTAGTATGAGTAG CCGTTTATTCCAAGAAGTACGTGAGCAACGCGGGTTAGCTTACTCAGTGTTTTCTTACCATTCTTCTTATGAAGATACAG GTATGTTAACGCTGTATGGTGGAACAGGTAGCCAACAATTAGATACACTGTATGAAACAATGCAAGAAACATTAGAAACA TTGAAAAATACAGGTATTACAGAAAAAGAGCTTATTAATAGTAAAGAGCAATTAAAAGGAAACTTAATGTTAAGTTTAGA AAGTACGAATAGCCGTATGAGCCGTAATGGTAAAAATGAATTGCTACTTCGTAAGCATCGTTCACTTGATGAGATTATTG AAAGTGTAAACACTGTAACAAAAGAAAATGTAGATGAATTAATTCGTAACATGTTTACAGATGAATTCTCTGCAGCATTA ATTAGTCCAGATGGAAAACTTCCAAAAGGAATAAAACTATAA
Upstream 100 bases:
>100_bases AACGTGTGGATTAAATACATTTGCATGACATTCATCCTTAAACTTGTTATCATTAAATAATAGCATTTATTTAGAGAAAC TGGCAGTAGGAGGAAAGTTT
Downstream 100 bases:
>100_bases TTGCTATATATGATTTAAAAATAACCGTATCTCTTTTAAACGAGATACGGTTATTTTTTGTGGATATATAGAATAGGTAC TAATTATGGATAACTTAAAT
Product: zinc protease, insulinase family
Products: NA
Alternate protein names: ORFP [H]
Number of amino acids: Translated: 413; Mature: 413
Protein sequence:
>413_residues MIKKYTCKNGVRIVMENIPTVRSVAIGIWIHAGSRNENEKNNGISHFLEHMFFKGTETRSAREIAESFDSIGGQVNAFTS KEYTCYYAKVLDEHAKYALDVLADMFFNSTFDEEELKKEKNVVCEEIKMYEDAPDDIVHDMLTKATYETHPLGYPILGTE ETLNTFTGDTLRQYIKDHYTPENVVVSIAGNIDEAFLQTVEQYFGSYEGTTNREQVHSPIFHFNKVARKKETEQAHLCLG YKGLQMGHEDIYNLIVLNNVLGGSMSSRLFQEVREQRGLAYSVFSYHSSYEDTGMLTLYGGTGSQQLDTLYETMQETLET LKNTGITEKELINSKEQLKGNLMLSLESTNSRMSRNGKNELLLRKHRSLDEIIESVNTVTKENVDELIRNMFTDEFSAAL ISPDGKLPKGIKL
Sequences:
>Translated_413_residues MIKKYTCKNGVRIVMENIPTVRSVAIGIWIHAGSRNENEKNNGISHFLEHMFFKGTETRSAREIAESFDSIGGQVNAFTS KEYTCYYAKVLDEHAKYALDVLADMFFNSTFDEEELKKEKNVVCEEIKMYEDAPDDIVHDMLTKATYETHPLGYPILGTE ETLNTFTGDTLRQYIKDHYTPENVVVSIAGNIDEAFLQTVEQYFGSYEGTTNREQVHSPIFHFNKVARKKETEQAHLCLG YKGLQMGHEDIYNLIVLNNVLGGSMSSRLFQEVREQRGLAYSVFSYHSSYEDTGMLTLYGGTGSQQLDTLYETMQETLET LKNTGITEKELINSKEQLKGNLMLSLESTNSRMSRNGKNELLLRKHRSLDEIIESVNTVTKENVDELIRNMFTDEFSAAL ISPDGKLPKGIKL >Mature_413_residues MIKKYTCKNGVRIVMENIPTVRSVAIGIWIHAGSRNENEKNNGISHFLEHMFFKGTETRSAREIAESFDSIGGQVNAFTS KEYTCYYAKVLDEHAKYALDVLADMFFNSTFDEEELKKEKNVVCEEIKMYEDAPDDIVHDMLTKATYETHPLGYPILGTE ETLNTFTGDTLRQYIKDHYTPENVVVSIAGNIDEAFLQTVEQYFGSYEGTTNREQVHSPIFHFNKVARKKETEQAHLCLG YKGLQMGHEDIYNLIVLNNVLGGSMSSRLFQEVREQRGLAYSVFSYHSSYEDTGMLTLYGGTGSQQLDTLYETMQETLET LKNTGITEKELINSKEQLKGNLMLSLESTNSRMSRNGKNELLLRKHRSLDEIIESVNTVTKENVDELIRNMFTDEFSAAL ISPDGKLPKGIKL
Specific function: Unknown
COG id: COG0612
COG function: function code R; Predicted Zn-dependent peptidases
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the peptidase M16 family [H]
Homologues:
Organism=Homo sapiens, GI94538354, Length=420, Percent_Identity=30, Blast_Score=196, Evalue=3e-50, Organism=Homo sapiens, GI46593007, Length=415, Percent_Identity=25.3012048192771, Blast_Score=143, Evalue=3e-34, Organism=Homo sapiens, GI24308013, Length=459, Percent_Identity=25.0544662309368, Blast_Score=122, Evalue=6e-28, Organism=Homo sapiens, GI50592988, Length=425, Percent_Identity=20.7058823529412, Blast_Score=79, Evalue=1e-14, Organism=Homo sapiens, GI155969707, Length=119, Percent_Identity=36.1344537815126, Blast_Score=74, Evalue=2e-13, Organism=Escherichia coli, GI1787770, Length=177, Percent_Identity=28.8135593220339, Blast_Score=86, Evalue=3e-18, Organism=Escherichia coli, GI2367164, Length=172, Percent_Identity=26.1627906976744, Blast_Score=62, Evalue=6e-11, Organism=Caenorhabditis elegans, GI71999683, Length=416, Percent_Identity=28.6057692307692, Blast_Score=181, Evalue=5e-46, Organism=Caenorhabditis elegans, GI17553678, Length=418, Percent_Identity=26.3157894736842, Blast_Score=168, Evalue=6e-42, Organism=Caenorhabditis elegans, GI17510601, Length=368, Percent_Identity=22.2826086956522, Blast_Score=87, Evalue=1e-17, Organism=Saccharomyces cerevisiae, GI6323192, Length=394, Percent_Identity=30.7106598984772, Blast_Score=199, Evalue=7e-52, Organism=Saccharomyces cerevisiae, GI6321813, Length=427, Percent_Identity=26.6978922716628, Blast_Score=149, Evalue=5e-37, Organism=Saccharomyces cerevisiae, GI6319426, Length=400, Percent_Identity=22.25, Blast_Score=70, Evalue=7e-13, Organism=Drosophila melanogaster, GI21357875, Length=412, Percent_Identity=29.6116504854369, Blast_Score=193, Evalue=2e-49, Organism=Drosophila melanogaster, GI24646943, Length=412, Percent_Identity=29.6116504854369, Blast_Score=193, Evalue=2e-49, Organism=Drosophila melanogaster, GI19921772, Length=439, Percent_Identity=24.6013667425968, Blast_Score=127, Evalue=1e-29, Organism=Drosophila melanogaster, GI24665395, Length=406, Percent_Identity=21.4285714285714, Blast_Score=69, Evalue=6e-12,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR011249 - InterPro: IPR011237 - InterPro: IPR011765 - InterPro: IPR001431 - InterPro: IPR007863 [H]
Pfam domain/function: PF00675 Peptidase_M16; PF05193 Peptidase_M16_C [H]
EC number: 3.4.99.- [C]
Molecular weight: Translated: 47005; Mature: 47005
Theoretical pI: Translated: 5.06; Mature: 5.06
Prosite motif: PS00143 INSULINASE
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.0 %Cys (Translated Protein) 3.1 %Met (Translated Protein) 4.1 %Cys+Met (Translated Protein) 1.0 %Cys (Mature Protein) 3.1 %Met (Mature Protein) 4.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MIKKYTCKNGVRIVMENIPTVRSVAIGIWIHAGSRNENEKNNGISHFLEHMFFKGTETRS CCCCEECCCCCEEHHHCCCHHHHEEEEEEEEECCCCCCCCCCCHHHHHHHHHHCCCCCHH AREIAESFDSIGGQVNAFTSKEYTCYYAKVLDEHAKYALDVLADMFFNSTFDEEELKKEK HHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH NVVCEEIKMYEDAPDDIVHDMLTKATYETHPLGYPILGTEETLNTFTGDTLRQYIKDHYT HHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHCCCHHHHHHHHHCCC PENVVVSIAGNIDEAFLQTVEQYFGSYEGTTNREQVHSPIFHFNKVARKKETEQAHLCLG CCCEEEEEECCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCHHHHEEC YKGLQMGHEDIYNLIVLNNVLGGSMSSRLFQEVREQRGLAYSVFSYHSSYEDTGMLTLYG CHHHHCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCEEEHHHHHCCCCCCCEEEEEC GTGSQQLDTLYETMQETLETLKNTGITEKELINSKEQLKGNLMLSLESTNSRMSRNGKNE CCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHCCHHHCCCCEEEEEECCCHHHHCCCCHH LLLRKHRSLDEIIESVNTVTKENVDELIRNMFTDEFSAALISPDGKLPKGIKL HHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEECCCCCCCCCCCC >Mature Secondary Structure MIKKYTCKNGVRIVMENIPTVRSVAIGIWIHAGSRNENEKNNGISHFLEHMFFKGTETRS CCCCEECCCCCEEHHHCCCHHHHEEEEEEEEECCCCCCCCCCCHHHHHHHHHHCCCCCHH AREIAESFDSIGGQVNAFTSKEYTCYYAKVLDEHAKYALDVLADMFFNSTFDEEELKKEK HHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH NVVCEEIKMYEDAPDDIVHDMLTKATYETHPLGYPILGTEETLNTFTGDTLRQYIKDHYT HHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHCCCHHHHHHHHHCCC PENVVVSIAGNIDEAFLQTVEQYFGSYEGTTNREQVHSPIFHFNKVARKKETEQAHLCLG CCCEEEEEECCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCHHHHEEC YKGLQMGHEDIYNLIVLNNVLGGSMSSRLFQEVREQRGLAYSVFSYHSSYEDTGMLTLYG CHHHHCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCEEEHHHHHCCCCCCCEEEEEC GTGSQQLDTLYETMQETLETLKNTGITEKELINSKEQLKGNLMLSLESTNSRMSRNGKNE CCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHCCHHHCCCCEEEEEECCCHHHHCCCCHH LLLRKHRSLDEIIESVNTVTKENVDELIRNMFTDEFSAALISPDGKLPKGIKL HHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEECCCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: Zn [C]
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: Hydrolase; Acting on peptide bonds (Peptidases); Endopeptidases of unknown catalytic mechanism [C]
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9384377; 8098035 [H]