Definition | Bacillus cereus E33L, complete genome. |
---|---|
Accession | NC_006274 |
Length | 5,300,915 |
Click here to switch to the map view.
The map label for this gene is 52141713
Identifier: 52141713
GI number: 52141713
Start: 3666425
End: 3667348
Strand: Reverse
Name: 52141713
Synonym: BCZK3534
Alternate gene names: NA
Gene position: 3667348-3666425 (Counterclockwise)
Preceding gene: 52141712
Following gene: 52141714
Centisome position: 69.18
GC content: 34.96
Gene sequence:
>924_bases ATGAAAATTTTTGATGCTCATTGTGATGTGCTTCTTCAGTTATGGAGTGCACAAGGGAAAAAGGATTTTAAACATGATCC GCAATTACATATTACATTTGAACAGTTAAAGAGAAGAAAAGGAAGTATACAATGTTTTGCTATATATGTGCCAGAAACAG TGGCATATGAAAACCGATTTGAAGTAGCATTGCAAATGGTAGATATTTTTTATAATGAAATTTTATCCCTACCAGGTGTG AAGTTTATTCAGACAAAAGATGATATTAACATGTTAAAACAGGATGAGATTGGGGCGATATTAACACTAGAAGGATGCGA AGCAATTGGAAAAGATGCAATGAAATTACGGTTGTTGTATCGCCTAGGAGTACGTTCTTTTGGACTAACATGGAATTATG CCAATCTATTAGCTGATGGTGCGTTAGAGACACGTGGGGCGGGTTTAACGACTTCTGGTAAACATGTTGTACAAGAACTG AATGCGTTACATGTATGGACTGATGTGTCTCATCTGAATGAAAGAAGTTTTTGGGATGTGATAGAAATTGCAAAAAATCC TATAGCTTCCCACTCAAATTGTATGAAACTTTGCGAGCACCCACGTAATTTAAGCGATGAACAACTTAAAGTTCTTATAA AGCGAAATGGCATGATCGGTGTCACTTTTGTACCACAGTTCCTAACAAATGAAAAACAAGCGAATGTAGCAGATATAGTA AGACATATCGAATATATTTGTTCATTAGGTGGAGAGAACAATATAGGGTTTGGTTCGGATTTTGATGGGATTTTAGAAAC AGTTGTAAATGTATCAGCGTATCGAGATTATGAAAATGTAATGAACGAGCTTTGTAAACACTATGCTGCATCTACTGTAG AGCGATTTTTATATGATAATTTTGTTGAACATATAAGCTTTTAA
Upstream 100 bases:
>100_bases GAAGAACGTACAGCAATAAAATTAATTGTAGAACCTCGTTAAGTTTAAACAACCTGTTTGCAGTATGCAAACAGGTTGTT TTTATAATGGAGGAATGAAA
Downstream 100 bases:
>100_bases GATTTTTGTTTTGAAAATACAGAAAAATTAGAATTGATTCATTTTATATTATTGCTTTTTTTAAGACCTAATACTAGAAT GAAAAACAAAATAGCTTTAT
Product: renal dipeptidase family protein
Products: NA
Alternate protein names: ORF X [H]
Number of amino acids: Translated: 307; Mature: 307
Protein sequence:
>307_residues MKIFDAHCDVLLQLWSAQGKKDFKHDPQLHITFEQLKRRKGSIQCFAIYVPETVAYENRFEVALQMVDIFYNEILSLPGV KFIQTKDDINMLKQDEIGAILTLEGCEAIGKDAMKLRLLYRLGVRSFGLTWNYANLLADGALETRGAGLTTSGKHVVQEL NALHVWTDVSHLNERSFWDVIEIAKNPIASHSNCMKLCEHPRNLSDEQLKVLIKRNGMIGVTFVPQFLTNEKQANVADIV RHIEYICSLGGENNIGFGSDFDGILETVVNVSAYRDYENVMNELCKHYAASTVERFLYDNFVEHISF
Sequences:
>Translated_307_residues MKIFDAHCDVLLQLWSAQGKKDFKHDPQLHITFEQLKRRKGSIQCFAIYVPETVAYENRFEVALQMVDIFYNEILSLPGV KFIQTKDDINMLKQDEIGAILTLEGCEAIGKDAMKLRLLYRLGVRSFGLTWNYANLLADGALETRGAGLTTSGKHVVQEL NALHVWTDVSHLNERSFWDVIEIAKNPIASHSNCMKLCEHPRNLSDEQLKVLIKRNGMIGVTFVPQFLTNEKQANVADIV RHIEYICSLGGENNIGFGSDFDGILETVVNVSAYRDYENVMNELCKHYAASTVERFLYDNFVEHISF >Mature_307_residues MKIFDAHCDVLLQLWSAQGKKDFKHDPQLHITFEQLKRRKGSIQCFAIYVPETVAYENRFEVALQMVDIFYNEILSLPGV KFIQTKDDINMLKQDEIGAILTLEGCEAIGKDAMKLRLLYRLGVRSFGLTWNYANLLADGALETRGAGLTTSGKHVVQEL NALHVWTDVSHLNERSFWDVIEIAKNPIASHSNCMKLCEHPRNLSDEQLKVLIKRNGMIGVTFVPQFLTNEKQANVADIV RHIEYICSLGGENNIGFGSDFDGILETVVNVSAYRDYENVMNELCKHYAASTVERFLYDNFVEHISF
Specific function: Unknown
COG id: COG2355
COG function: function code E; Zn-dependent dipeptidase, microsomal dipeptidase homolog
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: Belongs to the peptidase M19 family [H]
Homologues:
Organism=Homo sapiens, GI189458885, Length=279, Percent_Identity=24.7311827956989, Blast_Score=108, Evalue=6e-24, Organism=Homo sapiens, GI4758190, Length=279, Percent_Identity=24.7311827956989, Blast_Score=108, Evalue=6e-24, Organism=Homo sapiens, GI11641269, Length=248, Percent_Identity=25, Blast_Score=89, Evalue=5e-18, Organism=Homo sapiens, GI193211608, Length=214, Percent_Identity=27.1028037383178, Blast_Score=88, Evalue=1e-17, Organism=Homo sapiens, GI193211610, Length=214, Percent_Identity=27.1028037383178, Blast_Score=85, Evalue=9e-17, Organism=Drosophila melanogaster, GI221475880, Length=264, Percent_Identity=28.7878787878788, Blast_Score=123, Evalue=1e-28, Organism=Drosophila melanogaster, GI161083233, Length=255, Percent_Identity=27.4509803921569, Blast_Score=116, Evalue=1e-26, Organism=Drosophila melanogaster, GI281362638, Length=244, Percent_Identity=28.2786885245902, Blast_Score=109, Evalue=2e-24, Organism=Drosophila melanogaster, GI281362636, Length=244, Percent_Identity=28.2786885245902, Blast_Score=109, Evalue=2e-24,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR008257 [H]
Pfam domain/function: PF01244 Peptidase_M19 [H]
EC number: NA
Molecular weight: Translated: 34953; Mature: 34953
Theoretical pI: Translated: 5.95; Mature: 5.95
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.3 %Cys (Translated Protein) 2.3 %Met (Translated Protein) 4.6 %Cys+Met (Translated Protein) 2.3 %Cys (Mature Protein) 2.3 %Met (Mature Protein) 4.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKIFDAHCDVLLQLWSAQGKKDFKHDPQLHITFEQLKRRKGSIQCFAIYVPETVAYENRF CCEEHHHHHHHHHHHHCCCCCCCCCCCCEEEEHHHHHCCCCCEEEEEEECCCHHHHHHHH EVALQMVDIFYNEILSLPGVKFIQTKDDINMLKQDEIGAILTLEGCEAIGKDAMKLRLLY HHHHHHHHHHHHHHHHCCCCEEEECCHHHHHHHHCCCCEEEEECCHHHHCHHHHHHHHHH RLGVRSFGLTWNYANLLADGALETRGAGLTTSGKHVVQELNALHVWTDVSHLNERSFWDV HHHHHHHCCCCCHHHHHHCCCHHCCCCCCCCCHHHHHHHHHHHHEEHHHHHHCCCHHHHH IEIAKNPIASHSNCMKLCEHPRNLSDEQLKVLIKRNGMIGVTFVPQFLTNEKQANVADIV HHHHHCCCCCCHHHHHHHHCCCCCCHHHHHHHHHCCCEEEEEECHHHHCCCCCCCHHHHH RHIEYICSLGGENNIGFGSDFDGILETVVNVSAYRDYENVMNELCKHYAASTVERFLYDN HHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH FVEHISF HHHHHCC >Mature Secondary Structure MKIFDAHCDVLLQLWSAQGKKDFKHDPQLHITFEQLKRRKGSIQCFAIYVPETVAYENRF CCEEHHHHHHHHHHHHCCCCCCCCCCCCEEEEHHHHHCCCCCEEEEEEECCCHHHHHHHH EVALQMVDIFYNEILSLPGVKFIQTKDDINMLKQDEIGAILTLEGCEAIGKDAMKLRLLY HHHHHHHHHHHHHHHHCCCCEEEECCHHHHHHHHCCCCEEEEECCHHHHCHHHHHHHHHH RLGVRSFGLTWNYANLLADGALETRGAGLTTSGKHVVQELNALHVWTDVSHLNERSFWDV HHHHHHHCCCCCHHHHHHCCCHHCCCCCCCCCHHHHHHHHHHHHEEHHHHHHCCCHHHHH IEIAKNPIASHSNCMKLCEHPRNLSDEQLKVLIKRNGMIGVTFVPQFLTNEKQANVADIV HHHHHCCCCCCHHHHHHHHCCCCCCHHHHHHHHHCCCEEEEEECHHHHCCCCCCCHHHHH RHIEYICSLGGENNIGFGSDFDGILETVVNVSAYRDYENVMNELCKHYAASTVERFLYDN HHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH FVEHISF HHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 1313537 [H]