The gene/protein map for NC_009495 is currently unavailable.
Definition Clostridium botulinum A str. ATCC 3502, complete genome.
Accession NC_009495
Length 3,886,916

Click here to switch to the map view.

The map label for this gene is 148379183

Identifier: 148379183

GI number: 148379183

Start: 1318554

End: 1319882

Strand: Direct

Name: 148379183

Synonym: CBO1200

Alternate gene names: NA

Gene position: 1318554-1319882 (Clockwise)

Preceding gene: 148379182

Following gene: 148379184

Centisome position: 33.92

GC content: 30.32

Gene sequence:

>1329_bases
ATGAAATTTATACAGTATAAAACTACTAAAAAAGATAAATTCTTAAAAAACGAAATTAATAATACTACAGATGAAAGACC
AACATTACATGTTTCACAAATTGAAAATAGGATCATAGAGGGATTTGGAAGTTGTTTTAATGAGCTGGGGATGAAAGCAT
TAAATCACTTAGACAAAGATGAAAGAAATAAAGTACTTGATCAACTATTTTCTACTAAAGGTGATTGTCGTTTTAATCTT
TGCAGAATGCCTATTGGAGCAAGTGATTATGCAACAGAATGGTATAGCTATAACGAAAATGAAAACGATTTTGATATGGA
GAAATTTAGTATTCAAAAAGATAAAAGACTTCTTATACCTTATATAAAAGAAGCATTAAAAAGAAATCCTAATATAATTC
TAACAGCTTCTCCATGGAGTCCTCCAACTTGGATGAAAACACAGAAGGCTTACAATTTTGGGACTCTACGCTTTGAAGAG
AAAGTTTTAAAAGCATATGCTCTTTATTTCGTTAAATTTGTACAAGCATATGAGGAAGAAGGAATAACAATTCATCAAGT
TCATGTTCAGAATGAAGTTGTTGCAGATCAAAAATTTCCGTCTTGTAGATGGACAGGAGATCAATTAACGGATTTTATTA
AAAATTATTTAGGACCAGCCTTTGAAAAGCATAATATTAGCAGTGAAATTTGGTTAGGGACAATTAATGCGCCAGAGCCC
TATGTTGAATGGCTAGAAGATTATACACAAGATTTTGATGTTTATGCTGGCTTAGTATTGAGAGATACAAAAGCATATAA
GTATGTTAAAGGAGTAGGATATCAATGGGCAGGAAAAAATGCTATTCAGCGTTCAGTTGAAGCATTTGCAGAAAAAAGAT
TTATCCAAACTGAAAATGAGTGCGGTAATGGTAAAAATACATGGATATATGCAGAATATGTTTTTAATCTATTCAGACAT
TATATTGTAAATGGTGTAAATGGATATATGTATTGGAATGCTGTTTTAGAACCAAAAGGAATGAGTACTTGGGGATGGGA
ACAGAATTCTATGATAACTGTAAATCCAGAAACTAAAGAAGTTATGTATAATCCAGAGTTCTATGTAATGAAACATTTTT
CTCATTTTGTTCAAAAAGGGGCAAAAAGATTAACTACTTCAGGTGTAGATTCTGTTGATACAGTAGCTTTCAGGAATCCA
GATGAATCTATAATTATTGTAATTTCAAATAAAAATGATGATTCTAGAATTGCTAATATTGAATTTACAGGTGAAATATT
TGAAGTTGAATTAGAAGGACATTCATTTAACACTATTGTTTTAGAATAA

Upstream 100 bases:

>100_bases
GGTGTACAAATTATTTTACTTAAATTTTATAAGTTAGATAAAATATACCCAAATATTATTAGCGATTTAGAAAATAGAAA
GATATAGAAGGAGGAAGAAT

Downstream 100 bases:

>100_bases
TTATTAAGAGGATGCCCGGATAAGGGCTTCCTTTTTCAAATAAGCTAAAAGAGGGAAAATAATATGGATAATAAGAATTA
TTTAAGTATAGATATTGGAG

Product: O-glycosyl hydrolase, family 30

Products: D-glucose; N-acylsphingosine

Alternate protein names: Glucan Endo-1 6-Beta-Glucosidase; Glycoside Hydrolase Family; O-Glycosyl Hydrolase Family; Glycoside Hydrolase Family Protein; O-Glycosyl Hydrolase; Glycosyl Hydrolase; Ricin B Lectin; O-Glycosyl Hydrolase Family Protein; Glycoside Hydrolase; Lysosomal Glucosyl Ceramidase-Like Protein; LPXTG-Motif Cell Wall Anchor Domain Protein; Cellulosome Protein Dockerin Type I; Glycosy Hydrolase Family Protein; Glucan EndO-1 6-Beta-Glucosidase; Beta-Glycosidase; Helix-Turn-Helix AraC Type; Glycosyl Hydrolase Family; Glycosyl Hydrolases Family; Endo-1 6-Beta-D-Glucanase; Glucuronoarabinoxylan Endo-1 4-Beta-Xylanase

Number of amino acids: Translated: 442; Mature: 442

Protein sequence:

>442_residues
MKFIQYKTTKKDKFLKNEINNTTDERPTLHVSQIENRIIEGFGSCFNELGMKALNHLDKDERNKVLDQLFSTKGDCRFNL
CRMPIGASDYATEWYSYNENENDFDMEKFSIQKDKRLLIPYIKEALKRNPNIILTASPWSPPTWMKTQKAYNFGTLRFEE
KVLKAYALYFVKFVQAYEEEGITIHQVHVQNEVVADQKFPSCRWTGDQLTDFIKNYLGPAFEKHNISSEIWLGTINAPEP
YVEWLEDYTQDFDVYAGLVLRDTKAYKYVKGVGYQWAGKNAIQRSVEAFAEKRFIQTENECGNGKNTWIYAEYVFNLFRH
YIVNGVNGYMYWNAVLEPKGMSTWGWEQNSMITVNPETKEVMYNPEFYVMKHFSHFVQKGAKRLTTSGVDSVDTVAFRNP
DESIIIVISNKNDDSRIANIEFTGEIFEVELEGHSFNTIVLE

Sequences:

>Translated_442_residues
MKFIQYKTTKKDKFLKNEINNTTDERPTLHVSQIENRIIEGFGSCFNELGMKALNHLDKDERNKVLDQLFSTKGDCRFNL
CRMPIGASDYATEWYSYNENENDFDMEKFSIQKDKRLLIPYIKEALKRNPNIILTASPWSPPTWMKTQKAYNFGTLRFEE
KVLKAYALYFVKFVQAYEEEGITIHQVHVQNEVVADQKFPSCRWTGDQLTDFIKNYLGPAFEKHNISSEIWLGTINAPEP
YVEWLEDYTQDFDVYAGLVLRDTKAYKYVKGVGYQWAGKNAIQRSVEAFAEKRFIQTENECGNGKNTWIYAEYVFNLFRH
YIVNGVNGYMYWNAVLEPKGMSTWGWEQNSMITVNPETKEVMYNPEFYVMKHFSHFVQKGAKRLTTSGVDSVDTVAFRNP
DESIIIVISNKNDDSRIANIEFTGEIFEVELEGHSFNTIVLE
>Mature_442_residues
MKFIQYKTTKKDKFLKNEINNTTDERPTLHVSQIENRIIEGFGSCFNELGMKALNHLDKDERNKVLDQLFSTKGDCRFNL
CRMPIGASDYATEWYSYNENENDFDMEKFSIQKDKRLLIPYIKEALKRNPNIILTASPWSPPTWMKTQKAYNFGTLRFEE
KVLKAYALYFVKFVQAYEEEGITIHQVHVQNEVVADQKFPSCRWTGDQLTDFIKNYLGPAFEKHNISSEIWLGTINAPEP
YVEWLEDYTQDFDVYAGLVLRDTKAYKYVKGVGYQWAGKNAIQRSVEAFAEKRFIQTENECGNGKNTWIYAEYVFNLFRH
YIVNGVNGYMYWNAVLEPKGMSTWGWEQNSMITVNPETKEVMYNPEFYVMKHFSHFVQKGAKRLTTSGVDSVDTVAFRNP
DESIIIVISNKNDDSRIANIEFTGEIFEVELEGHSFNTIVLE

Specific function: Unknown

COG id: COG5520

COG function: function code M; O-Glycosyl hydrolase

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

Organism=Homo sapiens, GI284807150, Length=409, Percent_Identity=28.361858190709, Blast_Score=146, Evalue=4e-35,
Organism=Homo sapiens, GI54607043, Length=410, Percent_Identity=28.2926829268293, Blast_Score=145, Evalue=7e-35,
Organism=Homo sapiens, GI54607045, Length=410, Percent_Identity=28.2926829268293, Blast_Score=145, Evalue=7e-35,
Organism=Homo sapiens, GI54607047, Length=410, Percent_Identity=28.2926829268293, Blast_Score=145, Evalue=7e-35,
Organism=Homo sapiens, GI284807152, Length=371, Percent_Identity=28.8409703504043, Blast_Score=135, Evalue=9e-32,
Organism=Caenorhabditis elegans, GI115532472, Length=434, Percent_Identity=30.8755760368664, Blast_Score=167, Evalue=1e-41,
Organism=Caenorhabditis elegans, GI115532470, Length=434, Percent_Identity=30.8755760368664, Blast_Score=166, Evalue=2e-41,
Organism=Caenorhabditis elegans, GI17539758, Length=454, Percent_Identity=28.6343612334802, Blast_Score=163, Evalue=1e-40,
Organism=Caenorhabditis elegans, GI17539756, Length=459, Percent_Identity=28.9760348583878, Blast_Score=160, Evalue=1e-39,
Organism=Caenorhabditis elegans, GI25151335, Length=386, Percent_Identity=30.3108808290155, Blast_Score=151, Evalue=7e-37,
Organism=Caenorhabditis elegans, GI193204210, Length=442, Percent_Identity=27.6018099547511, Blast_Score=149, Evalue=3e-36,
Organism=Caenorhabditis elegans, GI17542884, Length=472, Percent_Identity=26.6949152542373, Blast_Score=147, Evalue=1e-35,
Organism=Drosophila melanogaster, GI21355305, Length=433, Percent_Identity=24.0184757505774, Blast_Score=99, Evalue=7e-21,
Organism=Drosophila melanogaster, GI161078544, Length=429, Percent_Identity=23.5431235431235, Blast_Score=94, Evalue=2e-19,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: 3.2.1.45

Molecular weight: Translated: 51569; Mature: 51569

Theoretical pI: Translated: 5.54; Mature: 5.54

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.1 %Cys     (Translated Protein)
2.3 %Met     (Translated Protein)
3.4 %Cys+Met (Translated Protein)
1.1 %Cys     (Mature Protein)
2.3 %Met     (Mature Protein)
3.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKFIQYKTTKKDKFLKNEINNTTDERPTLHVSQIENRIIEGFGSCFNELGMKALNHLDKD
CCCEEECCCCHHHHHHHHHCCCCCCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHHCCHH
ERNKVLDQLFSTKGDCRFNLCRMPIGASDYATEWYSYNENENDFDMEKFSIQKDKRLLIP
HHHHHHHHHHCCCCCCEEEEEECCCCCCHHHHHCCCCCCCCCCCCHHHHHHCCCCEEECH
YIKEALKRNPNIILTASPWSPPTWMKTQKAYNFGTLRFEEKVLKAYALYFVKFVQAYEEE
HHHHHHHCCCCEEEEECCCCCCCCHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHC
GITIHQVHVQNEVVADQKFPSCRWTGDQLTDFIKNYLGPAFEKHNISSEIWLGTINAPEP
CCEEEEEEECCHHHCCCCCCCCCCCHHHHHHHHHHHHCCCHHHCCCCCEEEEEECCCCHH
YVEWLEDYTQDFDVYAGLVLRDTKAYKYVKGVGYQWAGKNAIQRSVEAFAEKRFIQTENE
HHHHHHHHHHCHHHHHCEEEECCHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCCCC
CGNGKNTWIYAEYVFNLFRHYIVNGVNGYMYWNAVLEPKGMSTWGWEQNSMITVNPETKE
CCCCCCCEEEHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCCCCCCEEEECCCCCE
VMYNPEFYVMKHFSHFVQKGAKRLTTSGVDSVDTVAFRNPDESIIIVISNKNDDSRIANI
EEECCCHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEECCCCCEEEEEEECCCCCCEEEEE
EFTGEIFEVELEGHSFNTIVLE
EEECEEEEEEECCCCCEEEEEC
>Mature Secondary Structure
MKFIQYKTTKKDKFLKNEINNTTDERPTLHVSQIENRIIEGFGSCFNELGMKALNHLDKD
CCCEEECCCCHHHHHHHHHCCCCCCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHHCCHH
ERNKVLDQLFSTKGDCRFNLCRMPIGASDYATEWYSYNENENDFDMEKFSIQKDKRLLIP
HHHHHHHHHHCCCCCCEEEEEECCCCCCHHHHHCCCCCCCCCCCCHHHHHHCCCCEEECH
YIKEALKRNPNIILTASPWSPPTWMKTQKAYNFGTLRFEEKVLKAYALYFVKFVQAYEEE
HHHHHHHCCCCEEEEECCCCCCCCHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHC
GITIHQVHVQNEVVADQKFPSCRWTGDQLTDFIKNYLGPAFEKHNISSEIWLGTINAPEP
CCEEEEEEECCHHHCCCCCCCCCCCHHHHHHHHHHHHCCCHHHCCCCCEEEEEECCCCHH
YVEWLEDYTQDFDVYAGLVLRDTKAYKYVKGVGYQWAGKNAIQRSVEAFAEKRFIQTENE
HHHHHHHHHHCHHHHHCEEEECCHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCCCC
CGNGKNTWIYAEYVFNLFRHYIVNGVNGYMYWNAVLEPKGMSTWGWEQNSMITVNPETKE
CCCCCCCEEEHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCCCCCCEEEECCCCCE
VMYNPEFYVMKHFSHFVQKGAKRLTTSGVDSVDTVAFRNPDESIIIVISNKNDDSRIANI
EEECCCHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEECCCCCEEEEEEECCCCCCEEEEE
EFTGEIFEVELEGHSFNTIVLE
EEECEEEEEEECCCCCEEEEEC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: D-glucosyl-N-acylsphingosine; H2O

Specific reaction: D-glucosyl-N-acylsphingosine + H2O = D-glucose + N-acylsphingosine

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA