Definition Clostridium botulinum A2 str. Kyoto chromosome, complete genome.
Accession NC_012563
Length 4,155,278

Click here to switch to the map view.

The map label for this gene is yugT [H]

Identifier: 226948933

GI number: 226948933

Start: 1880285

End: 1881949

Strand: Direct

Name: yugT [H]

Synonym: CLM_1846

Alternate gene names: 226948933

Gene position: 1880285-1881949 (Clockwise)

Preceding gene: 226948932

Following gene: 226948934

Centisome position: 45.25

GC content: 28.23

Gene sequence:

>1665_bases
ATGAATAAGACATGGTGGAAAGAAGCTGTTGCATATCAAATATATCCAAGAAGTTTTAAGGATTCAAATGATGATGGTAT
AGGAGATATAGAGGGTATAATTTCAAAATTAGATTATTTAAAAGATTTAGGTATAGATATAATTTGGATTTGTCCTATGT
ATAAGTCTCCAAATGATGACAACGGTTACGATATAAGCGATTATAAAGCTATAATGGACGAATTTGGAACTATGGAGGAT
TTTGATAAGTTACTACAAAAGGCCCATGAAAAAGGTATGAAACTTATAATAGATTTAGTTATAAATCATACCAGTGATGA
GCATAAATGGTTCATTGAATCAAGATCTTCTAAAGATAATCCCAAACGTGATTTTTATATATGGCGTGATGGTAAAGATG
GAAAGGAACCTAACAATTGGGAAAGTATATTTAAAGGTTCTGCCTGGGAATATGACTATAATACAGAACAATATTTTCTT
CACTTATTTAGTAAAAAGCAACCAGATTTAAATTGGGAAAATGAAAATGTTAGAAAAGAATTATATAAAATGATTAACTG
GTGGTTAGATAAGGGAATTGATGGATTTAGAGTAGATGCTATAAGTCATATAAAAAAAGAAAAGGGATTAAAAGACATAC
ATAATCCAAAAAATTTAGATTATGTTCCTTCTTTTGAAAAACATATGAATGTAGAGGGAATTCAAAAATATCTTAAAGAA
TTAAAGGAAAATACCTTTGATAAATATGACATAATAACTGTGGGAGAAGCCAATGGAGTAAATATAAGTCAAGCTCCTCA
ATGGGTAGGAGAAAAAGATGGCAAATTTAATATGATATTTCAGTTTGAACATCTAGATCTTTGGGATGTAGATCATAAAG
AACAGTCTACAATAAAAAAATTAAAAGAAGTATTAAGCAAATGGCAGGAAGGTTTAGAAGGAGTTGGATGGAATGCTTTG
TTTATAGAAAATCATGATATTCAAAGGGTAGTCTCAACTTTAGGAGATGATAAAAACTTTTGGGAAGAAAGTTCAAAAGC
CTTAGCTCTTATGTATTTTATGCAAAAGGGGACTCCATTTATATATCAAGGACAAGAAATAGGAATGACTAATGTTAAAT
TTGAAGGTATTGAAGATTATAATGATATAAAAACTATAAATATTTATAAAGAAAAAATAAGAAAAGGTATACCAAAAGAT
CAGGCCCTCAAATATGTATGGGAAACTTCAAGAGATAACTCAAGAACACCAATGCAATGGGATACCACAGAAAATGCTGG
ATTTTCAAAAGAAAAACCTTGGATGAAAGTTAATCCGAACTATGTAGATATAAACGCTAGGGAACAAGAAAATAACCTAA
ATTCTATTTTGAACTTTTATAAAAAAATTATAAGAGTAAAGAAAGAAAATGAGGCACTTATATATGGAAAATATAATTTG
ATTTTAGCGCATCATGAACAAATATATGCTTACACAAGAACTTTACGAAATGAAAAATTTATAGTAATTGCTAATTTAAC
AAATAAGGAAGCTAAATATACTTATAAAAGAGAAAAACTAAATTATAAAGGATTGATAATTTCAAACTATTCAATAGAGA
AACATGAGGATATAACAGAAATATTATTAAAGCCTTTTGAAGCGAGACTTTATAAAATAGTTTGA

Upstream 100 bases:

>100_bases
AAGGTATTATTTTTTTATGTTTTGCGCAAACGATTGCTCAAATATTTATCAAGAGTATGCAAAAATTACTTAAAACTATA
TTACAAATGGAGGATTATTA

Downstream 100 bases:

>100_bases
AATTTAAATTAATATAGATTATAAATATTGGGGGGATAAAAATGAAAAGGAATAAGCTAGTATCTTTTGATTTTTGGCAA
AAATTTGGAAAGACACTATT

Product: glycosyl hydrolase, family 13

Products: NA

Alternate protein names: Oligosaccharide alpha-1,6-glucosidase 3; Sucrase-isomaltase 3; Isomaltase 3 [H]

Number of amino acids: Translated: 554; Mature: 554

Protein sequence:

>554_residues
MNKTWWKEAVAYQIYPRSFKDSNDDGIGDIEGIISKLDYLKDLGIDIIWICPMYKSPNDDNGYDISDYKAIMDEFGTMED
FDKLLQKAHEKGMKLIIDLVINHTSDEHKWFIESRSSKDNPKRDFYIWRDGKDGKEPNNWESIFKGSAWEYDYNTEQYFL
HLFSKKQPDLNWENENVRKELYKMINWWLDKGIDGFRVDAISHIKKEKGLKDIHNPKNLDYVPSFEKHMNVEGIQKYLKE
LKENTFDKYDIITVGEANGVNISQAPQWVGEKDGKFNMIFQFEHLDLWDVDHKEQSTIKKLKEVLSKWQEGLEGVGWNAL
FIENHDIQRVVSTLGDDKNFWEESSKALALMYFMQKGTPFIYQGQEIGMTNVKFEGIEDYNDIKTINIYKEKIRKGIPKD
QALKYVWETSRDNSRTPMQWDTTENAGFSKEKPWMKVNPNYVDINAREQENNLNSILNFYKKIIRVKKENEALIYGKYNL
ILAHHEQIYAYTRTLRNEKFIVIANLTNKEAKYTYKREKLNYKGLIISNYSIEKHEDITEILLKPFEARLYKIV

Sequences:

>Translated_554_residues
MNKTWWKEAVAYQIYPRSFKDSNDDGIGDIEGIISKLDYLKDLGIDIIWICPMYKSPNDDNGYDISDYKAIMDEFGTMED
FDKLLQKAHEKGMKLIIDLVINHTSDEHKWFIESRSSKDNPKRDFYIWRDGKDGKEPNNWESIFKGSAWEYDYNTEQYFL
HLFSKKQPDLNWENENVRKELYKMINWWLDKGIDGFRVDAISHIKKEKGLKDIHNPKNLDYVPSFEKHMNVEGIQKYLKE
LKENTFDKYDIITVGEANGVNISQAPQWVGEKDGKFNMIFQFEHLDLWDVDHKEQSTIKKLKEVLSKWQEGLEGVGWNAL
FIENHDIQRVVSTLGDDKNFWEESSKALALMYFMQKGTPFIYQGQEIGMTNVKFEGIEDYNDIKTINIYKEKIRKGIPKD
QALKYVWETSRDNSRTPMQWDTTENAGFSKEKPWMKVNPNYVDINAREQENNLNSILNFYKKIIRVKKENEALIYGKYNL
ILAHHEQIYAYTRTLRNEKFIVIANLTNKEAKYTYKREKLNYKGLIISNYSIEKHEDITEILLKPFEARLYKIV
>Mature_554_residues
MNKTWWKEAVAYQIYPRSFKDSNDDGIGDIEGIISKLDYLKDLGIDIIWICPMYKSPNDDNGYDISDYKAIMDEFGTMED
FDKLLQKAHEKGMKLIIDLVINHTSDEHKWFIESRSSKDNPKRDFYIWRDGKDGKEPNNWESIFKGSAWEYDYNTEQYFL
HLFSKKQPDLNWENENVRKELYKMINWWLDKGIDGFRVDAISHIKKEKGLKDIHNPKNLDYVPSFEKHMNVEGIQKYLKE
LKENTFDKYDIITVGEANGVNISQAPQWVGEKDGKFNMIFQFEHLDLWDVDHKEQSTIKKLKEVLSKWQEGLEGVGWNAL
FIENHDIQRVVSTLGDDKNFWEESSKALALMYFMQKGTPFIYQGQEIGMTNVKFEGIEDYNDIKTINIYKEKIRKGIPKD
QALKYVWETSRDNSRTPMQWDTTENAGFSKEKPWMKVNPNYVDINAREQENNLNSILNFYKKIIRVKKENEALIYGKYNL
ILAHHEQIYAYTRTLRNEKFIVIANLTNKEAKYTYKREKLNYKGLIISNYSIEKHEDITEILLKPFEARLYKIV

Specific function: Unknown

COG id: COG0366

COG function: function code G; Glycosidases

Gene ontology:

Cell location: Cytoplasm [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the glycosyl hydrolase 13 family [H]

Homologues:

Organism=Homo sapiens, GI187423904, Length=528, Percent_Identity=32.5757575757576, Blast_Score=261, Evalue=1e-69,
Organism=Escherichia coli, GI1790687, Length=551, Percent_Identity=39.3829401088929, Blast_Score=435, Evalue=1e-123,
Organism=Escherichia coli, GI1786604, Length=387, Percent_Identity=24.031007751938, Blast_Score=92, Evalue=1e-19,
Organism=Escherichia coli, GI87081873, Length=284, Percent_Identity=26.056338028169, Blast_Score=66, Evalue=5e-12,
Organism=Caenorhabditis elegans, GI25147709, Length=492, Percent_Identity=24.1869918699187, Blast_Score=125, Evalue=6e-29,
Organism=Caenorhabditis elegans, GI32565753, Length=202, Percent_Identity=32.1782178217822, Blast_Score=119, Evalue=4e-27,
Organism=Saccharomyces cerevisiae, GI6322245, Length=578, Percent_Identity=39.7923875432526, Blast_Score=391, Evalue=1e-109,
Organism=Saccharomyces cerevisiae, GI6319776, Length=584, Percent_Identity=39.554794520548, Blast_Score=372, Evalue=1e-103,
Organism=Saccharomyces cerevisiae, GI6321731, Length=584, Percent_Identity=39.3835616438356, Blast_Score=371, Evalue=1e-103,
Organism=Saccharomyces cerevisiae, GI6324416, Length=576, Percent_Identity=39.2361111111111, Blast_Score=364, Evalue=1e-101,
Organism=Saccharomyces cerevisiae, GI6322241, Length=576, Percent_Identity=39.0625, Blast_Score=360, Evalue=1e-100,
Organism=Saccharomyces cerevisiae, GI6322021, Length=576, Percent_Identity=39.0625, Blast_Score=360, Evalue=1e-100,
Organism=Saccharomyces cerevisiae, GI6321726, Length=576, Percent_Identity=38.3680555555556, Blast_Score=358, Evalue=1e-100,
Organism=Drosophila melanogaster, GI24583745, Length=526, Percent_Identity=35.361216730038, Blast_Score=296, Evalue=3e-80,
Organism=Drosophila melanogaster, GI24583747, Length=527, Percent_Identity=35.2941176470588, Blast_Score=293, Evalue=3e-79,
Organism=Drosophila melanogaster, GI24583749, Length=527, Percent_Identity=35.2941176470588, Blast_Score=293, Evalue=3e-79,
Organism=Drosophila melanogaster, GI221330053, Length=598, Percent_Identity=31.2709030100334, Blast_Score=275, Evalue=7e-74,
Organism=Drosophila melanogaster, GI24586589, Length=595, Percent_Identity=31.2605042016807, Blast_Score=262, Evalue=5e-70,
Organism=Drosophila melanogaster, GI24586597, Length=594, Percent_Identity=31.8181818181818, Blast_Score=262, Evalue=6e-70,
Organism=Drosophila melanogaster, GI24586599, Length=535, Percent_Identity=31.7757009345794, Blast_Score=260, Evalue=2e-69,
Organism=Drosophila melanogaster, GI24586593, Length=543, Percent_Identity=32.0441988950276, Blast_Score=259, Evalue=2e-69,
Organism=Drosophila melanogaster, GI24586587, Length=584, Percent_Identity=30.8219178082192, Blast_Score=257, Evalue=2e-68,
Organism=Drosophila melanogaster, GI45549022, Length=534, Percent_Identity=34.0823970037453, Blast_Score=255, Evalue=5e-68,
Organism=Drosophila melanogaster, GI24586591, Length=204, Percent_Identity=51.4705882352941, Blast_Score=227, Evalue=2e-59,
Organism=Drosophila melanogaster, GI281360393, Length=488, Percent_Identity=29.0983606557377, Blast_Score=186, Evalue=4e-47,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR013780
- InterPro:   IPR006047
- InterPro:   IPR006589
- InterPro:   IPR017853
- InterPro:   IPR013781 [H]

Pfam domain/function: PF00128 Alpha-amylase [H]

EC number: =3.2.1.10 [H]

Molecular weight: Translated: 65701; Mature: 65701

Theoretical pI: Translated: 6.23; Mature: 6.23

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.2 %Cys     (Translated Protein)
2.3 %Met     (Translated Protein)
2.5 %Cys+Met (Translated Protein)
0.2 %Cys     (Mature Protein)
2.3 %Met     (Mature Protein)
2.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNKTWWKEAVAYQIYPRSFKDSNDDGIGDIEGIISKLDYLKDLGIDIIWICPMYKSPNDD
CCCHHHHHHHEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCC
NGYDISDYKAIMDEFGTMEDFDKLLQKAHEKGMKLIIDLVINHTSDEHKWFIESRSSKDN
CCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHEEEHHHHCCCCCCCEEEEECCCCCCC
PKRDFYIWRDGKDGKEPNNWESIFKGSAWEYDYNTEQYFLHLFSKKQPDLNWENENVRKE
CCCCEEEEECCCCCCCCCCHHHHHCCCCEECCCCHHHHHHHHHHCCCCCCCCCCHHHHHH
LYKMINWWLDKGIDGFRVDAISHIKKEKGLKDIHNPKNLDYVPSFEKHMNVEGIQKYLKE
HHHHHHHHHHCCCCCEEHHHHHHHHHHCCCHHHCCCCCCCCCCCHHHHCCHHHHHHHHHH
LKENTFDKYDIITVGEANGVNISQAPQWVGEKDGKFNMIFQFEHLDLWDVDHKEQSTIKK
HHHCCCCCEEEEEECCCCCCCCCCCCHHCCCCCCCEEEEEEEECCCEECCCCCHHHHHHH
LKEVLSKWQEGLEGVGWNALFIENHDIQRVVSTLGDDKNFWEESSKALALMYFMQKGTPF
HHHHHHHHHHHHCCCCCEEEEEECCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCE
IYQGQEIGMTNVKFEGIEDYNDIKTINIYKEKIRKGIPKDQALKYVWETSRDNSRTPMQW
EEECCCCCCCCEEECCCCCCCCCEEHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCEEE
DTTENAGFSKEKPWMKVNPNYVDINAREQENNLNSILNFYKKIIRVKKENEALIYGKYNL
CCCCCCCCCCCCCCEEECCCEEEEECCHHHCCHHHHHHHHHHHHHHHCCCCEEEEEEEEE
ILAHHEQIYAYTRTLRNEKFIVIANLTNKEAKYTYKREKLNYKGLIISNYSIEKHEDITE
EEEEHHHHHHHHHHHCCCCEEEEEECCCCCHHHEEHHHCCCCCEEEEECCCCHHHHHHHH
ILLKPFEARLYKIV
HHHHHHHHHHHCCC
>Mature Secondary Structure
MNKTWWKEAVAYQIYPRSFKDSNDDGIGDIEGIISKLDYLKDLGIDIIWICPMYKSPNDD
CCCHHHHHHHEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCC
NGYDISDYKAIMDEFGTMEDFDKLLQKAHEKGMKLIIDLVINHTSDEHKWFIESRSSKDN
CCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHEEEHHHHCCCCCCCEEEEECCCCCCC
PKRDFYIWRDGKDGKEPNNWESIFKGSAWEYDYNTEQYFLHLFSKKQPDLNWENENVRKE
CCCCEEEEECCCCCCCCCCHHHHHCCCCEECCCCHHHHHHHHHHCCCCCCCCCCHHHHHH
LYKMINWWLDKGIDGFRVDAISHIKKEKGLKDIHNPKNLDYVPSFEKHMNVEGIQKYLKE
HHHHHHHHHHCCCCCEEHHHHHHHHHHCCCHHHCCCCCCCCCCCHHHHCCHHHHHHHHHH
LKENTFDKYDIITVGEANGVNISQAPQWVGEKDGKFNMIFQFEHLDLWDVDHKEQSTIKK
HHHCCCCCEEEEEECCCCCCCCCCCCHHCCCCCCCEEEEEEEECCCEECCCCCHHHHHHH
LKEVLSKWQEGLEGVGWNALFIENHDIQRVVSTLGDDKNFWEESSKALALMYFMQKGTPF
HHHHHHHHHHHHCCCCCEEEEEECCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCE
IYQGQEIGMTNVKFEGIEDYNDIKTINIYKEKIRKGIPKDQALKYVWETSRDNSRTPMQW
EEECCCCCCCCEEECCCCCCCCCEEHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCEEE
DTTENAGFSKEKPWMKVNPNYVDINAREQENNLNSILNFYKKIIRVKKENEALIYGKYNL
CCCCCCCCCCCCCCEEECCCEEEEECCHHHCCHHHHHHHHHHHHHHHCCCCEEEEEEEEE
ILAHHEQIYAYTRTLRNEKFIVIANLTNKEAKYTYKREKLNYKGLIISNYSIEKHEDITE
EEEEHHHHHHHHHHHCCCCEEEEEECCCCCHHHEEHHHCCCCCEEEEECCCCHHHHHHHH
ILLKPFEARLYKIV
HHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9274030; 9384377 [H]