| Definition | Clostridium botulinum A str. Hall, complete genome. |
|---|---|
| Accession | NC_009698 |
| Length | 3,760,560 |
Click here to switch to the map view.
The map label for this gene is yugT [H]
Identifier: 153936226
GI number: 153936226
Start: 1708069
End: 1709733
Strand: Direct
Name: yugT [H]
Synonym: CLC_1632
Alternate gene names: 153936226
Gene position: 1708069-1709733 (Clockwise)
Preceding gene: 153935077
Following gene: 153935681
Centisome position: 45.42
GC content: 28.17
Gene sequence:
>1665_bases ATGAATAAGACATGGTGGAAAGAAGCTGTTGCATATCAAATATATCCAAGAAGTTTTAAGGATTCAAATGATGATGGTAT AGGAGATATAGAGGGTATAATTTCAAAATTAGATTATTTAAAAGATTTAGGTATAGATATAATTTGGATTTGTCCTATGT ATAAGTCTCCAAATGATGACAACGGTTACGATATAAGCGATTATAAAGCTATAATGGACGAATTTGGAACTATGGAGGAT TTTGATAAGTTACTACAAAAGGCCCATGAAAAAGGTATGAAGCTTATAATAGATTTAGTTATAAATCATACCAGTGATGA GCATAAATGGTTCATTGAATCAAGATCTTCTAAAGATAATCCAAAACGTGATTTTTATATATGGCGTGATGGTAAAGATG GAAAGGAACCTAACAATTGGGAAAGTATATTTAAAGGTTCTGCCTGGGAATATGACTATAATACAGAACAATATTTTCTT CACTTATTTAGTAAAAAGCAACCAGATTTAAATTGGGAAAATGAAAATGTTAGAAATGAATTATATAAAATGATTAACTG GTGGTTAGATAAGGGAATTGATGGATTTAGAGTAGATGCTATAAGTCATATAAAAAAAGAAAAGGGATTAAAAGACATAC ATAATCCAAAAAATTTAGATTATGTTCCTTCTTTTGAAAAACATATGAATGTAGAGGGAATTCAAAAATATCTTAAAGAA TTAAAGGAAAATACCTTTGATAAATATGACATAATAACTGTGGGAGAAGCCAATGGAGTAAATATAAGTCAAGCTCCTCA ATGGGTAGGAGAAAAAGATGGCAAATTTAATATGATATTTCAGTTTGAACATCTAGATCTTTGGGATGTAGATCATAAAG AACAGTCTACAATAAAAAAATTAAAAGAAGTATTAAGCAAATGGCAGGAAGGTTTAGAAGGAGTTGGATGGAATGCTTTG TTTATAGAAAATCATGATATTCAAAGGGTAGTCTCAACTTTAGGAGATGATAAAAACTTTTGGGAAGAAAGTTCAAAAGC CTTAGCTCTTATGTATTTTATGCAAAAGGGGACTCCATTTATATATCAAGGACAAGAAATAGGAATGACTAATGTTAAAT TTGAAGGTATTGAAGATTATAATGATATAAAAACTATAAATATTTATAAAGAAAAAATAAGAAAAGGTATACCAAAAGAT CAAGCCCTCAAATATGTATGGGAAACTTCAAGAGATAACTCAAGAACACCAATGCAATGGGATACCACAGAAAATGCTGG ATTTTCAAAAGAAAAACCTTGGATGAAAGTTAATCCGAACTATGTAGATATAAACGCTAGGGAACAAGAAAATAACCTAA ATTCTATTTTGAACTTTTATAAAAAAATTATAAGAGTAAAGAAAGAAAATGAGGCACTTATATATGGAAAATATAATTTG ATTTTAGCGCATCATGAACAAATATATGCTTACACAAGAACTTTACGAAATGAAAAATTTATAGTAATTGCTAATTTAAC AAATAAGGAAGCTAAATATACTTATAAAAGAGAAAAACTAAATTATAAAGGATTGATAATTTCAAACTATTCAATAGAGA AACATGAGGATATAACAGAAATATTATTAAAGCCTTTTGAAGCGAGACTTTATAAAATAGTTTGA
Upstream 100 bases:
>100_bases AAGGTATTATTTTTTTATGTTTTGCGCAAACGATTGCTCAAATATTTATCAAGAGTATGCAAAAATTACTTAAAACTATA TTACAAATGGAGGATTATTA
Downstream 100 bases:
>100_bases AATTTAAATTAATATAGATTATAAATATTGGGGGGATAAAAATGAAAAGGAATAAGCTAGTATCTTTTGATTTTTGGCAA AAATTTGGAAAGACACTATT
Product: glycosy hydrolase family protein
Products: NA
Alternate protein names: Oligosaccharide alpha-1,6-glucosidase 3; Sucrase-isomaltase 3; Isomaltase 3 [H]
Number of amino acids: Translated: 554; Mature: 554
Protein sequence:
>554_residues MNKTWWKEAVAYQIYPRSFKDSNDDGIGDIEGIISKLDYLKDLGIDIIWICPMYKSPNDDNGYDISDYKAIMDEFGTMED FDKLLQKAHEKGMKLIIDLVINHTSDEHKWFIESRSSKDNPKRDFYIWRDGKDGKEPNNWESIFKGSAWEYDYNTEQYFL HLFSKKQPDLNWENENVRNELYKMINWWLDKGIDGFRVDAISHIKKEKGLKDIHNPKNLDYVPSFEKHMNVEGIQKYLKE LKENTFDKYDIITVGEANGVNISQAPQWVGEKDGKFNMIFQFEHLDLWDVDHKEQSTIKKLKEVLSKWQEGLEGVGWNAL FIENHDIQRVVSTLGDDKNFWEESSKALALMYFMQKGTPFIYQGQEIGMTNVKFEGIEDYNDIKTINIYKEKIRKGIPKD QALKYVWETSRDNSRTPMQWDTTENAGFSKEKPWMKVNPNYVDINAREQENNLNSILNFYKKIIRVKKENEALIYGKYNL ILAHHEQIYAYTRTLRNEKFIVIANLTNKEAKYTYKREKLNYKGLIISNYSIEKHEDITEILLKPFEARLYKIV
Sequences:
>Translated_554_residues MNKTWWKEAVAYQIYPRSFKDSNDDGIGDIEGIISKLDYLKDLGIDIIWICPMYKSPNDDNGYDISDYKAIMDEFGTMED FDKLLQKAHEKGMKLIIDLVINHTSDEHKWFIESRSSKDNPKRDFYIWRDGKDGKEPNNWESIFKGSAWEYDYNTEQYFL HLFSKKQPDLNWENENVRNELYKMINWWLDKGIDGFRVDAISHIKKEKGLKDIHNPKNLDYVPSFEKHMNVEGIQKYLKE LKENTFDKYDIITVGEANGVNISQAPQWVGEKDGKFNMIFQFEHLDLWDVDHKEQSTIKKLKEVLSKWQEGLEGVGWNAL FIENHDIQRVVSTLGDDKNFWEESSKALALMYFMQKGTPFIYQGQEIGMTNVKFEGIEDYNDIKTINIYKEKIRKGIPKD QALKYVWETSRDNSRTPMQWDTTENAGFSKEKPWMKVNPNYVDINAREQENNLNSILNFYKKIIRVKKENEALIYGKYNL ILAHHEQIYAYTRTLRNEKFIVIANLTNKEAKYTYKREKLNYKGLIISNYSIEKHEDITEILLKPFEARLYKIV >Mature_554_residues MNKTWWKEAVAYQIYPRSFKDSNDDGIGDIEGIISKLDYLKDLGIDIIWICPMYKSPNDDNGYDISDYKAIMDEFGTMED FDKLLQKAHEKGMKLIIDLVINHTSDEHKWFIESRSSKDNPKRDFYIWRDGKDGKEPNNWESIFKGSAWEYDYNTEQYFL HLFSKKQPDLNWENENVRNELYKMINWWLDKGIDGFRVDAISHIKKEKGLKDIHNPKNLDYVPSFEKHMNVEGIQKYLKE LKENTFDKYDIITVGEANGVNISQAPQWVGEKDGKFNMIFQFEHLDLWDVDHKEQSTIKKLKEVLSKWQEGLEGVGWNAL FIENHDIQRVVSTLGDDKNFWEESSKALALMYFMQKGTPFIYQGQEIGMTNVKFEGIEDYNDIKTINIYKEKIRKGIPKD QALKYVWETSRDNSRTPMQWDTTENAGFSKEKPWMKVNPNYVDINAREQENNLNSILNFYKKIIRVKKENEALIYGKYNL ILAHHEQIYAYTRTLRNEKFIVIANLTNKEAKYTYKREKLNYKGLIISNYSIEKHEDITEILLKPFEARLYKIV
Specific function: Unknown
COG id: COG0366
COG function: function code G; Glycosidases
Gene ontology:
Cell location: Cytoplasm [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the glycosyl hydrolase 13 family [H]
Homologues:
Organism=Homo sapiens, GI187423904, Length=528, Percent_Identity=32.5757575757576, Blast_Score=260, Evalue=3e-69, Organism=Escherichia coli, GI1790687, Length=551, Percent_Identity=39.3829401088929, Blast_Score=434, Evalue=1e-123, Organism=Escherichia coli, GI1786604, Length=387, Percent_Identity=24.2894056847545, Blast_Score=94, Evalue=2e-20, Organism=Caenorhabditis elegans, GI25147709, Length=492, Percent_Identity=24.1869918699187, Blast_Score=124, Evalue=9e-29, Organism=Caenorhabditis elegans, GI32565753, Length=202, Percent_Identity=32.1782178217822, Blast_Score=119, Evalue=3e-27, Organism=Saccharomyces cerevisiae, GI6322245, Length=578, Percent_Identity=39.6193771626298, Blast_Score=389, Evalue=1e-109, Organism=Saccharomyces cerevisiae, GI6319776, Length=584, Percent_Identity=39.554794520548, Blast_Score=370, Evalue=1e-103, Organism=Saccharomyces cerevisiae, GI6321731, Length=584, Percent_Identity=39.3835616438356, Blast_Score=370, Evalue=1e-103, Organism=Saccharomyces cerevisiae, GI6324416, Length=576, Percent_Identity=39.0625, Blast_Score=362, Evalue=1e-101, Organism=Saccharomyces cerevisiae, GI6322241, Length=576, Percent_Identity=38.8888888888889, Blast_Score=358, Evalue=1e-99, Organism=Saccharomyces cerevisiae, GI6322021, Length=576, Percent_Identity=38.8888888888889, Blast_Score=358, Evalue=1e-99, Organism=Saccharomyces cerevisiae, GI6321726, Length=576, Percent_Identity=38.1944444444444, Blast_Score=357, Evalue=4e-99, Organism=Drosophila melanogaster, GI24583745, Length=526, Percent_Identity=35.171102661597, Blast_Score=293, Evalue=1e-79, Organism=Drosophila melanogaster, GI24583747, Length=527, Percent_Identity=35.2941176470588, Blast_Score=292, Evalue=5e-79, Organism=Drosophila melanogaster, GI24583749, Length=527, Percent_Identity=35.2941176470588, Blast_Score=292, Evalue=5e-79, Organism=Drosophila melanogaster, GI221330053, Length=598, Percent_Identity=31.2709030100334, Blast_Score=275, Evalue=8e-74, Organism=Drosophila melanogaster, GI24586589, Length=595, Percent_Identity=31.2605042016807, Blast_Score=261, Evalue=8e-70, Organism=Drosophila melanogaster, GI24586597, Length=597, Percent_Identity=31.4907872696817, Blast_Score=261, Evalue=9e-70, Organism=Drosophila melanogaster, GI24586599, Length=535, Percent_Identity=31.7757009345794, Blast_Score=260, Evalue=2e-69, Organism=Drosophila melanogaster, GI24586593, Length=543, Percent_Identity=32.0441988950276, Blast_Score=259, Evalue=3e-69, Organism=Drosophila melanogaster, GI45549022, Length=534, Percent_Identity=34.2696629213483, Blast_Score=258, Evalue=9e-69, Organism=Drosophila melanogaster, GI24586587, Length=584, Percent_Identity=30.8219178082192, Blast_Score=256, Evalue=2e-68, Organism=Drosophila melanogaster, GI24586591, Length=204, Percent_Identity=51.4705882352941, Blast_Score=227, Evalue=2e-59, Organism=Drosophila melanogaster, GI281360393, Length=488, Percent_Identity=29.0983606557377, Blast_Score=186, Evalue=5e-47,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR013780 - InterPro: IPR006047 - InterPro: IPR006589 - InterPro: IPR017853 - InterPro: IPR013781 [H]
Pfam domain/function: PF00128 Alpha-amylase [H]
EC number: =3.2.1.10 [H]
Molecular weight: Translated: 65687; Mature: 65687
Theoretical pI: Translated: 6.10; Mature: 6.10
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.2 %Cys (Translated Protein) 2.3 %Met (Translated Protein) 2.5 %Cys+Met (Translated Protein) 0.2 %Cys (Mature Protein) 2.3 %Met (Mature Protein) 2.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MNKTWWKEAVAYQIYPRSFKDSNDDGIGDIEGIISKLDYLKDLGIDIIWICPMYKSPNDD CCCHHHHHHHEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCC NGYDISDYKAIMDEFGTMEDFDKLLQKAHEKGMKLIIDLVINHTSDEHKWFIESRSSKDN CCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHEEEHHHHCCCCCCCEEEEECCCCCCC PKRDFYIWRDGKDGKEPNNWESIFKGSAWEYDYNTEQYFLHLFSKKQPDLNWENENVRNE CCCCEEEEECCCCCCCCCCHHHHHCCCCEECCCCHHHHHHHHHHCCCCCCCCCCCHHHHH LYKMINWWLDKGIDGFRVDAISHIKKEKGLKDIHNPKNLDYVPSFEKHMNVEGIQKYLKE HHHHHHHHHHCCCCCEEHHHHHHHHHHCCCHHHCCCCCCCCCCCHHHHCCHHHHHHHHHH LKENTFDKYDIITVGEANGVNISQAPQWVGEKDGKFNMIFQFEHLDLWDVDHKEQSTIKK HHHCCCCCEEEEEECCCCCCCCCCCCHHCCCCCCCEEEEEEEECCCEECCCCCHHHHHHH LKEVLSKWQEGLEGVGWNALFIENHDIQRVVSTLGDDKNFWEESSKALALMYFMQKGTPF HHHHHHHHHHHHCCCCCEEEEEECCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCE IYQGQEIGMTNVKFEGIEDYNDIKTINIYKEKIRKGIPKDQALKYVWETSRDNSRTPMQW EEECCCCCCCCEEECCCCCCCCCEEHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCEEE DTTENAGFSKEKPWMKVNPNYVDINAREQENNLNSILNFYKKIIRVKKENEALIYGKYNL CCCCCCCCCCCCCCEEECCCEEEEECCHHHCCHHHHHHHHHHHHHHHCCCCEEEEEEEEE ILAHHEQIYAYTRTLRNEKFIVIANLTNKEAKYTYKREKLNYKGLIISNYSIEKHEDITE EEEEHHHHHHHHHHHCCCCEEEEEECCCCCHHHEEHHHCCCCCEEEEECCCCHHHHHHHH ILLKPFEARLYKIV HHHHHHHHHHHCCC >Mature Secondary Structure MNKTWWKEAVAYQIYPRSFKDSNDDGIGDIEGIISKLDYLKDLGIDIIWICPMYKSPNDD CCCHHHHHHHEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCC NGYDISDYKAIMDEFGTMEDFDKLLQKAHEKGMKLIIDLVINHTSDEHKWFIESRSSKDN CCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHEEEHHHHCCCCCCCEEEEECCCCCCC PKRDFYIWRDGKDGKEPNNWESIFKGSAWEYDYNTEQYFLHLFSKKQPDLNWENENVRNE CCCCEEEEECCCCCCCCCCHHHHHCCCCEECCCCHHHHHHHHHHCCCCCCCCCCCHHHHH LYKMINWWLDKGIDGFRVDAISHIKKEKGLKDIHNPKNLDYVPSFEKHMNVEGIQKYLKE HHHHHHHHHHCCCCCEEHHHHHHHHHHCCCHHHCCCCCCCCCCCHHHHCCHHHHHHHHHH LKENTFDKYDIITVGEANGVNISQAPQWVGEKDGKFNMIFQFEHLDLWDVDHKEQSTIKK HHHCCCCCEEEEEECCCCCCCCCCCCHHCCCCCCCEEEEEEEECCCEECCCCCHHHHHHH LKEVLSKWQEGLEGVGWNALFIENHDIQRVVSTLGDDKNFWEESSKALALMYFMQKGTPF HHHHHHHHHHHHCCCCCEEEEEECCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCE IYQGQEIGMTNVKFEGIEDYNDIKTINIYKEKIRKGIPKDQALKYVWETSRDNSRTPMQW EEECCCCCCCCEEECCCCCCCCCEEHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCEEE DTTENAGFSKEKPWMKVNPNYVDINAREQENNLNSILNFYKKIIRVKKENEALIYGKYNL CCCCCCCCCCCCCCEEECCCEEEEECCHHHCCHHHHHHHHHHHHHHHCCCCEEEEEEEEE ILAHHEQIYAYTRTLRNEKFIVIANLTNKEAKYTYKREKLNYKGLIISNYSIEKHEDITE EEEEHHHHHHHHHHHCCCCEEEEEECCCCCHHHEEHHHCCCCCEEEEECCCCHHHHHHHH ILLKPFEARLYKIV HHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9274030; 9384377 [H]