Definition | Bacteroides vulgatus ATCC 8482 chromosome, complete genome. |
---|---|
Accession | NC_009614 |
Length | 5,163,189 |
Click here to switch to the map view.
The map label for this gene is yicI [H]
Identifier: 150005448
GI number: 150005448
Start: 3721756
End: 3724002
Strand: Reverse
Name: yicI [H]
Synonym: BVU_2931
Alternate gene names: 150005448
Gene position: 3724002-3721756 (Counterclockwise)
Preceding gene: 150005449
Following gene: 150005447
Centisome position: 72.13
GC content: 49.71
Gene sequence:
>2247_bases ATGAAACCAACCAATTACCACTTATTCGATTTTCTGGACTTTGATCCCGATCTGTCAAGAGACGAATCATTATGGAAAGC ATACAAGCCCACCTCTGTCTATGAAAAAGACGGGGATATCTGTATAAACGTCCCCTTTCAAAAGCAAGTGCTCTCTAACG ATATGGCGCCCGATACGACATCGCCCCGTGAAGAGTATACGCTGGTCATCCGTCAGTACACCTCAGGTATCACCCGGCTC TTTATCGGATTCGGTGAAGAAAGCATGACGGATCAGTCGGAAATGCTCCAATTCAGTGACCGGGTAAAGAAACTCCCTTT GCAAGTGACACAAACAGAAGGGGAATGGTTGATCACTACCCAAGACGGAATCCAACGGGCACTTATCCATGTAAAGCCCC CGGTACTGGACCGTTGGAGCGAACTGCTGCCCGATCCTCAAGAGACCTTGGATCTTCGCCTTTACCCCGACGGCAAACGT GAAATCCGACTGGCTGCCTACGACCACTTCTCCCCTCCCCGTTACGATGCGCTGCCACTGGCTTTCTGCAAACGAAACGG AGTGAAGGAACGTGCCACCCTTTCCTTTGAATCAAAACCCGATGAATGTTTTGCCGGAACAGGGGAACGCTTTGCCAAAA TGGACTTAAGCGGACAAACGTTCTTCTTAAAGAACCAGGACGGACAAGGTGTGAACAACCGCCGTACCTATAAGAATATT CCTTTCTACCTCTCCAGCCGGATGTATGGAACATTCTATCACACCTGTGCCCACAGCAAACTGTCACTGGCCGGGCAATC CACCCGCTCGGTACAGTTCCTAAGCGACCAGGCCATGCTGGATGTTTTCGTCATAGCAGGCGACACGATGGAAGAAATCC TCCGGGGTTATCGGGATCTGACCGGATATCCTTCCATGCCTCCCCTCTGGAGTTTCGGCATCTGGATGAGCCGCATGACC TACTTCAGCGCTGATGAAGTAAATGAGATATGCGACCGGATGCGTGCCGAACATTACCCCTGCGATGTCATCCATCTGGA TACCGGCTGGTTCAAGACCGACTGGCTGTGCGAATGGAAATTCAACGAAGAACGCTTTCCCGATCCCAAAGGGTTCATCC AAGGACTGAAGAAAAAAGGATATCGCGTGTCCTTATGGCAACTCCCCTACGTGGCGGAGAACGCCGAACAGATAGACGAA GCACGCAAAAACGATTATATAGCTCCGCTGACAAAGAAACAAGATTCGGAAGGTTCCAATTTCTCCGCTTTGGACTATGC CGGAACCATTGACTTCACTTATCCCCAAGCAACCGAATGGTATAAAGGACTGCTGAAGAATCTGCTGGATATGGGCGTGA CCTGCATCAAAACCGATTTTGGAGAAAACATCCACATGGATGCCCTCTATAAAGGCATGAAACCCGAATTGCTGAACAAC CTGTATGCATTGCTTTATCAGAAAGCCGCTTATGAAATCACAAAAGACGTAACCGGTGACGGCATCGTATGGGCACGCTC GGCATGGGCGGGATGCCAACGCTATCCCCTGCATTGGGGAGGTGACTCATGCAGTTCATGGGATGGGATGGCAGGCTCGC TGAAAGGCGGCTTGCATTTCGGGCTTTCCGGCTTCGCTTTCTGGAGTCATGATGTCCCCGGATTCCACACATTGCCCAAC TTCATGAACTCTATAGTGGACGATGATGTATATATGCGCTGGACACAATTCGGTGTATTCTCCTCCCACATCCGTTATCA CGGAACAAACAAACGCGAGCCGTGGCATTATCCCGCTATCGCACCTATGATAAAAAAGTGGTGGAAGTTGCGCTATACGC TTATACCTTATATCGTAGAACAAAGCCGTAAGGCCATCGCAAGCGGAGCACCTCTCTTGCAAGCTCTGATTTTTCATCAC CCCGAAGACAAATTGTGCTGGCACATCGACGATGAATACTATTTTGGTAACGACTTCCTAGTAGCCCCGGTCATGAACAG CGAGAACCGCCGGGATGTCTATCTGCCCGAAGGAAAATGGGTGAATTTCTTCACAGGTGAACGCTTGGAAGGAGGACGCT GGCTGAAGAATCTGGATGTCCCTCTGGACGAAATGCCCGTATATGTACGTCAAGGAGCGACCATCCCCGTTTATCCGGAT GAAGTGGAATGCACGGACGATATGGATTTAAGCAAAAGCATCGGTCTACACATAGACCCTCATTTTAAAGGAATATTTAA GAACTGA
Upstream 100 bases:
>100_bases CCGCAAGTAGATGAAAGTATAAGACAAAAGGCAGGCAGGGTCGCTGTAGGTGGGGTGTCACAACCAAGAATCATTATAAA AACAAAATAACAAAAAGAGA
Downstream 100 bases:
>100_bases CGAATATGGAAACTTGGAAAACGAATTTAGACGAGACAAAGAAACGATATATAGATTGGTGGAACCATAAAGGAATCATA CTGAACATGTGGGAGCACTT
Product: alpha-glycosidase
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 748; Mature: 748
Protein sequence:
>748_residues MKPTNYHLFDFLDFDPDLSRDESLWKAYKPTSVYEKDGDICINVPFQKQVLSNDMAPDTTSPREEYTLVIRQYTSGITRL FIGFGEESMTDQSEMLQFSDRVKKLPLQVTQTEGEWLITTQDGIQRALIHVKPPVLDRWSELLPDPQETLDLRLYPDGKR EIRLAAYDHFSPPRYDALPLAFCKRNGVKERATLSFESKPDECFAGTGERFAKMDLSGQTFFLKNQDGQGVNNRRTYKNI PFYLSSRMYGTFYHTCAHSKLSLAGQSTRSVQFLSDQAMLDVFVIAGDTMEEILRGYRDLTGYPSMPPLWSFGIWMSRMT YFSADEVNEICDRMRAEHYPCDVIHLDTGWFKTDWLCEWKFNEERFPDPKGFIQGLKKKGYRVSLWQLPYVAENAEQIDE ARKNDYIAPLTKKQDSEGSNFSALDYAGTIDFTYPQATEWYKGLLKNLLDMGVTCIKTDFGENIHMDALYKGMKPELLNN LYALLYQKAAYEITKDVTGDGIVWARSAWAGCQRYPLHWGGDSCSSWDGMAGSLKGGLHFGLSGFAFWSHDVPGFHTLPN FMNSIVDDDVYMRWTQFGVFSSHIRYHGTNKREPWHYPAIAPMIKKWWKLRYTLIPYIVEQSRKAIASGAPLLQALIFHH PEDKLCWHIDDEYYFGNDFLVAPVMNSENRRDVYLPEGKWVNFFTGERLEGGRWLKNLDVPLDEMPVYVRQGATIPVYPD EVECTDDMDLSKSIGLHIDPHFKGIFKN
Sequences:
>Translated_748_residues MKPTNYHLFDFLDFDPDLSRDESLWKAYKPTSVYEKDGDICINVPFQKQVLSNDMAPDTTSPREEYTLVIRQYTSGITRL FIGFGEESMTDQSEMLQFSDRVKKLPLQVTQTEGEWLITTQDGIQRALIHVKPPVLDRWSELLPDPQETLDLRLYPDGKR EIRLAAYDHFSPPRYDALPLAFCKRNGVKERATLSFESKPDECFAGTGERFAKMDLSGQTFFLKNQDGQGVNNRRTYKNI PFYLSSRMYGTFYHTCAHSKLSLAGQSTRSVQFLSDQAMLDVFVIAGDTMEEILRGYRDLTGYPSMPPLWSFGIWMSRMT YFSADEVNEICDRMRAEHYPCDVIHLDTGWFKTDWLCEWKFNEERFPDPKGFIQGLKKKGYRVSLWQLPYVAENAEQIDE ARKNDYIAPLTKKQDSEGSNFSALDYAGTIDFTYPQATEWYKGLLKNLLDMGVTCIKTDFGENIHMDALYKGMKPELLNN LYALLYQKAAYEITKDVTGDGIVWARSAWAGCQRYPLHWGGDSCSSWDGMAGSLKGGLHFGLSGFAFWSHDVPGFHTLPN FMNSIVDDDVYMRWTQFGVFSSHIRYHGTNKREPWHYPAIAPMIKKWWKLRYTLIPYIVEQSRKAIASGAPLLQALIFHH PEDKLCWHIDDEYYFGNDFLVAPVMNSENRRDVYLPEGKWVNFFTGERLEGGRWLKNLDVPLDEMPVYVRQGATIPVYPD EVECTDDMDLSKSIGLHIDPHFKGIFKN >Mature_748_residues MKPTNYHLFDFLDFDPDLSRDESLWKAYKPTSVYEKDGDICINVPFQKQVLSNDMAPDTTSPREEYTLVIRQYTSGITRL FIGFGEESMTDQSEMLQFSDRVKKLPLQVTQTEGEWLITTQDGIQRALIHVKPPVLDRWSELLPDPQETLDLRLYPDGKR EIRLAAYDHFSPPRYDALPLAFCKRNGVKERATLSFESKPDECFAGTGERFAKMDLSGQTFFLKNQDGQGVNNRRTYKNI PFYLSSRMYGTFYHTCAHSKLSLAGQSTRSVQFLSDQAMLDVFVIAGDTMEEILRGYRDLTGYPSMPPLWSFGIWMSRMT YFSADEVNEICDRMRAEHYPCDVIHLDTGWFKTDWLCEWKFNEERFPDPKGFIQGLKKKGYRVSLWQLPYVAENAEQIDE ARKNDYIAPLTKKQDSEGSNFSALDYAGTIDFTYPQATEWYKGLLKNLLDMGVTCIKTDFGENIHMDALYKGMKPELLNN LYALLYQKAAYEITKDVTGDGIVWARSAWAGCQRYPLHWGGDSCSSWDGMAGSLKGGLHFGLSGFAFWSHDVPGFHTLPN FMNSIVDDDVYMRWTQFGVFSSHIRYHGTNKREPWHYPAIAPMIKKWWKLRYTLIPYIVEQSRKAIASGAPLLQALIFHH PEDKLCWHIDDEYYFGNDFLVAPVMNSENRRDVYLPEGKWVNFFTGERLEGGRWLKNLDVPLDEMPVYVRQGATIPVYPD EVECTDDMDLSKSIGLHIDPHFKGIFKN
Specific function: Can catalyzes the transfer of alpha-xylosyl residue from alpha-xyloside to xylose, glucose, mannose, fructose, maltose, isomaltose, nigerose, kojibiose, sucrose and trehalose [H]
COG id: COG1501
COG function: function code G; Alpha-glucosidases, family 31 of glycosyl hydrolases
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the glycosyl hydrolase 31 family [H]
Homologues:
Organism=Homo sapiens, GI88900491, Length=501, Percent_Identity=27.9441117764471, Blast_Score=167, Evalue=3e-41, Organism=Homo sapiens, GI38202257, Length=524, Percent_Identity=27.4809160305344, Blast_Score=167, Evalue=3e-41, Organism=Homo sapiens, GI66346737, Length=588, Percent_Identity=26.3605442176871, Blast_Score=162, Evalue=8e-40, Organism=Homo sapiens, GI119393895, Length=600, Percent_Identity=24.8333333333333, Blast_Score=149, Evalue=8e-36, Organism=Homo sapiens, GI119393893, Length=600, Percent_Identity=24.8333333333333, Blast_Score=149, Evalue=8e-36, Organism=Homo sapiens, GI119393891, Length=600, Percent_Identity=24.8333333333333, Blast_Score=149, Evalue=8e-36, Organism=Homo sapiens, GI221316699, Length=557, Percent_Identity=22.262118491921, Blast_Score=146, Evalue=9e-35, Organism=Homo sapiens, GI157364974, Length=585, Percent_Identity=23.0769230769231, Blast_Score=133, Evalue=8e-31, Organism=Homo sapiens, GI310115361, Length=247, Percent_Identity=31.1740890688259, Blast_Score=126, Evalue=1e-28, Organism=Homo sapiens, GI153791946, Length=384, Percent_Identity=23.9583333333333, Blast_Score=109, Evalue=1e-23, Organism=Escherichia coli, GI2367256, Length=542, Percent_Identity=35.7933579335793, Blast_Score=346, Evalue=3e-96, Organism=Escherichia coli, GI2367323, Length=614, Percent_Identity=25.4071661237785, Blast_Score=170, Evalue=3e-43, Organism=Caenorhabditis elegans, GI71991189, Length=504, Percent_Identity=26.984126984127, Blast_Score=155, Evalue=1e-37, Organism=Caenorhabditis elegans, GI17560800, Length=479, Percent_Identity=24.8434237995825, Blast_Score=137, Evalue=2e-32, Organism=Caenorhabditis elegans, GI17560798, Length=479, Percent_Identity=24.8434237995825, Blast_Score=137, Evalue=2e-32, Organism=Caenorhabditis elegans, GI71985706, Length=246, Percent_Identity=26.8292682926829, Blast_Score=105, Evalue=7e-23, Organism=Caenorhabditis elegans, GI32563849, Length=260, Percent_Identity=28.8461538461538, Blast_Score=104, Evalue=2e-22, Organism=Saccharomyces cerevisiae, GI6319706, Length=589, Percent_Identity=23.7691001697793, Blast_Score=147, Evalue=6e-36, Organism=Drosophila melanogaster, GI24650054, Length=433, Percent_Identity=28.175519630485, Blast_Score=137, Evalue=3e-32, Organism=Drosophila melanogaster, GI24643749, Length=485, Percent_Identity=23.0927835051546, Blast_Score=118, Evalue=1e-26, Organism=Drosophila melanogaster, GI24643753, Length=485, Percent_Identity=23.0927835051546, Blast_Score=118, Evalue=1e-26, Organism=Drosophila melanogaster, GI24643751, Length=485, Percent_Identity=23.0927835051546, Blast_Score=118, Evalue=1e-26, Organism=Drosophila melanogaster, GI24643746, Length=485, Percent_Identity=23.0927835051546, Blast_Score=118, Evalue=1e-26, Organism=Drosophila melanogaster, GI21357605, Length=485, Percent_Identity=23.0927835051546, Blast_Score=118, Evalue=1e-26, Organism=Drosophila melanogaster, GI28571438, Length=387, Percent_Identity=22.7390180878553, Blast_Score=101, Evalue=2e-21, Organism=Drosophila melanogaster, GI28571440, Length=384, Percent_Identity=22.3958333333333, Blast_Score=100, Evalue=4e-21,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR013785 - InterPro: IPR011013 - InterPro: IPR000322 - InterPro: IPR017853 [H]
Pfam domain/function: PF01055 Glyco_hydro_31 [H]
EC number: NA
Molecular weight: Translated: 86398; Mature: 86398
Theoretical pI: Translated: 5.62; Mature: 5.62
Prosite motif: PS00290 IG_MHC
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.6 %Cys (Translated Protein) 2.9 %Met (Translated Protein) 4.5 %Cys+Met (Translated Protein) 1.6 %Cys (Mature Protein) 2.9 %Met (Mature Protein) 4.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKPTNYHLFDFLDFDPDLSRDESLWKAYKPTSVYEKDGDICINVPFQKQVLSNDMAPDTT CCCCCEEEEEEECCCCCCCCCHHHHHHCCCCCCEECCCCEEEECCHHHHHHCCCCCCCCC SPREEYTLVIRQYTSGITRLFIGFGEESMTDQSEMLQFSDRVKKLPLQVTQTEGEWLITT CCHHHHEEHHHHHHCCCEEEEEECCCHHCCCHHHHHHHHHHHHHCCEEEEECCCCEEEEE QDGIQRALIHVKPPVLDRWSELLPDPQETLDLRLYPDGKREIRLAAYDHFSPPRYDALPL HHHHHHHHHCCCCCHHHHHHHHCCCCHHHEEEEECCCCCCEEEEEEECCCCCCCCCCCHH AFCKRNGVKERATLSFESKPDECFAGTGERFAKMDLSGQTFFLKNQDGQGVNNRRTYKNI HHHHCCCCCCCEEEEECCCCCHHHCCCCCHHEEECCCCCEEEEECCCCCCCCCCCCCCCC PFYLSSRMYGTFYHTCAHSKLSLAGQSTRSVQFLSDQAMLDVFVIAGDTMEEILRGYRDL CEEEECCCCHHHHHHHCCCHHEECCCCCCCEEECCCCCEEEEEEECCCHHHHHHHHHHHC TGYPSMPPLWSFGIWMSRMTYFSADEVNEICDRMRAEHYPCDVIHLDTGWFKTDWLCEWK CCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCEEEECCCCEECCEEEEEE FNEERFPDPKGFIQGLKKKGYRVSLWQLPYVAENAEQIDEARKNDYIAPLTKKQDSEGSN CCCCCCCCHHHHHHHHHHCCCEEEEEECCCHHCCHHHHHHHHHCCCCCCCCCCCCCCCCC FSALDYAGTIDFTYPQATEWYKGLLKNLLDMGVTCIKTDFGENIHMDALYKGMKPELLNN EEEEEECCEEEECCCCHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHCCCCHHHHHH LYALLYQKAAYEITKDVTGDGIVWARSAWAGCQRYPLHWGGDSCSSWDGMAGSLKGGLHF HHHHHHHHHHHHHHHCCCCCCEEEECCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEE GLSGFAFWSHDVPGFHTLPNFMNSIVDDDVYMRWTQFGVFSSHIRYHGTNKREPWHYPAI CCCEEEEECCCCCCHHHHHHHHHHHHCCCHHEEHHHHHHHHHHEEECCCCCCCCCCCCHH APMIKKWWKLRYTLIPYIVEQSRKAIASGAPLLQALIFHHPEDKLCWHIDDEYYFGNDFL HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCCCCEEEEECCCEEECCCEE VAPVMNSENRRDVYLPEGKWVNFFTGERLEGGRWLKNLDVPLDEMPVYVRQGATIPVYPD EEEECCCCCCCEEECCCCCEEEEECCCCCCCCCHHHCCCCCHHHCCCHHHCCCEECCCCC EVECTDDMDLSKSIGLHIDPHFKGIFKN CCCCCCCCCCHHCCCCEECCCCCCCCCC >Mature Secondary Structure MKPTNYHLFDFLDFDPDLSRDESLWKAYKPTSVYEKDGDICINVPFQKQVLSNDMAPDTT CCCCCEEEEEEECCCCCCCCCHHHHHHCCCCCCEECCCCEEEECCHHHHHHCCCCCCCCC SPREEYTLVIRQYTSGITRLFIGFGEESMTDQSEMLQFSDRVKKLPLQVTQTEGEWLITT CCHHHHEEHHHHHHCCCEEEEEECCCHHCCCHHHHHHHHHHHHHCCEEEEECCCCEEEEE QDGIQRALIHVKPPVLDRWSELLPDPQETLDLRLYPDGKREIRLAAYDHFSPPRYDALPL HHHHHHHHHCCCCCHHHHHHHHCCCCHHHEEEEECCCCCCEEEEEEECCCCCCCCCCCHH AFCKRNGVKERATLSFESKPDECFAGTGERFAKMDLSGQTFFLKNQDGQGVNNRRTYKNI HHHHCCCCCCCEEEEECCCCCHHHCCCCCHHEEECCCCCEEEEECCCCCCCCCCCCCCCC PFYLSSRMYGTFYHTCAHSKLSLAGQSTRSVQFLSDQAMLDVFVIAGDTMEEILRGYRDL CEEEECCCCHHHHHHHCCCHHEECCCCCCCEEECCCCCEEEEEEECCCHHHHHHHHHHHC TGYPSMPPLWSFGIWMSRMTYFSADEVNEICDRMRAEHYPCDVIHLDTGWFKTDWLCEWK CCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCEEEECCCCEECCEEEEEE FNEERFPDPKGFIQGLKKKGYRVSLWQLPYVAENAEQIDEARKNDYIAPLTKKQDSEGSN CCCCCCCCHHHHHHHHHHCCCEEEEEECCCHHCCHHHHHHHHHCCCCCCCCCCCCCCCCC FSALDYAGTIDFTYPQATEWYKGLLKNLLDMGVTCIKTDFGENIHMDALYKGMKPELLNN EEEEEECCEEEECCCCHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHCCCCHHHHHH LYALLYQKAAYEITKDVTGDGIVWARSAWAGCQRYPLHWGGDSCSSWDGMAGSLKGGLHF HHHHHHHHHHHHHHHCCCCCCEEEECCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEE GLSGFAFWSHDVPGFHTLPNFMNSIVDDDVYMRWTQFGVFSSHIRYHGTNKREPWHYPAI CCCEEEEECCCCCCHHHHHHHHHHHHCCCHHEEHHHHHHHHHHEEECCCCCCCCCCCCHH APMIKKWWKLRYTLIPYIVEQSRKAIASGAPLLQALIFHHPEDKLCWHIDDEYYFGNDFL HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCCCCEEEEECCCEEECCCEE VAPVMNSENRRDVYLPEGKWVNFFTGERLEGGRWLKNLDVPLDEMPVYVRQGATIPVYPD EEEECCCCCCCEEECCCCCEEEEECCCCCCCCCHHHCCCCCHHHCCCHHHCCCEECCCCC EVECTDDMDLSKSIGLHIDPHFKGIFKN CCCCCCCCCCHHCCCCEECCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 7686882; 9278503 [H]