Definition Bacillus licheniformis ATCC 14580, complete genome.
Accession NC_006322
Length 4,222,645

Click here to switch to the map view.

The map label for this gene is bglC [H]

Identifier: 52787876

GI number: 52787876

Start: 4028632

End: 4030050

Strand: Reverse

Name: bglC [H]

Synonym: BLi04199

Alternate gene names: 52787876

Gene position: 4030050-4028632 (Counterclockwise)

Preceding gene: 52787877

Following gene: 52787875

Centisome position: 95.44

GC content: 48.06

Gene sequence:

>1419_bases
ATGAAATATAAAACATTAGCTCAATTTCCGAAAGATTTTTTGTGGGGCGCGTCTACTTCCGCTTATCAAGTGGAAGGCGC
TTGGGACGAAGATGGAAAAGGGCCTTCTGTCATCGATGCGCGCGAAAGCTACCCGGAAGGGACAACCGATTTTAAAGTCG
CAAGCGACCATTACCAGCGCTATAAAGAAGATATCGCTTTATTTGCGGAAATGGGCTTCAAAGCGTACCGTTTTTCGATT
GCCTGGACGCGCATCATTCCGGACGGGGACGGTGATATTAATCCGAAGGGAATCGAATTTTACAGCCGTTTGATCGATGA
ACTTCTAAAGTATGGAATCGAACCAATTGTTACGATGTACCACTTTGATCTGCCGAATGCTTTGCAGAAAAAAGGCGGCT
GGTCGGACAGGGCCACGATTGATGCTTTTGAAAAGTATGCGAAGGTCCTTTTTGAAAGCTACGGTGACCGCGTCAAATAC
TGGCTGACCATCAATGAGCAAAATATGATGATCCTCCACGGATCTGCACTCGGTACACTCGATCCGAACTTGGAAAATCC
GAAAAAAGAGCTTTATCAGCAAAACCATCACATGCTCGTCGCACAGGCGAAAGCGATCAAGCTTTGCCATGAGATGCTGC
CGGAAGCAAAAATCGGTCCTGCGCCGAATATTGCGCTCATCTATCCCGCTTCTTCGAAACCGGAGGACGTGCTGGCGGCT
TTTAACTATAATGCGATCCGAAACTGGCTTTACTTGGATATGGCCGTATTCGGACGGTACAATACAACAGCGTGGGCATA
TATGAAAGAAAAAGGCTGCACACCGGTCATCGCTGAAGGGGATATGGACATTCTGCGGTCGGCCAAGCCGGATTTTATCG
CGTTTAACTACTATACATCGCAAACGGCTGAAGCAAGCAGGGGTGATGGCAGCGACACGGCTGCTCGAGGCGGAGACCAG
CATTTGCAGACGGGAGAAGAAGGCGTATATAGGGGAAGCAGCAATCCGCACCTAAAGAAAAACGCATTTGGCTGGGAGAT
CGACCCTGTCGGTTTCCGTTCGACGCTGCGCGAAATTTACGACCGCTACCAGCTGCCGCTGATCGTCACTGAGAACGGCC
TCGGCGCGTTTGATCAGCTTGAAGACGGAGATGTCGTAAATGACGATTACCGCATCGATTATTTAAAAGAGCATATCAAG
CAAATTCAGCTGGCAATCACGGATGGAGTCGATGTTTTCGGCTACTCCCCATGGTCTGCCATCGACTTAATTTCGACCCA
TCAAGGCTGTTCAAAACGCTACGGATTTATTTATGTGAACCGCGATGAATTTGATTTGAAAGACTTGCGCCGCATTCGCA
AAAAAAGCTTTTACTGGTATAAAAACCTGATTGCTACAAACGGCGAAACACTCGATTAA

Upstream 100 bases:

>100_bases
TCCGGTCAATACACGGATGTCATTCCAACGGATCAAAAAAAGGTGAAAACTGAAGAACGGATCATCACTTTAATTTCATC
ACCAAGGGAGGAAAATAATC

Downstream 100 bases:

>100_bases
ACAAAGAAAGAGGATGTTCAGATGATCTCTGAATATCCTCTTTTTTGGTTTGAAAATGCTCATTTTAAAATTTGCAGAAA
ATTTTAAAAATATGAATGTA

Product: hypothetical protein

Products: NA

Alternate protein names: 6-phospho-beta-glucosidase [H]

Number of amino acids: Translated: 472; Mature: 472

Protein sequence:

>472_residues
MKYKTLAQFPKDFLWGASTSAYQVEGAWDEDGKGPSVIDARESYPEGTTDFKVASDHYQRYKEDIALFAEMGFKAYRFSI
AWTRIIPDGDGDINPKGIEFYSRLIDELLKYGIEPIVTMYHFDLPNALQKKGGWSDRATIDAFEKYAKVLFESYGDRVKY
WLTINEQNMMILHGSALGTLDPNLENPKKELYQQNHHMLVAQAKAIKLCHEMLPEAKIGPAPNIALIYPASSKPEDVLAA
FNYNAIRNWLYLDMAVFGRYNTTAWAYMKEKGCTPVIAEGDMDILRSAKPDFIAFNYYTSQTAEASRGDGSDTAARGGDQ
HLQTGEEGVYRGSSNPHLKKNAFGWEIDPVGFRSTLREIYDRYQLPLIVTENGLGAFDQLEDGDVVNDDYRIDYLKEHIK
QIQLAITDGVDVFGYSPWSAIDLISTHQGCSKRYGFIYVNRDEFDLKDLRRIRKKSFYWYKNLIATNGETLD

Sequences:

>Translated_472_residues
MKYKTLAQFPKDFLWGASTSAYQVEGAWDEDGKGPSVIDARESYPEGTTDFKVASDHYQRYKEDIALFAEMGFKAYRFSI
AWTRIIPDGDGDINPKGIEFYSRLIDELLKYGIEPIVTMYHFDLPNALQKKGGWSDRATIDAFEKYAKVLFESYGDRVKY
WLTINEQNMMILHGSALGTLDPNLENPKKELYQQNHHMLVAQAKAIKLCHEMLPEAKIGPAPNIALIYPASSKPEDVLAA
FNYNAIRNWLYLDMAVFGRYNTTAWAYMKEKGCTPVIAEGDMDILRSAKPDFIAFNYYTSQTAEASRGDGSDTAARGGDQ
HLQTGEEGVYRGSSNPHLKKNAFGWEIDPVGFRSTLREIYDRYQLPLIVTENGLGAFDQLEDGDVVNDDYRIDYLKEHIK
QIQLAITDGVDVFGYSPWSAIDLISTHQGCSKRYGFIYVNRDEFDLKDLRRIRKKSFYWYKNLIATNGETLD
>Mature_472_residues
MKYKTLAQFPKDFLWGASTSAYQVEGAWDEDGKGPSVIDARESYPEGTTDFKVASDHYQRYKEDIALFAEMGFKAYRFSI
AWTRIIPDGDGDINPKGIEFYSRLIDELLKYGIEPIVTMYHFDLPNALQKKGGWSDRATIDAFEKYAKVLFESYGDRVKY
WLTINEQNMMILHGSALGTLDPNLENPKKELYQQNHHMLVAQAKAIKLCHEMLPEAKIGPAPNIALIYPASSKPEDVLAA
FNYNAIRNWLYLDMAVFGRYNTTAWAYMKEKGCTPVIAEGDMDILRSAKPDFIAFNYYTSQTAEASRGDGSDTAARGGDQ
HLQTGEEGVYRGSSNPHLKKNAFGWEIDPVGFRSTLREIYDRYQLPLIVTENGLGAFDQLEDGDVVNDDYRIDYLKEHIK
QIQLAITDGVDVFGYSPWSAIDLISTHQGCSKRYGFIYVNRDEFDLKDLRRIRKKSFYWYKNLIATNGETLD

Specific function: Is able to catalyze the hydrolysis of aryl-phospho-beta- D-glucosides such as 4-methylumbelliferyl-phospho-beta-D- glucopyranoside (MUG-P), phosphoarbutin and phosphosalicin. Is not essential for growth on arbutin and salicin as the sole carbon source [H]

COG id: COG2723

COG function: function code G; Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the glycosyl hydrolase 1 family [H]

Homologues:

Organism=Homo sapiens, GI32481206, Length=480, Percent_Identity=33.5416666666667, Blast_Score=233, Evalue=3e-61,
Organism=Homo sapiens, GI110681710, Length=483, Percent_Identity=33.5403726708075, Blast_Score=220, Evalue=2e-57,
Organism=Homo sapiens, GI13273313, Length=487, Percent_Identity=31.6221765913758, Blast_Score=216, Evalue=3e-56,
Organism=Homo sapiens, GI28376633, Length=474, Percent_Identity=28.9029535864979, Blast_Score=183, Evalue=4e-46,
Organism=Homo sapiens, GI24497614, Length=489, Percent_Identity=30.6748466257669, Blast_Score=167, Evalue=1e-41,
Organism=Homo sapiens, GI190360571, Length=98, Percent_Identity=44.8979591836735, Blast_Score=85, Evalue=2e-16,
Organism=Escherichia coli, GI2367174, Length=487, Percent_Identity=42.0944558521561, Blast_Score=387, Evalue=1e-109,
Organism=Escherichia coli, GI2367270, Length=479, Percent_Identity=44.8851774530271, Blast_Score=383, Evalue=1e-107,
Organism=Escherichia coli, GI1789070, Length=493, Percent_Identity=42.393509127789, Blast_Score=368, Evalue=1e-103,
Organism=Caenorhabditis elegans, GI17552856, Length=479, Percent_Identity=32.1503131524008, Blast_Score=212, Evalue=3e-55,
Organism=Caenorhabditis elegans, GI17539390, Length=486, Percent_Identity=30.0411522633745, Blast_Score=197, Evalue=1e-50,
Organism=Drosophila melanogaster, GI21356577, Length=487, Percent_Identity=30.5954825462012, Blast_Score=209, Evalue=2e-54,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001360
- InterPro:   IPR018120
- InterPro:   IPR017853
- InterPro:   IPR013781 [H]

Pfam domain/function: PF00232 Glyco_hydro_1 [H]

EC number: =3.2.1.86 [H]

Molecular weight: Translated: 53821; Mature: 53821

Theoretical pI: Translated: 5.15; Mature: 5.15

Prosite motif: PS00572 GLYCOSYL_HYDROL_F1_1 ; PS00653 GLYCOSYL_HYDROL_F1_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
2.1 %Met     (Translated Protein)
2.8 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
2.1 %Met     (Mature Protein)
2.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKYKTLAQFPKDFLWGASTSAYQVEGAWDEDGKGPSVIDARESYPEGTTDFKVASDHYQR
CCCCHHHHCCHHHHCCCCCCCEEEECCCCCCCCCCCEECCHHCCCCCCCCEEECHHHHHH
YKEDIALFAEMGFKAYRFSIAWTRIIPDGDGDINPKGIEFYSRLIDELLKYGIEPIVTMY
HHHHHHHHHHHCCEEEEEEEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEHEE
HFDLPNALQKKGGWSDRATIDAFEKYAKVLFESYGDRVKYWLTINEQNMMILHGSALGTL
ECCCCHHHHHCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCEEEEECCCCCCC
DPNLENPKKELYQQNHHMLVAQAKAIKLCHEMLPEAKIGPAPNIALIYPASSKPEDVLAA
CCCCCCHHHHHHHCCCCEEEHHHHHHHHHHHHCCCCCCCCCCCEEEEECCCCCCHHHHHH
FNYNAIRNWLYLDMAVFGRYNTTAWAYMKEKGCTPVIAEGDMDILRSAKPDFIAFNYYTS
CCHHHHHHHHHHHHHHHCCCCCEEEEEEHHCCCCCEEECCCHHHHHCCCCCEEEEEEECC
QTAEASRGDGSDTAARGGDQHLQTGEEGVYRGSSNPHLKKNAFGWEIDPVGFRSTLREIY
CCCCCCCCCCCCHHHCCCCHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHH
DRYQLPLIVTENGLGAFDQLEDGDVVNDDYRIDYLKEHIKQIQLAITDGVDVFGYSPWSA
HHHCCCEEEECCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHEEHHEECCCCEECCCCCHH
IDLISTHQGCSKRYGFIYVNRDEFDLKDLRRIRKKSFYWYKNLIATNGETLD
HHHHHHCCCHHCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCC
>Mature Secondary Structure
MKYKTLAQFPKDFLWGASTSAYQVEGAWDEDGKGPSVIDARESYPEGTTDFKVASDHYQR
CCCCHHHHCCHHHHCCCCCCCEEEECCCCCCCCCCCEECCHHCCCCCCCCEEECHHHHHH
YKEDIALFAEMGFKAYRFSIAWTRIIPDGDGDINPKGIEFYSRLIDELLKYGIEPIVTMY
HHHHHHHHHHHCCEEEEEEEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEHEE
HFDLPNALQKKGGWSDRATIDAFEKYAKVLFESYGDRVKYWLTINEQNMMILHGSALGTL
ECCCCHHHHHCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCEEEEECCCCCCC
DPNLENPKKELYQQNHHMLVAQAKAIKLCHEMLPEAKIGPAPNIALIYPASSKPEDVLAA
CCCCCCHHHHHHHCCCCEEEHHHHHHHHHHHHCCCCCCCCCCCEEEEECCCCCCHHHHHH
FNYNAIRNWLYLDMAVFGRYNTTAWAYMKEKGCTPVIAEGDMDILRSAKPDFIAFNYYTS
CCHHHHHHHHHHHHHHHCCCCCEEEEEEHHCCCCCEEECCCHHHHHCCCCCEEEEEEECC
QTAEASRGDGSDTAARGGDQHLQTGEEGVYRGSSNPHLKKNAFGWEIDPVGFRSTLREIY
CCCCCCCCCCCCHHHCCCCHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHH
DRYQLPLIVTENGLGAFDQLEDGDVVNDDYRIDYLKEHIKQIQLAITDGVDVFGYSPWSA
HHHCCCEEEECCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHEEHHEECCCCEECCCCCHH
IDLISTHQGCSKRYGFIYVNRDEFDLKDLRRIRKKSFYWYKNLIATNGETLD
HHHHHHCCCHHCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 7704255; 8969502; 9384377; 2841296 [H]