| Definition | Escherichia coli HS, complete genome. |
|---|---|
| Accession | NC_009800 |
| Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is glgX [H]
Identifier: 157162910
GI number: 157162910
Start: 3616383
End: 3618356
Strand: Reverse
Name: glgX [H]
Synonym: EcHS_A3631
Alternate gene names: 157162910
Gene position: 3618356-3616383 (Counterclockwise)
Preceding gene: 157162911
Following gene: 157162909
Centisome position: 77.92
GC content: 54.15
Gene sequence:
>1974_bases ATGACACAACTCGCCATTGGCAAACCCGCTCCCCTCGGCGCGCATTACGACGGTCAGGGCGTCAACTTCACACTTTTCTC CGTTCATGCCGAGCGGGTAGAACTGTGTGTCTTTGACGCCAATGGCCAGGAACATCGCTATGACTTGCCAGGGCACAGTG GCGACATTTGGCACGGTTATCTGCCGGATGCGCGCCCGGGTTTGCGTTATGGTTATCGCGTTCATGGCCCCTGGCAACCC GCCGAGGGGCATCGCTTTAACCCGGCGAAGTTGTTGATTGATCCTTGCGCGCGGCAAATTGACGGGGAGTTTAAAGATAA CCCGCTGCTGCACGCCGGTCATAATGAACCTGACTATCGCGACAACGCCGCCATTGCGCCGAAATGCGTAGTGGTGGTTG ATCACTATGACTGGGAAGATGATGCCCCGCCGCGCACGCCGTGGGGCAGCACCATCATTTATGAAGCCCATGTCAAAGGA TTAACGTACTTGCACCCGGAGATCCCGGTCGAGATCCGTGGCACTTATAAAGCCCTCGGGCATCCGGTGATGATCAACTA TTTGAAACAATTGGGCATTACCGCGCTGGAACTGCTGCCAGTGGCGCAGTTTGCCAGTGAACCACGTCTGCAACGCATGG GGCTAAGTAACTACTGGGGTTACAACCCGGTGGCGATGTTTGCGCTGCATCCGGCGTATGCCTGCTCGCCAGAAACGGCG CTGGATGAGTTTCGCGATGCAATCAAAGCACTGCATAAAGCGGGTATCGAAGTCATTCTTGATATCGTGCTCAACCATAG TGCGGAACTGGACCTCGACGGCCCGTTATTCTCGCTGCGTGGGATCGATAACCGTAGCTATTATTGGATAAGAGAAGACG GCGATTATCACAACTGGACCGGTTGCGGCAACACGCTCAATTTGAGTCATCCGGCGGTGGTGGATTATGCCAGCGCCTGC CTGCGTTATTGGGTAGAAACCTGCCACGTCGATGGTTTCCGCTTTGATCTGGCGGCAGTCATGGGCCGTACGCCAGAGTT CCGTCAGGATGCGCCGTTGTTTACCGCTATCCAGAACTGCCCGGTGCTCTCGCAGGTGAAGTTAATTGCTGAACCGTGGG ATATCGCTCCTGGTGGTTATCAGGTGGGAAATTTCCCGCCGCTGTTTGCCGAGTGGAACGATCATTTCCGCGATGCTGCC CGTCGTTTCTGGCTGCATTATGATTTGCCTCTGGGGGCGTTTGCCGGGCGTTTTGCTGCCTCCAGCGATGTTTTTAAACG TAATGGTCGTCTGCCGAGTGCCGCGATTAATCTCGTCACCGCACATGACGGTTTTACGCTTCGCGACTGCGTTTGCTTCA ACCATAAACACAATGAAGCAAACGGAGAAGAAAATCGCGACGGGACCAACAACAATTACAGTAACAATCATGGTAAAGAA GGGTTAGGTGGTACTCTTGATCTGGTTGAACGGCGGCGCGACAGCATTCATGCCCTGTTAACAACGTTGTTGCTCTCCCA GGGCACGCCGATGTTACTGGCCGGTGACGAACATGGTCACAGCCAGCATGGCAATAACAATGCCTACTGTCAGGATAACC AATTAACCTGGTTGGACTGGTCGCAGGCAAGCAGTGGTTTAACCGCATTTACCGCCGCGTTAATCCATCTGCGCAAGCGT ATTCCCGCTTTGGTGGAGAATCGCTGGTGGGAAGAAGGCGACGGCAATGTCCGTTGGCTAAATCGATATGCTCAACCTTT AAGCACGGATGAGTGGCAAAACGGGCCGAAACAGCTGCAAATTCTGCTCTCGGATCGCTTTTTGATCGCAATTAACGCCA CGCTTGAGGTAACAGAGATTGTTTTACCTGCTGGGGAGTGGCACGCCATTCCCCCATTCGCTGGAGAGGATAACCCAGTG ATTACGGCTGTCTGGCAGGGACCTGCACACGGATTGTGTGTGTTCCAGAGATGA
Upstream 100 bases:
>100_bases GCACGGTACACAGCGATGAGATTGCCAGCCACGGTCGTCAGCATTCACTAAGCCTGACGCTACCACCGCTGGCCACTATC TGGCTGGTTCGGGAGGCAGA
Downstream 100 bases:
>100_bases TAAAAAAGGAGTTAGTCATGGTTAGTTTAGAGAAGAACGATCACTTAATGTTGGCGCGCCAGCTGCCATTGAAATCTGTT GCCCTGATACTGGCGGGAGG
Product: glycogen debranching enzyme
Products: NA
Alternate protein names: Glycogen operon protein GlgX [H]
Number of amino acids: Translated: 657; Mature: 656
Protein sequence:
>657_residues MTQLAIGKPAPLGAHYDGQGVNFTLFSVHAERVELCVFDANGQEHRYDLPGHSGDIWHGYLPDARPGLRYGYRVHGPWQP AEGHRFNPAKLLIDPCARQIDGEFKDNPLLHAGHNEPDYRDNAAIAPKCVVVVDHYDWEDDAPPRTPWGSTIIYEAHVKG LTYLHPEIPVEIRGTYKALGHPVMINYLKQLGITALELLPVAQFASEPRLQRMGLSNYWGYNPVAMFALHPAYACSPETA LDEFRDAIKALHKAGIEVILDIVLNHSAELDLDGPLFSLRGIDNRSYYWIREDGDYHNWTGCGNTLNLSHPAVVDYASAC LRYWVETCHVDGFRFDLAAVMGRTPEFRQDAPLFTAIQNCPVLSQVKLIAEPWDIAPGGYQVGNFPPLFAEWNDHFRDAA RRFWLHYDLPLGAFAGRFAASSDVFKRNGRLPSAAINLVTAHDGFTLRDCVCFNHKHNEANGEENRDGTNNNYSNNHGKE GLGGTLDLVERRRDSIHALLTTLLLSQGTPMLLAGDEHGHSQHGNNNAYCQDNQLTWLDWSQASSGLTAFTAALIHLRKR IPALVENRWWEEGDGNVRWLNRYAQPLSTDEWQNGPKQLQILLSDRFLIAINATLEVTEIVLPAGEWHAIPPFAGEDNPV ITAVWQGPAHGLCVFQR
Sequences:
>Translated_657_residues MTQLAIGKPAPLGAHYDGQGVNFTLFSVHAERVELCVFDANGQEHRYDLPGHSGDIWHGYLPDARPGLRYGYRVHGPWQP AEGHRFNPAKLLIDPCARQIDGEFKDNPLLHAGHNEPDYRDNAAIAPKCVVVVDHYDWEDDAPPRTPWGSTIIYEAHVKG LTYLHPEIPVEIRGTYKALGHPVMINYLKQLGITALELLPVAQFASEPRLQRMGLSNYWGYNPVAMFALHPAYACSPETA LDEFRDAIKALHKAGIEVILDIVLNHSAELDLDGPLFSLRGIDNRSYYWIREDGDYHNWTGCGNTLNLSHPAVVDYASAC LRYWVETCHVDGFRFDLAAVMGRTPEFRQDAPLFTAIQNCPVLSQVKLIAEPWDIAPGGYQVGNFPPLFAEWNDHFRDAA RRFWLHYDLPLGAFAGRFAASSDVFKRNGRLPSAAINLVTAHDGFTLRDCVCFNHKHNEANGEENRDGTNNNYSNNHGKE GLGGTLDLVERRRDSIHALLTTLLLSQGTPMLLAGDEHGHSQHGNNNAYCQDNQLTWLDWSQASSGLTAFTAALIHLRKR IPALVENRWWEEGDGNVRWLNRYAQPLSTDEWQNGPKQLQILLSDRFLIAINATLEVTEIVLPAGEWHAIPPFAGEDNPV ITAVWQGPAHGLCVFQR >Mature_656_residues TQLAIGKPAPLGAHYDGQGVNFTLFSVHAERVELCVFDANGQEHRYDLPGHSGDIWHGYLPDARPGLRYGYRVHGPWQPA EGHRFNPAKLLIDPCARQIDGEFKDNPLLHAGHNEPDYRDNAAIAPKCVVVVDHYDWEDDAPPRTPWGSTIIYEAHVKGL TYLHPEIPVEIRGTYKALGHPVMINYLKQLGITALELLPVAQFASEPRLQRMGLSNYWGYNPVAMFALHPAYACSPETAL DEFRDAIKALHKAGIEVILDIVLNHSAELDLDGPLFSLRGIDNRSYYWIREDGDYHNWTGCGNTLNLSHPAVVDYASACL RYWVETCHVDGFRFDLAAVMGRTPEFRQDAPLFTAIQNCPVLSQVKLIAEPWDIAPGGYQVGNFPPLFAEWNDHFRDAAR RFWLHYDLPLGAFAGRFAASSDVFKRNGRLPSAAINLVTAHDGFTLRDCVCFNHKHNEANGEENRDGTNNNYSNNHGKEG LGGTLDLVERRRDSIHALLTTLLLSQGTPMLLAGDEHGHSQHGNNNAYCQDNQLTWLDWSQASSGLTAFTAALIHLRKRI PALVENRWWEEGDGNVRWLNRYAQPLSTDEWQNGPKQLQILLSDRFLIAINATLEVTEIVLPAGEWHAIPPFAGEDNPVI TAVWQGPAHGLCVFQR
Specific function: Hydrolyzes the alpha-1,6-glucosidic linkages in glycogen which has first been partially depolymerized by phosphorylase. Shows only very little activity with native glycogen [H]
COG id: COG1523
COG function: function code G; Type II secretory pathway, pullulanase PulA and related glycosidases
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the glycosyl hydrolase 13 family [H]
Homologues:
Organism=Escherichia coli, GI2367229, Length=657, Percent_Identity=99.6955859969559, Blast_Score=1358, Evalue=0.0,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR006047 - InterPro: IPR004193 - InterPro: IPR017853 - InterPro: IPR013781 - InterPro: IPR022844 - InterPro: IPR011837 - InterPro: IPR013783 - InterPro: IPR014756 [H]
Pfam domain/function: PF00128 Alpha-amylase; PF02922 CBM_48 [H]
EC number: 3.2.1.- [C]
Molecular weight: Translated: 73619; Mature: 73488
Theoretical pI: Translated: 6.00; Mature: 6.00
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.8 %Cys (Translated Protein) 0.9 %Met (Translated Protein) 2.7 %Cys+Met (Translated Protein) 1.8 %Cys (Mature Protein) 0.8 %Met (Mature Protein) 2.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTQLAIGKPAPLGAHYDGQGVNFTLFSVHAERVELCVFDANGQEHRYDLPGHSGDIWHGY CCCCCCCCCCCCCCCCCCCCCEEEEEEEEECEEEEEEECCCCCCCEEECCCCCCCEEECC LPDARPGLRYGYRVHGPWQPAEGHRFNPAKLLIDPCARQIDGEFKDNPLLHAGHNEPDYR CCCCCCCCCCCEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCEEECCCCCCCCC DNAAIAPKCVVVVDHYDWEDDAPPRTPWGSTIIYEAHVKGLTYLHPEIPVEIRGTYKALG CCCCCCCEEEEEEECCCCCCCCCCCCCCCCEEEEEEECCCEEEECCCCCEEECCHHHHHC HPVMINYLKQLGITALELLPVAQFASEPRLQRMGLSNYWGYNPVAMFALHPAYACSPETA CHHHHHHHHHHCHHHHHHHHHHHHHCCCHHHHCCCCCCCCCCCEEEEEECCCCCCCCHHH LDEFRDAIKALHKAGIEVILDIVLNHSAELDLDGPLFSLRGIDNRSYYWIREDGDYHNWT HHHHHHHHHHHHHCCHHHHHHHHHCCCCCCCCCCCEEEEECCCCCEEEEEECCCCCCCCC GCGNTLNLSHPAVVDYASACLRYWVETCHVDGFRFDLAAVMGRTPEFRQDAPLFTAIQNC CCCCEECCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCCCCCCCHHHHHHCC PVLSQVKLIAEPWDIAPGGYQVGNFPPLFAEWNDHFRDAARRFWLHYDLPLGAFAGRFAA CCHHHHHHHCCCCCCCCCCEECCCCCCCHHHHHHHHHHHHHHEEEEECCCHHHHHHHHCC SSDVFKRNGRLPSAAINLVTAHDGFTLRDCVCFNHKHNEANGEENRDGTNNNYSNNHGKE CCHHHHHCCCCCHHHEEEEEECCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC GLGGTLDLVERRRDSIHALLTTLLLSQGTPMLLAGDEHGHSQHGNNNAYCQDNQLTWLDW CCCCHHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCEEECCCEEEEEH SQASSGLTAFTAALIHLRKRIPALVENRWWEEGDGNVRWLNRYAQPLSTDEWQNGPKQLQ HCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEHHHHHCCCCCCHHHCCCHHHEE ILLSDRFLIAINATLEVTEIVLPAGEWHAIPPFAGEDNPVITAVWQGPAHGLCVFQR EEEECCEEEEEECEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCCCEEEECC >Mature Secondary Structure TQLAIGKPAPLGAHYDGQGVNFTLFSVHAERVELCVFDANGQEHRYDLPGHSGDIWHGY CCCCCCCCCCCCCCCCCCCCEEEEEEEEECEEEEEEECCCCCCCEEECCCCCCCEEECC LPDARPGLRYGYRVHGPWQPAEGHRFNPAKLLIDPCARQIDGEFKDNPLLHAGHNEPDYR CCCCCCCCCCCEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCEEECCCCCCCCC DNAAIAPKCVVVVDHYDWEDDAPPRTPWGSTIIYEAHVKGLTYLHPEIPVEIRGTYKALG CCCCCCCEEEEEEECCCCCCCCCCCCCCCCEEEEEEECCCEEEECCCCCEEECCHHHHHC HPVMINYLKQLGITALELLPVAQFASEPRLQRMGLSNYWGYNPVAMFALHPAYACSPETA CHHHHHHHHHHCHHHHHHHHHHHHHCCCHHHHCCCCCCCCCCCEEEEEECCCCCCCCHHH LDEFRDAIKALHKAGIEVILDIVLNHSAELDLDGPLFSLRGIDNRSYYWIREDGDYHNWT HHHHHHHHHHHHHCCHHHHHHHHHCCCCCCCCCCCEEEEECCCCCEEEEEECCCCCCCCC GCGNTLNLSHPAVVDYASACLRYWVETCHVDGFRFDLAAVMGRTPEFRQDAPLFTAIQNC CCCCEECCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCCCCCCCHHHHHHCC PVLSQVKLIAEPWDIAPGGYQVGNFPPLFAEWNDHFRDAARRFWLHYDLPLGAFAGRFAA CCHHHHHHHCCCCCCCCCCEECCCCCCCHHHHHHHHHHHHHHEEEEECCCHHHHHHHHCC SSDVFKRNGRLPSAAINLVTAHDGFTLRDCVCFNHKHNEANGEENRDGTNNNYSNNHGKE CCHHHHHCCCCCHHHEEEEEECCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC GLGGTLDLVERRRDSIHALLTTLLLSQGTPMLLAGDEHGHSQHGNNNAYCQDNQLTWLDW CCCCHHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCEEECCCEEEEEH SQASSGLTAFTAALIHLRKRIPALVENRWWEEGDGNVRWLNRYAQPLSTDEWQNGPKQLQ HCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEHHHHHCCCCCCHHHCCCHHHEE ILLSDRFLIAINATLEVTEIVLPAGEWHAIPPFAGEDNPVITAVWQGPAHGLCVFQR EEEECCEEEEEECEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCCCEEEECC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: Hydrolase; Glycosylases; Glycosidases [C]
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA