Definition Mycobacterium avium subsp. paratuberculosis K-10, complete genome.
Accession NC_002944
Length 4,829,781

Click here to switch to the map view.

The map label for this gene is glgE [H]

Identifier: 41408531

GI number: 41408531

Start: 2730831

End: 2732969

Strand: Direct

Name: glgE [H]

Synonym: MAP2433

Alternate gene names: 41408531

Gene position: 2730831-2732969 (Clockwise)

Preceding gene: 41408529

Following gene: 41408532

Centisome position: 56.54

GC content: 68.12

Gene sequence:

>2139_bases
TTGCGAACCGCCAGGCGGGCGCGAAGGACCCGGGCAGATCGGAACGGAGTGTTGGTGCCCGGTCGTGTCGAGATCGATAA
CGTCCAGCCGGTCGTCTCCTGTGGGACATATCCCGCCAAGGCCGTCGTCGGCGAGGTGGTGCCGGTCAGCGCCTCGGTGT
GGCGCGAGGGACACGAGGCGGTGGCCGCGACGCTGGTGGTGCGCTACCTGGGACCGGCCTATCCGCCGGTGACCGAGACG
CGCCGGGTCAAGGCGGTGCAGGCACCGGTCGCCGACCGCGAGGGCAGGGTCGACCAGCAACGGATCAAGCCGCTGTCGCT
GCCGATGACGATGGGCCCCGAACCCTACGTCTTCCACGGTCAGTTCACCCCCGACCAGGTCGGATTATGGACCTTCCGGG
TGGACGGCTGGGGCGACCCGATCCACTCCTGGCGGCACGGGCTGGTCGCCAAGCTGGACGCCGGCCAGGGCGAGACGGAG
CTGTCCAACGATCTGCTGGTCGGCGCGGAGCTGTTCGAGCGCGCCGCCACGGGTGTGCCGCGCGCCCGGCGCGAACCGCT
GCTGGCCGCCGCGGCCGCGCTGCGCACCGCCGGGGACCCGGTCACCCGGACGGCGCTGGCGCTGGCGCCGGAAATCGAGG
AGATCCTGGCGGAGTATCCGCTGCGCGATCTGCTGACCCGCGGCGAGCAGTACGGGGTCTGGGTGGATCGGCCGCTGGCC
CGCTTCGGCTCCTGGTATGAGATGTTCCCCCGCTCGACCGGCGGCTGGGACGACGACGGCAACCCCGTGCACGGCACCTT
CGCGACGGCGGCCGCGGCGCTGCCGCGCATCGCGGCGATGGGCTTCGACGTCGTCTACCTGCCGCCGATCCATCCCATCG
GGAAGGTACATCGCAAGGGGCGCAACAACTCTCCCACCGCCGGCCCGACGGATGTGGGCTCGCCGTGGGCGATCGGCAGC
GACGAGGGCGGCCACGACGCCGTCCACCCGGATCTGGGCACCATCGAGGATTTCGACGCGTTCGTCGCCCGGGCCCGCGA
ACTGGGCATGGAGGTGGCGCTCGACCTGGCGCTGCAATGCGCGCCGGATCATCCGTGGGCGCGCGAGCACCGCAACTGGT
TCACCGAATTGCCGGACGGCACCATCGCGTACGCGGAGAACCCGCCCAAGAAGTACCAGGACATCTATCCGCTCAACTTC
GACAACGACCCGGCCGGTCTCTATGACGAGGTGCTGCGGGTGGTCCGGCACTGGATCGACCACGGCGTCAAGTTCTTTCG
CGTCGACAACCCGCACACCAAACCGCCGGACTTCTGGGCCTGGCTGATCGCCGCGGTCAAGGGCATCGATCCCGATGTGC
TGTTCCTGTCCGAGGCCTTCACCCCGCCGGTCCGGCAAAACGGGCTGACCAAACTGGGCTTCACCCAGTCCTATACGTAC
TTCACCTGGCGCACCGCGAAGTGGGAGCTGACCGAGTTCGGCAACGACATCGCCGCGCTGGCCGATTTCCGCCGGCCCAA
CCTGTTCGTCAACACACCCGACATCCTGCACGCCATCCTGCAGCACAACGGTCCGGGGATGTTCGCCATCCGCGCGGTGC
TGGCCGCCACGATGGGTCCGGCCTGGGGGGTGTACTCGGGCTACGAACTGTTCGAGCACCGCGCGGTGCGCGAGGGCAGC
GAGGAGTACCTGAACTCCGAGAAGTACGAACTGCGGCCCCGCGATTTCGCCGGCGCGCTCGCCGAGGGAAGGTCGCTCGA
GCCATTCATCACGCAGCTCAACACGATTCGGCGGCTCCACCCCGCGCTGCAACAGCTGCGCACCATCCATTTCCACGGCG
TGGACAACGACGCGCTGCTCGCCTACAGCAAGTTCGACCCGGCCACCGGCGACTGCGTGCTGGTGGTGGTGACGCTCAAC
GCATTCGGCCCCGAGGAAGCGACGCTGTTTTTAGACATGGCGGCATTGGGTATGGAGCCTTACGAGCGCTTTTGGGTGCG
CGACGAGATCACCGGCCAGGAATTCCAGTGGGGGCAAGCCAATTACGTTCGCATCGACCCGGCACAGGCGGTCGCTCACG
TCATCAACATGCCGCTCATCCCCGATGAGGCTCGAATGACCTTGCTACGCAGGCGCTGA

Upstream 100 bases:

>100_bases
ACCCAGAAAGTTCCCCACCACAGGAAGTTTCCCGACGAGCGGCATTCCGCACGGAGCACACCGCGGGTGGGCCGCCCACA
ATCGGAAACCCCGGTGCCGA

Downstream 100 bases:

>100_bases
ACGCACGACGAAACGGACCGGAGGTGAAAGGGACATGAGCCACGCCGATCAACTCGCTCGGACGCACCTGGCGCCCGATC
CTGCGGACCTGTCGCGCCTG

Product: hypothetical protein

Products: D-Glucose 6-phosphate; D-Glucose [C]

Alternate protein names: NA

Number of amino acids: Translated: 712; Mature: 712

Protein sequence:

>712_residues
MRTARRARRTRADRNGVLVPGRVEIDNVQPVVSCGTYPAKAVVGEVVPVSASVWREGHEAVAATLVVRYLGPAYPPVTET
RRVKAVQAPVADREGRVDQQRIKPLSLPMTMGPEPYVFHGQFTPDQVGLWTFRVDGWGDPIHSWRHGLVAKLDAGQGETE
LSNDLLVGAELFERAATGVPRARREPLLAAAAALRTAGDPVTRTALALAPEIEEILAEYPLRDLLTRGEQYGVWVDRPLA
RFGSWYEMFPRSTGGWDDDGNPVHGTFATAAAALPRIAAMGFDVVYLPPIHPIGKVHRKGRNNSPTAGPTDVGSPWAIGS
DEGGHDAVHPDLGTIEDFDAFVARARELGMEVALDLALQCAPDHPWAREHRNWFTELPDGTIAYAENPPKKYQDIYPLNF
DNDPAGLYDEVLRVVRHWIDHGVKFFRVDNPHTKPPDFWAWLIAAVKGIDPDVLFLSEAFTPPVRQNGLTKLGFTQSYTY
FTWRTAKWELTEFGNDIAALADFRRPNLFVNTPDILHAILQHNGPGMFAIRAVLAATMGPAWGVYSGYELFEHRAVREGS
EEYLNSEKYELRPRDFAGALAEGRSLEPFITQLNTIRRLHPALQQLRTIHFHGVDNDALLAYSKFDPATGDCVLVVVTLN
AFGPEEATLFLDMAALGMEPYERFWVRDEITGQEFQWGQANYVRIDPAQAVAHVINMPLIPDEARMTLLRRR

Sequences:

>Translated_712_residues
MRTARRARRTRADRNGVLVPGRVEIDNVQPVVSCGTYPAKAVVGEVVPVSASVWREGHEAVAATLVVRYLGPAYPPVTET
RRVKAVQAPVADREGRVDQQRIKPLSLPMTMGPEPYVFHGQFTPDQVGLWTFRVDGWGDPIHSWRHGLVAKLDAGQGETE
LSNDLLVGAELFERAATGVPRARREPLLAAAAALRTAGDPVTRTALALAPEIEEILAEYPLRDLLTRGEQYGVWVDRPLA
RFGSWYEMFPRSTGGWDDDGNPVHGTFATAAAALPRIAAMGFDVVYLPPIHPIGKVHRKGRNNSPTAGPTDVGSPWAIGS
DEGGHDAVHPDLGTIEDFDAFVARARELGMEVALDLALQCAPDHPWAREHRNWFTELPDGTIAYAENPPKKYQDIYPLNF
DNDPAGLYDEVLRVVRHWIDHGVKFFRVDNPHTKPPDFWAWLIAAVKGIDPDVLFLSEAFTPPVRQNGLTKLGFTQSYTY
FTWRTAKWELTEFGNDIAALADFRRPNLFVNTPDILHAILQHNGPGMFAIRAVLAATMGPAWGVYSGYELFEHRAVREGS
EEYLNSEKYELRPRDFAGALAEGRSLEPFITQLNTIRRLHPALQQLRTIHFHGVDNDALLAYSKFDPATGDCVLVVVTLN
AFGPEEATLFLDMAALGMEPYERFWVRDEITGQEFQWGQANYVRIDPAQAVAHVINMPLIPDEARMTLLRRR
>Mature_712_residues
MRTARRARRTRADRNGVLVPGRVEIDNVQPVVSCGTYPAKAVVGEVVPVSASVWREGHEAVAATLVVRYLGPAYPPVTET
RRVKAVQAPVADREGRVDQQRIKPLSLPMTMGPEPYVFHGQFTPDQVGLWTFRVDGWGDPIHSWRHGLVAKLDAGQGETE
LSNDLLVGAELFERAATGVPRARREPLLAAAAALRTAGDPVTRTALALAPEIEEILAEYPLRDLLTRGEQYGVWVDRPLA
RFGSWYEMFPRSTGGWDDDGNPVHGTFATAAAALPRIAAMGFDVVYLPPIHPIGKVHRKGRNNSPTAGPTDVGSPWAIGS
DEGGHDAVHPDLGTIEDFDAFVARARELGMEVALDLALQCAPDHPWAREHRNWFTELPDGTIAYAENPPKKYQDIYPLNF
DNDPAGLYDEVLRVVRHWIDHGVKFFRVDNPHTKPPDFWAWLIAAVKGIDPDVLFLSEAFTPPVRQNGLTKLGFTQSYTY
FTWRTAKWELTEFGNDIAALADFRRPNLFVNTPDILHAILQHNGPGMFAIRAVLAATMGPAWGVYSGYELFEHRAVREGS
EEYLNSEKYELRPRDFAGALAEGRSLEPFITQLNTIRRLHPALQQLRTIHFHGVDNDALLAYSKFDPATGDCVLVVVTLN
AFGPEEATLFLDMAALGMEPYERFWVRDEITGQEFQWGQANYVRIDPAQAVAHVINMPLIPDEARMTLLRRR

Specific function: Could be involved in glycogen catabolism [H]

COG id: COG0366

COG function: function code G; Glycosidases

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the glycosyl hydrolase 13 family. GlgE subfamily [H]

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR021828
- InterPro:   IPR006047
- InterPro:   IPR017853
- InterPro:   IPR013781 [H]

Pfam domain/function: PF00128 Alpha-amylase; PF11896 DUF3416 [H]

EC number: 3.2.1.93 [C]

Molecular weight: Translated: 79354; Mature: 79354

Theoretical pI: Translated: 5.89; Mature: 5.89

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.4 %Cys     (Translated Protein)
1.7 %Met     (Translated Protein)
2.1 %Cys+Met (Translated Protein)
0.4 %Cys     (Mature Protein)
1.7 %Met     (Mature Protein)
2.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MRTARRARRTRADRNGVLVPGRVEIDNVQPVVSCGTYPAKAVVGEVVPVSASVWREGHEA
CCCHHHHHHHHCCCCCEEECCEEEECCCCHHHHCCCCCHHHHHHCCCCCCHHHHHHHHHH
VAATLVVRYLGPAYPPVTETRRVKAVQAPVADREGRVDQQRIKPLSLPMTMGPEPYVFHG
HHHHHHHHHHCCCCCCCCCHHHHHHHHCCCCCCCCCCCHHHCCCCCCCCCCCCCCEEEEC
QFTPDQVGLWTFRVDGWGDPIHSWRHGLVAKLDAGQGETELSNDLLVGAELFERAATGVP
CCCCCCCEEEEEEECCCCCHHHHHHCCCEEEECCCCCCCHHCCCEEEHHHHHHHHHCCCC
RARREPLLAAAAALRTAGDPVTRTALALAPEIEEILAEYPLRDLLTRGEQYGVWVDRPLA
CHHHCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHCCCCCCEEECCHHH
RFGSWYEMFPRSTGGWDDDGNPVHGTFATAAAALPRIAAMGFDVVYLPPIHPIGKVHRKG
HHCCHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHCC
RNNSPTAGPTDVGSPWAIGSDEGGHDAVHPDLGTIEDFDAFVARARELGMEVALDLALQC
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCHHHHHHHHHEE
APDHPWAREHRNWFTELPDGTIAYAENPPKKYQDIYPLNFDNDPAGLYDEVLRVVRHWID
CCCCCHHHHHHHHHHCCCCCEEEECCCCCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHH
HGVKFFRVDNPHTKPPDFWAWLIAAVKGIDPDVLFLSEAFTPPVRQNGLTKLGFTQSYTY
CCEEEEEECCCCCCCHHHHHHHHHHHHCCCCCEEEHHHHCCCCHHHCCCEECCCCCCEEE
FTWRTAKWELTEFGNDIAALADFRRPNLFVNTPDILHAILQHNGPGMFAIRAVLAATMGP
EEEEECEEEHHHHCCHHHHHHHCCCCCEEECCHHHHHHHHHCCCCCHHHHHHHHHHHCCC
AWGVYSGYELFEHRAVREGSEEYLNSEKYELRPRDFAGALAEGRSLEPFITQLNTIRRLH
CCCCCHHHHHHHHHHHHCCCHHHHCCCCEECCCHHHHHHHHCCCCCCHHHHHHHHHHHHH
PALQQLRTIHFHGVDNDALLAYSKFDPATGDCVLVVVTLNAFGPEEATLFLDMAALGMEP
HHHHHHHEEEEECCCCCCEEEEECCCCCCCCEEEEEEEECCCCCCCCEEEEEHHHHCCCH
YERFWVRDEITGQEFQWGQANYVRIDPAQAVAHVINMPLIPDEARMTLLRRR
HHHHEECCCCCCCCCCCCCCCEEEECHHHHHHHHHCCCCCCCHHHHHHHHCC
>Mature Secondary Structure
MRTARRARRTRADRNGVLVPGRVEIDNVQPVVSCGTYPAKAVVGEVVPVSASVWREGHEA
CCCHHHHHHHHCCCCCEEECCEEEECCCCHHHHCCCCCHHHHHHCCCCCCHHHHHHHHHH
VAATLVVRYLGPAYPPVTETRRVKAVQAPVADREGRVDQQRIKPLSLPMTMGPEPYVFHG
HHHHHHHHHHCCCCCCCCCHHHHHHHHCCCCCCCCCCCHHHCCCCCCCCCCCCCCEEEEC
QFTPDQVGLWTFRVDGWGDPIHSWRHGLVAKLDAGQGETELSNDLLVGAELFERAATGVP
CCCCCCCEEEEEEECCCCCHHHHHHCCCEEEECCCCCCCHHCCCEEEHHHHHHHHHCCCC
RARREPLLAAAAALRTAGDPVTRTALALAPEIEEILAEYPLRDLLTRGEQYGVWVDRPLA
CHHHCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHCCCCCCEEECCHHH
RFGSWYEMFPRSTGGWDDDGNPVHGTFATAAAALPRIAAMGFDVVYLPPIHPIGKVHRKG
HHCCHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHCC
RNNSPTAGPTDVGSPWAIGSDEGGHDAVHPDLGTIEDFDAFVARARELGMEVALDLALQC
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCHHHHHHHHHEE
APDHPWAREHRNWFTELPDGTIAYAENPPKKYQDIYPLNFDNDPAGLYDEVLRVVRHWID
CCCCCHHHHHHHHHHCCCCCEEEECCCCCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHH
HGVKFFRVDNPHTKPPDFWAWLIAAVKGIDPDVLFLSEAFTPPVRQNGLTKLGFTQSYTY
CCEEEEEECCCCCCCHHHHHHHHHHHHCCCCCEEEHHHHCCCCHHHCCCEECCCCCCEEE
FTWRTAKWELTEFGNDIAALADFRRPNLFVNTPDILHAILQHNGPGMFAIRAVLAATMGP
EEEEECEEEHHHHCCHHHHHHHCCCCCEEECCHHHHHHHHHCCCCCHHHHHHHHHHHCCC
AWGVYSGYELFEHRAVREGSEEYLNSEKYELRPRDFAGALAEGRSLEPFITQLNTIRRLH
CCCCCHHHHHHHHHHHHCCCHHHHCCCCEECCCHHHHHHHHCCCCCCHHHHHHHHHHHHH
PALQQLRTIHFHGVDNDALLAYSKFDPATGDCVLVVVTLNAFGPEEATLFLDMAALGMEP
HHHHHHHEEEEECCCCCCEEEEECCCCCCCCEEEEEEEECCCCCCCCEEEEEHHHHCCCH
YERFWVRDEITGQEFQWGQANYVRIDPAQAVAHVINMPLIPDEARMTLLRRR
HHHHEECCCCCCCCCCCCCCCEEEECHHHHHHHHHCCCCCCCHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: H2O; alpha,alpha'-Trehalose 6-phosphate [C]

Specific reaction: H2O + alpha,alpha'-Trehalose 6-phosphate --> D-Glucose 6-phosphate + D-Glucose [C]

General reaction: O-Glycosyl bond hydrolysis [C]

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 12788972 [H]