Definition Escherichia coli O157:H7 str. EC4115, complete genome.
Accession NC_011353
Length 5,572,075

Click here to switch to the map view.

The map label for this gene is aglA [H]

Identifier: 209400134

GI number: 209400134

Start: 4756075

End: 4757691

Strand: Reverse

Name: aglA [H]

Synonym: ECH74115_5114

Alternate gene names: 209400134

Gene position: 4757691-4756075 (Counterclockwise)

Preceding gene: 209396187

Following gene: 209396857

Centisome position: 85.38

GC content: 52.07

Gene sequence:

>1617_bases
ATGCTCAGTCAAATTCAACGCTTTGGCGGCGCGATGTTCACGCCAGTGCTGCTGTTTCCCTTCGCCGGGATTGTGGTGGG
TCTTGCCATCTTGCTGCAAAACCCGATGTTTGTCGGGGAATCACTGACCGATCCGAACAGTTTATTCGCGCAAATCGTAC
ACATTATTGAAGAGGGCGGTTGGACGGTATTCCGTAATATGCCGCTGATTTTTGCTGTCGGTTTACCCATTGGCCTTGCT
AAGCAAGCGCAGGGGCGTGCTTGTCTGGCGGTGATGGTGAGTTTCCTGACCTGGAACTATTTCATCAATGCGATGGGAAT
GACCTGGGGAAGCTACTTCGGCGTCGATTTCACTCAGGACGCGGTGGCAGGTAGCGGTCTGACAATGATGGCCGGGATTA
AAACCCTCGATACCAGCATTATCGGCGCAATTATCATTTCCGGCATTGTGACGGCGCTGCATAACCGTCTGTTCGATAAA
AAACTGCCGGTTTTTCTCGGCATTTTCCAGGGGACGTCTTATGTGGTGATTATCGCCTTCCTGGTGATGATCCCCTGTGC
CTGGCTGACATTGCTCGGCTGGCCAAAAGTACAAATGGGGATTGAATCTCTGCAAGCGTTCCTGCGTTCGGCGGGTGCGC
TTGGGGTGTGGGTTTACACCTTCCTCGAACGTATTCTGATCCCAACCGGTTTACACCACTTCATCTACGGACCGTTTATC
TTTGGTCCGGCAGCTGTTGAAGGCGGAATTCAGATGTACTGGGCGCAGCATCTGCAAGAGTTCAGTTTGAGCGCCGAGCC
GCTGAAATCGTTGTTCCCGGAAGGAGGTTTTGCCCTGCACGGTAACTCAAAAATCTTTGGTGCCGTGGGCATTTCTTTAG
CGATGTACTTCACTGCCGCACCGGAAAATCGGGTAAAAGTGGCGGGTTTGCTGATTCCCGCAACCTTAACCGCCATGCTG
GTGGGAATTACCGAACCGCTGGAATTTACCTTCCTGTTCATTTCACCGTTGCTGTTTGCGGTACACGCTGTGCTTGCGGC
CTCAATGTCGACCGTGATGTATCTCTTTGGTGTGGTGGGCAACATGGGCGGAGGTCTGATTGACCAGGTTTTACCGCAAA
ACTGGATCCCGATGTTCAGCAACCACGCGGATATGATGCTGACCCAAATCGCCATTGGGTTGTGCTTTACCCTGCTGTAC
TTCGTGGTTTTCCGCACCCTGATTCTGCAATTCAACATGTGCACGCCGGGACGTGAAGATGCGGAAGTGAAACTCTACTC
AAAAGCCGAATACAAAGCCTCGCGAGGCCAAACCACCGCGGCAGAGCCAAAAAAAGAGCTGGATCAGGCTGCCGGTATCC
TGCAAGCCCTGGGCGGGGTCGGCAATATCTCCAGCATTAACAATTGTGCGACGCGTCTACGCATTGCACTGCATGACATG
TCACAAACGCTGGATGACGAAGTCTTTAAAAAGCTGGGAGCGCACGGCGTCTTCCGTAGTGGCGATGCCATTCAGGTGAT
CATTGGTCTGCATGTATCCCAGCTGCGTGAACAGCTCGATAGCTTAATTAATTCTCATCAATCAGCAGAAAATGTTGCCA
TTACGGAGGCAGTATAA

Upstream 100 bases:

>100_bases
CGTCATGTTCTGCCGATCGCATCTTCCTATCCTCGCTCCAGGCCTGCCGCATAACCAATCAGGCTTCCTACTTACAGAAT
TGAGAAAAGAGGATGTGGAA

Downstream 100 bases:

>100_bases
TGACCAAATTCTCAGTGGTTGTCGCAGGCGGTGGAAGCACCTTTACGCCAGGCATCGTGTTGATGCTCCTGGCGAATCAG
GACCGTTTCCCGCTTCGTGC

Product: PTS system, alpha-glucoside-specific IIBC component

Products: NA

Alternate protein names: Alpha-glucoside permease IIC component; PTS system alpha-glucoside-specific EIIC component; Alpha-glucoside-specific phosphotransferase enzyme IIB component; PTS system alpha-glucoside-specific EIIB component [H]

Number of amino acids: Translated: 538; Mature: 538

Protein sequence:

>538_residues
MLSQIQRFGGAMFTPVLLFPFAGIVVGLAILLQNPMFVGESLTDPNSLFAQIVHIIEEGGWTVFRNMPLIFAVGLPIGLA
KQAQGRACLAVMVSFLTWNYFINAMGMTWGSYFGVDFTQDAVAGSGLTMMAGIKTLDTSIIGAIIISGIVTALHNRLFDK
KLPVFLGIFQGTSYVVIIAFLVMIPCAWLTLLGWPKVQMGIESLQAFLRSAGALGVWVYTFLERILIPTGLHHFIYGPFI
FGPAAVEGGIQMYWAQHLQEFSLSAEPLKSLFPEGGFALHGNSKIFGAVGISLAMYFTAAPENRVKVAGLLIPATLTAML
VGITEPLEFTFLFISPLLFAVHAVLAASMSTVMYLFGVVGNMGGGLIDQVLPQNWIPMFSNHADMMLTQIAIGLCFTLLY
FVVFRTLILQFNMCTPGREDAEVKLYSKAEYKASRGQTTAAEPKKELDQAAGILQALGGVGNISSINNCATRLRIALHDM
SQTLDDEVFKKLGAHGVFRSGDAIQVIIGLHVSQLREQLDSLINSHQSAENVAITEAV

Sequences:

>Translated_538_residues
MLSQIQRFGGAMFTPVLLFPFAGIVVGLAILLQNPMFVGESLTDPNSLFAQIVHIIEEGGWTVFRNMPLIFAVGLPIGLA
KQAQGRACLAVMVSFLTWNYFINAMGMTWGSYFGVDFTQDAVAGSGLTMMAGIKTLDTSIIGAIIISGIVTALHNRLFDK
KLPVFLGIFQGTSYVVIIAFLVMIPCAWLTLLGWPKVQMGIESLQAFLRSAGALGVWVYTFLERILIPTGLHHFIYGPFI
FGPAAVEGGIQMYWAQHLQEFSLSAEPLKSLFPEGGFALHGNSKIFGAVGISLAMYFTAAPENRVKVAGLLIPATLTAML
VGITEPLEFTFLFISPLLFAVHAVLAASMSTVMYLFGVVGNMGGGLIDQVLPQNWIPMFSNHADMMLTQIAIGLCFTLLY
FVVFRTLILQFNMCTPGREDAEVKLYSKAEYKASRGQTTAAEPKKELDQAAGILQALGGVGNISSINNCATRLRIALHDM
SQTLDDEVFKKLGAHGVFRSGDAIQVIIGLHVSQLREQLDSLINSHQSAENVAITEAV
>Mature_538_residues
MLSQIQRFGGAMFTPVLLFPFAGIVVGLAILLQNPMFVGESLTDPNSLFAQIVHIIEEGGWTVFRNMPLIFAVGLPIGLA
KQAQGRACLAVMVSFLTWNYFINAMGMTWGSYFGVDFTQDAVAGSGLTMMAGIKTLDTSIIGAIIISGIVTALHNRLFDK
KLPVFLGIFQGTSYVVIIAFLVMIPCAWLTLLGWPKVQMGIESLQAFLRSAGALGVWVYTFLERILIPTGLHHFIYGPFI
FGPAAVEGGIQMYWAQHLQEFSLSAEPLKSLFPEGGFALHGNSKIFGAVGISLAMYFTAAPENRVKVAGLLIPATLTAML
VGITEPLEFTFLFISPLLFAVHAVLAASMSTVMYLFGVVGNMGGGLIDQVLPQNWIPMFSNHADMMLTQIAIGLCFTLLY
FVVFRTLILQFNMCTPGREDAEVKLYSKAEYKASRGQTTAAEPKKELDQAAGILQALGGVGNISSINNCATRLRIALHDM
SQTLDDEVFKKLGAHGVFRSGDAIQVIIGLHVSQLREQLDSLINSHQSAENVAITEAV

Specific function: Involved in the transport and simultaneous phosphorylation at O-6 of the glucosyl moiety of sucrose and its five linkage-isomeric alpha-D-glucosyl-D-fructoses. Can also transport maltose, isomaltose and maltitol, phosphorylating at O-6 of their non-reduci

COG id: COG1263

COG function: function code G; Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific

Gene ontology:

Cell location: Cell membrane; Multi-pass membrane protein (Probable) [H]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 PTS EIIC type-1 domain [H]

Homologues:

Organism=Escherichia coli, GI1787908, Length=544, Percent_Identity=32.7205882352941, Blast_Score=231, Evalue=7e-62,
Organism=Escherichia coli, GI1787343, Length=530, Percent_Identity=30.7547169811321, Blast_Score=227, Evalue=2e-60,
Organism=Escherichia coli, GI1786894, Length=530, Percent_Identity=31.8867924528302, Blast_Score=169, Evalue=5e-43,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR018113
- InterPro:   IPR004719
- InterPro:   IPR001996
- InterPro:   IPR003352
- InterPro:   IPR013013
- InterPro:   IPR011535
- InterPro:   IPR010975 [H]

Pfam domain/function: PF00367 PTS_EIIB; PF02378 PTS_EIIC [H]

EC number: =2.7.1.69 [H]

Molecular weight: Translated: 58346; Mature: 58346

Theoretical pI: Translated: 6.51; Mature: 6.51

Prosite motif: PS01035 PTS_EIIB_TYPE_1_CYS ; PS51098 PTS_EIIB_TYPE_1 ; PS51103 PTS_EIIC_TYPE_1

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.9 %Cys     (Translated Protein)
4.1 %Met     (Translated Protein)
5.0 %Cys+Met (Translated Protein)
0.9 %Cys     (Mature Protein)
4.1 %Met     (Mature Protein)
5.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MLSQIQRFGGAMFTPVLLFPFAGIVVGLAILLQNPMFVGESLTDPNSLFAQIVHIIEEGG
CCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCC
WTVFRNMPLIFAVGLPIGLAKQAQGRACLAVMVSFLTWNYFINAMGMTWGSYFGVDFTQD
EEEEECCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHCCCCCCHH
AVAGSGLTMMAGIKTLDTSIIGAIIISGIVTALHNRLFDKKLPVFLGIFQGTSYVVIIAF
HHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHCCCHHHHHHHH
LVMIPCAWLTLLGWPKVQMGIESLQAFLRSAGALGVWVYTFLERILIPTGLHHFIYGPFI
HHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHH
FGPAAVEGGIQMYWAQHLQEFSLSAEPLKSLFPEGGFALHGNSKIFGAVGISLAMYFTAA
CCHHHHHCHHHHHHHHHHHHHCCCHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHHHCCC
PENRVKVAGLLIPATLTAMLVGITEPLEFTFLFISPLLFAVHAVLAASMSTVMYLFGVVG
CCCCEEEHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
NMGGGLIDQVLPQNWIPMFSNHADMMLTQIAIGLCFTLLYFVVFRTLILQFNMCTPGRED
CCCCHHHHHHCCCCCCCHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECCCCCCCCC
AEVKLYSKAEYKASRGQTTAAEPKKELDQAAGILQALGGVGNISSINNCATRLRIALHDM
CCEEEEECHHHHHCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHH
SQTLDDEVFKKLGAHGVFRSGDAIQVIIGLHVSQLREQLDSLINSHQSAENVAITEAV
HHHHHHHHHHHHCCCCCCCCCCCEEEHHHHHHHHHHHHHHHHHHCCCCCCCCEEECCC
>Mature Secondary Structure
MLSQIQRFGGAMFTPVLLFPFAGIVVGLAILLQNPMFVGESLTDPNSLFAQIVHIIEEGG
CCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCC
WTVFRNMPLIFAVGLPIGLAKQAQGRACLAVMVSFLTWNYFINAMGMTWGSYFGVDFTQD
EEEEECCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHCCCCCCHH
AVAGSGLTMMAGIKTLDTSIIGAIIISGIVTALHNRLFDKKLPVFLGIFQGTSYVVIIAF
HHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHCCCHHHHHHHH
LVMIPCAWLTLLGWPKVQMGIESLQAFLRSAGALGVWVYTFLERILIPTGLHHFIYGPFI
HHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHH
FGPAAVEGGIQMYWAQHLQEFSLSAEPLKSLFPEGGFALHGNSKIFGAVGISLAMYFTAA
CCHHHHHCHHHHHHHHHHHHHCCCHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHHHCCC
PENRVKVAGLLIPATLTAMLVGITEPLEFTFLFISPLLFAVHAVLAASMSTVMYLFGVVG
CCCCEEEHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
NMGGGLIDQVLPQNWIPMFSNHADMMLTQIAIGLCFTLLYFVVFRTLILQFNMCTPGRED
CCCCHHHHHHCCCCCCCHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECCCCCCCCC
AEVKLYSKAEYKASRGQTTAAEPKKELDQAAGILQALGGVGNISSINNCATRLRIALHDM
CCEEEEECHHHHHCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHH
SQTLDDEVFKKLGAHGVFRSGDAIQVIIGLHVSQLREQLDSLINSHQSAENVAITEAV
HHHHHHHHHHHHCCCCCCCCCCCEEEHHHHHHHHHHHHHHHHHHCCCCCCCCEEECCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 11473129; 11322729 [H]