| Definition | Escherichia coli O157:H7 str. EC4115, complete genome. |
|---|---|
| Accession | NC_011353 |
| Length | 5,572,075 |
Click here to switch to the map view.
The map label for this gene is aglA [H]
Identifier: 209400134
GI number: 209400134
Start: 4756075
End: 4757691
Strand: Reverse
Name: aglA [H]
Synonym: ECH74115_5114
Alternate gene names: 209400134
Gene position: 4757691-4756075 (Counterclockwise)
Preceding gene: 209396187
Following gene: 209396857
Centisome position: 85.38
GC content: 52.07
Gene sequence:
>1617_bases ATGCTCAGTCAAATTCAACGCTTTGGCGGCGCGATGTTCACGCCAGTGCTGCTGTTTCCCTTCGCCGGGATTGTGGTGGG TCTTGCCATCTTGCTGCAAAACCCGATGTTTGTCGGGGAATCACTGACCGATCCGAACAGTTTATTCGCGCAAATCGTAC ACATTATTGAAGAGGGCGGTTGGACGGTATTCCGTAATATGCCGCTGATTTTTGCTGTCGGTTTACCCATTGGCCTTGCT AAGCAAGCGCAGGGGCGTGCTTGTCTGGCGGTGATGGTGAGTTTCCTGACCTGGAACTATTTCATCAATGCGATGGGAAT GACCTGGGGAAGCTACTTCGGCGTCGATTTCACTCAGGACGCGGTGGCAGGTAGCGGTCTGACAATGATGGCCGGGATTA AAACCCTCGATACCAGCATTATCGGCGCAATTATCATTTCCGGCATTGTGACGGCGCTGCATAACCGTCTGTTCGATAAA AAACTGCCGGTTTTTCTCGGCATTTTCCAGGGGACGTCTTATGTGGTGATTATCGCCTTCCTGGTGATGATCCCCTGTGC CTGGCTGACATTGCTCGGCTGGCCAAAAGTACAAATGGGGATTGAATCTCTGCAAGCGTTCCTGCGTTCGGCGGGTGCGC TTGGGGTGTGGGTTTACACCTTCCTCGAACGTATTCTGATCCCAACCGGTTTACACCACTTCATCTACGGACCGTTTATC TTTGGTCCGGCAGCTGTTGAAGGCGGAATTCAGATGTACTGGGCGCAGCATCTGCAAGAGTTCAGTTTGAGCGCCGAGCC GCTGAAATCGTTGTTCCCGGAAGGAGGTTTTGCCCTGCACGGTAACTCAAAAATCTTTGGTGCCGTGGGCATTTCTTTAG CGATGTACTTCACTGCCGCACCGGAAAATCGGGTAAAAGTGGCGGGTTTGCTGATTCCCGCAACCTTAACCGCCATGCTG GTGGGAATTACCGAACCGCTGGAATTTACCTTCCTGTTCATTTCACCGTTGCTGTTTGCGGTACACGCTGTGCTTGCGGC CTCAATGTCGACCGTGATGTATCTCTTTGGTGTGGTGGGCAACATGGGCGGAGGTCTGATTGACCAGGTTTTACCGCAAA ACTGGATCCCGATGTTCAGCAACCACGCGGATATGATGCTGACCCAAATCGCCATTGGGTTGTGCTTTACCCTGCTGTAC TTCGTGGTTTTCCGCACCCTGATTCTGCAATTCAACATGTGCACGCCGGGACGTGAAGATGCGGAAGTGAAACTCTACTC AAAAGCCGAATACAAAGCCTCGCGAGGCCAAACCACCGCGGCAGAGCCAAAAAAAGAGCTGGATCAGGCTGCCGGTATCC TGCAAGCCCTGGGCGGGGTCGGCAATATCTCCAGCATTAACAATTGTGCGACGCGTCTACGCATTGCACTGCATGACATG TCACAAACGCTGGATGACGAAGTCTTTAAAAAGCTGGGAGCGCACGGCGTCTTCCGTAGTGGCGATGCCATTCAGGTGAT CATTGGTCTGCATGTATCCCAGCTGCGTGAACAGCTCGATAGCTTAATTAATTCTCATCAATCAGCAGAAAATGTTGCCA TTACGGAGGCAGTATAA
Upstream 100 bases:
>100_bases CGTCATGTTCTGCCGATCGCATCTTCCTATCCTCGCTCCAGGCCTGCCGCATAACCAATCAGGCTTCCTACTTACAGAAT TGAGAAAAGAGGATGTGGAA
Downstream 100 bases:
>100_bases TGACCAAATTCTCAGTGGTTGTCGCAGGCGGTGGAAGCACCTTTACGCCAGGCATCGTGTTGATGCTCCTGGCGAATCAG GACCGTTTCCCGCTTCGTGC
Product: PTS system, alpha-glucoside-specific IIBC component
Products: NA
Alternate protein names: Alpha-glucoside permease IIC component; PTS system alpha-glucoside-specific EIIC component; Alpha-glucoside-specific phosphotransferase enzyme IIB component; PTS system alpha-glucoside-specific EIIB component [H]
Number of amino acids: Translated: 538; Mature: 538
Protein sequence:
>538_residues MLSQIQRFGGAMFTPVLLFPFAGIVVGLAILLQNPMFVGESLTDPNSLFAQIVHIIEEGGWTVFRNMPLIFAVGLPIGLA KQAQGRACLAVMVSFLTWNYFINAMGMTWGSYFGVDFTQDAVAGSGLTMMAGIKTLDTSIIGAIIISGIVTALHNRLFDK KLPVFLGIFQGTSYVVIIAFLVMIPCAWLTLLGWPKVQMGIESLQAFLRSAGALGVWVYTFLERILIPTGLHHFIYGPFI FGPAAVEGGIQMYWAQHLQEFSLSAEPLKSLFPEGGFALHGNSKIFGAVGISLAMYFTAAPENRVKVAGLLIPATLTAML VGITEPLEFTFLFISPLLFAVHAVLAASMSTVMYLFGVVGNMGGGLIDQVLPQNWIPMFSNHADMMLTQIAIGLCFTLLY FVVFRTLILQFNMCTPGREDAEVKLYSKAEYKASRGQTTAAEPKKELDQAAGILQALGGVGNISSINNCATRLRIALHDM SQTLDDEVFKKLGAHGVFRSGDAIQVIIGLHVSQLREQLDSLINSHQSAENVAITEAV
Sequences:
>Translated_538_residues MLSQIQRFGGAMFTPVLLFPFAGIVVGLAILLQNPMFVGESLTDPNSLFAQIVHIIEEGGWTVFRNMPLIFAVGLPIGLA KQAQGRACLAVMVSFLTWNYFINAMGMTWGSYFGVDFTQDAVAGSGLTMMAGIKTLDTSIIGAIIISGIVTALHNRLFDK KLPVFLGIFQGTSYVVIIAFLVMIPCAWLTLLGWPKVQMGIESLQAFLRSAGALGVWVYTFLERILIPTGLHHFIYGPFI FGPAAVEGGIQMYWAQHLQEFSLSAEPLKSLFPEGGFALHGNSKIFGAVGISLAMYFTAAPENRVKVAGLLIPATLTAML VGITEPLEFTFLFISPLLFAVHAVLAASMSTVMYLFGVVGNMGGGLIDQVLPQNWIPMFSNHADMMLTQIAIGLCFTLLY FVVFRTLILQFNMCTPGREDAEVKLYSKAEYKASRGQTTAAEPKKELDQAAGILQALGGVGNISSINNCATRLRIALHDM SQTLDDEVFKKLGAHGVFRSGDAIQVIIGLHVSQLREQLDSLINSHQSAENVAITEAV >Mature_538_residues MLSQIQRFGGAMFTPVLLFPFAGIVVGLAILLQNPMFVGESLTDPNSLFAQIVHIIEEGGWTVFRNMPLIFAVGLPIGLA KQAQGRACLAVMVSFLTWNYFINAMGMTWGSYFGVDFTQDAVAGSGLTMMAGIKTLDTSIIGAIIISGIVTALHNRLFDK KLPVFLGIFQGTSYVVIIAFLVMIPCAWLTLLGWPKVQMGIESLQAFLRSAGALGVWVYTFLERILIPTGLHHFIYGPFI FGPAAVEGGIQMYWAQHLQEFSLSAEPLKSLFPEGGFALHGNSKIFGAVGISLAMYFTAAPENRVKVAGLLIPATLTAML VGITEPLEFTFLFISPLLFAVHAVLAASMSTVMYLFGVVGNMGGGLIDQVLPQNWIPMFSNHADMMLTQIAIGLCFTLLY FVVFRTLILQFNMCTPGREDAEVKLYSKAEYKASRGQTTAAEPKKELDQAAGILQALGGVGNISSINNCATRLRIALHDM SQTLDDEVFKKLGAHGVFRSGDAIQVIIGLHVSQLREQLDSLINSHQSAENVAITEAV
Specific function: Involved in the transport and simultaneous phosphorylation at O-6 of the glucosyl moiety of sucrose and its five linkage-isomeric alpha-D-glucosyl-D-fructoses. Can also transport maltose, isomaltose and maltitol, phosphorylating at O-6 of their non-reduci
COG id: COG1263
COG function: function code G; Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific
Gene ontology:
Cell location: Cell membrane; Multi-pass membrane protein (Probable) [H]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 PTS EIIC type-1 domain [H]
Homologues:
Organism=Escherichia coli, GI1787908, Length=544, Percent_Identity=32.7205882352941, Blast_Score=231, Evalue=7e-62, Organism=Escherichia coli, GI1787343, Length=530, Percent_Identity=30.7547169811321, Blast_Score=227, Evalue=2e-60, Organism=Escherichia coli, GI1786894, Length=530, Percent_Identity=31.8867924528302, Blast_Score=169, Evalue=5e-43,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR018113 - InterPro: IPR004719 - InterPro: IPR001996 - InterPro: IPR003352 - InterPro: IPR013013 - InterPro: IPR011535 - InterPro: IPR010975 [H]
Pfam domain/function: PF00367 PTS_EIIB; PF02378 PTS_EIIC [H]
EC number: =2.7.1.69 [H]
Molecular weight: Translated: 58346; Mature: 58346
Theoretical pI: Translated: 6.51; Mature: 6.51
Prosite motif: PS01035 PTS_EIIB_TYPE_1_CYS ; PS51098 PTS_EIIB_TYPE_1 ; PS51103 PTS_EIIC_TYPE_1
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.9 %Cys (Translated Protein) 4.1 %Met (Translated Protein) 5.0 %Cys+Met (Translated Protein) 0.9 %Cys (Mature Protein) 4.1 %Met (Mature Protein) 5.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MLSQIQRFGGAMFTPVLLFPFAGIVVGLAILLQNPMFVGESLTDPNSLFAQIVHIIEEGG CCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCC WTVFRNMPLIFAVGLPIGLAKQAQGRACLAVMVSFLTWNYFINAMGMTWGSYFGVDFTQD EEEEECCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHCCCCCCHH AVAGSGLTMMAGIKTLDTSIIGAIIISGIVTALHNRLFDKKLPVFLGIFQGTSYVVIIAF HHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHCCCHHHHHHHH LVMIPCAWLTLLGWPKVQMGIESLQAFLRSAGALGVWVYTFLERILIPTGLHHFIYGPFI HHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHH FGPAAVEGGIQMYWAQHLQEFSLSAEPLKSLFPEGGFALHGNSKIFGAVGISLAMYFTAA CCHHHHHCHHHHHHHHHHHHHCCCHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHHHCCC PENRVKVAGLLIPATLTAMLVGITEPLEFTFLFISPLLFAVHAVLAASMSTVMYLFGVVG CCCCEEEHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH NMGGGLIDQVLPQNWIPMFSNHADMMLTQIAIGLCFTLLYFVVFRTLILQFNMCTPGRED CCCCHHHHHHCCCCCCCHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECCCCCCCCC AEVKLYSKAEYKASRGQTTAAEPKKELDQAAGILQALGGVGNISSINNCATRLRIALHDM CCEEEEECHHHHHCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHH SQTLDDEVFKKLGAHGVFRSGDAIQVIIGLHVSQLREQLDSLINSHQSAENVAITEAV HHHHHHHHHHHHCCCCCCCCCCCEEEHHHHHHHHHHHHHHHHHHCCCCCCCCEEECCC >Mature Secondary Structure MLSQIQRFGGAMFTPVLLFPFAGIVVGLAILLQNPMFVGESLTDPNSLFAQIVHIIEEGG CCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCC WTVFRNMPLIFAVGLPIGLAKQAQGRACLAVMVSFLTWNYFINAMGMTWGSYFGVDFTQD EEEEECCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHCCCCCCHH AVAGSGLTMMAGIKTLDTSIIGAIIISGIVTALHNRLFDKKLPVFLGIFQGTSYVVIIAF HHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHCCCHHHHHHHH LVMIPCAWLTLLGWPKVQMGIESLQAFLRSAGALGVWVYTFLERILIPTGLHHFIYGPFI HHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHH FGPAAVEGGIQMYWAQHLQEFSLSAEPLKSLFPEGGFALHGNSKIFGAVGISLAMYFTAA CCHHHHHCHHHHHHHHHHHHHCCCHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHHHCCC PENRVKVAGLLIPATLTAMLVGITEPLEFTFLFISPLLFAVHAVLAASMSTVMYLFGVVG CCCCEEEHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH NMGGGLIDQVLPQNWIPMFSNHADMMLTQIAIGLCFTLLYFVVFRTLILQFNMCTPGRED CCCCHHHHHHCCCCCCCHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECCCCCCCCC AEVKLYSKAEYKASRGQTTAAEPKKELDQAAGILQALGGVGNISSINNCATRLRIALHDM CCEEEEECHHHHHCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHH SQTLDDEVFKKLGAHGVFRSGDAIQVIIGLHVSQLREQLDSLINSHQSAENVAITEAV HHHHHHHHHHHHCCCCCCCCCCCEEEHHHHHHHHHHHHHHHHHHCCCCCCCCEEECCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 11473129; 11322729 [H]