Definition | Escherichia coli HS, complete genome. |
---|---|
Accession | NC_009800 |
Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is fryA [H]
Identifier: 157161856
GI number: 157161856
Start: 2533392
End: 2535887
Strand: Reverse
Name: fryA [H]
Synonym: EcHS_A2520
Alternate gene names: 157161856
Gene position: 2535887-2533392 (Counterclockwise)
Preceding gene: 157161857
Following gene: 157161852
Centisome position: 54.61
GC content: 55.21
Gene sequence:
>2496_bases ATGTTAACGATTCAATTTCTCTGTCCTTTGCCTAACGGTCTACATGCTCGTCCGGCGTGGGAACTTAAAGAACAGTGCAG CCAGTGGCAAAGCGAAATCACTTTTATTAACCATCGCCAGAACGCAAAGGCAGATGCGAAAAGCTCGCTGGCGCTGATTG GCACCGGCACCCTATTTAATGACAGTTGCAGCCTGAACATTAGCGGCAGCGATGAAGAGCAGGCGCGGCGCGTGCTGGAA GAGTACATCCAGGTGCGCTTTATCGACAGCGACAGCGTTCAGCCTACGCAGGCAGAACTGACGGCGCATCCGCTGCCGCG TTCATTAAGCCGCCTTAACCCGGATTTACTGTACGGCAATGTGCTGGCAAGCGGCGTCGGCGTGGGTACGCTGACCCTGT TACAAAGCGACAGCCTCGACAGTTATCGGGCAATCCCCGCCAGTGCGCAAGATTCCACCCGGCTGGAGCACAGCCTGGCA ACGCTTGCCGAGCAACTGAACCAGCAATTGCGTGAGCGTGACGGCGAAAGCAAAACTATCCTCAGCGCCCATTTGTCGCT GATTCAGGATGATGAATTTGCAGGCAATATCCGTCGCCTGATGACAGAACAGCATCAGGGGCTGGGGGCGGCGATCATCA GCAATATGGAGCAGGTTTGCGCCAAACTTTCTGCCTCTGCCAGCGATTATCTGCGCGAACGTGTTAGCGACATTCGCGAT ATCAGCGAACAGTTGCTGCATATCACCTGGCCGGAACTGAAGCCGCGCAACAAGCTGGTGCTTGAAAAACCGACCATTCT GGTGGCTGAAGATTTAACCCCAAGCCAGTTTTTGAGCCTTGATTTGAAAAATCTTGCGGGCATGATTCTGGAGAAAACCG GGCGCACCTCGCATACACTGATTCTGGCCCGTGCCTCGGCGATCCCGGTACTGAGTGGCTTGCCGCTGGATGCGATTGCC CGTTATGCCGGGCAACCTGCAGTGCTTGACGCCCAGTGCGGCGTGCTGGCGATTAACCCGAATGACGCGGTGAGCGGTTA TTATCAGGTCGCGCAGACGCTGGCGGATAAACGCCAAAAACAACAGGCGCAGGCTGCCGCGCAGCTGGCCTATTCCCGTG ATAACAAGCGTATTGATATTGCGGCGAATATCGGCACCGCTCTGGAAGCGCCAGGCGCGTTTGCCAACGGCGCGGAAGGT GTCGGGCTGTTCCGTACCGAAATGCTCTATATGGATCGCGACAGCGCGCCGGACGAGCAGGAGCAATTTGAAGCCTACCA GCAGGTGCTACTGGCGGCGGGCGACAAGCCGATTATCTTCCGCACGATGGACATCGGCGGCGATAAAAGCATTCCTTATC TGAATATTCCCCAGGAAGAGAACCCGTTCCTCGGCTATCGCGCGGTACGTATTTACCCGGAATTTGCTGGCCTGTTCCGC ACTCAACTGCGGGCCATTTTGCGCGCCGCCAGTTTCGGCAACGCCCAGTTGATGATCCCGATGGTTCACAGCCTCGATCA GATCTTATGGGTGAAAGGCGAGATCCAAAAAGCGATCGTTGAGCTTAAGCGCGATGGCCTGCGTCATGCAGAGACGATTA CGCTTGGGATCATGGTGGAAGTTCCGTCGGTGTGCTACATCATCGACCACTTCTGCGATGAGGTCGATTTCTTCAGTATC GGCTCCAACGATATGACCCAGTATCTGTATGCGGTCGATCGTAATAACCCGCGCGTATCGCCGCTATATAACCCGATTAC GCCATCGTTCCTGCGCATGTTGCAGCAAATAGTTACCACTGCGCATCAGCGGGGCAAATGGGTAGGCATTTGCGGTGAAC TGGGCGGTGAAAGCCGTTATCTGCCGCTACTGCTTGGGCTGGGCCTGGATGAGCTGAGTATGAGTAGCCCGCGTATTCCG GCGGTGAAAAGCCAGCTTCGTCAACTGGATAGCGAGGCGTGTCGGGAACTGGCGCGTCAGGCATGTGAATGCCGCAGTGC GCAGGAAATTGAAGCGTTACTCACCGCCTTTACGCCGGAAGAAGACGTTCGCCCACTGCTGGCGCTGGAGAATATCTTTG TTGATCAGGATTTTAGCAATAAAGAGCAGGCGATCCAGTTCCTGTGCGGCAACCTCGGCGTTAACGGGCGCACTGAACAT CCGTTTGAGCTGGAAGAAGATGTCTGGCAGCGGGAAGAGATTGTTACCACCGGCGTTGGTTTTGGCGTAGCGATCCCGCA CACCAAATCTCAGTGGATCCGTCATTCCAGTATCAGCATTGCCCGGCTGGCGAAACCGATTGGCTGGCAGTCAGAAATGG GCGAAGTCGAACTGGTGATCATGCTGACGCTGGGTGCTAACGAAGGGATGAATCATGTGAAAGTCTTCTCGCAGCTGGCG CGTAAACTGGTGAATAAAAACTTCCGCCAGTCGTTGTTTGCCGCGCAAGATGCACAAAGTATCCTGACGCTGCTGGAAAC AGAATTAACCTTCTGA
Upstream 100 bases:
>100_bases GATGCAGCAACTTTTATCTGCCCTTATTCAACGTCTTACGCGTGAGACGGTTGTTCAACTGACGGATTTCAGATGATCTC CTGATTAACCCGGAGCGGTT
Downstream 100 bases:
>100_bases CGTTAGCCCTGAAAACGGGCGCTGTACTCTCCCGGCGTCAGACCAAACTGACGCCGGAAAACGCGACAAAAATAGTCGCT ATCCGGAAAACCGCAACGCT
Product: multiphosphoryl transfer protein 1
Products: NA
Alternate protein names: MTP; Phosphoenolpyruvate-protein phosphotransferase; Phosphotransferase system enzyme I; Phosphocarrier protein HPr; Protein H; Fructose-like phosphotransferase enzyme IIA component; PTS system fructose-like EIIA component [H]
Number of amino acids: Translated: 831; Mature: 831
Protein sequence:
>831_residues MLTIQFLCPLPNGLHARPAWELKEQCSQWQSEITFINHRQNAKADAKSSLALIGTGTLFNDSCSLNISGSDEEQARRVLE EYIQVRFIDSDSVQPTQAELTAHPLPRSLSRLNPDLLYGNVLASGVGVGTLTLLQSDSLDSYRAIPASAQDSTRLEHSLA TLAEQLNQQLRERDGESKTILSAHLSLIQDDEFAGNIRRLMTEQHQGLGAAIISNMEQVCAKLSASASDYLRERVSDIRD ISEQLLHITWPELKPRNKLVLEKPTILVAEDLTPSQFLSLDLKNLAGMILEKTGRTSHTLILARASAIPVLSGLPLDAIA RYAGQPAVLDAQCGVLAINPNDAVSGYYQVAQTLADKRQKQQAQAAAQLAYSRDNKRIDIAANIGTALEAPGAFANGAEG VGLFRTEMLYMDRDSAPDEQEQFEAYQQVLLAAGDKPIIFRTMDIGGDKSIPYLNIPQEENPFLGYRAVRIYPEFAGLFR TQLRAILRAASFGNAQLMIPMVHSLDQILWVKGEIQKAIVELKRDGLRHAETITLGIMVEVPSVCYIIDHFCDEVDFFSI GSNDMTQYLYAVDRNNPRVSPLYNPITPSFLRMLQQIVTTAHQRGKWVGICGELGGESRYLPLLLGLGLDELSMSSPRIP AVKSQLRQLDSEACRELARQACECRSAQEIEALLTAFTPEEDVRPLLALENIFVDQDFSNKEQAIQFLCGNLGVNGRTEH PFELEEDVWQREEIVTTGVGFGVAIPHTKSQWIRHSSISIARLAKPIGWQSEMGEVELVIMLTLGANEGMNHVKVFSQLA RKLVNKNFRQSLFAAQDAQSILTLLETELTF
Sequences:
>Translated_831_residues MLTIQFLCPLPNGLHARPAWELKEQCSQWQSEITFINHRQNAKADAKSSLALIGTGTLFNDSCSLNISGSDEEQARRVLE EYIQVRFIDSDSVQPTQAELTAHPLPRSLSRLNPDLLYGNVLASGVGVGTLTLLQSDSLDSYRAIPASAQDSTRLEHSLA TLAEQLNQQLRERDGESKTILSAHLSLIQDDEFAGNIRRLMTEQHQGLGAAIISNMEQVCAKLSASASDYLRERVSDIRD ISEQLLHITWPELKPRNKLVLEKPTILVAEDLTPSQFLSLDLKNLAGMILEKTGRTSHTLILARASAIPVLSGLPLDAIA RYAGQPAVLDAQCGVLAINPNDAVSGYYQVAQTLADKRQKQQAQAAAQLAYSRDNKRIDIAANIGTALEAPGAFANGAEG VGLFRTEMLYMDRDSAPDEQEQFEAYQQVLLAAGDKPIIFRTMDIGGDKSIPYLNIPQEENPFLGYRAVRIYPEFAGLFR TQLRAILRAASFGNAQLMIPMVHSLDQILWVKGEIQKAIVELKRDGLRHAETITLGIMVEVPSVCYIIDHFCDEVDFFSI GSNDMTQYLYAVDRNNPRVSPLYNPITPSFLRMLQQIVTTAHQRGKWVGICGELGGESRYLPLLLGLGLDELSMSSPRIP AVKSQLRQLDSEACRELARQACECRSAQEIEALLTAFTPEEDVRPLLALENIFVDQDFSNKEQAIQFLCGNLGVNGRTEH PFELEEDVWQREEIVTTGVGFGVAIPHTKSQWIRHSSISIARLAKPIGWQSEMGEVELVIMLTLGANEGMNHVKVFSQLA RKLVNKNFRQSLFAAQDAQSILTLLETELTF >Mature_831_residues MLTIQFLCPLPNGLHARPAWELKEQCSQWQSEITFINHRQNAKADAKSSLALIGTGTLFNDSCSLNISGSDEEQARRVLE EYIQVRFIDSDSVQPTQAELTAHPLPRSLSRLNPDLLYGNVLASGVGVGTLTLLQSDSLDSYRAIPASAQDSTRLEHSLA TLAEQLNQQLRERDGESKTILSAHLSLIQDDEFAGNIRRLMTEQHQGLGAAIISNMEQVCAKLSASASDYLRERVSDIRD ISEQLLHITWPELKPRNKLVLEKPTILVAEDLTPSQFLSLDLKNLAGMILEKTGRTSHTLILARASAIPVLSGLPLDAIA RYAGQPAVLDAQCGVLAINPNDAVSGYYQVAQTLADKRQKQQAQAAAQLAYSRDNKRIDIAANIGTALEAPGAFANGAEG VGLFRTEMLYMDRDSAPDEQEQFEAYQQVLLAAGDKPIIFRTMDIGGDKSIPYLNIPQEENPFLGYRAVRIYPEFAGLFR TQLRAILRAASFGNAQLMIPMVHSLDQILWVKGEIQKAIVELKRDGLRHAETITLGIMVEVPSVCYIIDHFCDEVDFFSI GSNDMTQYLYAVDRNNPRVSPLYNPITPSFLRMLQQIVTTAHQRGKWVGICGELGGESRYLPLLLGLGLDELSMSSPRIP AVKSQLRQLDSEACRELARQACECRSAQEIEALLTAFTPEEDVRPLLALENIFVDQDFSNKEQAIQFLCGNLGVNGRTEH PFELEEDVWQREEIVTTGVGFGVAIPHTKSQWIRHSSISIARLAKPIGWQSEMGEVELVIMLTLGANEGMNHVKVFSQLA RKLVNKNFRQSLFAAQDAQSILTLLETELTF
Specific function: Multifunctional protein that includes general (non sugar-specific) and sugar-specific components of the phosphoenolpyruvate-dependent sugar phosphotransferase system (sugar PTS). This major carbohydrate active-transport system catalyzes the phosphorylatio
COG id: COG1080
COG function: function code G; Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria)
Gene ontology:
Cell location: Cytoplasm (Probable) [H]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 PTS EIIA type-2 domain [H]
Homologues:
Organism=Escherichia coli, GI1788726, Length=831, Percent_Identity=100, Blast_Score=1709, Evalue=0.0, Organism=Escherichia coli, GI48994992, Length=834, Percent_Identity=45.083932853717, Blast_Score=703, Evalue=0.0, Organism=Escherichia coli, GI1788756, Length=575, Percent_Identity=39.1304347826087, Blast_Score=394, Evalue=1e-110, Organism=Escherichia coli, GI1789193, Length=524, Percent_Identity=32.2519083969466, Blast_Score=255, Evalue=9e-69, Organism=Escherichia coli, GI1787994, Length=422, Percent_Identity=26.0663507109005, Blast_Score=105, Evalue=1e-23, Organism=Escherichia coli, GI1786951, Length=136, Percent_Identity=27.2058823529412, Blast_Score=69, Evalue=8e-13,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR008279 - InterPro: IPR006318 - InterPro: IPR023151 - InterPro: IPR000121 - InterPro: IPR016152 - InterPro: IPR002178 - InterPro: IPR005698 - InterPro: IPR000032 - InterPro: IPR004715 - InterPro: IPR008731 - InterPro: IPR015813 [H]
Pfam domain/function: PF05524 PEP-utilisers_N; PF00391 PEP-utilizers; PF02896 PEP-utilizers_C; PF00381 PTS-HPr; PF00359 PTS_EIIA_2 [H]
EC number: =2.7.3.9 [H]
Molecular weight: Translated: 92131; Mature: 92131
Theoretical pI: Translated: 4.97; Mature: 4.97
Prosite motif: PS51094 PTS_EIIA_TYPE_2 ; PS00742 PEP_ENZYMES_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.4 %Cys (Translated Protein) 1.9 %Met (Translated Protein) 3.4 %Cys+Met (Translated Protein) 1.4 %Cys (Mature Protein) 1.9 %Met (Mature Protein) 3.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MLTIQFLCPLPNGLHARPAWELKEQCSQWQSEITFINHRQNAKADAKSSLALIGTGTLFN CEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCEEEECCCCCCCCCCCCEEEEEECEEEC DSCSLNISGSDEEQARRVLEEYIQVRFIDSDSVQPTQAELTAHPLPRSLSRLNPDLLYGN CCEEEEECCCCHHHHHHHHHHHHHEEEECCCCCCCCHHHHCCCCCHHHHHHCCHHHEEHH VLASGVGVGTLTLLQSDSLDSYRAIPASAQDSTRLEHSLATLAEQLNQQLRERDGESKTI HHHCCCCCCEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHH LSAHLSLIQDDEFAGNIRRLMTEQHQGLGAAIISNMEQVCAKLSASASDYLRERVSDIRD HHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH ISEQLLHITWPELKPRNKLVLEKPTILVAEDLTPSQFLSLDLKNLAGMILEKTGRTSHTL HHHHHHEEECCCCCCCCCEEEECCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCCEEE ILARASAIPVLSGLPLDAIARYAGQPAVLDAQCGVLAINPNDAVSGYYQVAQTLADKRQK EEEECCCCCCCCCCCHHHHHHHCCCCCEEECCCCEEEECCCHHHHHHHHHHHHHHHHHHH QQAQAAAQLAYSRDNKRIDIAANIGTALEAPGAFANGAEGVGLFRTEMLYMDRDSAPDEQ HHHHHHHHHHHCCCCCEEEEEECCCCHHCCCCCCCCCCCCCCHHHHHEEEECCCCCCCHH EQFEAYQQVLLAAGDKPIIFRTMDIGGDKSIPYLNIPQEENPFLGYRAVRIYPEFAGLFR HHHHHHHHHHHHCCCCCEEEEEEECCCCCCCCEEECCCCCCCCCCEEEEEEEHHHHHHHH TQLRAILRAASFGNAQLMIPMVHSLDQILWVKGEIQKAIVELKRDGLRHAETITLGIMVE HHHHHHHHHHCCCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHEEEEEEEEE VPSVCYIIDHFCDEVDFFSIGSNDMTQYLYAVDRNNPRVSPLYNPITPSFLRMLQQIVTT CCHHHHHHHHHHCCCCEEECCCCHHHHHHHHEECCCCCCCCCCCCCCHHHHHHHHHHHHH AHQRGKWVGICGELGGESRYLPLLLGLGLDELSMSSPRIPAVKSQLRQLDSEACRELARQ HHHCCCEEEEECCCCCCCCEEHHHCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHH ACECRSAQEIEALLTAFTPEEDVRPLLALENIFVDQDFSNKEQAIQFLCGNLGVNGRTEH HHHCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCCCCCCCCC PFELEEDVWQREEIVTTGVGFGVAIPHTKSQWIRHSSISIARLAKPIGWQSEMGEVELVI CCCCHHHHHHHHHHHHCCCCCCEECCCCHHHHHHHCCCHHHHHHCCCCCCCCCCCEEEEE MLTLGANEGMNHVKVFSQLARKLVNKNFRQSLFAAQDAQSILTLLETELTF EEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC >Mature Secondary Structure MLTIQFLCPLPNGLHARPAWELKEQCSQWQSEITFINHRQNAKADAKSSLALIGTGTLFN CEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCEEEECCCCCCCCCCCCEEEEEECEEEC DSCSLNISGSDEEQARRVLEEYIQVRFIDSDSVQPTQAELTAHPLPRSLSRLNPDLLYGN CCEEEEECCCCHHHHHHHHHHHHHEEEECCCCCCCCHHHHCCCCCHHHHHHCCHHHEEHH VLASGVGVGTLTLLQSDSLDSYRAIPASAQDSTRLEHSLATLAEQLNQQLRERDGESKTI HHHCCCCCCEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHH LSAHLSLIQDDEFAGNIRRLMTEQHQGLGAAIISNMEQVCAKLSASASDYLRERVSDIRD HHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH ISEQLLHITWPELKPRNKLVLEKPTILVAEDLTPSQFLSLDLKNLAGMILEKTGRTSHTL HHHHHHEEECCCCCCCCCEEEECCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCCEEE ILARASAIPVLSGLPLDAIARYAGQPAVLDAQCGVLAINPNDAVSGYYQVAQTLADKRQK EEEECCCCCCCCCCCHHHHHHHCCCCCEEECCCCEEEECCCHHHHHHHHHHHHHHHHHHH QQAQAAAQLAYSRDNKRIDIAANIGTALEAPGAFANGAEGVGLFRTEMLYMDRDSAPDEQ HHHHHHHHHHHCCCCCEEEEEECCCCHHCCCCCCCCCCCCCCHHHHHEEEECCCCCCCHH EQFEAYQQVLLAAGDKPIIFRTMDIGGDKSIPYLNIPQEENPFLGYRAVRIYPEFAGLFR HHHHHHHHHHHHCCCCCEEEEEEECCCCCCCCEEECCCCCCCCCCEEEEEEEHHHHHHHH TQLRAILRAASFGNAQLMIPMVHSLDQILWVKGEIQKAIVELKRDGLRHAETITLGIMVE HHHHHHHHHHCCCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHEEEEEEEEE VPSVCYIIDHFCDEVDFFSIGSNDMTQYLYAVDRNNPRVSPLYNPITPSFLRMLQQIVTT CCHHHHHHHHHHCCCCEEECCCCHHHHHHHHEECCCCCCCCCCCCCCHHHHHHHHHHHHH AHQRGKWVGICGELGGESRYLPLLLGLGLDELSMSSPRIPAVKSQLRQLDSEACRELARQ HHHCCCEEEEECCCCCCCCEEHHHCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHH ACECRSAQEIEALLTAFTPEEDVRPLLALENIFVDQDFSNKEQAIQFLCGNLGVNGRTEH HHHCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCCCCCCCCC PFELEEDVWQREEIVTTGVGFGVAIPHTKSQWIRHSSISIARLAKPIGWQSEMGEVELVI CCCCHHHHHHHHHHHHCCCCCCEECCCCHHHHHHHCCCHHHHHHCCCCCCCCCCCEEEEE MLTLGANEGMNHVKVFSQLARKLVNKNFRQSLFAAQDAQSILTLLETELTF EEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]