Definition Escherichia coli HS, complete genome.
Accession NC_009800
Length 4,643,538

Click here to switch to the map view.

The map label for this gene is pheP [H]

Identifier: 157160068

GI number: 157160068

Start: 638053

End: 639429

Strand: Direct

Name: pheP [H]

Synonym: EcHS_A0623

Alternate gene names: 157160068

Gene position: 638053-639429 (Clockwise)

Preceding gene: 157160067

Following gene: 157160074

Centisome position: 13.74

GC content: 52.58

Gene sequence:

>1377_bases
GTGAAAAACGCGTCAACCGTATCGGAAGATACTGCGTCGAATCAAGAGCCGACGCTTCATCGCGGATTACATAACCGTCA
TATTCAACTGATTGCGTTGGGTGGCGCAATTGGTACTGGTCTGTTTCTTGGCATTGGCCCGGCGATTCAGATGGCGGGTC
CGGCTGTATTGCTGGGCTACGGCGTCGCCGGGATCATCGCTTTCCTGATTATGCGCCAGCTTGGCGAAATGGTGGTTGAG
GAGCCGGTATCCGGTTCATTTGCCCACTTTGCCTATAAATACTGGGGACCGTTTGCGGGCTTCCTCTCTGGCTGGAACTA
CTGGGTAATGTTCGTGCTGGTGGGAATGGCAGAGCTGACCGCTGCGGGCATCTATATGCAGTACTGGTTCCCGGATGTTC
CAACGTGGATTTGGGCTGCCGCCTTCTTTATTATCATCAACGCCGTTAACCTGGTGAACGTGCGCTTATATGGCGAAACC
GAGTTCTGGTTTGCGCTGATTAAAGTGCTGGCGATCATCGGTATGATCGGCTTTGGCCTGTGGCTGCTGTTTTCTGGTCA
CGGCGGCGAGAAAGCCAGTATCGACAACCTCTGGCGCTACGGTGGTTTCTTCGCCACCGGCTGGAATGGGCTGATTTTGT
CGCTGGCGGTAATTATGTTCTCCTTCGGCGGTCTGGAGCTGATTGGGATTACTGCCGCTGAAGCGCGCGATCCGGAAAAA
AGCATTCCAAAAGCGGTAAATCAGGTGGTGTATCGCATCCTGCTGTTTTACATCGGTTCACTGGTGGTTTTACTGGCGCT
CTATCCGTGGATGGAAGTGAAATCCAACAGTAGCCCGTTTGTGATGATTTTCCATAATCTCGACAGCAACGTGGTAGCTT
CTGCGCTGAACTTCGTCATTCTGGTAGCATCGCTGTCAGTGTATAACAGCGGGGTTTACTCTAACAGCCGCATGCTGTTT
GGCCTTTCTGTGCAGGGTAATGCGCCGAAGTTTTTGACTCGCGTCAGCCGTCGCGGTGTGCCGATTAACTCGCTGATGCT
TTCCGGAGCGATCACTTCGCTGGTGGTGTTAATCAACTATCTGCTGCCGCAAAAAGCGTTTGGTCTGCTGATGGCGCTGG
TGGTAGCAACGCTGCTGTTGAACTGGATTATGATCTGTCTGGCGCATCTGCGTTTTCGTGCAGCGATGCGACGTCAGGGG
CGTGAAACACAGTTTAAGGCGCTGCTTTATCCGTTCGGCAACTATCTTTGCATCGCCTTCCTCGGCATGATTTTGCTGCT
GATGTGCACGATGGATGATATGCGCTTGTCAGCGATCCTGCTGCCGGTGTGGATTGTATTCCTGTTTGTGGCATTTAAAA
CGCTGCGTCGGAAATAA

Upstream 100 bases:

>100_bases
AGCAGGATACCCCGTTTAACCGTGTGGATTGTGTCTTGCGACGATGGGCACTAAATGTTAAAAGGTGCCCCTCAACAAAA
AAGACACACAGGGGAAAGGC

Downstream 100 bases:

>100_bases
GGCATTCACGCTACATCCGACAAAACGATGTCCGCTCTCATCCATTCGATGAGAGCGGTTTTTTTAATTACTGCTTAAAT
GCACCCGCCAGAGAGCGAAT

Product: phenylalanine transporter

Products: Proton [Cytoplasm]; L-phenylalanine [Cytoplasm] [C]

Alternate protein names: NA

Number of amino acids: Translated: 458; Mature: 458

Protein sequence:

>458_residues
MKNASTVSEDTASNQEPTLHRGLHNRHIQLIALGGAIGTGLFLGIGPAIQMAGPAVLLGYGVAGIIAFLIMRQLGEMVVE
EPVSGSFAHFAYKYWGPFAGFLSGWNYWVMFVLVGMAELTAAGIYMQYWFPDVPTWIWAAAFFIIINAVNLVNVRLYGET
EFWFALIKVLAIIGMIGFGLWLLFSGHGGEKASIDNLWRYGGFFATGWNGLILSLAVIMFSFGGLELIGITAAEARDPEK
SIPKAVNQVVYRILLFYIGSLVVLLALYPWMEVKSNSSPFVMIFHNLDSNVVASALNFVILVASLSVYNSGVYSNSRMLF
GLSVQGNAPKFLTRVSRRGVPINSLMLSGAITSLVVLINYLLPQKAFGLLMALVVATLLLNWIMICLAHLRFRAAMRRQG
RETQFKALLYPFGNYLCIAFLGMILLLMCTMDDMRLSAILLPVWIVFLFVAFKTLRRK

Sequences:

>Translated_458_residues
MKNASTVSEDTASNQEPTLHRGLHNRHIQLIALGGAIGTGLFLGIGPAIQMAGPAVLLGYGVAGIIAFLIMRQLGEMVVE
EPVSGSFAHFAYKYWGPFAGFLSGWNYWVMFVLVGMAELTAAGIYMQYWFPDVPTWIWAAAFFIIINAVNLVNVRLYGET
EFWFALIKVLAIIGMIGFGLWLLFSGHGGEKASIDNLWRYGGFFATGWNGLILSLAVIMFSFGGLELIGITAAEARDPEK
SIPKAVNQVVYRILLFYIGSLVVLLALYPWMEVKSNSSPFVMIFHNLDSNVVASALNFVILVASLSVYNSGVYSNSRMLF
GLSVQGNAPKFLTRVSRRGVPINSLMLSGAITSLVVLINYLLPQKAFGLLMALVVATLLLNWIMICLAHLRFRAAMRRQG
RETQFKALLYPFGNYLCIAFLGMILLLMCTMDDMRLSAILLPVWIVFLFVAFKTLRRK
>Mature_458_residues
MKNASTVSEDTASNQEPTLHRGLHNRHIQLIALGGAIGTGLFLGIGPAIQMAGPAVLLGYGVAGIIAFLIMRQLGEMVVE
EPVSGSFAHFAYKYWGPFAGFLSGWNYWVMFVLVGMAELTAAGIYMQYWFPDVPTWIWAAAFFIIINAVNLVNVRLYGET
EFWFALIKVLAIIGMIGFGLWLLFSGHGGEKASIDNLWRYGGFFATGWNGLILSLAVIMFSFGGLELIGITAAEARDPEK
SIPKAVNQVVYRILLFYIGSLVVLLALYPWMEVKSNSSPFVMIFHNLDSNVVASALNFVILVASLSVYNSGVYSNSRMLF
GLSVQGNAPKFLTRVSRRGVPINSLMLSGAITSLVVLINYLLPQKAFGLLMALVVATLLLNWIMICLAHLRFRAAMRRQG
RETQFKALLYPFGNYLCIAFLGMILLLMCTMDDMRLSAILLPVWIVFLFVAFKTLRRK

Specific function: Permease that is involved in the transport across the cytoplasmic membrane of phenylalanine [H]

COG id: COG1113

COG function: function code E; Gamma-aminobutyrate permease and related permeases

Gene ontology:

Cell location: Cell inner membrane; Multi-pass membrane protein [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the amino acid-polyamine-organocation (APC) superfamily. Amino acid transporter (AAT) (TC 2.A.3.1) family [H]

Homologues:

Organism=Homo sapiens, GI110347453, Length=290, Percent_Identity=27.5862068965517, Blast_Score=79, Evalue=1e-14,
Organism=Homo sapiens, GI181337167, Length=405, Percent_Identity=24.1975308641975, Blast_Score=74, Evalue=3e-13,
Organism=Homo sapiens, GI258645169, Length=441, Percent_Identity=21.7687074829932, Blast_Score=72, Evalue=1e-12,
Organism=Homo sapiens, GI115648063, Length=460, Percent_Identity=21.9565217391304, Blast_Score=69, Evalue=1e-11,
Organism=Homo sapiens, GI115648022, Length=460, Percent_Identity=21.9565217391304, Blast_Score=69, Evalue=1e-11,
Organism=Escherichia coli, GI1786789, Length=458, Percent_Identity=99.5633187772926, Blast_Score=914, Evalue=0.0,
Organism=Escherichia coli, GI1786302, Length=434, Percent_Identity=64.0552995391705, Blast_Score=593, Evalue=1e-171,
Organism=Escherichia coli, GI48994972, Length=429, Percent_Identity=44.0559440559441, Blast_Score=369, Evalue=1e-103,
Organism=Escherichia coli, GI1786602, Length=452, Percent_Identity=44.9115044247788, Blast_Score=353, Evalue=1e-98,
Organism=Escherichia coli, GI1790653, Length=453, Percent_Identity=37.9690949227373, Blast_Score=337, Evalue=8e-94,
Organism=Escherichia coli, GI87081915, Length=425, Percent_Identity=36.4705882352941, Blast_Score=300, Evalue=9e-83,
Organism=Escherichia coli, GI1788480, Length=418, Percent_Identity=37.799043062201, Blast_Score=271, Evalue=7e-74,
Organism=Escherichia coli, GI1789017, Length=430, Percent_Identity=36.7441860465116, Blast_Score=271, Evalue=7e-74,
Organism=Escherichia coli, GI87081708, Length=447, Percent_Identity=35.1230425055928, Blast_Score=269, Evalue=3e-73,
Organism=Escherichia coli, GI87081869, Length=175, Percent_Identity=28.5714285714286, Blast_Score=79, Evalue=8e-16,
Organism=Escherichia coli, GI87082023, Length=366, Percent_Identity=23.4972677595628, Blast_Score=68, Evalue=1e-12,
Organism=Escherichia coli, GI2367353, Length=370, Percent_Identity=21.3513513513514, Blast_Score=63, Evalue=4e-11,
Organism=Caenorhabditis elegans, GI17540018, Length=382, Percent_Identity=23.5602094240838, Blast_Score=72, Evalue=9e-13,
Organism=Caenorhabditis elegans, GI17531343, Length=391, Percent_Identity=21.9948849104859, Blast_Score=67, Evalue=2e-11,
Organism=Saccharomyces cerevisiae, GI6320772, Length=442, Percent_Identity=35.972850678733, Blast_Score=253, Evalue=5e-68,
Organism=Saccharomyces cerevisiae, GI6324061, Length=443, Percent_Identity=35.4401805869075, Blast_Score=246, Evalue=4e-66,
Organism=Saccharomyces cerevisiae, GI6324059, Length=450, Percent_Identity=34.4444444444444, Blast_Score=216, Evalue=4e-57,
Organism=Saccharomyces cerevisiae, GI6322892, Length=424, Percent_Identity=35.8490566037736, Blast_Score=215, Evalue=9e-57,
Organism=Saccharomyces cerevisiae, GI6319824, Length=449, Percent_Identity=32.293986636971, Blast_Score=214, Evalue=2e-56,
Organism=Saccharomyces cerevisiae, GI6324990, Length=441, Percent_Identity=32.4263038548753, Blast_Score=213, Evalue=5e-56,
Organism=Saccharomyces cerevisiae, GI6324924, Length=448, Percent_Identity=31.4732142857143, Blast_Score=206, Evalue=4e-54,
Organism=Saccharomyces cerevisiae, GI6321629, Length=416, Percent_Identity=31.9711538461538, Blast_Score=206, Evalue=4e-54,
Organism=Saccharomyces cerevisiae, GI6324553, Length=399, Percent_Identity=33.3333333333333, Blast_Score=202, Evalue=9e-53,
Organism=Saccharomyces cerevisiae, GI6320717, Length=484, Percent_Identity=27.8925619834711, Blast_Score=198, Evalue=2e-51,
Organism=Saccharomyces cerevisiae, GI6321053, Length=439, Percent_Identity=31.8906605922551, Blast_Score=192, Evalue=1e-49,
Organism=Saccharomyces cerevisiae, GI6319543, Length=436, Percent_Identity=32.1100917431193, Blast_Score=191, Evalue=1e-49,
Organism=Saccharomyces cerevisiae, GI6320251, Length=488, Percent_Identity=28.2786885245902, Blast_Score=187, Evalue=3e-48,
Organism=Saccharomyces cerevisiae, GI6322967, Length=395, Percent_Identity=31.3924050632911, Blast_Score=186, Evalue=8e-48,
Organism=Saccharomyces cerevisiae, GI6319542, Length=480, Percent_Identity=26.6666666666667, Blast_Score=177, Evalue=4e-45,
Organism=Saccharomyces cerevisiae, GI6324981, Length=416, Percent_Identity=32.6923076923077, Blast_Score=173, Evalue=6e-44,
Organism=Saccharomyces cerevisiae, GI6319608, Length=413, Percent_Identity=29.7820823244552, Blast_Score=172, Evalue=8e-44,
Organism=Saccharomyces cerevisiae, GI6320364, Length=487, Percent_Identity=27.7207392197125, Blast_Score=134, Evalue=2e-32,
Organism=Drosophila melanogaster, GI24666159, Length=422, Percent_Identity=22.7488151658768, Blast_Score=88, Evalue=1e-17,
Organism=Drosophila melanogaster, GI221512776, Length=422, Percent_Identity=22.7488151658768, Blast_Score=88, Evalue=1e-17,
Organism=Drosophila melanogaster, GI24667468, Length=427, Percent_Identity=22.7166276346604, Blast_Score=69, Evalue=7e-12,
Organism=Drosophila melanogaster, GI221331183, Length=351, Percent_Identity=23.6467236467236, Blast_Score=66, Evalue=5e-11,
Organism=Drosophila melanogaster, GI17647653, Length=351, Percent_Identity=23.6467236467236, Blast_Score=66, Evalue=5e-11,
Organism=Drosophila melanogaster, GI24664379, Length=351, Percent_Identity=23.6467236467236, Blast_Score=66, Evalue=5e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR004841
- InterPro:   IPR002293
- InterPro:   IPR004840 [H]

Pfam domain/function: PF00324 AA_permease [H]

EC number: NA

Molecular weight: Translated: 50678; Mature: 50678

Theoretical pI: Translated: 9.78; Mature: 9.78

Prosite motif: PS00218 AMINO_ACID_PERMEASE_1

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.7 %Cys     (Translated Protein)
4.4 %Met     (Translated Protein)
5.0 %Cys+Met (Translated Protein)
0.7 %Cys     (Mature Protein)
4.4 %Met     (Mature Protein)
5.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKNASTVSEDTASNQEPTLHRGLHNRHIQLIALGGAIGTGLFLGIGPAIQMAGPAVLLGY
CCCCCCCCCCCCCCCCCHHHHCCCCCCEEEEEECCHHHHHHHHHCCHHHHHCCCHHHHHH
GVAGIIAFLIMRQLGEMVVEEPVSGSFAHFAYKYWGPFAGFLSGWNYWVMFVLVGMAELT
HHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH
AAGIYMQYWFPDVPTWIWAAAFFIIINAVNLVNVRLYGETEFWFALIKVLAIIGMIGFGL
HHHHHHEECCCCCHHHHHHHHHHHHHHHHHEEEEEEECCHHHHHHHHHHHHHHHHHHHHH
WLLFSGHGGEKASIDNLWRYGGFFATGWNGLILSLAVIMFSFGGLELIGITAAEARDPEK
HHHCCCCCCCCCCHHHHHHHCCEEECCHHHHHHHHHHHHHHCCCEEEEEEEHHHCCCCHH
SIPKAVNQVVYRILLFYIGSLVVLLALYPWMEVKSNSSPFVMIFHNLDSNVVASALNFVI
HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHCCCCCCCEEEEEECCCHHHHHHHHHHHH
LVASLSVYNSGVYSNSRMLFGLSVQGNAPKFLTRVSRRGVPINSLMLSGAITSLVVLINY
HHHHHHHHHCCCCCCCEEEEEEEECCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHH
LLPQKAFGLLMALVVATLLLNWIMICLAHLRFRAAMRRQGRETQFKALLYPFGNYLCIAF
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHH
LGMILLLMCTMDDMRLSAILLPVWIVFLFVAFKTLRRK
HHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCC
>Mature Secondary Structure
MKNASTVSEDTASNQEPTLHRGLHNRHIQLIALGGAIGTGLFLGIGPAIQMAGPAVLLGY
CCCCCCCCCCCCCCCCCHHHHCCCCCCEEEEEECCHHHHHHHHHCCHHHHHCCCHHHHHH
GVAGIIAFLIMRQLGEMVVEEPVSGSFAHFAYKYWGPFAGFLSGWNYWVMFVLVGMAELT
HHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH
AAGIYMQYWFPDVPTWIWAAAFFIIINAVNLVNVRLYGETEFWFALIKVLAIIGMIGFGL
HHHHHHEECCCCCHHHHHHHHHHHHHHHHHEEEEEEECCHHHHHHHHHHHHHHHHHHHHH
WLLFSGHGGEKASIDNLWRYGGFFATGWNGLILSLAVIMFSFGGLELIGITAAEARDPEK
HHHCCCCCCCCCCHHHHHHHCCEEECCHHHHHHHHHHHHHHCCCEEEEEEEHHHCCCCHH
SIPKAVNQVVYRILLFYIGSLVVLLALYPWMEVKSNSSPFVMIFHNLDSNVVASALNFVI
HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHCCCCCCCEEEEEECCCHHHHHHHHHHHH
LVASLSVYNSGVYSNSRMLFGLSVQGNAPKFLTRVSRRGVPINSLMLSGAITSLVVLINY
HHHHHHHHHCCCCCCCEEEEEEEECCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHH
LLPQKAFGLLMALVVATLLLNWIMICLAHLRFRAAMRRQGRETQFKALLYPFGNYLCIAF
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHH
LGMILLLMCTMDDMRLSAILLPVWIVFLFVAFKTLRRK
HHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: Proton [Periplasm]; L-phenylalanine [Periplasm] [C]

Specific reaction: Proton [Periplasm] + L-phenylalanine [Periplasm] = Proton [Cytoplasm] + L-phenylalanine [Cytoplasm] [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 1711024; 8905232; 9278503; 8626334 [H]