Definition | Escherichia coli HS, complete genome. |
---|---|
Accession | NC_009800 |
Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is pheP [H]
Identifier: 157160068
GI number: 157160068
Start: 638053
End: 639429
Strand: Direct
Name: pheP [H]
Synonym: EcHS_A0623
Alternate gene names: 157160068
Gene position: 638053-639429 (Clockwise)
Preceding gene: 157160067
Following gene: 157160074
Centisome position: 13.74
GC content: 52.58
Gene sequence:
>1377_bases GTGAAAAACGCGTCAACCGTATCGGAAGATACTGCGTCGAATCAAGAGCCGACGCTTCATCGCGGATTACATAACCGTCA TATTCAACTGATTGCGTTGGGTGGCGCAATTGGTACTGGTCTGTTTCTTGGCATTGGCCCGGCGATTCAGATGGCGGGTC CGGCTGTATTGCTGGGCTACGGCGTCGCCGGGATCATCGCTTTCCTGATTATGCGCCAGCTTGGCGAAATGGTGGTTGAG GAGCCGGTATCCGGTTCATTTGCCCACTTTGCCTATAAATACTGGGGACCGTTTGCGGGCTTCCTCTCTGGCTGGAACTA CTGGGTAATGTTCGTGCTGGTGGGAATGGCAGAGCTGACCGCTGCGGGCATCTATATGCAGTACTGGTTCCCGGATGTTC CAACGTGGATTTGGGCTGCCGCCTTCTTTATTATCATCAACGCCGTTAACCTGGTGAACGTGCGCTTATATGGCGAAACC GAGTTCTGGTTTGCGCTGATTAAAGTGCTGGCGATCATCGGTATGATCGGCTTTGGCCTGTGGCTGCTGTTTTCTGGTCA CGGCGGCGAGAAAGCCAGTATCGACAACCTCTGGCGCTACGGTGGTTTCTTCGCCACCGGCTGGAATGGGCTGATTTTGT CGCTGGCGGTAATTATGTTCTCCTTCGGCGGTCTGGAGCTGATTGGGATTACTGCCGCTGAAGCGCGCGATCCGGAAAAA AGCATTCCAAAAGCGGTAAATCAGGTGGTGTATCGCATCCTGCTGTTTTACATCGGTTCACTGGTGGTTTTACTGGCGCT CTATCCGTGGATGGAAGTGAAATCCAACAGTAGCCCGTTTGTGATGATTTTCCATAATCTCGACAGCAACGTGGTAGCTT CTGCGCTGAACTTCGTCATTCTGGTAGCATCGCTGTCAGTGTATAACAGCGGGGTTTACTCTAACAGCCGCATGCTGTTT GGCCTTTCTGTGCAGGGTAATGCGCCGAAGTTTTTGACTCGCGTCAGCCGTCGCGGTGTGCCGATTAACTCGCTGATGCT TTCCGGAGCGATCACTTCGCTGGTGGTGTTAATCAACTATCTGCTGCCGCAAAAAGCGTTTGGTCTGCTGATGGCGCTGG TGGTAGCAACGCTGCTGTTGAACTGGATTATGATCTGTCTGGCGCATCTGCGTTTTCGTGCAGCGATGCGACGTCAGGGG CGTGAAACACAGTTTAAGGCGCTGCTTTATCCGTTCGGCAACTATCTTTGCATCGCCTTCCTCGGCATGATTTTGCTGCT GATGTGCACGATGGATGATATGCGCTTGTCAGCGATCCTGCTGCCGGTGTGGATTGTATTCCTGTTTGTGGCATTTAAAA CGCTGCGTCGGAAATAA
Upstream 100 bases:
>100_bases AGCAGGATACCCCGTTTAACCGTGTGGATTGTGTCTTGCGACGATGGGCACTAAATGTTAAAAGGTGCCCCTCAACAAAA AAGACACACAGGGGAAAGGC
Downstream 100 bases:
>100_bases GGCATTCACGCTACATCCGACAAAACGATGTCCGCTCTCATCCATTCGATGAGAGCGGTTTTTTTAATTACTGCTTAAAT GCACCCGCCAGAGAGCGAAT
Product: phenylalanine transporter
Products: Proton [Cytoplasm]; L-phenylalanine [Cytoplasm] [C]
Alternate protein names: NA
Number of amino acids: Translated: 458; Mature: 458
Protein sequence:
>458_residues MKNASTVSEDTASNQEPTLHRGLHNRHIQLIALGGAIGTGLFLGIGPAIQMAGPAVLLGYGVAGIIAFLIMRQLGEMVVE EPVSGSFAHFAYKYWGPFAGFLSGWNYWVMFVLVGMAELTAAGIYMQYWFPDVPTWIWAAAFFIIINAVNLVNVRLYGET EFWFALIKVLAIIGMIGFGLWLLFSGHGGEKASIDNLWRYGGFFATGWNGLILSLAVIMFSFGGLELIGITAAEARDPEK SIPKAVNQVVYRILLFYIGSLVVLLALYPWMEVKSNSSPFVMIFHNLDSNVVASALNFVILVASLSVYNSGVYSNSRMLF GLSVQGNAPKFLTRVSRRGVPINSLMLSGAITSLVVLINYLLPQKAFGLLMALVVATLLLNWIMICLAHLRFRAAMRRQG RETQFKALLYPFGNYLCIAFLGMILLLMCTMDDMRLSAILLPVWIVFLFVAFKTLRRK
Sequences:
>Translated_458_residues MKNASTVSEDTASNQEPTLHRGLHNRHIQLIALGGAIGTGLFLGIGPAIQMAGPAVLLGYGVAGIIAFLIMRQLGEMVVE EPVSGSFAHFAYKYWGPFAGFLSGWNYWVMFVLVGMAELTAAGIYMQYWFPDVPTWIWAAAFFIIINAVNLVNVRLYGET EFWFALIKVLAIIGMIGFGLWLLFSGHGGEKASIDNLWRYGGFFATGWNGLILSLAVIMFSFGGLELIGITAAEARDPEK SIPKAVNQVVYRILLFYIGSLVVLLALYPWMEVKSNSSPFVMIFHNLDSNVVASALNFVILVASLSVYNSGVYSNSRMLF GLSVQGNAPKFLTRVSRRGVPINSLMLSGAITSLVVLINYLLPQKAFGLLMALVVATLLLNWIMICLAHLRFRAAMRRQG RETQFKALLYPFGNYLCIAFLGMILLLMCTMDDMRLSAILLPVWIVFLFVAFKTLRRK >Mature_458_residues MKNASTVSEDTASNQEPTLHRGLHNRHIQLIALGGAIGTGLFLGIGPAIQMAGPAVLLGYGVAGIIAFLIMRQLGEMVVE EPVSGSFAHFAYKYWGPFAGFLSGWNYWVMFVLVGMAELTAAGIYMQYWFPDVPTWIWAAAFFIIINAVNLVNVRLYGET EFWFALIKVLAIIGMIGFGLWLLFSGHGGEKASIDNLWRYGGFFATGWNGLILSLAVIMFSFGGLELIGITAAEARDPEK SIPKAVNQVVYRILLFYIGSLVVLLALYPWMEVKSNSSPFVMIFHNLDSNVVASALNFVILVASLSVYNSGVYSNSRMLF GLSVQGNAPKFLTRVSRRGVPINSLMLSGAITSLVVLINYLLPQKAFGLLMALVVATLLLNWIMICLAHLRFRAAMRRQG RETQFKALLYPFGNYLCIAFLGMILLLMCTMDDMRLSAILLPVWIVFLFVAFKTLRRK
Specific function: Permease that is involved in the transport across the cytoplasmic membrane of phenylalanine [H]
COG id: COG1113
COG function: function code E; Gamma-aminobutyrate permease and related permeases
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the amino acid-polyamine-organocation (APC) superfamily. Amino acid transporter (AAT) (TC 2.A.3.1) family [H]
Homologues:
Organism=Homo sapiens, GI110347453, Length=290, Percent_Identity=27.5862068965517, Blast_Score=79, Evalue=1e-14, Organism=Homo sapiens, GI181337167, Length=405, Percent_Identity=24.1975308641975, Blast_Score=74, Evalue=3e-13, Organism=Homo sapiens, GI258645169, Length=441, Percent_Identity=21.7687074829932, Blast_Score=72, Evalue=1e-12, Organism=Homo sapiens, GI115648063, Length=460, Percent_Identity=21.9565217391304, Blast_Score=69, Evalue=1e-11, Organism=Homo sapiens, GI115648022, Length=460, Percent_Identity=21.9565217391304, Blast_Score=69, Evalue=1e-11, Organism=Escherichia coli, GI1786789, Length=458, Percent_Identity=99.5633187772926, Blast_Score=914, Evalue=0.0, Organism=Escherichia coli, GI1786302, Length=434, Percent_Identity=64.0552995391705, Blast_Score=593, Evalue=1e-171, Organism=Escherichia coli, GI48994972, Length=429, Percent_Identity=44.0559440559441, Blast_Score=369, Evalue=1e-103, Organism=Escherichia coli, GI1786602, Length=452, Percent_Identity=44.9115044247788, Blast_Score=353, Evalue=1e-98, Organism=Escherichia coli, GI1790653, Length=453, Percent_Identity=37.9690949227373, Blast_Score=337, Evalue=8e-94, Organism=Escherichia coli, GI87081915, Length=425, Percent_Identity=36.4705882352941, Blast_Score=300, Evalue=9e-83, Organism=Escherichia coli, GI1788480, Length=418, Percent_Identity=37.799043062201, Blast_Score=271, Evalue=7e-74, Organism=Escherichia coli, GI1789017, Length=430, Percent_Identity=36.7441860465116, Blast_Score=271, Evalue=7e-74, Organism=Escherichia coli, GI87081708, Length=447, Percent_Identity=35.1230425055928, Blast_Score=269, Evalue=3e-73, Organism=Escherichia coli, GI87081869, Length=175, Percent_Identity=28.5714285714286, Blast_Score=79, Evalue=8e-16, Organism=Escherichia coli, GI87082023, Length=366, Percent_Identity=23.4972677595628, Blast_Score=68, Evalue=1e-12, Organism=Escherichia coli, GI2367353, Length=370, Percent_Identity=21.3513513513514, Blast_Score=63, Evalue=4e-11, Organism=Caenorhabditis elegans, GI17540018, Length=382, Percent_Identity=23.5602094240838, Blast_Score=72, Evalue=9e-13, Organism=Caenorhabditis elegans, GI17531343, Length=391, Percent_Identity=21.9948849104859, Blast_Score=67, Evalue=2e-11, Organism=Saccharomyces cerevisiae, GI6320772, Length=442, Percent_Identity=35.972850678733, Blast_Score=253, Evalue=5e-68, Organism=Saccharomyces cerevisiae, GI6324061, Length=443, Percent_Identity=35.4401805869075, Blast_Score=246, Evalue=4e-66, Organism=Saccharomyces cerevisiae, GI6324059, Length=450, Percent_Identity=34.4444444444444, Blast_Score=216, Evalue=4e-57, Organism=Saccharomyces cerevisiae, GI6322892, Length=424, Percent_Identity=35.8490566037736, Blast_Score=215, Evalue=9e-57, Organism=Saccharomyces cerevisiae, GI6319824, Length=449, Percent_Identity=32.293986636971, Blast_Score=214, Evalue=2e-56, Organism=Saccharomyces cerevisiae, GI6324990, Length=441, Percent_Identity=32.4263038548753, Blast_Score=213, Evalue=5e-56, Organism=Saccharomyces cerevisiae, GI6324924, Length=448, Percent_Identity=31.4732142857143, Blast_Score=206, Evalue=4e-54, Organism=Saccharomyces cerevisiae, GI6321629, Length=416, Percent_Identity=31.9711538461538, Blast_Score=206, Evalue=4e-54, Organism=Saccharomyces cerevisiae, GI6324553, Length=399, Percent_Identity=33.3333333333333, Blast_Score=202, Evalue=9e-53, Organism=Saccharomyces cerevisiae, GI6320717, Length=484, Percent_Identity=27.8925619834711, Blast_Score=198, Evalue=2e-51, Organism=Saccharomyces cerevisiae, GI6321053, Length=439, Percent_Identity=31.8906605922551, Blast_Score=192, Evalue=1e-49, Organism=Saccharomyces cerevisiae, GI6319543, Length=436, Percent_Identity=32.1100917431193, Blast_Score=191, Evalue=1e-49, Organism=Saccharomyces cerevisiae, GI6320251, Length=488, Percent_Identity=28.2786885245902, Blast_Score=187, Evalue=3e-48, Organism=Saccharomyces cerevisiae, GI6322967, Length=395, Percent_Identity=31.3924050632911, Blast_Score=186, Evalue=8e-48, Organism=Saccharomyces cerevisiae, GI6319542, Length=480, Percent_Identity=26.6666666666667, Blast_Score=177, Evalue=4e-45, Organism=Saccharomyces cerevisiae, GI6324981, Length=416, Percent_Identity=32.6923076923077, Blast_Score=173, Evalue=6e-44, Organism=Saccharomyces cerevisiae, GI6319608, Length=413, Percent_Identity=29.7820823244552, Blast_Score=172, Evalue=8e-44, Organism=Saccharomyces cerevisiae, GI6320364, Length=487, Percent_Identity=27.7207392197125, Blast_Score=134, Evalue=2e-32, Organism=Drosophila melanogaster, GI24666159, Length=422, Percent_Identity=22.7488151658768, Blast_Score=88, Evalue=1e-17, Organism=Drosophila melanogaster, GI221512776, Length=422, Percent_Identity=22.7488151658768, Blast_Score=88, Evalue=1e-17, Organism=Drosophila melanogaster, GI24667468, Length=427, Percent_Identity=22.7166276346604, Blast_Score=69, Evalue=7e-12, Organism=Drosophila melanogaster, GI221331183, Length=351, Percent_Identity=23.6467236467236, Blast_Score=66, Evalue=5e-11, Organism=Drosophila melanogaster, GI17647653, Length=351, Percent_Identity=23.6467236467236, Blast_Score=66, Evalue=5e-11, Organism=Drosophila melanogaster, GI24664379, Length=351, Percent_Identity=23.6467236467236, Blast_Score=66, Evalue=5e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR004841 - InterPro: IPR002293 - InterPro: IPR004840 [H]
Pfam domain/function: PF00324 AA_permease [H]
EC number: NA
Molecular weight: Translated: 50678; Mature: 50678
Theoretical pI: Translated: 9.78; Mature: 9.78
Prosite motif: PS00218 AMINO_ACID_PERMEASE_1
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.7 %Cys (Translated Protein) 4.4 %Met (Translated Protein) 5.0 %Cys+Met (Translated Protein) 0.7 %Cys (Mature Protein) 4.4 %Met (Mature Protein) 5.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKNASTVSEDTASNQEPTLHRGLHNRHIQLIALGGAIGTGLFLGIGPAIQMAGPAVLLGY CCCCCCCCCCCCCCCCCHHHHCCCCCCEEEEEECCHHHHHHHHHCCHHHHHCCCHHHHHH GVAGIIAFLIMRQLGEMVVEEPVSGSFAHFAYKYWGPFAGFLSGWNYWVMFVLVGMAELT HHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH AAGIYMQYWFPDVPTWIWAAAFFIIINAVNLVNVRLYGETEFWFALIKVLAIIGMIGFGL HHHHHHEECCCCCHHHHHHHHHHHHHHHHHEEEEEEECCHHHHHHHHHHHHHHHHHHHHH WLLFSGHGGEKASIDNLWRYGGFFATGWNGLILSLAVIMFSFGGLELIGITAAEARDPEK HHHCCCCCCCCCCHHHHHHHCCEEECCHHHHHHHHHHHHHHCCCEEEEEEEHHHCCCCHH SIPKAVNQVVYRILLFYIGSLVVLLALYPWMEVKSNSSPFVMIFHNLDSNVVASALNFVI HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHCCCCCCCEEEEEECCCHHHHHHHHHHHH LVASLSVYNSGVYSNSRMLFGLSVQGNAPKFLTRVSRRGVPINSLMLSGAITSLVVLINY HHHHHHHHHCCCCCCCEEEEEEEECCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHH LLPQKAFGLLMALVVATLLLNWIMICLAHLRFRAAMRRQGRETQFKALLYPFGNYLCIAF HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHH LGMILLLMCTMDDMRLSAILLPVWIVFLFVAFKTLRRK HHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCC >Mature Secondary Structure MKNASTVSEDTASNQEPTLHRGLHNRHIQLIALGGAIGTGLFLGIGPAIQMAGPAVLLGY CCCCCCCCCCCCCCCCCHHHHCCCCCCEEEEEECCHHHHHHHHHCCHHHHHCCCHHHHHH GVAGIIAFLIMRQLGEMVVEEPVSGSFAHFAYKYWGPFAGFLSGWNYWVMFVLVGMAELT HHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH AAGIYMQYWFPDVPTWIWAAAFFIIINAVNLVNVRLYGETEFWFALIKVLAIIGMIGFGL HHHHHHEECCCCCHHHHHHHHHHHHHHHHHEEEEEEECCHHHHHHHHHHHHHHHHHHHHH WLLFSGHGGEKASIDNLWRYGGFFATGWNGLILSLAVIMFSFGGLELIGITAAEARDPEK HHHCCCCCCCCCCHHHHHHHCCEEECCHHHHHHHHHHHHHHCCCEEEEEEEHHHCCCCHH SIPKAVNQVVYRILLFYIGSLVVLLALYPWMEVKSNSSPFVMIFHNLDSNVVASALNFVI HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHCCCCCCCEEEEEECCCHHHHHHHHHHHH LVASLSVYNSGVYSNSRMLFGLSVQGNAPKFLTRVSRRGVPINSLMLSGAITSLVVLINY HHHHHHHHHCCCCCCCEEEEEEEECCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHH LLPQKAFGLLMALVVATLLLNWIMICLAHLRFRAAMRRQGRETQFKALLYPFGNYLCIAF HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHH LGMILLLMCTMDDMRLSAILLPVWIVFLFVAFKTLRRK HHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: Proton [Periplasm]; L-phenylalanine [Periplasm] [C]
Specific reaction: Proton [Periplasm] + L-phenylalanine [Periplasm] = Proton [Cytoplasm] + L-phenylalanine [Cytoplasm] [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 1711024; 8905232; 9278503; 8626334 [H]