| Definition | Salmonella enterica subsp. enterica serovar Typhi str. Ty2 chromosome, complete genome. |
|---|---|
| Accession | NC_004631 |
| Length | 4,791,961 |
Click here to switch to the map view.
The map label for this gene is aroP [H]
Identifier: 29140697
GI number: 29140697
Start: 176518
End: 177888
Strand: Reverse
Name: aroP [H]
Synonym: t0156
Alternate gene names: 29140697
Gene position: 177888-176518 (Counterclockwise)
Preceding gene: 29140703
Following gene: 29140696
Centisome position: 3.71
GC content: 55.8
Gene sequence:
>1371_bases ATGGACAGTCAGCAGCACGGCGAGCAGCTCAAGCGCGGCCTGAAAAACCGCCATATTCAGCTTATCGCATTGGGCGGCGC GATAGGGACAGGCTTGTTCCTGGGAAGCGCCTCCGTTATTCAGTCGGCGGGGCCAGGAATTATCCTGGGCTATGCGATTG CAGGCTTTATTGCCTTTCTGATTATGCGCCAGCTGGGAGAGATGGTGGTTGAAGAACCGGTCGCCGGCTCTTTTAGCCAC TTCGCCTATAAATACTGGGGCGGCTTTGCCGGATTCGCTTCCGGCTGGAACTACTGGGTGCTGTACGTATTAGTTGCCAT GGCGGAACTGACCGCAGTGGGTAAATATATCCAGTTCTGGTATCCGGAGATCCCCACCTGGGCTTCTGCGGCAGCCTTCT TTGTGATTATCAACGCCATCAACCTGACCAACGTTAAAGTGTTCGGTGAGATGGAGTTTTGGTTCGCCATTATTAAAGTG ATCGCCGTCATCGCTATGATTCTGTTCGGCGCGTGGTTACTCTTCAGCGATACCGCTGGTCCGCAGGCCACCGTGCGCAA CCTGTGGGAACAGGGCGGCTTTCTGCCGCATGGCTGGACCGGGCTGGTGATGATGATGGCCATTATTATGTTTTCGTTCG GCGGACTGGAGCTGGTAGGGATCACCGCCGCTGAAGCCGATAACCCGGAACAGAGCATCCCCAAAGCTACCAATCAGGTT ATCTATCGCATCCTGATTTTTTATATCGGCTCGCTGGCGGTTCTGCTTTCGCTGCTGCCATGGACTCGCGTCACCGCCGA TACCAGCCCGTTTGTCCTTATCTTCCATGAGCTGGGCGATACGTTTGTCGCCAATGCGCTGAATATCGTGGTTCTCACCG CCGCGCTATCGGTTTATAACAGCTGCGTTTACTGCAACAGCCGTATGCTGTTCGGTCTGGCCCAGCAGGGAAATGCGCCG AAAGCGCTGCTGAACGTCGACAAACGCGGCGTGCCGGTGAGCTCCATTTTGGTTTCCGCCGTCGTGACGGCGTTGTGCGT GCTGCTCAACTACCTGGCCCCAGAGTCCGCATTCGGACTGCTGATGGCGCTGGTGGTCTCCGCGCTGGTCATTAACTGGG CGATGATTAGCCTTGCCCATATGATGTTCCGTCGCGCGAAACAGCAGCAGGGCGTGAAAACCCGCTTCCCGGCCCTGTTT TATCCGTTCGGCAACGTTCTCTGCCTGCTGTTTATGGCGGCGGTATTGATCATTATGCTGATGACGCCAGGTATGGCGAT CTCCGTGTGGCTGATTCCGGTATGGCTGCTCATCCTGGGCGTCGGCTACCTGTGTAAAGAAAAAACGGCAAAAACGGTGA AAGCCCACTGA
Upstream 100 bases:
>100_bases TCAATACAAAAAGCGGAATTGCAAACTTACACACGCACTACTGCATCGTTCAAAATAAGCGCGTTATGAATAAACCGCGC CCGATACACGAGGTTTTATG
Downstream 100 bases:
>100_bases TCGTGGCGTTACTTTTCCGCCCTCCCCGTCCGGGGAGGGTGTTATCTGCCTCACAGTTTCCCACGTATAACCATTTTATC CATATCAGCGGCGTGATCCT
Product: aromatic amino acid transporter
Products: L-tryptophan [Cytoplasm]; Proton [Cytoplasm]; Proton [Cytoplasm]; L-tyrosine [Cytoplasm]; Proton [Cytoplasm]; L-phenylalanine [Cytoplasm] [C]
Alternate protein names: General aromatic amino acid permease [H]
Number of amino acids: Translated: 456; Mature: 456
Protein sequence:
>456_residues MDSQQHGEQLKRGLKNRHIQLIALGGAIGTGLFLGSASVIQSAGPGIILGYAIAGFIAFLIMRQLGEMVVEEPVAGSFSH FAYKYWGGFAGFASGWNYWVLYVLVAMAELTAVGKYIQFWYPEIPTWASAAAFFVIINAINLTNVKVFGEMEFWFAIIKV IAVIAMILFGAWLLFSDTAGPQATVRNLWEQGGFLPHGWTGLVMMMAIIMFSFGGLELVGITAAEADNPEQSIPKATNQV IYRILIFYIGSLAVLLSLLPWTRVTADTSPFVLIFHELGDTFVANALNIVVLTAALSVYNSCVYCNSRMLFGLAQQGNAP KALLNVDKRGVPVSSILVSAVVTALCVLLNYLAPESAFGLLMALVVSALVINWAMISLAHMMFRRAKQQQGVKTRFPALF YPFGNVLCLLFMAAVLIIMLMTPGMAISVWLIPVWLLILGVGYLCKEKTAKTVKAH
Sequences:
>Translated_456_residues MDSQQHGEQLKRGLKNRHIQLIALGGAIGTGLFLGSASVIQSAGPGIILGYAIAGFIAFLIMRQLGEMVVEEPVAGSFSH FAYKYWGGFAGFASGWNYWVLYVLVAMAELTAVGKYIQFWYPEIPTWASAAAFFVIINAINLTNVKVFGEMEFWFAIIKV IAVIAMILFGAWLLFSDTAGPQATVRNLWEQGGFLPHGWTGLVMMMAIIMFSFGGLELVGITAAEADNPEQSIPKATNQV IYRILIFYIGSLAVLLSLLPWTRVTADTSPFVLIFHELGDTFVANALNIVVLTAALSVYNSCVYCNSRMLFGLAQQGNAP KALLNVDKRGVPVSSILVSAVVTALCVLLNYLAPESAFGLLMALVVSALVINWAMISLAHMMFRRAKQQQGVKTRFPALF YPFGNVLCLLFMAAVLIIMLMTPGMAISVWLIPVWLLILGVGYLCKEKTAKTVKAH >Mature_456_residues MDSQQHGEQLKRGLKNRHIQLIALGGAIGTGLFLGSASVIQSAGPGIILGYAIAGFIAFLIMRQLGEMVVEEPVAGSFSH FAYKYWGGFAGFASGWNYWVLYVLVAMAELTAVGKYIQFWYPEIPTWASAAAFFVIINAINLTNVKVFGEMEFWFAIIKV IAVIAMILFGAWLLFSDTAGPQATVRNLWEQGGFLPHGWTGLVMMMAIIMFSFGGLELVGITAAEADNPEQSIPKATNQV IYRILIFYIGSLAVLLSLLPWTRVTADTSPFVLIFHELGDTFVANALNIVVLTAALSVYNSCVYCNSRMLFGLAQQGNAP KALLNVDKRGVPVSSILVSAVVTALCVLLNYLAPESAFGLLMALVVSALVINWAMISLAHMMFRRAKQQQGVKTRFPALF YPFGNVLCLLFMAAVLIIMLMTPGMAISVWLIPVWLLILGVGYLCKEKTAKTVKAH
Specific function: Permease that is involved in the transport across the cytoplasmic membrane of the aromatic amino acids (phenylalanine, tyrosine, and tryptophan) [H]
COG id: COG1113
COG function: function code E; Gamma-aminobutyrate permease and related permeases
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the amino acid-polyamine-organocation (APC) superfamily. Amino acid transporter (AAT) (TC 2.A.3.1) family [H]
Homologues:
Organism=Homo sapiens, GI258645169, Length=435, Percent_Identity=22.9885057471264, Blast_Score=81, Evalue=2e-15, Organism=Homo sapiens, GI181337167, Length=280, Percent_Identity=26.4285714285714, Blast_Score=76, Evalue=8e-14, Organism=Homo sapiens, GI110347453, Length=360, Percent_Identity=22.7777777777778, Blast_Score=75, Evalue=9e-14, Organism=Homo sapiens, GI258614003, Length=436, Percent_Identity=22.2477064220184, Blast_Score=73, Evalue=6e-13, Organism=Homo sapiens, GI258614005, Length=436, Percent_Identity=22.2477064220184, Blast_Score=73, Evalue=6e-13, Organism=Escherichia coli, GI1786302, Length=456, Percent_Identity=92.1052631578947, Blast_Score=852, Evalue=0.0, Organism=Escherichia coli, GI1786789, Length=439, Percent_Identity=64.2369020501139, Blast_Score=591, Evalue=1e-170, Organism=Escherichia coli, GI1786602, Length=453, Percent_Identity=42.6048565121413, Blast_Score=343, Evalue=1e-95, Organism=Escherichia coli, GI1790653, Length=464, Percent_Identity=39.2241379310345, Blast_Score=340, Evalue=8e-95, Organism=Escherichia coli, GI48994972, Length=437, Percent_Identity=39.3592677345538, Blast_Score=327, Evalue=1e-90, Organism=Escherichia coli, GI87081915, Length=403, Percent_Identity=38.2133995037221, Blast_Score=296, Evalue=2e-81, Organism=Escherichia coli, GI87081708, Length=465, Percent_Identity=34.4086021505376, Blast_Score=288, Evalue=4e-79, Organism=Escherichia coli, GI1789017, Length=434, Percent_Identity=35.0230414746544, Blast_Score=268, Evalue=7e-73, Organism=Escherichia coli, GI1788480, Length=418, Percent_Identity=34.4497607655502, Blast_Score=250, Evalue=2e-67, Organism=Escherichia coli, GI87082023, Length=357, Percent_Identity=22.4089635854342, Blast_Score=64, Evalue=2e-11, Organism=Caenorhabditis elegans, GI17532491, Length=401, Percent_Identity=26.1845386533666, Blast_Score=81, Evalue=1e-15, Organism=Caenorhabditis elegans, GI17531343, Length=378, Percent_Identity=24.3386243386243, Blast_Score=75, Evalue=9e-14, Organism=Caenorhabditis elegans, GI17533459, Length=289, Percent_Identity=24.9134948096886, Blast_Score=67, Evalue=1e-11, Organism=Saccharomyces cerevisiae, GI6320772, Length=457, Percent_Identity=35.4485776805252, Blast_Score=239, Evalue=5e-64, Organism=Saccharomyces cerevisiae, GI6324061, Length=451, Percent_Identity=33.0376940133038, Blast_Score=231, Evalue=1e-61, Organism=Saccharomyces cerevisiae, GI6324924, Length=456, Percent_Identity=32.6754385964912, Blast_Score=214, Evalue=2e-56, Organism=Saccharomyces cerevisiae, GI6321053, Length=420, Percent_Identity=30.952380952381, Blast_Score=206, Evalue=4e-54, Organism=Saccharomyces cerevisiae, GI6324553, Length=398, Percent_Identity=32.6633165829146, Blast_Score=202, Evalue=8e-53, Organism=Saccharomyces cerevisiae, GI6324059, Length=446, Percent_Identity=33.8565022421525, Blast_Score=198, Evalue=1e-51, Organism=Saccharomyces cerevisiae, GI6322892, Length=398, Percent_Identity=32.9145728643216, Blast_Score=198, Evalue=2e-51, Organism=Saccharomyces cerevisiae, GI6320717, Length=407, Percent_Identity=30.4668304668305, Blast_Score=195, Evalue=1e-50, Organism=Saccharomyces cerevisiae, GI6319824, Length=405, Percent_Identity=29.6296296296296, Blast_Score=186, Evalue=5e-48, Organism=Saccharomyces cerevisiae, GI6324990, Length=421, Percent_Identity=29.6912114014252, Blast_Score=186, Evalue=5e-48, Organism=Saccharomyces cerevisiae, GI6321629, Length=407, Percent_Identity=29.7297297297297, Blast_Score=185, Evalue=1e-47, Organism=Saccharomyces cerevisiae, GI6319543, Length=401, Percent_Identity=30.4239401496259, Blast_Score=174, Evalue=2e-44, Organism=Saccharomyces cerevisiae, GI6320251, Length=405, Percent_Identity=28.8888888888889, Blast_Score=171, Evalue=2e-43, Organism=Saccharomyces cerevisiae, GI6319608, Length=443, Percent_Identity=28.4424379232506, Blast_Score=166, Evalue=9e-42, Organism=Saccharomyces cerevisiae, GI6319542, Length=406, Percent_Identity=28.8177339901478, Blast_Score=164, Evalue=3e-41, Organism=Saccharomyces cerevisiae, GI6322967, Length=399, Percent_Identity=28.5714285714286, Blast_Score=158, Evalue=2e-39, Organism=Saccharomyces cerevisiae, GI6324981, Length=396, Percent_Identity=28.030303030303, Blast_Score=140, Evalue=3e-34, Organism=Saccharomyces cerevisiae, GI6320364, Length=500, Percent_Identity=25.8, Blast_Score=119, Evalue=1e-27, Organism=Drosophila melanogaster, GI24667468, Length=416, Percent_Identity=25.4807692307692, Blast_Score=87, Evalue=2e-17, Organism=Drosophila melanogaster, GI221512776, Length=368, Percent_Identity=22.554347826087, Blast_Score=72, Evalue=6e-13, Organism=Drosophila melanogaster, GI24666159, Length=368, Percent_Identity=22.554347826087, Blast_Score=72, Evalue=6e-13, Organism=Drosophila melanogaster, GI24668806, Length=428, Percent_Identity=21.2616822429907, Blast_Score=70, Evalue=3e-12, Organism=Drosophila melanogaster, GI21356285, Length=428, Percent_Identity=21.2616822429907, Blast_Score=70, Evalue=3e-12, Organism=Drosophila melanogaster, GI24668802, Length=428, Percent_Identity=21.2616822429907, Blast_Score=70, Evalue=3e-12, Organism=Drosophila melanogaster, GI161077963, Length=387, Percent_Identity=25.5813953488372, Blast_Score=69, Evalue=6e-12,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR004841 - InterPro: IPR002293 - InterPro: IPR004840 [H]
Pfam domain/function: PF00324 AA_permease [H]
EC number: NA
Molecular weight: Translated: 49744; Mature: 49744
Theoretical pI: Translated: 8.85; Mature: 8.85
Prosite motif: PS00218 AMINO_ACID_PERMEASE_1
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.1 %Cys (Translated Protein) 4.2 %Met (Translated Protein) 5.3 %Cys+Met (Translated Protein) 1.1 %Cys (Mature Protein) 4.2 %Met (Mature Protein) 5.3 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MDSQQHGEQLKRGLKNRHIQLIALGGAIGTGLFLGSASVIQSAGPGIILGYAIAGFIAFL CCCHHHHHHHHHHHHHCCEEEEEECCHHHHHHHHCCHHHHHCCCCCEEHHHHHHHHHHHH IMRQLGEMVVEEPVAGSFSHFAYKYWGGFAGFASGWNYWVLYVLVAMAELTAVGKYIQFW HHHHHHHHHHHCCCCCCHHHHHHHHHCCHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHH YPEIPTWASAAAFFVIINAINLTNVKVFGEMEFWFAIIKVIAVIAMILFGAWLLFSDTAG CCCCCCHHHHHHHHHHHHHHCCCEEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC PQATVRNLWEQGGFLPHGWTGLVMMMAIIMFSFGGLELVGITAAEADNPEQSIPKATNQV CHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCEEEEEEEECCCCCCHHHCCHHHHHH IYRILIFYIGSLAVLLSLLPWTRVTADTSPFVLIFHELGDTFVANALNIVVLTAALSVYN HHHHHHHHHHHHHHHHHHCCCHHCCCCCCCCEEEHHHHCHHHHHHHHHHHHHHHHHHHHH SCVYCNSRMLFGLAQQGNAPKALLNVDKRGVPVSSILVSAVVTALCVLLNYLAPESAFGL HHHHCCCHHHHHHHCCCCCCHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCHHHHHH LMALVVSALVINWAMISLAHMMFRRAKQQQGVKTRFPALFYPFGNVLCLLFMAAVLIIML HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHCCCHHHHHHHHHHHHHHHHHHHHHH MTPGMAISVWLIPVWLLILGVGYLCKEKTAKTVKAH HCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC >Mature Secondary Structure MDSQQHGEQLKRGLKNRHIQLIALGGAIGTGLFLGSASVIQSAGPGIILGYAIAGFIAFL CCCHHHHHHHHHHHHHCCEEEEEECCHHHHHHHHCCHHHHHCCCCCEEHHHHHHHHHHHH IMRQLGEMVVEEPVAGSFSHFAYKYWGGFAGFASGWNYWVLYVLVAMAELTAVGKYIQFW HHHHHHHHHHHCCCCCCHHHHHHHHHCCHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHH YPEIPTWASAAAFFVIINAINLTNVKVFGEMEFWFAIIKVIAVIAMILFGAWLLFSDTAG CCCCCCHHHHHHHHHHHHHHCCCEEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC PQATVRNLWEQGGFLPHGWTGLVMMMAIIMFSFGGLELVGITAAEADNPEQSIPKATNQV CHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCEEEEEEEECCCCCCHHHCCHHHHHH IYRILIFYIGSLAVLLSLLPWTRVTADTSPFVLIFHELGDTFVANALNIVVLTAALSVYN HHHHHHHHHHHHHHHHHHCCCHHCCCCCCCCEEEHHHHCHHHHHHHHHHHHHHHHHHHHH SCVYCNSRMLFGLAQQGNAPKALLNVDKRGVPVSSILVSAVVTALCVLLNYLAPESAFGL HHHHCCCHHHHHHHCCCCCCHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCHHHHHH LMALVVSALVINWAMISLAHMMFRRAKQQQGVKTRFPALFYPFGNVLCLLFMAAVLIIML HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHCCCHHHHHHHHHHHHHHHHHHHHHH MTPGMAISVWLIPVWLLILGVGYLCKEKTAKTVKAH HCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: L-tryptophan [Periplasm]; Proton [Periplasm]; Proton [Periplasm]; L-tyrosine [Periplasm]; Proton [Periplasm]; L-phenylalanine [Periplasm] [C]
Specific reaction: L-tryptophan [Periplasm] + Proton [Periplasm] = L-tryptophan [Cytoplasm] + Proton [Cytoplasm] Proton [Periplasm] + L-tyrosine [Periplasm] = Proton [Cytoplasm] + L-tyrosine [Cytoplasm] Proton [Periplasm] + L-phenylalanine [Periplasm] = Proton [Cytoplasm] +
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]