Definition Escherichia coli HS, complete genome.
Accession NC_009800
Length 4,643,538

Click here to switch to the map view.

The map label for this gene is proP [H]

Identifier: 157163576

GI number: 157163576

Start: 4354523

End: 4356025

Strand: Direct

Name: proP [H]

Synonym: EcHS_A4351

Alternate gene names: 157163576

Gene position: 4354523-4356025 (Clockwise)

Preceding gene: 157163575

Following gene: 157163585

Centisome position: 93.78

GC content: 50.17

Gene sequence:

>1503_bases
ATGCTGAAAAGGAAAAAAGTAAAACCGATTACCCTTCGTGATGTCACCATTATTGATGACGGTAAACTGCGTAAAGCCAT
TACCGCAGCATCACTGGGTAATGCAATGGAATGGTTTGATTTTGGTGTTTATGGTTTTGTTGCTTACGCATTAGGTAAAG
TTTTTTTCCCGGGGGCTGACCCCAGCGTGCAGATGGTTGCTGCACTTGCCACGTTCTCCGTTCCCTTTCTGATTCGACCG
CTTGGCGGACTCTTCTTTGGTATGTTGGGCGATAAATATGGTCGCCAGAAGATCCTCGCTATCACTATTGTGATTATGTC
GATCAGTACGTTCTGTATTGGCTTAATACCGTCCTACGACACGATTGGTATTTGGGCACCGATTCTGCTGTTGATCTGTA
AGATGGCACAAGGTTTCTCGGTCGGCGGTGAATATACCGGGGCGTCGATATTTGTTGCGGAATACTCCCCTGACCGTAAA
CGTGGCTTTATGGGCAGCTGGCTGGACTTTGGTTCTATTGCCGGGTTTGTGCTGGGTGCTGGCGTAGTGGTGTTAATTTC
GACCATTGTCGGCGAAGCGAACTTCCTCGACTGGGGCTGGCGTATTCCGTTCTTTATTGCTCTGCCGTTAGGGATTATCG
GGCTTTACCTGCGCCATGCGCTGGAAGAAACTCCGGCGTTCCAGCAGCATGTCGATAAACTGGAACAGGGCGACCGCGAA
GGTTTGCAGGATGGCCCGAAAGTCTCGTTTAAAGAGATTGCCACCAAACACTGGCGCAGCCTGTTGACATGTATTGGTCT
GGTAATTGCCACCAACGTGACTTACTACATGTTGCTGACCTATATGCCGAGTTATTTGTCGCATAACCTGCATTACTCCG
AAGACCACGGGGTGCTGATTATTATCGCCATTATGATCGGTATGCTGTTTGTCCAGCCGGTGATGGGCTTGCTGAGTGAC
CGTTTTGGCCGTCGTCCGTTTGTGCTACTTGGTAGTGTTGCCCTGTTTGTGTTGGCGATCCCGGCGTTTATTCTGATTAA
CAGTAACGTCATCGGCCTGATTTTTGCCGGGTTACTGATGCTGGCGGTGATCCTTAACTGCTTTACGGGCGTTATGGCTT
CTACCTTGCCAGCGATGTTCCCGACGCATATCCGTTACAGCGCGCTGGCGGCGGCATTTAATATTTCGGTGCTGGTTGCC
GGTCTGACGCCAACACTGGCGGCCTGGCTGGTCGAAAGCTCGCAGAATCTGATGATGCCTGCCTATTACCTGATGGTAGT
GGCGGTGATTGGTTTAATCACCGGCGTAACCATGAAAGAGACGGCAAATCGTCCGTTGAAAGGTGCGACACCGGCGGCGT
CAGATATACAGGAAGCGAAGGAAATTCTCGTCGAGCATTACGATAATATCGAGCAGAAAATCGATAATATTGACCACGAG
ATTGCCGATTTGCAGGCGAAACGTACCCGCCTGGTGCAGCAACATCCGCGAATTGATGAATAA

Upstream 100 bases:

>100_bases
GTTACAGAGATTGCATCCTGCAATTCCCGCTCCCCTTTTGCGGCCGTCGCGCTGATTTTTCTGGCGTTTGCGGAAATGGG
CCAACTCTGCGAGGAAAGCT

Downstream 100 bases:

>100_bases
GCTGAAACGGATGGCCTGATGTGACGCTGTCTTATCAGGCCAATTGAACTCTTAAGGTTCACTTAATCTCTGACGCGCAT
ACTCTCCTCCAGGTTAACGG

Product: proline/glycine betaine transporter

Products: betaine [Cytoplasm]; Proton [Cytoplasm]; L-proline [Cytoplasm] [C]

Alternate protein names: Proline porter II; PPII [H]

Number of amino acids: Translated: 500; Mature: 500

Protein sequence:

>500_residues
MLKRKKVKPITLRDVTIIDDGKLRKAITAASLGNAMEWFDFGVYGFVAYALGKVFFPGADPSVQMVAALATFSVPFLIRP
LGGLFFGMLGDKYGRQKILAITIVIMSISTFCIGLIPSYDTIGIWAPILLLICKMAQGFSVGGEYTGASIFVAEYSPDRK
RGFMGSWLDFGSIAGFVLGAGVVVLISTIVGEANFLDWGWRIPFFIALPLGIIGLYLRHALEETPAFQQHVDKLEQGDRE
GLQDGPKVSFKEIATKHWRSLLTCIGLVIATNVTYYMLLTYMPSYLSHNLHYSEDHGVLIIIAIMIGMLFVQPVMGLLSD
RFGRRPFVLLGSVALFVLAIPAFILINSNVIGLIFAGLLMLAVILNCFTGVMASTLPAMFPTHIRYSALAAAFNISVLVA
GLTPTLAAWLVESSQNLMMPAYYLMVVAVIGLITGVTMKETANRPLKGATPAASDIQEAKEILVEHYDNIEQKIDNIDHE
IADLQAKRTRLVQQHPRIDE

Sequences:

>Translated_500_residues
MLKRKKVKPITLRDVTIIDDGKLRKAITAASLGNAMEWFDFGVYGFVAYALGKVFFPGADPSVQMVAALATFSVPFLIRP
LGGLFFGMLGDKYGRQKILAITIVIMSISTFCIGLIPSYDTIGIWAPILLLICKMAQGFSVGGEYTGASIFVAEYSPDRK
RGFMGSWLDFGSIAGFVLGAGVVVLISTIVGEANFLDWGWRIPFFIALPLGIIGLYLRHALEETPAFQQHVDKLEQGDRE
GLQDGPKVSFKEIATKHWRSLLTCIGLVIATNVTYYMLLTYMPSYLSHNLHYSEDHGVLIIIAIMIGMLFVQPVMGLLSD
RFGRRPFVLLGSVALFVLAIPAFILINSNVIGLIFAGLLMLAVILNCFTGVMASTLPAMFPTHIRYSALAAAFNISVLVA
GLTPTLAAWLVESSQNLMMPAYYLMVVAVIGLITGVTMKETANRPLKGATPAASDIQEAKEILVEHYDNIEQKIDNIDHE
IADLQAKRTRLVQQHPRIDE
>Mature_500_residues
MLKRKKVKPITLRDVTIIDDGKLRKAITAASLGNAMEWFDFGVYGFVAYALGKVFFPGADPSVQMVAALATFSVPFLIRP
LGGLFFGMLGDKYGRQKILAITIVIMSISTFCIGLIPSYDTIGIWAPILLLICKMAQGFSVGGEYTGASIFVAEYSPDRK
RGFMGSWLDFGSIAGFVLGAGVVVLISTIVGEANFLDWGWRIPFFIALPLGIIGLYLRHALEETPAFQQHVDKLEQGDRE
GLQDGPKVSFKEIATKHWRSLLTCIGLVIATNVTYYMLLTYMPSYLSHNLHYSEDHGVLIIIAIMIGMLFVQPVMGLLSD
RFGRRPFVLLGSVALFVLAIPAFILINSNVIGLIFAGLLMLAVILNCFTGVMASTLPAMFPTHIRYSALAAAFNISVLVA
GLTPTLAAWLVESSQNLMMPAYYLMVVAVIGLITGVTMKETANRPLKGATPAASDIQEAKEILVEHYDNIEQKIDNIDHE
IADLQAKRTRLVQQHPRIDE

Specific function: Proton symporter that senses osmotic shifts and responds by importing osmolytes such as proline, glycine betaine, stachydrine, pipecolic acid, ectoine and taurine. It is both an osmosensor and an osmoregulator which is available to participate early in th

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cell inner membrane; Multi-pass membrane protein [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the major facilitator superfamily. Sugar transporter (TC 2.A.1.1) family [H]

Homologues:

Organism=Escherichia coli, GI1790550, Length=500, Percent_Identity=99.4, Blast_Score=1012, Evalue=0.0,
Organism=Escherichia coli, GI1788292, Length=443, Percent_Identity=31.8284424379233, Blast_Score=224, Evalue=1e-59,
Organism=Escherichia coli, GI1789941, Length=452, Percent_Identity=32.9646017699115, Blast_Score=208, Evalue=8e-55,
Organism=Escherichia coli, GI1788942, Length=433, Percent_Identity=30.2540415704388, Blast_Score=191, Evalue=1e-49,
Organism=Escherichia coli, GI87082231, Length=483, Percent_Identity=23.6024844720497, Blast_Score=96, Evalue=7e-21,
Organism=Escherichia coli, GI87082404, Length=389, Percent_Identity=24.6786632390745, Blast_Score=90, Evalue=3e-19,
Organism=Saccharomyces cerevisiae, GI6323512, Length=444, Percent_Identity=24.3243243243243, Blast_Score=66, Evalue=1e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR004736
- InterPro:   IPR020846
- InterPro:   IPR016196
- InterPro:   IPR015041
- InterPro:   IPR005828
- InterPro:   IPR005829 [H]

Pfam domain/function: PF08946 Osmo_CC; PF00083 Sugar_tr [H]

EC number: NA

Molecular weight: Translated: 54834; Mature: 54834

Theoretical pI: Translated: 7.65; Mature: 7.65

Prosite motif: PS50850 MFS ; PS00216 SUGAR_TRANSPORT_1

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.8 %Cys     (Translated Protein)
3.8 %Met     (Translated Protein)
4.6 %Cys+Met (Translated Protein)
0.8 %Cys     (Mature Protein)
3.8 %Met     (Mature Protein)
4.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MLKRKKVKPITLRDVTIIDDGKLRKAITAASLGNAMEWFDFGVYGFVAYALGKVFFPGAD
CCCCCCCCCEEEEEEEEECCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCC
PSVQMVAALATFSVPFLIRPLGGLFFGMLGDKYGRQKILAITIVIMSISTFCIGLIPSYD
CHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCC
TIGIWAPILLLICKMAQGFSVGGEYTGASIFVAEYSPDRKRGFMGSWLDFGSIAGFVLGA
HHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEECCCCHHCCCCCCHHHHHHHHHHHHHH
GVVVLISTIVGEANFLDWGWRIPFFIALPLGIIGLYLRHALEETPAFQQHVDKLEQGDRE
HHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCC
GLQDGPKVSFKEIATKHWRSLLTCIGLVIATNVTYYMLLTYMPSYLSHNLHYSEDHGVLI
CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHH
IIAIMIGMLFVQPVMGLLSDRFGRRPFVLLGSVALFVLAIPAFILINSNVIGLIFAGLLM
HHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH
LAVILNCFTGVMASTLPAMFPTHIRYSALAAAFNISVLVAGLTPTLAAWLVESSQNLMMP
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECHHHHHHHHHHHCCCCCHHH
AYYLMVVAVIGLITGVTMKETANRPLKGATPAASDIQEAKEILVEHYDNIEQKIDNIDHE
HHHHHHHHHHHHHHCCHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH
IADLQAKRTRLVQQHPRIDE
HHHHHHHHHHHHHHCCCCCC
>Mature Secondary Structure
MLKRKKVKPITLRDVTIIDDGKLRKAITAASLGNAMEWFDFGVYGFVAYALGKVFFPGAD
CCCCCCCCCEEEEEEEEECCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCC
PSVQMVAALATFSVPFLIRPLGGLFFGMLGDKYGRQKILAITIVIMSISTFCIGLIPSYD
CHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCC
TIGIWAPILLLICKMAQGFSVGGEYTGASIFVAEYSPDRKRGFMGSWLDFGSIAGFVLGA
HHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEECCCCHHCCCCCCHHHHHHHHHHHHHH
GVVVLISTIVGEANFLDWGWRIPFFIALPLGIIGLYLRHALEETPAFQQHVDKLEQGDRE
HHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCC
GLQDGPKVSFKEIATKHWRSLLTCIGLVIATNVTYYMLLTYMPSYLSHNLHYSEDHGVLI
CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHH
IIAIMIGMLFVQPVMGLLSDRFGRRPFVLLGSVALFVLAIPAFILINSNVIGLIFAGLLM
HHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH
LAVILNCFTGVMASTLPAMFPTHIRYSALAAAFNISVLVAGLTPTLAAWLVESSQNLMMP
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECHHHHHHHHHHHCCCCCHHH
AYYLMVVAVIGLITGVTMKETANRPLKGATPAASDIQEAKEILVEHYDNIEQKIDNIDHE
HHHHHHHHHHHHHHCCHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH
IADLQAKRTRLVQQHPRIDE
HHHHHHHHHHHHHHCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: betaine [Periplasm]; Proton [Periplasm]; L-proline [Periplasm] [C]

Specific reaction: Proton [Periplasm] + betaine [Periplasm] = Proton [Cytoplasm] + betaine [Cytoplasm] Proton [Periplasm] + L-proline [Periplasm] = Proton [Cytoplasm] + L-proline [Cytoplasm] [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796 [H]