Definition | Escherichia coli HS, complete genome. |
---|---|
Accession | NC_009800 |
Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is proP [H]
Identifier: 157163576
GI number: 157163576
Start: 4354523
End: 4356025
Strand: Direct
Name: proP [H]
Synonym: EcHS_A4351
Alternate gene names: 157163576
Gene position: 4354523-4356025 (Clockwise)
Preceding gene: 157163575
Following gene: 157163585
Centisome position: 93.78
GC content: 50.17
Gene sequence:
>1503_bases ATGCTGAAAAGGAAAAAAGTAAAACCGATTACCCTTCGTGATGTCACCATTATTGATGACGGTAAACTGCGTAAAGCCAT TACCGCAGCATCACTGGGTAATGCAATGGAATGGTTTGATTTTGGTGTTTATGGTTTTGTTGCTTACGCATTAGGTAAAG TTTTTTTCCCGGGGGCTGACCCCAGCGTGCAGATGGTTGCTGCACTTGCCACGTTCTCCGTTCCCTTTCTGATTCGACCG CTTGGCGGACTCTTCTTTGGTATGTTGGGCGATAAATATGGTCGCCAGAAGATCCTCGCTATCACTATTGTGATTATGTC GATCAGTACGTTCTGTATTGGCTTAATACCGTCCTACGACACGATTGGTATTTGGGCACCGATTCTGCTGTTGATCTGTA AGATGGCACAAGGTTTCTCGGTCGGCGGTGAATATACCGGGGCGTCGATATTTGTTGCGGAATACTCCCCTGACCGTAAA CGTGGCTTTATGGGCAGCTGGCTGGACTTTGGTTCTATTGCCGGGTTTGTGCTGGGTGCTGGCGTAGTGGTGTTAATTTC GACCATTGTCGGCGAAGCGAACTTCCTCGACTGGGGCTGGCGTATTCCGTTCTTTATTGCTCTGCCGTTAGGGATTATCG GGCTTTACCTGCGCCATGCGCTGGAAGAAACTCCGGCGTTCCAGCAGCATGTCGATAAACTGGAACAGGGCGACCGCGAA GGTTTGCAGGATGGCCCGAAAGTCTCGTTTAAAGAGATTGCCACCAAACACTGGCGCAGCCTGTTGACATGTATTGGTCT GGTAATTGCCACCAACGTGACTTACTACATGTTGCTGACCTATATGCCGAGTTATTTGTCGCATAACCTGCATTACTCCG AAGACCACGGGGTGCTGATTATTATCGCCATTATGATCGGTATGCTGTTTGTCCAGCCGGTGATGGGCTTGCTGAGTGAC CGTTTTGGCCGTCGTCCGTTTGTGCTACTTGGTAGTGTTGCCCTGTTTGTGTTGGCGATCCCGGCGTTTATTCTGATTAA CAGTAACGTCATCGGCCTGATTTTTGCCGGGTTACTGATGCTGGCGGTGATCCTTAACTGCTTTACGGGCGTTATGGCTT CTACCTTGCCAGCGATGTTCCCGACGCATATCCGTTACAGCGCGCTGGCGGCGGCATTTAATATTTCGGTGCTGGTTGCC GGTCTGACGCCAACACTGGCGGCCTGGCTGGTCGAAAGCTCGCAGAATCTGATGATGCCTGCCTATTACCTGATGGTAGT GGCGGTGATTGGTTTAATCACCGGCGTAACCATGAAAGAGACGGCAAATCGTCCGTTGAAAGGTGCGACACCGGCGGCGT CAGATATACAGGAAGCGAAGGAAATTCTCGTCGAGCATTACGATAATATCGAGCAGAAAATCGATAATATTGACCACGAG ATTGCCGATTTGCAGGCGAAACGTACCCGCCTGGTGCAGCAACATCCGCGAATTGATGAATAA
Upstream 100 bases:
>100_bases GTTACAGAGATTGCATCCTGCAATTCCCGCTCCCCTTTTGCGGCCGTCGCGCTGATTTTTCTGGCGTTTGCGGAAATGGG CCAACTCTGCGAGGAAAGCT
Downstream 100 bases:
>100_bases GCTGAAACGGATGGCCTGATGTGACGCTGTCTTATCAGGCCAATTGAACTCTTAAGGTTCACTTAATCTCTGACGCGCAT ACTCTCCTCCAGGTTAACGG
Product: proline/glycine betaine transporter
Products: betaine [Cytoplasm]; Proton [Cytoplasm]; L-proline [Cytoplasm] [C]
Alternate protein names: Proline porter II; PPII [H]
Number of amino acids: Translated: 500; Mature: 500
Protein sequence:
>500_residues MLKRKKVKPITLRDVTIIDDGKLRKAITAASLGNAMEWFDFGVYGFVAYALGKVFFPGADPSVQMVAALATFSVPFLIRP LGGLFFGMLGDKYGRQKILAITIVIMSISTFCIGLIPSYDTIGIWAPILLLICKMAQGFSVGGEYTGASIFVAEYSPDRK RGFMGSWLDFGSIAGFVLGAGVVVLISTIVGEANFLDWGWRIPFFIALPLGIIGLYLRHALEETPAFQQHVDKLEQGDRE GLQDGPKVSFKEIATKHWRSLLTCIGLVIATNVTYYMLLTYMPSYLSHNLHYSEDHGVLIIIAIMIGMLFVQPVMGLLSD RFGRRPFVLLGSVALFVLAIPAFILINSNVIGLIFAGLLMLAVILNCFTGVMASTLPAMFPTHIRYSALAAAFNISVLVA GLTPTLAAWLVESSQNLMMPAYYLMVVAVIGLITGVTMKETANRPLKGATPAASDIQEAKEILVEHYDNIEQKIDNIDHE IADLQAKRTRLVQQHPRIDE
Sequences:
>Translated_500_residues MLKRKKVKPITLRDVTIIDDGKLRKAITAASLGNAMEWFDFGVYGFVAYALGKVFFPGADPSVQMVAALATFSVPFLIRP LGGLFFGMLGDKYGRQKILAITIVIMSISTFCIGLIPSYDTIGIWAPILLLICKMAQGFSVGGEYTGASIFVAEYSPDRK RGFMGSWLDFGSIAGFVLGAGVVVLISTIVGEANFLDWGWRIPFFIALPLGIIGLYLRHALEETPAFQQHVDKLEQGDRE GLQDGPKVSFKEIATKHWRSLLTCIGLVIATNVTYYMLLTYMPSYLSHNLHYSEDHGVLIIIAIMIGMLFVQPVMGLLSD RFGRRPFVLLGSVALFVLAIPAFILINSNVIGLIFAGLLMLAVILNCFTGVMASTLPAMFPTHIRYSALAAAFNISVLVA GLTPTLAAWLVESSQNLMMPAYYLMVVAVIGLITGVTMKETANRPLKGATPAASDIQEAKEILVEHYDNIEQKIDNIDHE IADLQAKRTRLVQQHPRIDE >Mature_500_residues MLKRKKVKPITLRDVTIIDDGKLRKAITAASLGNAMEWFDFGVYGFVAYALGKVFFPGADPSVQMVAALATFSVPFLIRP LGGLFFGMLGDKYGRQKILAITIVIMSISTFCIGLIPSYDTIGIWAPILLLICKMAQGFSVGGEYTGASIFVAEYSPDRK RGFMGSWLDFGSIAGFVLGAGVVVLISTIVGEANFLDWGWRIPFFIALPLGIIGLYLRHALEETPAFQQHVDKLEQGDRE GLQDGPKVSFKEIATKHWRSLLTCIGLVIATNVTYYMLLTYMPSYLSHNLHYSEDHGVLIIIAIMIGMLFVQPVMGLLSD RFGRRPFVLLGSVALFVLAIPAFILINSNVIGLIFAGLLMLAVILNCFTGVMASTLPAMFPTHIRYSALAAAFNISVLVA GLTPTLAAWLVESSQNLMMPAYYLMVVAVIGLITGVTMKETANRPLKGATPAASDIQEAKEILVEHYDNIEQKIDNIDHE IADLQAKRTRLVQQHPRIDE
Specific function: Proton symporter that senses osmotic shifts and responds by importing osmolytes such as proline, glycine betaine, stachydrine, pipecolic acid, ectoine and taurine. It is both an osmosensor and an osmoregulator which is available to participate early in th
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the major facilitator superfamily. Sugar transporter (TC 2.A.1.1) family [H]
Homologues:
Organism=Escherichia coli, GI1790550, Length=500, Percent_Identity=99.4, Blast_Score=1012, Evalue=0.0, Organism=Escherichia coli, GI1788292, Length=443, Percent_Identity=31.8284424379233, Blast_Score=224, Evalue=1e-59, Organism=Escherichia coli, GI1789941, Length=452, Percent_Identity=32.9646017699115, Blast_Score=208, Evalue=8e-55, Organism=Escherichia coli, GI1788942, Length=433, Percent_Identity=30.2540415704388, Blast_Score=191, Evalue=1e-49, Organism=Escherichia coli, GI87082231, Length=483, Percent_Identity=23.6024844720497, Blast_Score=96, Evalue=7e-21, Organism=Escherichia coli, GI87082404, Length=389, Percent_Identity=24.6786632390745, Blast_Score=90, Evalue=3e-19, Organism=Saccharomyces cerevisiae, GI6323512, Length=444, Percent_Identity=24.3243243243243, Blast_Score=66, Evalue=1e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR004736 - InterPro: IPR020846 - InterPro: IPR016196 - InterPro: IPR015041 - InterPro: IPR005828 - InterPro: IPR005829 [H]
Pfam domain/function: PF08946 Osmo_CC; PF00083 Sugar_tr [H]
EC number: NA
Molecular weight: Translated: 54834; Mature: 54834
Theoretical pI: Translated: 7.65; Mature: 7.65
Prosite motif: PS50850 MFS ; PS00216 SUGAR_TRANSPORT_1
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.8 %Cys (Translated Protein) 3.8 %Met (Translated Protein) 4.6 %Cys+Met (Translated Protein) 0.8 %Cys (Mature Protein) 3.8 %Met (Mature Protein) 4.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MLKRKKVKPITLRDVTIIDDGKLRKAITAASLGNAMEWFDFGVYGFVAYALGKVFFPGAD CCCCCCCCCEEEEEEEEECCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCC PSVQMVAALATFSVPFLIRPLGGLFFGMLGDKYGRQKILAITIVIMSISTFCIGLIPSYD CHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCC TIGIWAPILLLICKMAQGFSVGGEYTGASIFVAEYSPDRKRGFMGSWLDFGSIAGFVLGA HHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEECCCCHHCCCCCCHHHHHHHHHHHHHH GVVVLISTIVGEANFLDWGWRIPFFIALPLGIIGLYLRHALEETPAFQQHVDKLEQGDRE HHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCC GLQDGPKVSFKEIATKHWRSLLTCIGLVIATNVTYYMLLTYMPSYLSHNLHYSEDHGVLI CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHH IIAIMIGMLFVQPVMGLLSDRFGRRPFVLLGSVALFVLAIPAFILINSNVIGLIFAGLLM HHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH LAVILNCFTGVMASTLPAMFPTHIRYSALAAAFNISVLVAGLTPTLAAWLVESSQNLMMP HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECHHHHHHHHHHHCCCCCHHH AYYLMVVAVIGLITGVTMKETANRPLKGATPAASDIQEAKEILVEHYDNIEQKIDNIDHE HHHHHHHHHHHHHHCCHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH IADLQAKRTRLVQQHPRIDE HHHHHHHHHHHHHHCCCCCC >Mature Secondary Structure MLKRKKVKPITLRDVTIIDDGKLRKAITAASLGNAMEWFDFGVYGFVAYALGKVFFPGAD CCCCCCCCCEEEEEEEEECCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCC PSVQMVAALATFSVPFLIRPLGGLFFGMLGDKYGRQKILAITIVIMSISTFCIGLIPSYD CHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCC TIGIWAPILLLICKMAQGFSVGGEYTGASIFVAEYSPDRKRGFMGSWLDFGSIAGFVLGA HHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEECCCCHHCCCCCCHHHHHHHHHHHHHH GVVVLISTIVGEANFLDWGWRIPFFIALPLGIIGLYLRHALEETPAFQQHVDKLEQGDRE HHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCC GLQDGPKVSFKEIATKHWRSLLTCIGLVIATNVTYYMLLTYMPSYLSHNLHYSEDHGVLI CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHH IIAIMIGMLFVQPVMGLLSDRFGRRPFVLLGSVALFVLAIPAFILINSNVIGLIFAGLLM HHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH LAVILNCFTGVMASTLPAMFPTHIRYSALAAAFNISVLVAGLTPTLAAWLVESSQNLMMP HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECHHHHHHHHHHHCCCCCHHH AYYLMVVAVIGLITGVTMKETANRPLKGATPAASDIQEAKEILVEHYDNIEQKIDNIDHE HHHHHHHHHHHHHHCCHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH IADLQAKRTRLVQQHPRIDE HHHHHHHHHHHHHHCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: betaine [Periplasm]; Proton [Periplasm]; L-proline [Periplasm] [C]
Specific reaction: Proton [Periplasm] + betaine [Periplasm] = Proton [Cytoplasm] + betaine [Cytoplasm] Proton [Periplasm] + L-proline [Periplasm] = Proton [Cytoplasm] + L-proline [Cytoplasm] [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]