The gene/protein map for NC_004631 is currently unavailable.
Definition Salmonella enterica subsp. enterica serovar Typhi str. Ty2 chromosome, complete genome.
Accession NC_004631
Length 4,791,961

Click here to switch to the map view.

The map label for this gene is proP [H]

Identifier: 29144459

GI number: 29144459

Start: 4360738

End: 4362240

Strand: Direct

Name: proP [H]

Synonym: t4197

Alternate gene names: 29144459

Gene position: 4360738-4362240 (Clockwise)

Preceding gene: 29144452

Following gene: 29144467

Centisome position: 91.0

GC content: 52.96

Gene sequence:

>1503_bases
ATGCTGAAAAGGAAAAAAATAAAACCGATTACACTGGGCGATGTGACCATCATTGATGATGGTAAACTTCGCAAAGCGAT
TACCGCCGCGTCGCTGGGCAACGCGATGGAGTGGTTTGATTTTGGTGTTTATGGATTTGTTGCCTACGCGTTGGGTAAAG
TCTTTTTCCCCGGCGCCGATCCCAGCGTCCAGATGATTGCCGCGCTGGCCACGTTTTCCGTTCCCTTCCTGATTCGTCCG
CTCGGCGGGTTATTCTTTGGTATGCTCGGCGATAAATACGGGCGCCAGAAGATCCTGGCGATCACGATTGTGATTATGTC
GATCAGTACCTTCTGTATCGGGTTAATCCCCTCTTACGCGACGATCGGTATCTGGGCGCCAATACTGTTGTTGCTGTGTA
AAATGGCGCAGGGCTTCTCGGTTGGCGGGGAATATACCGGCGCGTCGATCTTTGTCGCGGAATATTCGCCGGATCGTAAA
CGCGGATTTATGGGAAGCTGGCTGGATTTTGGTTCCATCGCCGGGTTCGTGCTGGGCGCGGGCGTGGTGGTCTTAATCTC
GACGATTGTCGGCGAGGAGAATTTCCTTGAGTGGGGCTGGCGTATTCCGTTCTTTATCGCCCTGCCATTGGGGATTATTG
GTCTCTACTTACGCCATGCGCTGGAGGAGACGCCAGCGTTTCAGCAGCACGTGGATAAACTGGAGCAGGGCGACCGCGAA
GGGTTGCAGGATGGGCCGAAAGTCTCCTTTAAAGAGATTGCCACCAAACACTGGCGTAGCCTGTTGTCATGTATCGGTCT
GGTGATTGCCACCAACGTGACCTACTACATGCTGCTCACCTACATGCCGAGCTACCTGTCGCATAACCTGCACTATTCTG
AAGATCACGGCGTGTTGATTATCATCGCCATTATGATCGGGATGCTGTTTGTGCAGCCGGTGATGGGGCTGCTGAGCGAC
CGTTTCGGTCGACGTCCATTTGTGATTATGGGCAGCATTGCGCTGTTCGCGCTGGCGATCCCGGCCTTCATCCTGATTAA
CAGTAACGTTATTGGCCTGATTTTTGCCGGTTTGTTGATGCTGGCGGTGATTCTGAACTGCTTTACCGGGGTGATGGCCT
CGACATTGCCGGCGATGTTTCCGACGCATATTCGTTACAGCGCGCTGGCGGCGGCTTTTAATATCTCTGTATTGATTGCC
GGTCTGACGCCAACGCTGGCGGCCTGGCTGGTGGAAAGCTCGCAGGATCTGATGATACCGGCGTATTATTTGATGGTCAT
CGCGGTGATAGGCTTGATTACCGGTATTTCCATGAAAGAGACGGCCAATCGTCCGTTAAAAGGCGCAACGCCAGCGGCGT
CGGACATCCAGGAAGCGAAGGAAATTCTGGGCGAGCATTACGATAATATTGAGCAGAAAATCGACGACATCGATCAGGAA
ATTGCGGAGCTGCAGGTCAAACGTTCGCGTCTGGTACAGCAACATCCGCGTATCGATGAATAA

Upstream 100 bases:

>100_bases
ATAGCGTTCGCGCCCCTCCCTCCGCTCGACGGCGACGCTGGCGCGGTATGCCAGTGCCCGCCGTATATAGCGCTACAGGG
CTTAGCCTATGAGGACAGCT

Downstream 100 bases:

>100_bases
ATTTCGCGCTTAAGGTTCGCTTAATCTCTCGCGGGCATACTCTCCTCCATACCTTTGGAGGAGAGCGTCATGAAAAGCTA
TATTTATAAAAGTTTGACGA

Product: proline/glycine betaine transporter

Products: betaine [Cytoplasm]; Proton [Cytoplasm]; L-proline [Cytoplasm] [C]

Alternate protein names: Proline porter II; PPII [H]

Number of amino acids: Translated: 500; Mature: 500

Protein sequence:

>500_residues
MLKRKKIKPITLGDVTIIDDGKLRKAITAASLGNAMEWFDFGVYGFVAYALGKVFFPGADPSVQMIAALATFSVPFLIRP
LGGLFFGMLGDKYGRQKILAITIVIMSISTFCIGLIPSYATIGIWAPILLLLCKMAQGFSVGGEYTGASIFVAEYSPDRK
RGFMGSWLDFGSIAGFVLGAGVVVLISTIVGEENFLEWGWRIPFFIALPLGIIGLYLRHALEETPAFQQHVDKLEQGDRE
GLQDGPKVSFKEIATKHWRSLLSCIGLVIATNVTYYMLLTYMPSYLSHNLHYSEDHGVLIIIAIMIGMLFVQPVMGLLSD
RFGRRPFVIMGSIALFALAIPAFILINSNVIGLIFAGLLMLAVILNCFTGVMASTLPAMFPTHIRYSALAAAFNISVLIA
GLTPTLAAWLVESSQDLMIPAYYLMVIAVIGLITGISMKETANRPLKGATPAASDIQEAKEILGEHYDNIEQKIDDIDQE
IAELQVKRSRLVQQHPRIDE

Sequences:

>Translated_500_residues
MLKRKKIKPITLGDVTIIDDGKLRKAITAASLGNAMEWFDFGVYGFVAYALGKVFFPGADPSVQMIAALATFSVPFLIRP
LGGLFFGMLGDKYGRQKILAITIVIMSISTFCIGLIPSYATIGIWAPILLLLCKMAQGFSVGGEYTGASIFVAEYSPDRK
RGFMGSWLDFGSIAGFVLGAGVVVLISTIVGEENFLEWGWRIPFFIALPLGIIGLYLRHALEETPAFQQHVDKLEQGDRE
GLQDGPKVSFKEIATKHWRSLLSCIGLVIATNVTYYMLLTYMPSYLSHNLHYSEDHGVLIIIAIMIGMLFVQPVMGLLSD
RFGRRPFVIMGSIALFALAIPAFILINSNVIGLIFAGLLMLAVILNCFTGVMASTLPAMFPTHIRYSALAAAFNISVLIA
GLTPTLAAWLVESSQDLMIPAYYLMVIAVIGLITGISMKETANRPLKGATPAASDIQEAKEILGEHYDNIEQKIDDIDQE
IAELQVKRSRLVQQHPRIDE
>Mature_500_residues
MLKRKKIKPITLGDVTIIDDGKLRKAITAASLGNAMEWFDFGVYGFVAYALGKVFFPGADPSVQMIAALATFSVPFLIRP
LGGLFFGMLGDKYGRQKILAITIVIMSISTFCIGLIPSYATIGIWAPILLLLCKMAQGFSVGGEYTGASIFVAEYSPDRK
RGFMGSWLDFGSIAGFVLGAGVVVLISTIVGEENFLEWGWRIPFFIALPLGIIGLYLRHALEETPAFQQHVDKLEQGDRE
GLQDGPKVSFKEIATKHWRSLLSCIGLVIATNVTYYMLLTYMPSYLSHNLHYSEDHGVLIIIAIMIGMLFVQPVMGLLSD
RFGRRPFVIMGSIALFALAIPAFILINSNVIGLIFAGLLMLAVILNCFTGVMASTLPAMFPTHIRYSALAAAFNISVLIA
GLTPTLAAWLVESSQDLMIPAYYLMVIAVIGLITGISMKETANRPLKGATPAASDIQEAKEILGEHYDNIEQKIDDIDQE
IAELQVKRSRLVQQHPRIDE

Specific function: Proton symporter that senses osmotic shifts and responds by importing osmolytes such as proline, glycine betaine, stachydrine, pipecolic acid, ectoine and taurine. It is both an osmosensor and an osmoregulator which is available to participate early in th

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cell inner membrane; Multi-pass membrane protein [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the major facilitator superfamily. Sugar transporter (TC 2.A.1.1) family [H]

Homologues:

Organism=Escherichia coli, GI1790550, Length=500, Percent_Identity=95, Blast_Score=978, Evalue=0.0,
Organism=Escherichia coli, GI1788292, Length=443, Percent_Identity=32.0541760722348, Blast_Score=225, Evalue=6e-60,
Organism=Escherichia coli, GI1789941, Length=431, Percent_Identity=34.5707656612529, Blast_Score=212, Evalue=5e-56,
Organism=Escherichia coli, GI1788942, Length=436, Percent_Identity=30.2752293577982, Blast_Score=187, Evalue=1e-48,
Organism=Escherichia coli, GI87082231, Length=487, Percent_Identity=23.4086242299795, Blast_Score=96, Evalue=6e-21,
Organism=Escherichia coli, GI87082404, Length=390, Percent_Identity=25.6410256410256, Blast_Score=89, Evalue=7e-19,
Organism=Saccharomyces cerevisiae, GI6323512, Length=459, Percent_Identity=24.8366013071895, Blast_Score=66, Evalue=1e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR004736
- InterPro:   IPR020846
- InterPro:   IPR016196
- InterPro:   IPR015041
- InterPro:   IPR005828
- InterPro:   IPR005829 [H]

Pfam domain/function: PF08946 Osmo_CC; PF00083 Sugar_tr [H]

EC number: NA

Molecular weight: Translated: 54770; Mature: 54770

Theoretical pI: Translated: 6.79; Mature: 6.79

Prosite motif: PS50850 MFS ; PS00216 SUGAR_TRANSPORT_1

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.8 %Cys     (Translated Protein)
3.8 %Met     (Translated Protein)
4.6 %Cys+Met (Translated Protein)
0.8 %Cys     (Mature Protein)
3.8 %Met     (Mature Protein)
4.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MLKRKKIKPITLGDVTIIDDGKLRKAITAASLGNAMEWFDFGVYGFVAYALGKVFFPGAD
CCCCCCCCCEEECCEEEECCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCC
PSVQMIAALATFSVPFLIRPLGGLFFGMLGDKYGRQKILAITIVIMSISTFCIGLIPSYA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH
TIGIWAPILLLLCKMAQGFSVGGEYTGASIFVAEYSPDRKRGFMGSWLDFGSIAGFVLGA
HHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEECCCCHHCCCCCCHHHHHHHHHHHHHH
GVVVLISTIVGEENFLEWGWRIPFFIALPLGIIGLYLRHALEETPAFQQHVDKLEQGDRE
HHHHHHHHHHCCCHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCC
GLQDGPKVSFKEIATKHWRSLLSCIGLVIATNVTYYMLLTYMPSYLSHNLHYSEDHGVLI
CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHH
IIAIMIGMLFVQPVMGLLSDRFGRRPFVIMGSIALFALAIPAFILINSNVIGLIFAGLLM
HHHHHHHHHHHHHHHHHHHHHHCCCCEEEHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH
LAVILNCFTGVMASTLPAMFPTHIRYSALAAAFNISVLIAGLTPTLAAWLVESSQDLMIP
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECHHHHHHHHHHHCCCCCCHH
AYYLMVIAVIGLITGISMKETANRPLKGATPAASDIQEAKEILGEHYDNIEQKIDDIDQE
HHHHHHHHHHHHHHCCCHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH
IAELQVKRSRLVQQHPRIDE
HHHHHHHHHHHHHHCCCCCC
>Mature Secondary Structure
MLKRKKIKPITLGDVTIIDDGKLRKAITAASLGNAMEWFDFGVYGFVAYALGKVFFPGAD
CCCCCCCCCEEECCEEEECCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCC
PSVQMIAALATFSVPFLIRPLGGLFFGMLGDKYGRQKILAITIVIMSISTFCIGLIPSYA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH
TIGIWAPILLLLCKMAQGFSVGGEYTGASIFVAEYSPDRKRGFMGSWLDFGSIAGFVLGA
HHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEECCCCHHCCCCCCHHHHHHHHHHHHHH
GVVVLISTIVGEENFLEWGWRIPFFIALPLGIIGLYLRHALEETPAFQQHVDKLEQGDRE
HHHHHHHHHHCCCHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCC
GLQDGPKVSFKEIATKHWRSLLSCIGLVIATNVTYYMLLTYMPSYLSHNLHYSEDHGVLI
CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHH
IIAIMIGMLFVQPVMGLLSDRFGRRPFVIMGSIALFALAIPAFILINSNVIGLIFAGLLM
HHHHHHHHHHHHHHHHHHHHHHCCCCEEEHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH
LAVILNCFTGVMASTLPAMFPTHIRYSALAAAFNISVLIAGLTPTLAAWLVESSQDLMIP
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECHHHHHHHHHHHCCCCCCHH
AYYLMVIAVIGLITGISMKETANRPLKGATPAASDIQEAKEILGEHYDNIEQKIDDIDQE
HHHHHHHHHHHHHHCCCHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH
IAELQVKRSRLVQQHPRIDE
HHHHHHHHHHHHHHCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: betaine [Periplasm]; Proton [Periplasm]; L-proline [Periplasm] [C]

Specific reaction: Proton [Periplasm] + betaine [Periplasm] = Proton [Cytoplasm] + betaine [Cytoplasm] Proton [Periplasm] + L-proline [Periplasm] = Proton [Cytoplasm] + L-proline [Cytoplasm] [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796 [H]