The gene/protein map for NC_008769 is currently unavailable.
Definition Mycobacterium bovis BCG str. Pasteur 1173P2, complete genome.
Accession NC_008769
Length 4,374,522

Click here to switch to the map view.

The map label for this gene is betP [C]

Identifier: 121636840

GI number: 121636840

Start: 1052562

End: 1054343

Strand: Direct

Name: betP [C]

Synonym: BCG_0969

Alternate gene names: 121636840

Gene position: 1052562-1054343 (Clockwise)

Preceding gene: 121636835

Following gene: 121636841

Centisome position: 24.06

GC content: 61.56

Gene sequence:

>1782_bases
ATGTCAGCGAAAGAACGCGGTGACCAGAACGCCGTCGTCGACGCCCTGCGGAGTATTCAGCCCGCAGTCTTCATTCCGGC
TTCAGTGGTCATCGTCGCCATGATCGTCGTTTCCGTGGTGTACTCGAGCGTCGCCGAGAATGCGTTCGTTCGGCTGAACT
CCGCGATCACCGGCGGCGTCGGGTGGTGGTACATCCTGGTTGCCACCGGGTTTGTGGTATTCGCGCTGTACTGCGGCATT
TCCCGGATTGGCACTATCCGGCTGGGCCGCGACGATGAGCTCCCCGAGTTCAGCTTCTGGGCATGGCTGGCAATGCTGTT
TAGTGCCGGTATGGGTATCGGCCTGGTCTTCTACGGGGTGGCCGAGCCGCTCAGCCACTACCTGCGGCCACCGCGGTCAC
GCGGCGTGCCCGCGCTTACTGATGCGGCGGCTAACCAGGCGATGGCGCTGACAGTGTTCCACTGGGGCCTGCACGCCTGG
GCAATTTATGTCGTGGTTGGCCTCGGTATGGCGTACATGACCTATCGGCGGGGTCGCCCCTTGTCGGTGCGCTGGCTGCT
GGAGCCGGTCGTGGGTCGGGGCCGTGTAGAGGGCGCCTTGGGGCACGCGGTGGACGTCATCGCCATTGTCGGAACACTCT
TTGGTGTCGCCACGTCACTGGGCTTCGGTATCACTCAGATCGCCTCCGGCCTGGAATATCTCGGCTGGATCCGGGTGGAC
AACTGGTGGATGGTCGGCATGATCGCCGCCATCACCGCCACTGCGACGGCGTCGGTGGTCAGTGGGGTCAGCAAGGGTTT
GAAGTGGCTGTCGAACATCAATATGGCGCTGGCCGCCGCATTGGCCCTGTTCGTGTTGTTGCTCGGGCCGACACTTTTCT
TGCTGCAGTCGTGGGTGCAAAATTTGGGAGGCTACGTCCAGTCGCTTCCGCAATTCATGCTGCGCACCGCGCCGTTCTCG
CACGACGGCTGGCTCGGCGACTGGACTATCTTCTACTGGGGTTGGTGGATCAGCTGGGCTCCGTTTGTCGGGATGTTCAT
CGCGCGGATTTCGCGGGGACGGACGATCCGGGAGTTCATCGGGGCGGTGCTGCTCGTTCCCACCGTGATCGCCTCGCTAT
GGTTTACGATCTTCGGTGACTCGGCGTTGTTGCGGCAACGCAACAACGGCGACATGCTCGTCAACGGGGCGGTAGACACC
AACACATCGCTTTTCCGATTGCTGGACGGTTTGCCTATCGGGGCTATTACCAGCGTTCTTGCTGTGCTGGTGATCGTGTT
CTTCTTCGTTACGTCGTCGGACTCCGGTTCGTTGGTCATCGACATCTTGTCAGCGGGTGGTGAGCTGGACCCGCCCAAGC
TGACCAGGGTCTACTGGGCGGTGTTGGAGGGGGTAGCCGCGGCCGTTTTGCTCCTGATCGGAGGTGCTGGGTCACTGACC
GCGTTGCGGACGGCCGCTATTGCCACGGCCCTGCCGTTCTCAATCGTCATGGTGGTGGCGTGCTATGCGATGACCAAAGC
GTTCCACTTCGACCTGGCCGCCACACCTAGGCTGCTGCACGTCACCGTGCCTGACGTGGTTGCGGCAGGAAACCGGCGAC
GCCACGATATCTCGGCGACGCTGTCGGGGCTCATTGCCGTCCGTGATGTCGATAGCGGCACATATATAGTCCACCCCGAC
ACCGGCGCTCTCACCGTCACTGCACCACCAGATCCGTTGGACGATCATGTTTTTGAGTCTGATCGGCACGTAACGCGAAG
AAACACAACATCATCGAGATGA

Upstream 100 bases:

>100_bases
TTACTCAGCATGGTGCACAGGTCTGTGCTTGTCTGGTTGATGGTGATTTGGCGTTGCGGTGGCCGTGATGAGGACGCGGT
GAGAAACGGAGCTTGAAGAT

Downstream 100 bases:

>100_bases
TGTGTTATCGACCTGCCGGGTCGCCGCTGCCTGGACCGGAGCCGGCTACTTCCGGTAAACGCGCACCGCTGGATGAATCG
CCGCGGCATGAGAAGCTCGA

Product: putative glycine betaine transport integral membrane protein betP

Products: Proton [Cytoplasm]; choline [Cytoplasm] [C]

Alternate protein names: NA

Number of amino acids: Translated: 593; Mature: 592

Protein sequence:

>593_residues
MSAKERGDQNAVVDALRSIQPAVFIPASVVIVAMIVVSVVYSSVAENAFVRLNSAITGGVGWWYILVATGFVVFALYCGI
SRIGTIRLGRDDELPEFSFWAWLAMLFSAGMGIGLVFYGVAEPLSHYLRPPRSRGVPALTDAAANQAMALTVFHWGLHAW
AIYVVVGLGMAYMTYRRGRPLSVRWLLEPVVGRGRVEGALGHAVDVIAIVGTLFGVATSLGFGITQIASGLEYLGWIRVD
NWWMVGMIAAITATATASVVSGVSKGLKWLSNINMALAAALALFVLLLGPTLFLLQSWVQNLGGYVQSLPQFMLRTAPFS
HDGWLGDWTIFYWGWWISWAPFVGMFIARISRGRTIREFIGAVLLVPTVIASLWFTIFGDSALLRQRNNGDMLVNGAVDT
NTSLFRLLDGLPIGAITSVLAVLVIVFFFVTSSDSGSLVIDILSAGGELDPPKLTRVYWAVLEGVAAAVLLLIGGAGSLT
ALRTAAIATALPFSIVMVVACYAMTKAFHFDLAATPRLLHVTVPDVVAAGNRRRHDISATLSGLIAVRDVDSGTYIVHPD
TGALTVTAPPDPLDDHVFESDRHVTRRNTTSSR

Sequences:

>Translated_593_residues
MSAKERGDQNAVVDALRSIQPAVFIPASVVIVAMIVVSVVYSSVAENAFVRLNSAITGGVGWWYILVATGFVVFALYCGI
SRIGTIRLGRDDELPEFSFWAWLAMLFSAGMGIGLVFYGVAEPLSHYLRPPRSRGVPALTDAAANQAMALTVFHWGLHAW
AIYVVVGLGMAYMTYRRGRPLSVRWLLEPVVGRGRVEGALGHAVDVIAIVGTLFGVATSLGFGITQIASGLEYLGWIRVD
NWWMVGMIAAITATATASVVSGVSKGLKWLSNINMALAAALALFVLLLGPTLFLLQSWVQNLGGYVQSLPQFMLRTAPFS
HDGWLGDWTIFYWGWWISWAPFVGMFIARISRGRTIREFIGAVLLVPTVIASLWFTIFGDSALLRQRNNGDMLVNGAVDT
NTSLFRLLDGLPIGAITSVLAVLVIVFFFVTSSDSGSLVIDILSAGGELDPPKLTRVYWAVLEGVAAAVLLLIGGAGSLT
ALRTAAIATALPFSIVMVVACYAMTKAFHFDLAATPRLLHVTVPDVVAAGNRRRHDISATLSGLIAVRDVDSGTYIVHPD
TGALTVTAPPDPLDDHVFESDRHVTRRNTTSSR
>Mature_592_residues
SAKERGDQNAVVDALRSIQPAVFIPASVVIVAMIVVSVVYSSVAENAFVRLNSAITGGVGWWYILVATGFVVFALYCGIS
RIGTIRLGRDDELPEFSFWAWLAMLFSAGMGIGLVFYGVAEPLSHYLRPPRSRGVPALTDAAANQAMALTVFHWGLHAWA
IYVVVGLGMAYMTYRRGRPLSVRWLLEPVVGRGRVEGALGHAVDVIAIVGTLFGVATSLGFGITQIASGLEYLGWIRVDN
WWMVGMIAAITATATASVVSGVSKGLKWLSNINMALAAALALFVLLLGPTLFLLQSWVQNLGGYVQSLPQFMLRTAPFSH
DGWLGDWTIFYWGWWISWAPFVGMFIARISRGRTIREFIGAVLLVPTVIASLWFTIFGDSALLRQRNNGDMLVNGAVDTN
TSLFRLLDGLPIGAITSVLAVLVIVFFFVTSSDSGSLVIDILSAGGELDPPKLTRVYWAVLEGVAAAVLLLIGGAGSLTA
LRTAAIATALPFSIVMVVACYAMTKAFHFDLAATPRLLHVTVPDVVAAGNRRRHDISATLSGLIAVRDVDSGTYIVHPDT
GALTVTAPPDPLDDHVFESDRHVTRRNTTSSR

Specific function: High-Affinity Uptake Of Choline Driven By A Proton- Motive Force. [C]

COG id: COG1292

COG function: function code M; Choline-glycine betaine transporter

Gene ontology:

Cell location: Cell membrane; Multi-pass membrane protein (Probable)

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the BCCT transporter (TC 2.A.15) family

Homologues:

Organism=Escherichia coli, GI1786506, Length=499, Percent_Identity=37.2745490981964, Blast_Score=337, Evalue=1e-93,
Organism=Escherichia coli, GI1788102, Length=468, Percent_Identity=30.7692307692308, Blast_Score=173, Evalue=3e-44,
Organism=Escherichia coli, GI1786224, Length=504, Percent_Identity=26.7857142857143, Blast_Score=152, Evalue=8e-38,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): Y917_MYCTU (P63695)

Other databases:

- EMBL:   BX842574
- EMBL:   AE000516
- PIR:   E70582
- RefSeq:   NP_215432.1
- RefSeq:   NP_335375.1
- ProteinModelPortal:   P63695
- EnsemblBacteria:   EBMYCT00000002444
- EnsemblBacteria:   EBMYCT00000069746
- GeneID:   885172
- GeneID:   926257
- GenomeReviews:   AE000516_GR
- GenomeReviews:   AL123456_GR
- KEGG:   mtc:MT0942
- KEGG:   mtu:Rv0917
- TIGR:   MT0942
- TubercuList:   Rv0917
- GeneTree:   EBGT00050000018600
- HOGENOM:   HBG682943
- OMA:   RQRRDIS
- ProtClustDB:   CLSK790831
- InterPro:   IPR000060
- InterPro:   IPR018093
- TIGRFAMs:   TIGR00842

Pfam domain/function: PF02028 BCCT

EC number: NA

Molecular weight: Translated: 63869; Mature: 63738

Theoretical pI: Translated: 8.60; Mature: 8.60

Prosite motif: PS01303 BCCT

Important sites: NA

Signals:

None

Transmembrane regions:

HASH(0x13a57228)-; HASH(0x12a08b6c)-; HASH(0x124ed294)-; HASH(0x14e5964c)-; HASH(0x13295958)-; HASH(0x13ca44c8)-; HASH(0x124ed2c4)-; HASH(0x129dfea0)-; HASH(0x13f8e1b4)-; HASH(0x14b140f8)-; HASH(0x14e1d9a0)-; HASH(0x13036514)-;

Cys/Met content:

0.3 %Cys     (Translated Protein)
2.5 %Met     (Translated Protein)
2.9 %Cys+Met (Translated Protein)
0.3 %Cys     (Mature Protein)
2.4 %Met     (Mature Protein)
2.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSAKERGDQNAVVDALRSIQPAVFIPASVVIVAMIVVSVVYSSVAENAFVRLNSAITGGV
CCCCCCCCHHHHHHHHHHCCCCEECHHHHHHHHHHHHHHHHHHHHHHHHEEECCHHCCCH
GWWYILVATGFVVFALYCGISRIGTIRLGRDDELPEFSFWAWLAMLFSAGMGIGLVFYGV
HHHHHHHHHHHHHHHHHHHHHHHCEEEECCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHH
AEPLSHYLRPPRSRGVPALTDAAANQAMALTVFHWGLHAWAIYVVVGLGMAYMTYRRGRP
HHHHHHHHCCCCCCCCCCHHHHHCCCCEEEHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC
LSVRWLLEPVVGRGRVEGALGHAVDVIAIVGTLFGVATSLGFGITQIASGLEYLGWIRVD
CCHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCEEEEC
NWWMVGMIAAITATATASVVSGVSKGLKWLSNINMALAAALALFVLLLGPTLFLLQSWVQ
CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHH
NLGGYVQSLPQFMLRTAPFSHDGWLGDWTIFYWGWWISWAPFVGMFIARISRGRTIREFI
HHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHH
GAVLLVPTVIASLWFTIFGDSALLRQRNNGDMLVNGAVDTNTSLFRLLDGLPIGAITSVL
HHHHHHHHHHHHHHHHHHCCHHHHHCCCCCCEEEECCCCCCHHHHHHHCCCCHHHHHHHH
AVLVIVFFFVTSSDSGSLVIDILSAGGELDPPKLTRVYWAVLEGVAAAVLLLIGGAGSLT
HHHHHHHHHHCCCCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHH
ALRTAAIATALPFSIVMVVACYAMTKAFHFDLAATPRLLHVTVPDVVAAGNRRRHDISAT
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEECCHHHHCCCCCCHHHHHH
LSGLIAVRDVDSGTYIVHPDTGALTVTAPPDPLDDHVFESDRHVTRRNTTSSR
HHHEEEEEECCCCEEEEECCCCEEEEECCCCCCCHHHHCCCCCHHHCCCCCCH
>Mature Secondary Structure 
SAKERGDQNAVVDALRSIQPAVFIPASVVIVAMIVVSVVYSSVAENAFVRLNSAITGGV
CCCCCCCHHHHHHHHHHCCCCEECHHHHHHHHHHHHHHHHHHHHHHHHEEECCHHCCCH
GWWYILVATGFVVFALYCGISRIGTIRLGRDDELPEFSFWAWLAMLFSAGMGIGLVFYGV
HHHHHHHHHHHHHHHHHHHHHHHCEEEECCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHH
AEPLSHYLRPPRSRGVPALTDAAANQAMALTVFHWGLHAWAIYVVVGLGMAYMTYRRGRP
HHHHHHHHCCCCCCCCCCHHHHHCCCCEEEHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC
LSVRWLLEPVVGRGRVEGALGHAVDVIAIVGTLFGVATSLGFGITQIASGLEYLGWIRVD
CCHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCEEEEC
NWWMVGMIAAITATATASVVSGVSKGLKWLSNINMALAAALALFVLLLGPTLFLLQSWVQ
CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHH
NLGGYVQSLPQFMLRTAPFSHDGWLGDWTIFYWGWWISWAPFVGMFIARISRGRTIREFI
HHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHH
GAVLLVPTVIASLWFTIFGDSALLRQRNNGDMLVNGAVDTNTSLFRLLDGLPIGAITSVL
HHHHHHHHHHHHHHHHHHCCHHHHHCCCCCCEEEECCCCCCHHHHHHHCCCCHHHHHHHH
AVLVIVFFFVTSSDSGSLVIDILSAGGELDPPKLTRVYWAVLEGVAAAVLLLIGGAGSLT
HHHHHHHHHHCCCCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHH
ALRTAAIATALPFSIVMVVACYAMTKAFHFDLAATPRLLHVTVPDVVAAGNRRRHDISAT
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEECCHHHHCCCCCCHHHHHH
LSGLIAVRDVDSGTYIVHPDTGALTVTAPPDPLDDHVFESDRHVTRRNTTSSR
HHHEEEEEECCCCEEEEECCCCEEEEECCCCCCCHHHHCCCCCHHHCCCCCCH

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: Proton [Periplasm]; choline [Periplasm] [C]

Specific reaction: Proton [Periplasm] + choline [Periplasm] = Proton [Cytoplasm] + choline [Cytoplasm] [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 9634230; 12218036