Definition Mycobacterium tuberculosis H37Ra, complete genome.
Accession NC_009525
Length 4,419,977

Click here to switch to the map view.

The map label for this gene is betA [C]

Identifier: 148660472

GI number: 148660472

Start: 798239

End: 799678

Strand: Direct

Name: betA [C]

Synonym: MRA_0705

Alternate gene names: 148660472

Gene position: 798239-799678 (Clockwise)

Preceding gene: 148660471

Following gene: 148660473

Centisome position: 18.06

GC content: 63.61

Gene sequence:

>1440_bases
GTGACTGCGGCGGTCCGGCATAGCGATGTGCTGGTCGTCGGTGCTGGAAGTGCTGGATCGGTTGTTGCCGAGCGTCTTTC
CATGGACTCGAGCTGTGTGGTGACCGTGCTTGAGGCTGGCCCCGGGCTGGCCGATCCGGGGTTGCTGGCTCAGACGGCCA
ATGGGTTGCAACTGCCGATCGGAGCTGGCAGCCCTCTGGTTGAGCGTTATCGGACGCGGCTCACCGATCGACCGGTTCGC
CACTTGCCGATCGTGCGGGGTGCGACGGTCGGCGGTTCCGGCGCAATCAACGGCGGCTATTTCTGCCGCGGACTGCCCAG
CGATTTCGACCGTGCCTCGATACCAGGCTGGGCATGGTCTGACGTTCTGGAGCACTTCCGGGCTATCGAGACAGATCTGG
ATTTCGAGACGCCTGTGCATGGCCGTAGTGGCCCCATCCCAGTTCGCCGCACACACGAAATGACTGGCATCACTGAAAGT
TTCATGGCTGCCGCAGAGGACGCAGGGTTCGCTTGGATCGCTGACCTCAACGATGTTGGGCCGGAAATGCCTTCGGGTGT
AGGCGCGGTCCCGCTCAACATCGTTAACGGCGTACGCACCAGCTCGGCGGTCGGCTATCTGATGCCCGCGCTGGGACGGC
CGAATCTGACACTGCTGGCCCGGACGCGGGCGGTGCGGTTGCGCTTTTCCGCCACCACCGCGGTGGGTGTCGACGCGATC
GGCCCAGGAGGCCCGGTAAGCCTGAGCGCTGACCGAATCGTATTGTGCGCCGGAGCGATTCAGTCAGCTCATCTGTTGAT
GCTCTCGGGCGTCGGCGAGGAGGAGGTGTTGCGATCCGCCGGTGTGAAGGTGCTTATGGCGTTGCCGGTTGGCATGGGCT
GCAGTGACCACCCGGAATGGGTGATGCCGACCAACTGGGCGGTGGCTGTCGATCGGCCGGTGTTAGAGGTGCTGCTGAGC
ACTCATGACGGCATCGAAATAAGGCCGTACACAGGCGGCTTCGTTGCGATGACCGGCGACGGTACAGCCGGGCATCGCGA
TTGGCCGCATATCGGGGTGGCGCTCATGCAGCCGCGGGCACGCGGACGCATCACGTTGGTCTCGAGTGATCCCCAGATAC
CAGTCCGCATCGAGCACCGATACGACAGTGAACCTGCCGATGTCGCGGCCCTGCGCCAGGGTAGCGCATTGGCCCACGAA
TTATGCGGTGCGGCAACGCGCATCGGTCCAGCCGTATGGGCGACATCGCAGCATCTGTGTGGTAGTGCCCCAATGGGCAC
CGACGATGACCCACGAGCCGTCGTCGACCCGAGGTGTCGGGTCCGCGGCATCGAAAACCTATGGGTGATAGACGGATCTG
TCCTTCCGTCGATCACCAGTCGCGGTCCACACGCAACGATCGTAATGCTGGGCCACCGCGCGGCCGAATTTGTTCAGTGA

Upstream 100 bases:

>100_bases
CTGAAGCGCGTGGACGACTTGGCTTATGGCGCTGGCCTGTGGTACGGGGTGGTGCGCGAACGTAACATCGGCGCGCTCAA
GCCGCAGATTCGTACCTAGT

Downstream 100 bases:

>100_bases
CTTTCGTCGAGTGGGGCGACCACAGCGGTCGCTGCCGAATGTGCATTTCGGTCAGGCATTGAGCAGGGGACCGAATAGCG
TAGCTCCGCATCGGACTGCA

Product: putative dehydrogenase

Products: Betaine Aldehyde; Reduced Acceptor. [C]

Alternate protein names: ORF2 [H]

Number of amino acids: Translated: 479; Mature: 478

Protein sequence:

>479_residues
MTAAVRHSDVLVVGAGSAGSVVAERLSMDSSCVVTVLEAGPGLADPGLLAQTANGLQLPIGAGSPLVERYRTRLTDRPVR
HLPIVRGATVGGSGAINGGYFCRGLPSDFDRASIPGWAWSDVLEHFRAIETDLDFETPVHGRSGPIPVRRTHEMTGITES
FMAAAEDAGFAWIADLNDVGPEMPSGVGAVPLNIVNGVRTSSAVGYLMPALGRPNLTLLARTRAVRLRFSATTAVGVDAI
GPGGPVSLSADRIVLCAGAIQSAHLLMLSGVGEEEVLRSAGVKVLMALPVGMGCSDHPEWVMPTNWAVAVDRPVLEVLLS
THDGIEIRPYTGGFVAMTGDGTAGHRDWPHIGVALMQPRARGRITLVSSDPQIPVRIEHRYDSEPADVAALRQGSALAHE
LCGAATRIGPAVWATSQHLCGSAPMGTDDDPRAVVDPRCRVRGIENLWVIDGSVLPSITSRGPHATIVMLGHRAAEFVQ

Sequences:

>Translated_479_residues
MTAAVRHSDVLVVGAGSAGSVVAERLSMDSSCVVTVLEAGPGLADPGLLAQTANGLQLPIGAGSPLVERYRTRLTDRPVR
HLPIVRGATVGGSGAINGGYFCRGLPSDFDRASIPGWAWSDVLEHFRAIETDLDFETPVHGRSGPIPVRRTHEMTGITES
FMAAAEDAGFAWIADLNDVGPEMPSGVGAVPLNIVNGVRTSSAVGYLMPALGRPNLTLLARTRAVRLRFSATTAVGVDAI
GPGGPVSLSADRIVLCAGAIQSAHLLMLSGVGEEEVLRSAGVKVLMALPVGMGCSDHPEWVMPTNWAVAVDRPVLEVLLS
THDGIEIRPYTGGFVAMTGDGTAGHRDWPHIGVALMQPRARGRITLVSSDPQIPVRIEHRYDSEPADVAALRQGSALAHE
LCGAATRIGPAVWATSQHLCGSAPMGTDDDPRAVVDPRCRVRGIENLWVIDGSVLPSITSRGPHATIVMLGHRAAEFVQ
>Mature_478_residues
TAAVRHSDVLVVGAGSAGSVVAERLSMDSSCVVTVLEAGPGLADPGLLAQTANGLQLPIGAGSPLVERYRTRLTDRPVRH
LPIVRGATVGGSGAINGGYFCRGLPSDFDRASIPGWAWSDVLEHFRAIETDLDFETPVHGRSGPIPVRRTHEMTGITESF
MAAAEDAGFAWIADLNDVGPEMPSGVGAVPLNIVNGVRTSSAVGYLMPALGRPNLTLLARTRAVRLRFSATTAVGVDAIG
PGGPVSLSADRIVLCAGAIQSAHLLMLSGVGEEEVLRSAGVKVLMALPVGMGCSDHPEWVMPTNWAVAVDRPVLEVLLST
HDGIEIRPYTGGFVAMTGDGTAGHRDWPHIGVALMQPRARGRITLVSSDPQIPVRIEHRYDSEPADVAALRQGSALAHEL
CGAATRIGPAVWATSQHLCGSAPMGTDDDPRAVVDPRCRVRGIENLWVIDGSVLPSITSRGPHATIVMLGHRAAEFVQ

Specific function: Can Catalyze The Oxidation Of Choline To Betaine Aldehyde And Betaine Aldehyde To Glycine Betaine At The Same Rate. [C]

COG id: COG2303

COG function: function code E; Choline dehydrogenase and related flavoproteins

Gene ontology:

Cell location: Membrane-Bound [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the GMC oxidoreductase family [H]

Homologues:

Organism=Homo sapiens, GI217272839, Length=542, Percent_Identity=29.1512915129151, Blast_Score=165, Evalue=9e-41,
Organism=Escherichia coli, GI1786503, Length=546, Percent_Identity=28.7545787545788, Blast_Score=154, Evalue=1e-38,
Organism=Caenorhabditis elegans, GI17532301, Length=551, Percent_Identity=26.3157894736842, Blast_Score=157, Evalue=1e-38,
Organism=Drosophila melanogaster, GI17137792, Length=551, Percent_Identity=29.7640653357532, Blast_Score=184, Evalue=1e-46,
Organism=Drosophila melanogaster, GI24642048, Length=580, Percent_Identity=28.448275862069, Blast_Score=179, Evalue=5e-45,
Organism=Drosophila melanogaster, GI24642055, Length=572, Percent_Identity=28.3216783216783, Blast_Score=165, Evalue=7e-41,
Organism=Drosophila melanogaster, GI24642039, Length=571, Percent_Identity=28.7215411558669, Blast_Score=160, Evalue=2e-39,
Organism=Drosophila melanogaster, GI24642037, Length=572, Percent_Identity=27.6223776223776, Blast_Score=144, Evalue=1e-34,
Organism=Drosophila melanogaster, GI24650267, Length=308, Percent_Identity=30.8441558441558, Blast_Score=128, Evalue=8e-30,
Organism=Drosophila melanogaster, GI24642059, Length=571, Percent_Identity=25.0437828371278, Blast_Score=126, Evalue=4e-29,
Organism=Drosophila melanogaster, GI18859993, Length=574, Percent_Identity=24.2160278745645, Blast_Score=119, Evalue=4e-27,
Organism=Drosophila melanogaster, GI24642035, Length=309, Percent_Identity=30.42071197411, Blast_Score=117, Evalue=1e-26,
Organism=Drosophila melanogaster, GI24642042, Length=303, Percent_Identity=31.023102310231, Blast_Score=115, Evalue=6e-26,
Organism=Drosophila melanogaster, GI18859995, Length=303, Percent_Identity=32.013201320132, Blast_Score=114, Evalue=1e-25,
Organism=Drosophila melanogaster, GI45549471, Length=313, Percent_Identity=29.7124600638978, Blast_Score=108, Evalue=8e-24,
Organism=Drosophila melanogaster, GI45551458, Length=313, Percent_Identity=29.7124600638978, Blast_Score=108, Evalue=9e-24,
Organism=Drosophila melanogaster, GI24642051, Length=301, Percent_Identity=30.2325581395349, Blast_Score=89, Evalue=4e-18,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR012132
- InterPro:   IPR000172
- InterPro:   IPR007867 [H]

Pfam domain/function: PF05199 GMC_oxred_C; PF00732 GMC_oxred_N [H]

EC number: 1.1.99.1 [C]

Molecular weight: Translated: 50292; Mature: 50161

Theoretical pI: Translated: 6.34; Mature: 6.34

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.5 %Cys     (Translated Protein)
2.9 %Met     (Translated Protein)
4.4 %Cys+Met (Translated Protein)
1.5 %Cys     (Mature Protein)
2.7 %Met     (Mature Protein)
4.2 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTAAVRHSDVLVVGAGSAGSVVAERLSMDSSCVVTVLEAGPGLADPGLLAQTANGLQLPI
CCCEEEECCEEEEECCCCHHHHHHHHCCCCCCEEEEEECCCCCCCCCHHHHCCCCEEEEC
GAGSPLVERYRTRLTDRPVRHLPIVRGATVGGSGAINGGYFCRGLPSDFDRASIPGWAWS
CCCCHHHHHHHHHHCCCCHHHCCEEECCCCCCCCCCCCCEEECCCCCCCCCCCCCCCHHH
DVLEHFRAIETDLDFETPVHGRSGPIPVRRTHEMTGITESFMAAAEDAGFAWIADLNDVG
HHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHCHHHHHHHHHCCCCEEEEECCHHCC
PEMPSGVGAVPLNIVNGVRTSSAVGYLMPALGRPNLTLLARTRAVRLRFSATTAVGVDAI
CCCCCCCCCCCHHHHCCCCCHHHHHHHHHCCCCCCEEEEEEEEEEEEEEECEEEECCCCC
GPGGPVSLSADRIVLCAGAIQSAHLLMLSGVGEEEVLRSAGVKVLMALPVGMGCSDHPEW
CCCCCCEECCCEEEEEECCCCCCEEEEEECCCHHHHHHHCCCEEEEEECCCCCCCCCCCE
VMPTNWAVAVDRPVLEVLLSTHDGIEIRPYTGGFVAMTGDGTAGHRDWPHIGVALMQPRA
ECCCCEEEEECHHHHHHHHHCCCCCEEEECCCCEEEEECCCCCCCCCCCCCCEEEECCCC
RGRITLVSSDPQIPVRIEHRYDSEPADVAALRQGSALAHELCGAATRIGPAVWATSQHLC
CCEEEEEECCCCCEEEEECCCCCCCHHHHHHHCCCHHHHHHHHHHHHCCHHHCCCCHHHC
GSAPMGTDDDPRAVVDPRCRVRGIENLWVIDGSVLPSITSRGPHATIVMLGHRAAEFVQ
CCCCCCCCCCCCCEECCCHHEECCCEEEEECCCCCCHHHCCCCCEEEEEECCCHHHHCC
>Mature Secondary Structure 
TAAVRHSDVLVVGAGSAGSVVAERLSMDSSCVVTVLEAGPGLADPGLLAQTANGLQLPI
CCEEEECCEEEEECCCCHHHHHHHHCCCCCCEEEEEECCCCCCCCCHHHHCCCCEEEEC
GAGSPLVERYRTRLTDRPVRHLPIVRGATVGGSGAINGGYFCRGLPSDFDRASIPGWAWS
CCCCHHHHHHHHHHCCCCHHHCCEEECCCCCCCCCCCCCEEECCCCCCCCCCCCCCCHHH
DVLEHFRAIETDLDFETPVHGRSGPIPVRRTHEMTGITESFMAAAEDAGFAWIADLNDVG
HHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHCHHHHHHHHHCCCCEEEEECCHHCC
PEMPSGVGAVPLNIVNGVRTSSAVGYLMPALGRPNLTLLARTRAVRLRFSATTAVGVDAI
CCCCCCCCCCCHHHHCCCCCHHHHHHHHHCCCCCCEEEEEEEEEEEEEEECEEEECCCCC
GPGGPVSLSADRIVLCAGAIQSAHLLMLSGVGEEEVLRSAGVKVLMALPVGMGCSDHPEW
CCCCCCEECCCEEEEEECCCCCCEEEEEECCCHHHHHHHCCCEEEEEECCCCCCCCCCCE
VMPTNWAVAVDRPVLEVLLSTHDGIEIRPYTGGFVAMTGDGTAGHRDWPHIGVALMQPRA
ECCCCEEEEECHHHHHHHHHCCCCCEEEECCCCEEEEECCCCCCCCCCCCCCEEEECCCC
RGRITLVSSDPQIPVRIEHRYDSEPADVAALRQGSALAHELCGAATRIGPAVWATSQHLC
CCEEEEEECCCCCEEEEECCCCCCCHHHHHHHCCCHHHHHHHHHHHHCCHHHCCCCHHHC
GSAPMGTDDDPRAVVDPRCRVRGIENLWVIDGSVLPSITSRGPHATIVMLGHRAAEFVQ
CCCCCCCCCCCCCEECCCHHEECCCEEEEECCCCCCHHHCCCCCEEEEEECCCHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: Coenzyme Q [C]

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): 7 {choline}} 1.1 {phenazine}} [C]

Substrates: Choline; Acceptor [C]

Specific reaction: Choline + Acceptor = Betaine Aldehyde + Reduced Acceptor. [C]

General reaction: Redox reaction [C]

Inhibitor: 2-Dimethylaminoethanol; Betaine aldehyde; Monoethanolamine; Semicarbazide [C]

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 7836301 [H]