Definition Mesorhizobium sp. BNC1, complete genome.
Accession NC_008254
Length 4,412,446

Click here to switch to the map view.

The map label for this gene is betA [H]

Identifier: 110636059

GI number: 110636059

Start: 4032908

End: 4034527

Strand: Reverse

Name: betA [H]

Synonym: Meso_3734

Alternate gene names: 110636059

Gene position: 4034527-4032908 (Counterclockwise)

Preceding gene: 110636060

Following gene: 110636044

Centisome position: 91.44

GC content: 60.74

Gene sequence:

>1620_bases
ATGAGCAACCAGGTCGAAGCCGACGAGGATTTCGGCACCTACGATCATATCGTGATTGGCGCCGGTAGCGCGGGATGCGT
GCTGGCGAACCGACTCACACGCGACGGATCGCGCAGGGTGCTGCTCCTGGAAGCCGGTGGCAGCGACAATTGGCACTGGA
TCCGAATTCCCATCGGGTATGTCTATTGTATCGGCAATCCGCGCACCGATTGGATGTACAAGACGGAGCCGGAGCCGGGA
TTGAACGGTCGATCCATCGGCTATCCGCGTGGGCGCGTGCTTGGCGGCTGCTCGTCAATCAACGGGATGATCTACATGCG
CGGACAGGCGCGTGACTACGATCACTGGCGTCAGCTCGGCAATGTGGGCTGGAGTTGGGAGGACGTACTGCCGCTGTTCA
AGCGAGCCGAGAACTATTATCGCGGTGAGGATGACTATCACGGCGCCGAGGGCGAACTTCGCGTAGAGAAACAGCGCCTG
CACTGGCCGATACTGGATGCCTTCCGTGATGCGGCCGAGGCGGCGGGCATCCCGCGAACCGAAGACTTCAACCGCGGAGA
CAACGAAGGTTGCGGCTATTTTGACGTCACGCAACGCGGCGGCTTCCGGTGGAACGCGGTGCGGGCCTTTCTTGCCCCTG
TTCGTAACCGGCCTAATCTTCGGATTCAGATCAATGCACAGGTGGACCGGCTCATTTTTGAAGGTAATCGCGCCACAGGC
GTGCGTTTCCGTCTCGGTGGCCGTGACCGGATAGCCAAGGCCCGGGCCGACATACTGCTCGCGGCCGGGGCTATCGGGTC
GCCTGTAATCCTACAGCGTTCCGGGATAGGCGATCCCGACCATCTGGCTGCTCTCGGAATCGAGACACGTCGCGCTCTGA
AAGGCGTGGGCGCGAACTTGCAGGATCACCTCCAGCTTCGCTGCATCTACGCGGTCAGCGGCGCCTCAACGCTTAATGCG
CGTGCCAGAACGCTTATTGGGAAGGGTATGATGGGGGTCGAATATCTCCTGCGACGCACCGGACCCCTGTCGATGGCACC
GAGCCAACTCGGTGCTTTCGCGCGATCAGGATCGCATGTCGAGTCAGCCGATCTGGAGTTCCATGTCCAACCGCTGTCGC
TTGATCGTTTCGGCGAGCCCCTGCATACGTTTCCGGCGATCACCGCCAGTGTCTGTCACCTACGGCCGGAAAGCAGGGGC
GTGGTTCGGATTCGCTCGAGCGAACCTTCCGAGCCTCCGGCGATCCAGCCCAATTATCTATCGACCGAGACCGACCGGGC
AGTTGCGGCGAGTGCCATTCGCCTCACCCGCCGGATCATGGCTCAGGAGCCGATGCGGCGTTACCAGCCTCAAGAATTGA
AGCCCGGAGGGGATGACGACAGTGAAGAGGCGCTCCGCCGCGCGGCCGGCGAGATTGGGACAACCATTTTCCATCCAGTC
GGCACAGCACGCATGGGAACTGACCCGGAGGCTGTGGTTGATCCAGAATTGCGTGTTTACGGTATAGACAATCTGCGCAT
TGCCGATGCCTCGATCATGCCCACGATCACGTCGGGCAACACAAATGCCCCGACGATGATGATTGCCGAAAAGGCAGCGC
AACTGCTTTGCGTTTCCTAA

Upstream 100 bases:

>100_bases
CAAGGCGTGTTTCAGCAACACCGCCGACCATGCCGACATTAGTCGACGCGGTGCCCTCGGCGGCCGCCTTTGCGCTCGCG
ATAATCAGGAAGAATCGGCG

Downstream 100 bases:

>100_bases
GATAGTCTTGCGGCAGCTCTGCCGATGACTGCAAACCGCGAGACCTTCGGTTCGGGTTCTGCGGGATGAGGCGCGGCGCT
ATAGCGGCGATATTTGCGGA

Product: glucose-methanol-choline oxidoreductase

Products: NA

Alternate protein names: CDH; CHD [H]

Number of amino acids: Translated: 539; Mature: 538

Protein sequence:

>539_residues
MSNQVEADEDFGTYDHIVIGAGSAGCVLANRLTRDGSRRVLLLEAGGSDNWHWIRIPIGYVYCIGNPRTDWMYKTEPEPG
LNGRSIGYPRGRVLGGCSSINGMIYMRGQARDYDHWRQLGNVGWSWEDVLPLFKRAENYYRGEDDYHGAEGELRVEKQRL
HWPILDAFRDAAEAAGIPRTEDFNRGDNEGCGYFDVTQRGGFRWNAVRAFLAPVRNRPNLRIQINAQVDRLIFEGNRATG
VRFRLGGRDRIAKARADILLAAGAIGSPVILQRSGIGDPDHLAALGIETRRALKGVGANLQDHLQLRCIYAVSGASTLNA
RARTLIGKGMMGVEYLLRRTGPLSMAPSQLGAFARSGSHVESADLEFHVQPLSLDRFGEPLHTFPAITASVCHLRPESRG
VVRIRSSEPSEPPAIQPNYLSTETDRAVAASAIRLTRRIMAQEPMRRYQPQELKPGGDDDSEEALRRAAGEIGTTIFHPV
GTARMGTDPEAVVDPELRVYGIDNLRIADASIMPTITSGNTNAPTMMIAEKAAQLLCVS

Sequences:

>Translated_539_residues
MSNQVEADEDFGTYDHIVIGAGSAGCVLANRLTRDGSRRVLLLEAGGSDNWHWIRIPIGYVYCIGNPRTDWMYKTEPEPG
LNGRSIGYPRGRVLGGCSSINGMIYMRGQARDYDHWRQLGNVGWSWEDVLPLFKRAENYYRGEDDYHGAEGELRVEKQRL
HWPILDAFRDAAEAAGIPRTEDFNRGDNEGCGYFDVTQRGGFRWNAVRAFLAPVRNRPNLRIQINAQVDRLIFEGNRATG
VRFRLGGRDRIAKARADILLAAGAIGSPVILQRSGIGDPDHLAALGIETRRALKGVGANLQDHLQLRCIYAVSGASTLNA
RARTLIGKGMMGVEYLLRRTGPLSMAPSQLGAFARSGSHVESADLEFHVQPLSLDRFGEPLHTFPAITASVCHLRPESRG
VVRIRSSEPSEPPAIQPNYLSTETDRAVAASAIRLTRRIMAQEPMRRYQPQELKPGGDDDSEEALRRAAGEIGTTIFHPV
GTARMGTDPEAVVDPELRVYGIDNLRIADASIMPTITSGNTNAPTMMIAEKAAQLLCVS
>Mature_538_residues
SNQVEADEDFGTYDHIVIGAGSAGCVLANRLTRDGSRRVLLLEAGGSDNWHWIRIPIGYVYCIGNPRTDWMYKTEPEPGL
NGRSIGYPRGRVLGGCSSINGMIYMRGQARDYDHWRQLGNVGWSWEDVLPLFKRAENYYRGEDDYHGAEGELRVEKQRLH
WPILDAFRDAAEAAGIPRTEDFNRGDNEGCGYFDVTQRGGFRWNAVRAFLAPVRNRPNLRIQINAQVDRLIFEGNRATGV
RFRLGGRDRIAKARADILLAAGAIGSPVILQRSGIGDPDHLAALGIETRRALKGVGANLQDHLQLRCIYAVSGASTLNAR
ARTLIGKGMMGVEYLLRRTGPLSMAPSQLGAFARSGSHVESADLEFHVQPLSLDRFGEPLHTFPAITASVCHLRPESRGV
VRIRSSEPSEPPAIQPNYLSTETDRAVAASAIRLTRRIMAQEPMRRYQPQELKPGGDDDSEEALRRAAGEIGTTIFHPVG
TARMGTDPEAVVDPELRVYGIDNLRIADASIMPTITSGNTNAPTMMIAEKAAQLLCVS

Specific function: Can catalyze the oxidation of choline to betaine aldehyde and betaine aldehyde to glycine betaine [H]

COG id: COG2303

COG function: function code E; Choline dehydrogenase and related flavoproteins

Gene ontology:

Cell location: Membrane-Bound [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the GMC oxidoreductase family [H]

Homologues:

Organism=Homo sapiens, GI217272839, Length=532, Percent_Identity=42.6691729323308, Blast_Score=409, Evalue=1e-114,
Organism=Escherichia coli, GI1786503, Length=531, Percent_Identity=41.2429378531073, Blast_Score=358, Evalue=1e-100,
Organism=Caenorhabditis elegans, GI17532301, Length=542, Percent_Identity=37.2693726937269, Blast_Score=357, Evalue=7e-99,
Organism=Drosophila melanogaster, GI24642042, Length=566, Percent_Identity=36.3957597173145, Blast_Score=308, Evalue=5e-84,
Organism=Drosophila melanogaster, GI24642048, Length=581, Percent_Identity=34.7676419965577, Blast_Score=291, Evalue=8e-79,
Organism=Drosophila melanogaster, GI45551458, Length=565, Percent_Identity=34.8672566371681, Blast_Score=286, Evalue=3e-77,
Organism=Drosophila melanogaster, GI45549471, Length=565, Percent_Identity=34.8672566371681, Blast_Score=285, Evalue=4e-77,
Organism=Drosophila melanogaster, GI24642055, Length=571, Percent_Identity=35.3765323992995, Blast_Score=285, Evalue=7e-77,
Organism=Drosophila melanogaster, GI24642059, Length=581, Percent_Identity=34.9397590361446, Blast_Score=281, Evalue=6e-76,
Organism=Drosophila melanogaster, GI17137792, Length=572, Percent_Identity=35.1398601398601, Blast_Score=281, Evalue=1e-75,
Organism=Drosophila melanogaster, GI24642039, Length=570, Percent_Identity=34.2105263157895, Blast_Score=275, Evalue=5e-74,
Organism=Drosophila melanogaster, GI18859995, Length=566, Percent_Identity=34.6289752650177, Blast_Score=269, Evalue=3e-72,
Organism=Drosophila melanogaster, GI24642051, Length=570, Percent_Identity=33.5087719298246, Blast_Score=265, Evalue=6e-71,
Organism=Drosophila melanogaster, GI24650267, Length=564, Percent_Identity=34.3971631205674, Blast_Score=263, Evalue=2e-70,
Organism=Drosophila melanogaster, GI18859993, Length=566, Percent_Identity=33.0388692579505, Blast_Score=251, Evalue=7e-67,
Organism=Drosophila melanogaster, GI24642037, Length=569, Percent_Identity=32.5131810193322, Blast_Score=242, Evalue=4e-64,
Organism=Drosophila melanogaster, GI24642035, Length=306, Percent_Identity=40.1960784313725, Blast_Score=202, Evalue=3e-52,
Organism=Drosophila melanogaster, GI24645930, Length=590, Percent_Identity=28.135593220339, Blast_Score=155, Evalue=1e-37,
Organism=Drosophila melanogaster, GI24642057, Length=589, Percent_Identity=28.0135823429542, Blast_Score=145, Evalue=8e-35,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR011533
- InterPro:   IPR012132
- InterPro:   IPR000172
- InterPro:   IPR007867 [H]

Pfam domain/function: PF05199 GMC_oxred_C; PF00732 GMC_oxred_N [H]

EC number: =1.1.99.1 [H]

Molecular weight: Translated: 59354; Mature: 59223

Theoretical pI: Translated: 7.85; Mature: 7.85

Prosite motif: PS00624 GMC_OXRED_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.3 %Cys     (Translated Protein)
2.4 %Met     (Translated Protein)
3.7 %Cys+Met (Translated Protein)
1.3 %Cys     (Mature Protein)
2.2 %Met     (Mature Protein)
3.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSNQVEADEDFGTYDHIVIGAGSAGCVLANRLTRDGSRRVLLLEAGGSDNWHWIRIPIGY
CCCCCCCCCCCCCCCEEEEECCCCCHHHHHHHCCCCCCEEEEEECCCCCCEEEEEEEEEE
VYCIGNPRTDWMYKTEPEPGLNGRSIGYPRGRVLGGCSSINGMIYMRGQARDYDHWRQLG
EEECCCCCCCCEEECCCCCCCCCCCCCCCCCCEEECCCCCCCEEEEECCCCCHHHHHHHC
NVGWSWEDVLPLFKRAENYYRGEDDYHGAEGELRVEKQRLHWPILDAFRDAAEAAGIPRT
CCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEHHHCCCHHHHHHHHHHHHCCCCCC
EDFNRGDNEGCGYFDVTQRGGFRWNAVRAFLAPVRNRPNLRIQINAQVDRLIFEGNRATG
CCCCCCCCCCCCEEEECCCCCEEHHHHHHHHHHHCCCCCEEEEEECEEEEEEEECCCCCE
VRFRLGGRDRIAKARADILLAAGAIGSPVILQRSGIGDPDHLAALGIETRRALKGVGANL
EEEEECCHHHHHHHHHCEEEECCCCCCCEEEEECCCCCHHHHEECCHHHHHHHHHCCCCC
QDHLQLRCIYAVSGASTLNARARTLIGKGMMGVEYLLRRTGPLSMAPSQLGAFARSGSHV
CCCEEEEEEEEECCCCHHHHHHHHHHHCCCHHHHHHHHCCCCCCCCHHHHHHHHHCCCCC
ESADLEFHVQPLSLDRFGEPLHTFPAITASVCHLRPESRGVVRIRSSEPSEPPAIQPNYL
CCCCCEEEEEECCHHHCCCCHHHHHHHHHHHHHCCCCCCCEEEECCCCCCCCCCCCCCCC
STETDRAVAASAIRLTRRIMAQEPMRRYQPQELKPGGDDDSEEALRRAAGEIGTTIFHPV
CCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHCCEEECCC
GTARMGTDPEAVVDPELRVYGIDNLRIADASIMPTITSGNTNAPTMMIAEKAAQLLCVS
CCCCCCCCCCCEECCCEEEEECCCEEEECCEEEEEEECCCCCCCEEHEEHHHHHHEECC
>Mature Secondary Structure 
SNQVEADEDFGTYDHIVIGAGSAGCVLANRLTRDGSRRVLLLEAGGSDNWHWIRIPIGY
CCCCCCCCCCCCCCEEEEECCCCCHHHHHHHCCCCCCEEEEEECCCCCCEEEEEEEEEE
VYCIGNPRTDWMYKTEPEPGLNGRSIGYPRGRVLGGCSSINGMIYMRGQARDYDHWRQLG
EEECCCCCCCCEEECCCCCCCCCCCCCCCCCCEEECCCCCCCEEEEECCCCCHHHHHHHC
NVGWSWEDVLPLFKRAENYYRGEDDYHGAEGELRVEKQRLHWPILDAFRDAAEAAGIPRT
CCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEHHHCCCHHHHHHHHHHHHCCCCCC
EDFNRGDNEGCGYFDVTQRGGFRWNAVRAFLAPVRNRPNLRIQINAQVDRLIFEGNRATG
CCCCCCCCCCCCEEEECCCCCEEHHHHHHHHHHHCCCCCEEEEEECEEEEEEEECCCCCE
VRFRLGGRDRIAKARADILLAAGAIGSPVILQRSGIGDPDHLAALGIETRRALKGVGANL
EEEEECCHHHHHHHHHCEEEECCCCCCCEEEEECCCCCHHHHEECCHHHHHHHHHCCCCC
QDHLQLRCIYAVSGASTLNARARTLIGKGMMGVEYLLRRTGPLSMAPSQLGAFARSGSHV
CCCEEEEEEEEECCCCHHHHHHHHHHHCCCHHHHHHHHCCCCCCCCHHHHHHHHHCCCCC
ESADLEFHVQPLSLDRFGEPLHTFPAITASVCHLRPESRGVVRIRSSEPSEPPAIQPNYL
CCCCCEEEEEECCHHHCCCCHHHHHHHHHHHHHCCCCCCCEEEECCCCCCCCCCCCCCCC
STETDRAVAASAIRLTRRIMAQEPMRRYQPQELKPGGDDDSEEALRRAAGEIGTTIFHPV
CCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHCCEEECCC
GTARMGTDPEAVVDPELRVYGIDNLRIADASIMPTITSGNTNAPTMMIAEKAAQLLCVS
CCCCCCCCCCCEECCCEEEEECCCEEEECCEEEEEEECCCCCCCEEHEEHHHHHHEECC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: NA