Definition Mesorhizobium sp. BNC1, complete genome.
Accession NC_008254
Length 4,412,446

Click here to switch to the map view.

The map label for this gene is betA [H]

Identifier: 110636079

GI number: 110636079

Start: 4056934

End: 4058592

Strand: Reverse

Name: betA [H]

Synonym: Meso_3754

Alternate gene names: 110636079

Gene position: 4058592-4056934 (Counterclockwise)

Preceding gene: 110636080

Following gene: 110636078

Centisome position: 91.98

GC content: 62.09

Gene sequence:

>1659_bases
ATGAATGCTTCGGATGCGTCGGTTTACGACTATATCGTGGTCGGGGCAGGATCCGCGGGCTGTGTGCTCGCGAACAGGTT
ATCGGAAAACCGTCAGCTGCGGATCTTGCTGATCGAGGCGGGCGGCTTGGACTGGAACCCCCTGATCCACATCCCCATGG
GTTGCGGAAAGCTGATCCGGACACACATGCATGGCTGGGGCCTGGTGGCGGAACCGGACGAAGGGCTTCTCGGCCGTCGT
GATCCCTGGCCGCGCGGGCGCGTCCTAGGCGGCACCTCGTCCATCAATGGGATGCTGTACGTTCGCGGAAACCCAAGTGA
CTACGATCTCTGGTCGCAGATGGGGAACCGGGGCTGGGCGTTCGACGACGTCTTCCCCTATTTTCTTCGCTCCGAAGGCA
ATGTCGACCGACGCGACCGCTGGCACGGCAATGACGGGCCGCTGGTCGTCCAGAAAGCGCGGTCGCAGCATCCGCTTTAC
GAAGCATTCGTTGAGAGCGGTGCGGCGGCCGGTTTTCCGCTCAACGATGATTTCAACGGGGCCCGCCAGGAGGGATTTGG
GCGCTACGATTTCACCATTGACCGGGGTCGGCGTTGTTCCTCGGCCGCCGCCTATCTAAACCCGGTGCGAGATCGTCCGA
ACCTGGATGTCATGACCTCCGCGCACGTATCTCGGATTTTGATCGAAGACGGCGCCGCAACTGGCGTGGAGTATCGTAGG
AAGCAGGAAACCAGGCGTGCAAATGCCACACGGGAAGTCATCGTTTCAGCCGGAGCGATTCATTCCCCTGCTATCCTGAT
GCGGTCCGGCATTGGTGATCCCGCCATACTGACCAGGTTCGGCATTCCAGTGCACATGTCACTGCCGGGCGTCGGCAAGA
ACCTTCAGGACCATATTTCCATCTCGGTCCAGTTCGGCTGCAATCGGCCGATCACGCTGCACAGCATGGCCCGTATCGAT
CGAGCGGCATTCATGATGACGCGAGCGGTTCTGTTTCGCACGGGGGAAGGCGCAGTTTTCCCCGCCGAGGCCGGCGCCTA
CACCCGCACCAGGCCCGACCTCGAATACCCGGATCTGGGCTGGGTGTTTTTCCTTGGGCTGGGCTCCTCACGGGTCCGCA
TCCCCTTCCTTTCGGCGCTGCGGCCAGATCCGCTTGAACAGGAGGGGTTCATGGTCAAACTGCTGTTGCTCAGGCCCGAG
AGCCGCGGCGAGATAACCCTCCGTTCGGCTGACCCAGCCGACGCGCCGGTGATCTACGCCAACGCACTTTCCGCGCCAAG
CGACGCGGAAGCCTTGATCAGAGGCGTGGAGCAGGTACGCCTGGTCGCCTCGAAGGCCCCGCTGTCCGAATTTATCAGCA
CCGAGCTCGGCCCTGGTACGGAAGCCGTTTCGTCGGCCCAGATCGAAAAATTCGTCCGCAGCACGGCGACCACCGGACAT
CATCAATCGGGCACATGCAAGATGGGATCGGACCCGATGGCAGTCGTGGACGACGAACTGCGTGTGCATGGCTTGCAGGG
GCTGCGTGTCGTGGACGCATCCATCATGCCCAACATCGTCAGCGGCAACATCAACGCCCCCGTCATGATGATCGCGGAGA
AGGCGTCCGACCTCATTCTCGGACGGGCCGCACGTCCGTTGGAGGCGCGAGCGGCCTAA

Upstream 100 bases:

>100_bases
ATGACAGAACCGACGCCAACCATTACCCGAGGATCACATCGCTCGTCGGTCCTAATCCGGCGCCGGTCGGCCTCACAGAA
GGCAAATCTGGACTAAATCG

Downstream 100 bases:

>100_bases
CCCGGCGGCCGCGCCGGCAATATCCGGTCGGCGCCAGCAACATAGCTTTCCAGGAGGGTACAAATGGATGACGAGATTGT
CATTCTTCAGGCTACGAGTC

Product: glucose-methanol-choline oxidoreductase

Products: NA

Alternate protein names: CDH; CHD [H]

Number of amino acids: Translated: 552; Mature: 552

Protein sequence:

>552_residues
MNASDASVYDYIVVGAGSAGCVLANRLSENRQLRILLIEAGGLDWNPLIHIPMGCGKLIRTHMHGWGLVAEPDEGLLGRR
DPWPRGRVLGGTSSINGMLYVRGNPSDYDLWSQMGNRGWAFDDVFPYFLRSEGNVDRRDRWHGNDGPLVVQKARSQHPLY
EAFVESGAAAGFPLNDDFNGARQEGFGRYDFTIDRGRRCSSAAAYLNPVRDRPNLDVMTSAHVSRILIEDGAATGVEYRR
KQETRRANATREVIVSAGAIHSPAILMRSGIGDPAILTRFGIPVHMSLPGVGKNLQDHISISVQFGCNRPITLHSMARID
RAAFMMTRAVLFRTGEGAVFPAEAGAYTRTRPDLEYPDLGWVFFLGLGSSRVRIPFLSALRPDPLEQEGFMVKLLLLRPE
SRGEITLRSADPADAPVIYANALSAPSDAEALIRGVEQVRLVASKAPLSEFISTELGPGTEAVSSAQIEKFVRSTATTGH
HQSGTCKMGSDPMAVVDDELRVHGLQGLRVVDASIMPNIVSGNINAPVMMIAEKASDLILGRAARPLEARAA

Sequences:

>Translated_552_residues
MNASDASVYDYIVVGAGSAGCVLANRLSENRQLRILLIEAGGLDWNPLIHIPMGCGKLIRTHMHGWGLVAEPDEGLLGRR
DPWPRGRVLGGTSSINGMLYVRGNPSDYDLWSQMGNRGWAFDDVFPYFLRSEGNVDRRDRWHGNDGPLVVQKARSQHPLY
EAFVESGAAAGFPLNDDFNGARQEGFGRYDFTIDRGRRCSSAAAYLNPVRDRPNLDVMTSAHVSRILIEDGAATGVEYRR
KQETRRANATREVIVSAGAIHSPAILMRSGIGDPAILTRFGIPVHMSLPGVGKNLQDHISISVQFGCNRPITLHSMARID
RAAFMMTRAVLFRTGEGAVFPAEAGAYTRTRPDLEYPDLGWVFFLGLGSSRVRIPFLSALRPDPLEQEGFMVKLLLLRPE
SRGEITLRSADPADAPVIYANALSAPSDAEALIRGVEQVRLVASKAPLSEFISTELGPGTEAVSSAQIEKFVRSTATTGH
HQSGTCKMGSDPMAVVDDELRVHGLQGLRVVDASIMPNIVSGNINAPVMMIAEKASDLILGRAARPLEARAA
>Mature_552_residues
MNASDASVYDYIVVGAGSAGCVLANRLSENRQLRILLIEAGGLDWNPLIHIPMGCGKLIRTHMHGWGLVAEPDEGLLGRR
DPWPRGRVLGGTSSINGMLYVRGNPSDYDLWSQMGNRGWAFDDVFPYFLRSEGNVDRRDRWHGNDGPLVVQKARSQHPLY
EAFVESGAAAGFPLNDDFNGARQEGFGRYDFTIDRGRRCSSAAAYLNPVRDRPNLDVMTSAHVSRILIEDGAATGVEYRR
KQETRRANATREVIVSAGAIHSPAILMRSGIGDPAILTRFGIPVHMSLPGVGKNLQDHISISVQFGCNRPITLHSMARID
RAAFMMTRAVLFRTGEGAVFPAEAGAYTRTRPDLEYPDLGWVFFLGLGSSRVRIPFLSALRPDPLEQEGFMVKLLLLRPE
SRGEITLRSADPADAPVIYANALSAPSDAEALIRGVEQVRLVASKAPLSEFISTELGPGTEAVSSAQIEKFVRSTATTGH
HQSGTCKMGSDPMAVVDDELRVHGLQGLRVVDASIMPNIVSGNINAPVMMIAEKASDLILGRAARPLEARAA

Specific function: Can catalyze the oxidation of choline to betaine aldehyde and betaine aldehyde to glycine betaine [H]

COG id: COG2303

COG function: function code E; Choline dehydrogenase and related flavoproteins

Gene ontology:

Cell location: Membrane-Bound [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the GMC oxidoreductase family [H]

Homologues:

Organism=Homo sapiens, GI217272839, Length=547, Percent_Identity=44.9725776965265, Blast_Score=451, Evalue=1e-126,
Organism=Escherichia coli, GI1786503, Length=545, Percent_Identity=39.2660550458716, Blast_Score=390, Evalue=1e-110,
Organism=Caenorhabditis elegans, GI17532301, Length=552, Percent_Identity=38.4057971014493, Blast_Score=365, Evalue=1e-101,
Organism=Drosophila melanogaster, GI24642042, Length=564, Percent_Identity=40.6028368794326, Blast_Score=356, Evalue=2e-98,
Organism=Drosophila melanogaster, GI24642048, Length=563, Percent_Identity=38.0106571936057, Blast_Score=347, Evalue=1e-95,
Organism=Drosophila melanogaster, GI24642059, Length=568, Percent_Identity=39.4366197183099, Blast_Score=346, Evalue=3e-95,
Organism=Drosophila melanogaster, GI24650267, Length=569, Percent_Identity=39.1915641476274, Blast_Score=333, Evalue=2e-91,
Organism=Drosophila melanogaster, GI24642055, Length=578, Percent_Identity=38.0622837370242, Blast_Score=323, Evalue=1e-88,
Organism=Drosophila melanogaster, GI45549471, Length=563, Percent_Identity=37.6554174067496, Blast_Score=318, Evalue=7e-87,
Organism=Drosophila melanogaster, GI45551458, Length=563, Percent_Identity=37.6554174067496, Blast_Score=317, Evalue=1e-86,
Organism=Drosophila melanogaster, GI17137792, Length=551, Percent_Identity=39.3829401088929, Blast_Score=314, Evalue=8e-86,
Organism=Drosophila melanogaster, GI18859995, Length=561, Percent_Identity=38.5026737967914, Blast_Score=309, Evalue=3e-84,
Organism=Drosophila melanogaster, GI24642037, Length=569, Percent_Identity=36.0281195079086, Blast_Score=301, Evalue=7e-82,
Organism=Drosophila melanogaster, GI24642039, Length=566, Percent_Identity=37.8091872791519, Blast_Score=297, Evalue=2e-80,
Organism=Drosophila melanogaster, GI24642035, Length=579, Percent_Identity=36.6148531951641, Blast_Score=296, Evalue=2e-80,
Organism=Drosophila melanogaster, GI24642051, Length=569, Percent_Identity=35.1493848857645, Blast_Score=289, Evalue=4e-78,
Organism=Drosophila melanogaster, GI18859993, Length=578, Percent_Identity=33.9100346020761, Blast_Score=258, Evalue=8e-69,
Organism=Drosophila melanogaster, GI24645930, Length=573, Percent_Identity=31.064572425829, Blast_Score=179, Evalue=6e-45,
Organism=Drosophila melanogaster, GI24642057, Length=589, Percent_Identity=28.8624787775891, Blast_Score=172, Evalue=5e-43,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR011533
- InterPro:   IPR012132
- InterPro:   IPR000172
- InterPro:   IPR007867 [H]

Pfam domain/function: PF05199 GMC_oxred_C; PF00732 GMC_oxred_N [H]

EC number: =1.1.99.1 [H]

Molecular weight: Translated: 60031; Mature: 60031

Theoretical pI: Translated: 7.41; Mature: 7.41

Prosite motif: PS00623 GMC_OXRED_1 ; PS00624 GMC_OXRED_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.9 %Cys     (Translated Protein)
3.1 %Met     (Translated Protein)
4.0 %Cys+Met (Translated Protein)
0.9 %Cys     (Mature Protein)
3.1 %Met     (Mature Protein)
4.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNASDASVYDYIVVGAGSAGCVLANRLSENRQLRILLIEAGGLDWNPLIHIPMGCGKLIR
CCCCCCCEEEEEEEECCCCCHHHHHHHCCCCEEEEEEEEECCCCCCCEEEECCCHHHHHH
THMHGWGLVAEPDEGLLGRRDPWPRGRVLGGTSSINGMLYVRGNPSDYDLWSQMGNRGWA
HHHCCCCEEECCCCCCCCCCCCCCCCEEECCCCCCCEEEEEECCCCCHHHHHHHCCCCCC
FDDVFPYFLRSEGNVDRRDRWHGNDGPLVVQKARSQHPLYEAFVESGAAAGFPLNDDFNG
HHHHHHHHHCCCCCCCCCCCCCCCCCCEEEECCCCCCHHHHHHHHCCCCCCCCCCCCCCC
ARQEGFGRYDFTIDRGRRCSSAAAYLNPVRDRPNLDVMTSAHVSRILIEDGAATGVEYRR
HHHCCCCCEEEEECCCCCCHHHHHHHCCCCCCCCCEEEEHHHHEEEEEECCCCCCHHHHH
KQETRRANATREVIVSAGAIHSPAILMRSGIGDPAILTRFGIPVHMSLPGVGKNLQDHIS
HHHHHHHCCHHHHHEECCCCCCCHHHHHCCCCCCHHHHHCCCCEEEECCCCCCCCCCEEE
ISVQFGCNRPITLHSMARIDRAAFMMTRAVLFRTGEGAVFPAEAGAYTRTRPDLEYPDLG
EEEEECCCCCEEHHHHHHHHHHHHHHHHHHHEECCCCCEEECCCCCCCCCCCCCCCCCCC
WVFFLGLGSSRVRIPFLSALRPDPLEQEGFMVKLLLLRPESRGEITLRSADPADAPVIYA
EEEEEECCCCEEEECCHHHCCCCCCCCCCEEEEEEEECCCCCCCEEEECCCCCCCCEEEE
NALSAPSDAEALIRGVEQVRLVASKAPLSEFISTELGPGTEAVSSAQIEKFVRSTATTGH
ECCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCC
HQSGTCKMGSDPMAVVDDELRVHGLQGLRVVDASIMPNIVSGNINAPVMMIAEKASDLIL
CCCCCEECCCCCCEEECCCCEEECCCCEEEEEHHHCCHHHCCCCCCCEEEEEHHHHHHEE
GRAARPLEARAA
CCCCCCCCCCCC
>Mature Secondary Structure
MNASDASVYDYIVVGAGSAGCVLANRLSENRQLRILLIEAGGLDWNPLIHIPMGCGKLIR
CCCCCCCEEEEEEEECCCCCHHHHHHHCCCCEEEEEEEEECCCCCCCEEEECCCHHHHHH
THMHGWGLVAEPDEGLLGRRDPWPRGRVLGGTSSINGMLYVRGNPSDYDLWSQMGNRGWA
HHHCCCCEEECCCCCCCCCCCCCCCCEEECCCCCCCEEEEEECCCCCHHHHHHHCCCCCC
FDDVFPYFLRSEGNVDRRDRWHGNDGPLVVQKARSQHPLYEAFVESGAAAGFPLNDDFNG
HHHHHHHHHCCCCCCCCCCCCCCCCCCEEEECCCCCCHHHHHHHHCCCCCCCCCCCCCCC
ARQEGFGRYDFTIDRGRRCSSAAAYLNPVRDRPNLDVMTSAHVSRILIEDGAATGVEYRR
HHHCCCCCEEEEECCCCCCHHHHHHHCCCCCCCCCEEEEHHHHEEEEEECCCCCCHHHHH
KQETRRANATREVIVSAGAIHSPAILMRSGIGDPAILTRFGIPVHMSLPGVGKNLQDHIS
HHHHHHHCCHHHHHEECCCCCCCHHHHHCCCCCCHHHHHCCCCEEEECCCCCCCCCCEEE
ISVQFGCNRPITLHSMARIDRAAFMMTRAVLFRTGEGAVFPAEAGAYTRTRPDLEYPDLG
EEEEECCCCCEEHHHHHHHHHHHHHHHHHHHEECCCCCEEECCCCCCCCCCCCCCCCCCC
WVFFLGLGSSRVRIPFLSALRPDPLEQEGFMVKLLLLRPESRGEITLRSADPADAPVIYA
EEEEEECCCCEEEECCHHHCCCCCCCCCCEEEEEEEECCCCCCCEEEECCCCCCCCEEEE
NALSAPSDAEALIRGVEQVRLVASKAPLSEFISTELGPGTEAVSSAQIEKFVRSTATTGH
ECCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCC
HQSGTCKMGSDPMAVVDDELRVHGLQGLRVVDASIMPNIVSGNINAPVMMIAEKASDLIL
CCCCCEECCCCCCEEECCCCEEECCCCEEEEEHHHCCHHHCCCCCCCEEEEEHHHHHHEE
GRAARPLEARAA
CCCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: NA