Definition Bacillus cereus AH820, complete genome.
Accession NC_011773
Length 5,302,683

Click here to switch to the map view.

The map label for this gene is mmgA [H]

Identifier: 218906523

GI number: 218906523

Start: 5132003

End: 5133286

Strand: Reverse

Name: mmgA [H]

Synonym: BCAH820_5437

Alternate gene names: 218906523

Gene position: 5133286-5132003 (Counterclockwise)

Preceding gene: 218906524

Following gene: 218906522

Centisome position: 96.81

GC content: 44.63

Gene sequence:

>1284_bases
ATGAATACAACTTTAGAGAGAGGTGCAATCATTGCACCTCTTCTCAGCGTAACAATGCCGAGCAAGCGCTCGGTGCTAAA
AAAAGAAATGGGGGAAAGAAACATGAGTAAAACAGTTATTTTAAGTGCTGCAAGAACACCGGTGGGGAAATTTGGAGGAT
CTTTAAAAGATGTAAAAGCAACAGAACTTGGAGGAATCGCAATTAAAGGAGCGCTTGAAAGAGCGAATGTTTCAGCTAGT
GATGTTGAGGAAGTTATATTCGGAACGGTTATTCAAGGTGGACAGGGACAAATTCCATCACGCCAAGCTGCAAGGGCTGC
TGGAATCCCTTGGGAAGTGCAGACGGAAACTGTCAATAAAGTTTGTGCATCAGGGCTTCGTGCGGTGACTTTAGCGGATC
AAATTATTCGTACTGGCGATCAATCACTAATTGTAGCTGGTGGTATGGAGTCGATGAGTAACAGTCCTTACATTTTACGA
GGAGCAAGATGGGGATACAGAATGGGTAACAACGAAGTCATTGATTTAAACGTTGCTGACGGTTTAACATGCGCATTTTC
AGGTATACACATGGGTGTTTATGGCGGAGAAGTTGCGAAGGAAGATGGGATTTCCCGCGAAGCGCAAGATGAATGGGCAT
ATCGTAGCCATCAGCGTGCAGTTTCAGCGAATAAAGAAGGACGTTTTGAAGAGGAAATCGTGCCAGTAACGATTCCACAA
AGAAAAGGCGATCCTATTGTCGTTGCAAAGGATGAGGCGCCGCGTGAAGATACAACAATTGAAAAACTAGCAAAATTAAA
ACCTGTATTTGATAAGACAGCGACGGTGACAGCTGGTAATGCACCAGGGCTAAACGATGGTGGCGCTGCACTTGTATTAA
TGAGCGAAGACAGAGCGAAGCAAGAAGGAAGAAAGCCGTTAGCGACAATTTTGGCACATACAGCAATTGCAGTGGAGTCT
AAAGATTTCCCGAGGACGCCAGGTTATGCAATTAACGCATTGCTTGAAAAAACAGGCAAGAAAATTGAAGACATCGATTT
ATTTGAGATTAATGAAGCATTTGCAGCGGTAGCAATTGCAAGTACAGAAATCGCAGGAATTGACCCAGAAAAATTGAATG
TAAATGGCGGCGCAGTGGCGATGGGACACCCGATTGGAGCAAGCGGAGCGCGCATTATCGTTACATTAATCCATGCACTT
AAGCAACGCGGCGGTGGAATTGGAATTGCTTCGATTTGTAGTGGTGGCGGTCAAGGCGATGCAGTGATGATTGAAGTTCA
CTAA

Upstream 100 bases:

>100_bases
AGCGAAAGAAGTAGAAGAGAAGGTTCAAACGCTTGATGTAACAGAGATTTTAGAACGCTCTGTTATCGGACAGAAGAAAG
AAGCGATGTAATCTGCAAGC

Downstream 100 bases:

>100_bases
TTAAGATTAGAAGGTATATCGTTTATTAAAAATTAAGGGGGAAGAAGAAATGGGTGTACAAAAAATTGTTGTAATTGGTG
CAGGACAAATGGGGTCAGGA

Product: acetyl-CoA acetyltransferase

Products: NA

Alternate protein names: Acetoacetyl-CoA thiolase [H]

Number of amino acids: Translated: 427; Mature: 427

Protein sequence:

>427_residues
MNTTLERGAIIAPLLSVTMPSKRSVLKKEMGERNMSKTVILSAARTPVGKFGGSLKDVKATELGGIAIKGALERANVSAS
DVEEVIFGTVIQGGQGQIPSRQAARAAGIPWEVQTETVNKVCASGLRAVTLADQIIRTGDQSLIVAGGMESMSNSPYILR
GARWGYRMGNNEVIDLNVADGLTCAFSGIHMGVYGGEVAKEDGISREAQDEWAYRSHQRAVSANKEGRFEEEIVPVTIPQ
RKGDPIVVAKDEAPREDTTIEKLAKLKPVFDKTATVTAGNAPGLNDGGAALVLMSEDRAKQEGRKPLATILAHTAIAVES
KDFPRTPGYAINALLEKTGKKIEDIDLFEINEAFAAVAIASTEIAGIDPEKLNVNGGAVAMGHPIGASGARIIVTLIHAL
KQRGGGIGIASICSGGGQGDAVMIEVH

Sequences:

>Translated_427_residues
MNTTLERGAIIAPLLSVTMPSKRSVLKKEMGERNMSKTVILSAARTPVGKFGGSLKDVKATELGGIAIKGALERANVSAS
DVEEVIFGTVIQGGQGQIPSRQAARAAGIPWEVQTETVNKVCASGLRAVTLADQIIRTGDQSLIVAGGMESMSNSPYILR
GARWGYRMGNNEVIDLNVADGLTCAFSGIHMGVYGGEVAKEDGISREAQDEWAYRSHQRAVSANKEGRFEEEIVPVTIPQ
RKGDPIVVAKDEAPREDTTIEKLAKLKPVFDKTATVTAGNAPGLNDGGAALVLMSEDRAKQEGRKPLATILAHTAIAVES
KDFPRTPGYAINALLEKTGKKIEDIDLFEINEAFAAVAIASTEIAGIDPEKLNVNGGAVAMGHPIGASGARIIVTLIHAL
KQRGGGIGIASICSGGGQGDAVMIEVH
>Mature_427_residues
MNTTLERGAIIAPLLSVTMPSKRSVLKKEMGERNMSKTVILSAARTPVGKFGGSLKDVKATELGGIAIKGALERANVSAS
DVEEVIFGTVIQGGQGQIPSRQAARAAGIPWEVQTETVNKVCASGLRAVTLADQIIRTGDQSLIVAGGMESMSNSPYILR
GARWGYRMGNNEVIDLNVADGLTCAFSGIHMGVYGGEVAKEDGISREAQDEWAYRSHQRAVSANKEGRFEEEIVPVTIPQ
RKGDPIVVAKDEAPREDTTIEKLAKLKPVFDKTATVTAGNAPGLNDGGAALVLMSEDRAKQEGRKPLATILAHTAIAVES
KDFPRTPGYAINALLEKTGKKIEDIDLFEINEAFAAVAIASTEIAGIDPEKLNVNGGAVAMGHPIGASGARIIVTLIHAL
KQRGGGIGIASICSGGGQGDAVMIEVH

Specific function: SHORT-CHAIN FATTY ACIDS METABOLISM. [C]

COG id: COG0183

COG function: function code I; Acetyl-CoA acetyltransferase

Gene ontology:

Cell location: Cytoplasm [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the thiolase family [H]

Homologues:

Organism=Homo sapiens, GI4557237, Length=415, Percent_Identity=46.7469879518072, Blast_Score=356, Evalue=3e-98,
Organism=Homo sapiens, GI148539872, Length=389, Percent_Identity=48.0719794344473, Blast_Score=338, Evalue=5e-93,
Organism=Homo sapiens, GI167614485, Length=391, Percent_Identity=39.386189258312, Blast_Score=274, Evalue=1e-73,
Organism=Homo sapiens, GI4501853, Length=393, Percent_Identity=37.6590330788804, Blast_Score=223, Evalue=2e-58,
Organism=Homo sapiens, GI4504327, Length=449, Percent_Identity=32.5167037861915, Blast_Score=207, Evalue=1e-53,
Organism=Homo sapiens, GI194353979, Length=391, Percent_Identity=29.923273657289, Blast_Score=134, Evalue=2e-31,
Organism=Escherichia coli, GI1788554, Length=392, Percent_Identity=50, Blast_Score=351, Evalue=6e-98,
Organism=Escherichia coli, GI87082165, Length=391, Percent_Identity=45.2685421994885, Blast_Score=337, Evalue=8e-94,
Organism=Escherichia coli, GI1787663, Length=400, Percent_Identity=37.75, Blast_Score=245, Evalue=3e-66,
Organism=Escherichia coli, GI48994986, Length=399, Percent_Identity=35.8395989974937, Blast_Score=216, Evalue=3e-57,
Organism=Escherichia coli, GI1788683, Length=414, Percent_Identity=30.9178743961353, Blast_Score=193, Evalue=2e-50,
Organism=Caenorhabditis elegans, GI25147385, Length=386, Percent_Identity=44.300518134715, Blast_Score=313, Evalue=1e-85,
Organism=Caenorhabditis elegans, GI133906874, Length=389, Percent_Identity=43.1876606683805, Blast_Score=292, Evalue=2e-79,
Organism=Caenorhabditis elegans, GI17535921, Length=393, Percent_Identity=40.2035623409669, Blast_Score=254, Evalue=9e-68,
Organism=Caenorhabditis elegans, GI17551802, Length=434, Percent_Identity=33.8709677419355, Blast_Score=209, Evalue=3e-54,
Organism=Caenorhabditis elegans, GI17535917, Length=396, Percent_Identity=34.8484848484849, Blast_Score=205, Evalue=3e-53,
Organism=Saccharomyces cerevisiae, GI6325229, Length=397, Percent_Identity=45.8438287153652, Blast_Score=330, Evalue=3e-91,
Organism=Saccharomyces cerevisiae, GI6322031, Length=423, Percent_Identity=34.9881796690307, Blast_Score=209, Evalue=8e-55,
Organism=Drosophila melanogaster, GI24655093, Length=391, Percent_Identity=47.3145780051151, Blast_Score=352, Evalue=2e-97,
Organism=Drosophila melanogaster, GI24640423, Length=404, Percent_Identity=44.3069306930693, Blast_Score=324, Evalue=9e-89,
Organism=Drosophila melanogaster, GI17648125, Length=388, Percent_Identity=39.6907216494845, Blast_Score=254, Evalue=7e-68,
Organism=Drosophila melanogaster, GI17137578, Length=433, Percent_Identity=30.715935334873, Blast_Score=174, Evalue=1e-43,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR002155
- InterPro:   IPR016039
- InterPro:   IPR016038
- InterPro:   IPR020615
- InterPro:   IPR020610
- InterPro:   IPR020617
- InterPro:   IPR020613
- InterPro:   IPR020616 [H]

Pfam domain/function: PF02803 Thiolase_C; PF00108 Thiolase_N [H]

EC number: =2.3.1.9 [H]

Molecular weight: Translated: 44882; Mature: 44882

Theoretical pI: Translated: 6.53; Mature: 6.53

Prosite motif: PS00098 THIOLASE_1 ; PS00737 THIOLASE_2 ; PS00099 THIOLASE_3

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.7 %Cys     (Translated Protein)
2.6 %Met     (Translated Protein)
3.3 %Cys+Met (Translated Protein)
0.7 %Cys     (Mature Protein)
2.6 %Met     (Mature Protein)
3.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNTTLERGAIIAPLLSVTMPSKRSVLKKEMGERNMSKTVILSAARTPVGKFGGSLKDVKA
CCCCCCCCCHHHHHHHHCCCCHHHHHHHHHCCCCCCCEEEEEECCCCHHHHCCCCCCCCH
TELGGIAIKGALERANVSASDVEEVIFGTVIQGGQGQIPSRQAARAAGIPWEVQTETVNK
HHCCCEEEECHHHHCCCCHHHHHHHHHHHHCCCCCCCCCCHHHHHHCCCCCEEEHHHHHH
VCASGLRAVTLADQIIRTGDQSLIVAGGMESMSNSPYILRGARWGYRMGNNEVIDLNVAD
HHHCCCHHHHHHHHHHHCCCCEEEEECCHHHCCCCCEEEEECCCCEECCCCCEEEEECCC
GLTCAFSGIHMGVYGGEVAKEDGISREAQDEWAYRSHQRAVSANKEGRFEEEIVPVTIPQ
CCEEEECCEEEEECCCCHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEECC
RKGDPIVVAKDEAPREDTTIEKLAKLKPVFDKTATVTAGNAPGLNDGGAALVLMSEDRAK
CCCCEEEEECCCCCCCCHHHHHHHHHCHHHCCCEEEECCCCCCCCCCCEEEEEECCCHHH
QEGRKPLATILAHTAIAVESKDFPRTPGYAINALLEKTGKKIEDIDLFEINEAFAAVAIA
HHCCCHHHHHHHHHHHEEECCCCCCCCCHHHHHHHHHHCCCCCCCCEEEHHHHHHEEEEE
STEIAGIDPEKLNVNGGAVAMGHPIGASGARIIVTLIHALKQRGGGIGIASICSGGGQGD
CCEECCCCCCEEECCCCEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEHHHHCCCCCCC
AVMIEVH
EEEEEEC
>Mature Secondary Structure
MNTTLERGAIIAPLLSVTMPSKRSVLKKEMGERNMSKTVILSAARTPVGKFGGSLKDVKA
CCCCCCCCCHHHHHHHHCCCCHHHHHHHHHCCCCCCCEEEEEECCCCHHHHCCCCCCCCH
TELGGIAIKGALERANVSASDVEEVIFGTVIQGGQGQIPSRQAARAAGIPWEVQTETVNK
HHCCCEEEECHHHHCCCCHHHHHHHHHHHHCCCCCCCCCCHHHHHHCCCCCEEEHHHHHH
VCASGLRAVTLADQIIRTGDQSLIVAGGMESMSNSPYILRGARWGYRMGNNEVIDLNVAD
HHHCCCHHHHHHHHHHHCCCCEEEEECCHHHCCCCCEEEEECCCCEECCCCCEEEEECCC
GLTCAFSGIHMGVYGGEVAKEDGISREAQDEWAYRSHQRAVSANKEGRFEEEIVPVTIPQ
CCEEEECCEEEEECCCCHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEECC
RKGDPIVVAKDEAPREDTTIEKLAKLKPVFDKTATVTAGNAPGLNDGGAALVLMSEDRAK
CCCCEEEEECCCCCCCCHHHHHHHHHCHHHCCCEEEECCCCCCCCCCCEEEEEECCCHHH
QEGRKPLATILAHTAIAVESKDFPRTPGYAINALLEKTGKKIEDIDLFEINEAFAAVAIA
HHCCCHHHHHHHHHHHEEECCCCCCCCCHHHHHHHHHHCCCCCCCCEEEHHHHHHEEEEE
STEIAGIDPEKLNVNGGAVAMGHPIGASGARIIVTLIHALKQRGGGIGIASICSGGGQGD
CCEECCCCCCEEECCCCEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEHHHHCCCCCCC
AVMIEVH
EEEEEEC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8759838; 8969508; 9384377 [H]