The gene/protein map for NC_002737 is currently unavailable.
Definition Bacillus anthracis str. Sterne chromosome, complete genome.
Accession NC_005945
Length 5,228,663

Click here to switch to the map view.

The map label for this gene is mmgA [H]

Identifier: 49188179

GI number: 49188179

Start: 5073470

End: 5074651

Strand: Reverse

Name: mmgA [H]

Synonym: BAS5193

Alternate gene names: 49188179

Gene position: 5074651-5073470 (Counterclockwise)

Preceding gene: 49188180

Following gene: 49188178

Centisome position: 97.05

GC content: 44.59

Gene sequence:

>1182_bases
ATGAGTAAAACAGTTATTTTAAGTGCTGCAAGAACACCGGTGGGGAAATTTGGAGGATCTTTAAAAGATGTAAAAGCAAC
AGAACTTGGAGGAATCGCAATTAAAGCAGCGCTTGAAAGAGCGAATGTTTCAGCTAGTGATGTTGAGGAAGTTATATTCG
GAACGGTTATTCAAGGTGGACAGGGACAAATTCCATCACGCCAAGCTGCAAGGGCTGCTGGAATCCCTTGGGAAGTGCAG
ACGGAAACTGTCAATAAAGTTTGTGCATCAGGGCTTCGTGCGGTGACTTTAGCGGATCAAATTATTCGTACTGGCGATCA
ATCACTAATTGTAGCTGGTGGTATGGAGTCGATGAGTAACAGTCCTTACATTTTACGAGGAGCAAGATGGGGATACAGAA
TGGGTAACAACGAAGTCATTGATTTAAACGTTGCTGACGGTTTAACATGCGCATTTTCAGGTATACACATGGGTGTTTAT
GGCGGAGAAGTTGCGAAGGAGGATGGAATTTCTCGCGAAGCGCAAGATGAATGGGCATATCGTAGCCATCAGCGTGCAGT
TTCAGCGCATAAAGAAGGACGTTTTGAAGAGGAAATCGTGCCAGTAACGATTCCACAAAGAAAAGGCGATCCTATTGTCG
TTGCAAAGGATGAGGCGCCGCGTGAAGATACAACAATTGAAAAACTAGCAAAATTAAAACCTGTATTTGATAAGACAGCG
ACGGTGACAGCTGGTAATGCACCAGGGCTAAACGATGGTGGCGCTGCACTTGTATTAATGAGCGAAGACAGAGCGAAGCA
AGAAGGAAGAAAGCCGTTAGCGACAATTTTGGCACATACAGCAATTGCAGTGGAGTCTAAAGATTTCCCGAGGACGCCAG
GTTATGCAATTAACGCATTGCTTGAAAAAACAGGAAAGAAAATTGAAGACATCGATTTATTTGAGATTAATGAAGCATTT
GCAGCGGTAGCAATTGCAAGTACAGAAATCGCAGGAATTGACCCAGAAAAATTGAATGTAAATGGCGGCGCAGTGGCGAT
GGGACATCCGATTGGAGCAAGCGGAGCGCGCATTATCGTTACACTAATCCATGCACTTAAGCAACGCGGCGGTGGAATTG
GAATTGCTTCGATTTGTAGTGGTGGCGGTCAAGGCGATGCAGTGATGATTGAAGTTCACTAA

Upstream 100 bases:

>100_bases
TGAATACAACTTTAGAGAGAGGTGCAATCATTGCACCTCTTCTCAGCGTAACAATGCCGAGCAAGCGCTCGGTGCTAAAA
AAGAAATGGGGGAAAGAAAC

Downstream 100 bases:

>100_bases
TTAAGATTAGAAGATATATAGTTTATTAAAAATTAAGGGGGAAGAAGAAATGGGTGTACAAAAAATTGTTGTAATTGGTG
CAGGACAAATGGGGTCAGGA

Product: acetyl-CoA acetyltransferase

Products: NA

Alternate protein names: Acetoacetyl-CoA thiolase [H]

Number of amino acids: Translated: 393; Mature: 392

Protein sequence:

>393_residues
MSKTVILSAARTPVGKFGGSLKDVKATELGGIAIKAALERANVSASDVEEVIFGTVIQGGQGQIPSRQAARAAGIPWEVQ
TETVNKVCASGLRAVTLADQIIRTGDQSLIVAGGMESMSNSPYILRGARWGYRMGNNEVIDLNVADGLTCAFSGIHMGVY
GGEVAKEDGISREAQDEWAYRSHQRAVSAHKEGRFEEEIVPVTIPQRKGDPIVVAKDEAPREDTTIEKLAKLKPVFDKTA
TVTAGNAPGLNDGGAALVLMSEDRAKQEGRKPLATILAHTAIAVESKDFPRTPGYAINALLEKTGKKIEDIDLFEINEAF
AAVAIASTEIAGIDPEKLNVNGGAVAMGHPIGASGARIIVTLIHALKQRGGGIGIASICSGGGQGDAVMIEVH

Sequences:

>Translated_393_residues
MSKTVILSAARTPVGKFGGSLKDVKATELGGIAIKAALERANVSASDVEEVIFGTVIQGGQGQIPSRQAARAAGIPWEVQ
TETVNKVCASGLRAVTLADQIIRTGDQSLIVAGGMESMSNSPYILRGARWGYRMGNNEVIDLNVADGLTCAFSGIHMGVY
GGEVAKEDGISREAQDEWAYRSHQRAVSAHKEGRFEEEIVPVTIPQRKGDPIVVAKDEAPREDTTIEKLAKLKPVFDKTA
TVTAGNAPGLNDGGAALVLMSEDRAKQEGRKPLATILAHTAIAVESKDFPRTPGYAINALLEKTGKKIEDIDLFEINEAF
AAVAIASTEIAGIDPEKLNVNGGAVAMGHPIGASGARIIVTLIHALKQRGGGIGIASICSGGGQGDAVMIEVH
>Mature_392_residues
SKTVILSAARTPVGKFGGSLKDVKATELGGIAIKAALERANVSASDVEEVIFGTVIQGGQGQIPSRQAARAAGIPWEVQT
ETVNKVCASGLRAVTLADQIIRTGDQSLIVAGGMESMSNSPYILRGARWGYRMGNNEVIDLNVADGLTCAFSGIHMGVYG
GEVAKEDGISREAQDEWAYRSHQRAVSAHKEGRFEEEIVPVTIPQRKGDPIVVAKDEAPREDTTIEKLAKLKPVFDKTAT
VTAGNAPGLNDGGAALVLMSEDRAKQEGRKPLATILAHTAIAVESKDFPRTPGYAINALLEKTGKKIEDIDLFEINEAFA
AVAIASTEIAGIDPEKLNVNGGAVAMGHPIGASGARIIVTLIHALKQRGGGIGIASICSGGGQGDAVMIEVH

Specific function: SHORT-CHAIN FATTY ACIDS METABOLISM. [C]

COG id: COG0183

COG function: function code I; Acetyl-CoA acetyltransferase

Gene ontology:

Cell location: Cytoplasm [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the thiolase family [H]

Homologues:

Organism=Homo sapiens, GI4557237, Length=392, Percent_Identity=47.9591836734694, Blast_Score=348, Evalue=4e-96,
Organism=Homo sapiens, GI148539872, Length=389, Percent_Identity=48.0719794344473, Blast_Score=338, Evalue=4e-93,
Organism=Homo sapiens, GI167614485, Length=391, Percent_Identity=39.386189258312, Blast_Score=273, Evalue=2e-73,
Organism=Homo sapiens, GI4501853, Length=393, Percent_Identity=37.9134860050891, Blast_Score=225, Evalue=7e-59,
Organism=Homo sapiens, GI4504327, Length=430, Percent_Identity=32.7906976744186, Blast_Score=197, Evalue=1e-50,
Organism=Homo sapiens, GI194353979, Length=391, Percent_Identity=30.1790281329923, Blast_Score=136, Evalue=4e-32,
Organism=Escherichia coli, GI1788554, Length=392, Percent_Identity=50.2551020408163, Blast_Score=351, Evalue=4e-98,
Organism=Escherichia coli, GI87082165, Length=391, Percent_Identity=45.5242966751918, Blast_Score=338, Evalue=4e-94,
Organism=Escherichia coli, GI1787663, Length=400, Percent_Identity=37.75, Blast_Score=246, Evalue=2e-66,
Organism=Escherichia coli, GI48994986, Length=399, Percent_Identity=35.8395989974937, Blast_Score=215, Evalue=4e-57,
Organism=Escherichia coli, GI1788683, Length=410, Percent_Identity=30.9756097560976, Blast_Score=193, Evalue=2e-50,
Organism=Caenorhabditis elegans, GI25147385, Length=386, Percent_Identity=44.559585492228, Blast_Score=315, Evalue=4e-86,
Organism=Caenorhabditis elegans, GI133906874, Length=387, Percent_Identity=43.6692506459948, Blast_Score=293, Evalue=9e-80,
Organism=Caenorhabditis elegans, GI17535921, Length=393, Percent_Identity=40.4580152671756, Blast_Score=256, Evalue=2e-68,
Organism=Caenorhabditis elegans, GI17535917, Length=396, Percent_Identity=35.6060606060606, Blast_Score=206, Evalue=2e-53,
Organism=Caenorhabditis elegans, GI17551802, Length=429, Percent_Identity=34.2657342657343, Blast_Score=206, Evalue=2e-53,
Organism=Saccharomyces cerevisiae, GI6325229, Length=397, Percent_Identity=45.5919395465995, Blast_Score=327, Evalue=2e-90,
Organism=Saccharomyces cerevisiae, GI6322031, Length=398, Percent_Identity=35.929648241206, Blast_Score=206, Evalue=4e-54,
Organism=Drosophila melanogaster, GI24655093, Length=391, Percent_Identity=47.3145780051151, Blast_Score=353, Evalue=8e-98,
Organism=Drosophila melanogaster, GI24640423, Length=392, Percent_Identity=45.1530612244898, Blast_Score=325, Evalue=3e-89,
Organism=Drosophila melanogaster, GI17648125, Length=388, Percent_Identity=39.9484536082474, Blast_Score=256, Evalue=3e-68,
Organism=Drosophila melanogaster, GI17137578, Length=426, Percent_Identity=30.9859154929577, Blast_Score=172, Evalue=4e-43,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR002155
- InterPro:   IPR016039
- InterPro:   IPR016038
- InterPro:   IPR020615
- InterPro:   IPR020610
- InterPro:   IPR020617
- InterPro:   IPR020613
- InterPro:   IPR020616 [H]

Pfam domain/function: PF02803 Thiolase_C; PF00108 Thiolase_N [H]

EC number: =2.3.1.9 [H]

Molecular weight: Translated: 41164; Mature: 41033

Theoretical pI: Translated: 5.96; Mature: 5.96

Prosite motif: PS00098 THIOLASE_1 ; PS00737 THIOLASE_2 ; PS00099 THIOLASE_3

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.8 %Cys     (Translated Protein)
2.0 %Met     (Translated Protein)
2.8 %Cys+Met (Translated Protein)
0.8 %Cys     (Mature Protein)
1.8 %Met     (Mature Protein)
2.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSKTVILSAARTPVGKFGGSLKDVKATELGGIAIKAALERANVSASDVEEVIFGTVIQGG
CCCEEEEEECCCCHHHHCCCCCCCCHHHCCCCHHEEHHHHCCCCHHHHHHHHHHHHCCCC
QGQIPSRQAARAAGIPWEVQTETVNKVCASGLRAVTLADQIIRTGDQSLIVAGGMESMSN
CCCCCCHHHHHHCCCCCEEEHHHHHHHHHCCCHHHHHHHHHHHCCCCEEEEECCHHHCCC
SPYILRGARWGYRMGNNEVIDLNVADGLTCAFSGIHMGVYGGEVAKEDGISREAQDEWAY
CCEEEEECCCCEECCCCCEEEEECCCCCEEEECCEEEEECCCCHHHHCCCCCHHHHHHHH
RSHQRAVSAHKEGRFEEEIVPVTIPQRKGDPIVVAKDEAPREDTTIEKLAKLKPVFDKTA
HHHHHHHHHHHCCCCCCCEEEEEECCCCCCEEEEECCCCCCCCHHHHHHHHHCHHHCCCE
TVTAGNAPGLNDGGAALVLMSEDRAKQEGRKPLATILAHTAIAVESKDFPRTPGYAINAL
EEECCCCCCCCCCCEEEEEECCCHHHHHCCCHHHHHHHHHHEEEECCCCCCCCCHHHHHH
LEKTGKKIEDIDLFEINEAFAAVAIASTEIAGIDPEKLNVNGGAVAMGHPIGASGARIIV
HHHHCCCCCCCCEEEHHHHHHEEEEECCEECCCCCCEEECCCCEEEECCCCCCCCHHHHH
TLIHALKQRGGGIGIASICSGGGQGDAVMIEVH
HHHHHHHHCCCCEEEHHHHCCCCCCCEEEEEEC
>Mature Secondary Structure 
SKTVILSAARTPVGKFGGSLKDVKATELGGIAIKAALERANVSASDVEEVIFGTVIQGG
CCEEEEEECCCCHHHHCCCCCCCCHHHCCCCHHEEHHHHCCCCHHHHHHHHHHHHCCCC
QGQIPSRQAARAAGIPWEVQTETVNKVCASGLRAVTLADQIIRTGDQSLIVAGGMESMSN
CCCCCCHHHHHHCCCCCEEEHHHHHHHHHCCCHHHHHHHHHHHCCCCEEEEECCHHHCCC
SPYILRGARWGYRMGNNEVIDLNVADGLTCAFSGIHMGVYGGEVAKEDGISREAQDEWAY
CCEEEEECCCCEECCCCCEEEEECCCCCEEEECCEEEEECCCCHHHHCCCCCHHHHHHHH
RSHQRAVSAHKEGRFEEEIVPVTIPQRKGDPIVVAKDEAPREDTTIEKLAKLKPVFDKTA
HHHHHHHHHHHCCCCCCCEEEEEECCCCCCEEEEECCCCCCCCHHHHHHHHHCHHHCCCE
TVTAGNAPGLNDGGAALVLMSEDRAKQEGRKPLATILAHTAIAVESKDFPRTPGYAINAL
EEECCCCCCCCCCCEEEEEECCCHHHHHCCCHHHHHHHHHHEEEECCCCCCCCCHHHHHH
LEKTGKKIEDIDLFEINEAFAAVAIASTEIAGIDPEKLNVNGGAVAMGHPIGASGARIIV
HHHHCCCCCCCCEEEHHHHHHEEEEECCEECCCCCCEEECCCCEEEECCCCCCCCHHHHH
TLIHALKQRGGGIGIASICSGGGQGDAVMIEVH
HHHHHHHHCCCCEEEHHHHCCCCCCCEEEEEEC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8759838; 8969508; 9384377 [H]