Definition Bacillus anthracis str. Sterne chromosome, complete genome.
Accession NC_005945
Length 5,228,663

Click here to switch to the map view.

The map label for this gene is guaB [H]

Identifier: 49183047

GI number: 49183047

Start: 15469

End: 16932

Strand: Direct

Name: guaB [H]

Synonym: BAS0011

Alternate gene names: 49183047

Gene position: 15469-16932 (Clockwise)

Preceding gene: 49183045

Following gene: 49183048

Centisome position: 0.3

GC content: 38.52

Gene sequence:

>1464_bases
ATGTGGGAATCTAAATTTGTTAAAGAAGGTCTGACTTTTGATGACGTATTACTTGTACCAGCAAAGTCAGATGTATTACC
AAGAGAAGTAAGTGTTAAAACAGTTTTATCTGAAAGCTTACAGTTAAACATCCCGTTAATTAGTGCAGGAATGGATACAG
TAACAGAAGCGGATATGGCTATTGCAATGGCTCGTCAAGGCGGTTTAGGAATTATTCATAAAAACATGTCTATTGAACAA
CAAGCCGAGCAAGTTGATAAAGTAAAACGTTCTGAAAGTGGCGTTATTTCAGACCCATTCTTTTTAACTCCAGAACATCA
AGTGTATGATGCAGAGCATCTTATGGGAAAATACCGTATCTCAGGTGTACCGGTTGTAAATAATTTAGATGAGCGAAAAT
TAGTTGGTATTATTACAAACCGTGATATGCGTTTTATCCAAGACTACTCAATCAAAATTTCCGACGTAATGACAAAAGAA
CAGCTAATTACAGCTCCAGTTGGTACAACGCTAAGTGAAGCTGAAAAGATCCTACAAAAGTATAAAATTGAAAAACTCCC
TCTTGTTGATAACAACGGTGTATTACAAGGGCTTATTACAATAAAAGATATTGAAAAAGTAATTGAATTCCCAAATTCTG
CGAAGGATAAGCAAGGGCGCTTATTAGTTGGAGCAGCAGTTGGTGTAACGGCTGATGCTATGACTCGTATCGACGCATTA
GTAAAAGCTAGCGTAGATGCAATCGTACTTGATACAGCTCACGGACATTCTCAAGGTGTTATTGATAAAGTAAAAGAAGT
TCGTGCAAAGTATCCATCATTAAATATTATCGCTGGAAATGTTGCTACTGCTGAAGCAACAAAAGCATTAATTGAAGCAG
GTGCAAACGTAGTTAAAGTTGGTATTGGACCAGGTTCTATCTGTACAACACGTGTTGTAGCCGGCGTTGGTGTACCACAA
TTAACAGCGGTTTATGATTGTGCAACAGAAGCTCGTAAACACGGTATTCCAGTTATTGCTGATGGTGGTATTAAATACTC
TGGTGATATGGTTAAAGCTTTAGCAGCAGGAGCACATGTTGTTATGCTAGGCAGTATGTTTGCTGGTGTTGCTGAAAGCC
CTGGTGAAACTGAAATTTATCAAGGTCGCCAATTTAAAGTATATCGCGGTATGGGTTCTGTCGGAGCGATGGAAAAAGGA
AGTAAAGATCGTTACTTCCAAGAAGGAAATAAAAAACTTGTTCCAGAAGGTATTGAAGGACGAGTACCATATAAAGGACC
TTTAGCAGATACAGTTCACCAATTAGTTGGTGGTTTACGTGCAGGTATGGGCTATTGCGGAGCACAAGATTTAGAATTCT
TACGTGAGAATGCACAATTTATTCGCATGTCAGGTGCTGGTTTACTTGAAAGCCATCCTCACCACGTACAAATTACAAAA
GAGGCTCCAAACTACTCATTATAA

Upstream 100 bases:

>100_bases
TATTGTCTAAAAATTCTAACATATATATACGTCCTTGACAGTATTTTAACCAATTGATAAGCTACTAATAATAATTTCTG
GTATCATGGGGGGAACAATT

Downstream 100 bases:

>100_bases
TGTCTTATATACAAATAGACGGAGATTTGATATCTCTGTCTATTTTTTTTTGATTATGTTAGAATAACGGTTATGTGAGT
ACATAGATATTGGGGGTAGC

Product: inosine 5'-monophosphate dehydrogenase

Products: NA

Alternate protein names: IMP dehydrogenase; IMPD; IMPDH [H]

Number of amino acids: Translated: 487; Mature: 487

Protein sequence:

>487_residues
MWESKFVKEGLTFDDVLLVPAKSDVLPREVSVKTVLSESLQLNIPLISAGMDTVTEADMAIAMARQGGLGIIHKNMSIEQ
QAEQVDKVKRSESGVISDPFFLTPEHQVYDAEHLMGKYRISGVPVVNNLDERKLVGIITNRDMRFIQDYSIKISDVMTKE
QLITAPVGTTLSEAEKILQKYKIEKLPLVDNNGVLQGLITIKDIEKVIEFPNSAKDKQGRLLVGAAVGVTADAMTRIDAL
VKASVDAIVLDTAHGHSQGVIDKVKEVRAKYPSLNIIAGNVATAEATKALIEAGANVVKVGIGPGSICTTRVVAGVGVPQ
LTAVYDCATEARKHGIPVIADGGIKYSGDMVKALAAGAHVVMLGSMFAGVAESPGETEIYQGRQFKVYRGMGSVGAMEKG
SKDRYFQEGNKKLVPEGIEGRVPYKGPLADTVHQLVGGLRAGMGYCGAQDLEFLRENAQFIRMSGAGLLESHPHHVQITK
EAPNYSL

Sequences:

>Translated_487_residues
MWESKFVKEGLTFDDVLLVPAKSDVLPREVSVKTVLSESLQLNIPLISAGMDTVTEADMAIAMARQGGLGIIHKNMSIEQ
QAEQVDKVKRSESGVISDPFFLTPEHQVYDAEHLMGKYRISGVPVVNNLDERKLVGIITNRDMRFIQDYSIKISDVMTKE
QLITAPVGTTLSEAEKILQKYKIEKLPLVDNNGVLQGLITIKDIEKVIEFPNSAKDKQGRLLVGAAVGVTADAMTRIDAL
VKASVDAIVLDTAHGHSQGVIDKVKEVRAKYPSLNIIAGNVATAEATKALIEAGANVVKVGIGPGSICTTRVVAGVGVPQ
LTAVYDCATEARKHGIPVIADGGIKYSGDMVKALAAGAHVVMLGSMFAGVAESPGETEIYQGRQFKVYRGMGSVGAMEKG
SKDRYFQEGNKKLVPEGIEGRVPYKGPLADTVHQLVGGLRAGMGYCGAQDLEFLRENAQFIRMSGAGLLESHPHHVQITK
EAPNYSL
>Mature_487_residues
MWESKFVKEGLTFDDVLLVPAKSDVLPREVSVKTVLSESLQLNIPLISAGMDTVTEADMAIAMARQGGLGIIHKNMSIEQ
QAEQVDKVKRSESGVISDPFFLTPEHQVYDAEHLMGKYRISGVPVVNNLDERKLVGIITNRDMRFIQDYSIKISDVMTKE
QLITAPVGTTLSEAEKILQKYKIEKLPLVDNNGVLQGLITIKDIEKVIEFPNSAKDKQGRLLVGAAVGVTADAMTRIDAL
VKASVDAIVLDTAHGHSQGVIDKVKEVRAKYPSLNIIAGNVATAEATKALIEAGANVVKVGIGPGSICTTRVVAGVGVPQ
LTAVYDCATEARKHGIPVIADGGIKYSGDMVKALAAGAHVVMLGSMFAGVAESPGETEIYQGRQFKVYRGMGSVGAMEKG
SKDRYFQEGNKKLVPEGIEGRVPYKGPLADTVHQLVGGLRAGMGYCGAQDLEFLRENAQFIRMSGAGLLESHPHHVQITK
EAPNYSL

Specific function: GMP biosynthesis from IMP; first step. [C]

COG id: COG0516

COG function: function code F; IMP dehydrogenase/GMP reductase

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 2 CBS domains [H]

Homologues:

Organism=Homo sapiens, GI217035146, Length=453, Percent_Identity=45.4746136865342, Blast_Score=372, Evalue=1e-103,
Organism=Homo sapiens, GI156616279, Length=453, Percent_Identity=45.4746136865342, Blast_Score=370, Evalue=1e-102,
Organism=Homo sapiens, GI34328930, Length=453, Percent_Identity=45.4746136865342, Blast_Score=370, Evalue=1e-102,
Organism=Homo sapiens, GI34328928, Length=453, Percent_Identity=45.4746136865342, Blast_Score=370, Evalue=1e-102,
Organism=Homo sapiens, GI217035152, Length=447, Percent_Identity=45.413870246085, Blast_Score=362, Evalue=1e-100,
Organism=Homo sapiens, GI217035148, Length=453, Percent_Identity=44.5916114790287, Blast_Score=359, Evalue=3e-99,
Organism=Homo sapiens, GI66933016, Length=454, Percent_Identity=42.7312775330396, Blast_Score=355, Evalue=5e-98,
Organism=Homo sapiens, GI217035150, Length=453, Percent_Identity=42.6048565121413, Blast_Score=334, Evalue=1e-91,
Organism=Homo sapiens, GI156104880, Length=248, Percent_Identity=42.3387096774194, Blast_Score=178, Evalue=9e-45,
Organism=Homo sapiens, GI50541956, Length=246, Percent_Identity=40.650406504065, Blast_Score=172, Evalue=5e-43,
Organism=Homo sapiens, GI50541954, Length=246, Percent_Identity=40.650406504065, Blast_Score=172, Evalue=6e-43,
Organism=Homo sapiens, GI50541952, Length=246, Percent_Identity=40.650406504065, Blast_Score=172, Evalue=6e-43,
Organism=Homo sapiens, GI50541948, Length=246, Percent_Identity=40.650406504065, Blast_Score=172, Evalue=6e-43,
Organism=Escherichia coli, GI1788855, Length=486, Percent_Identity=55.9670781893004, Blast_Score=494, Evalue=1e-141,
Organism=Escherichia coli, GI1786293, Length=220, Percent_Identity=40.4545454545455, Blast_Score=164, Evalue=1e-41,
Organism=Caenorhabditis elegans, GI71994385, Length=501, Percent_Identity=35.7285429141717, Blast_Score=282, Evalue=2e-76,
Organism=Caenorhabditis elegans, GI71994389, Length=424, Percent_Identity=37.9716981132075, Blast_Score=269, Evalue=2e-72,
Organism=Caenorhabditis elegans, GI17560440, Length=241, Percent_Identity=40.6639004149378, Blast_Score=177, Evalue=2e-44,
Organism=Saccharomyces cerevisiae, GI6322012, Length=467, Percent_Identity=40.6852248394004, Blast_Score=344, Evalue=2e-95,
Organism=Saccharomyces cerevisiae, GI6323585, Length=458, Percent_Identity=40.8296943231441, Blast_Score=334, Evalue=2e-92,
Organism=Saccharomyces cerevisiae, GI6323464, Length=469, Percent_Identity=41.5778251599147, Blast_Score=333, Evalue=3e-92,
Organism=Saccharomyces cerevisiae, GI6319352, Length=341, Percent_Identity=40.4692082111437, Blast_Score=256, Evalue=6e-69,
Organism=Saccharomyces cerevisiae, GI6319353, Length=116, Percent_Identity=41.3793103448276, Blast_Score=86, Evalue=2e-17,
Organism=Drosophila melanogaster, GI24641071, Length=479, Percent_Identity=41.544885177453, Blast_Score=342, Evalue=3e-94,
Organism=Drosophila melanogaster, GI24641073, Length=479, Percent_Identity=41.544885177453, Blast_Score=342, Evalue=3e-94,
Organism=Drosophila melanogaster, GI28571163, Length=437, Percent_Identity=41.6475972540046, Blast_Score=307, Evalue=1e-83,

Paralogues:

None

Copy number: 600 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). [C]

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR013785
- InterPro:   IPR000644
- InterPro:   IPR005990
- InterPro:   IPR018529
- InterPro:   IPR015875
- InterPro:   IPR001093 [H]

Pfam domain/function: PF00571 CBS; PF00478 IMPDH [H]

EC number: =1.1.1.205 [H]

Molecular weight: Translated: 52375; Mature: 52375

Theoretical pI: Translated: 6.75; Mature: 6.75

Prosite motif: PS00487 IMP_DH_GMP_RED

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
3.3 %Met     (Translated Protein)
3.9 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
3.3 %Met     (Mature Protein)
3.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MWESKFVKEGLTFDDVLLVPAKSDVLPREVSVKTVLSESLQLNIPLISAGMDTVTEADMA
CCCHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHCCCEEECEEEECCCHHHHHHHHH
IAMARQGGLGIIHKNMSIEQQAEQVDKVKRSESGVISDPFFLTPEHQVYDAEHLMGKYRI
HHHHCCCCCEEEECCCCHHHHHHHHHHHHHHCCCCCCCCEEECCCCCCCCHHHHHHHHEE
SGVPVVNNLDERKLVGIITNRDMRFIQDYSIKISDVMTKEQLITAPVGTTLSEAEKILQK
CCCCEECCCCCCEEEEEEECCCEEEHHHCCEEHHHHHHHHHHEECCCCCCHHHHHHHHHH
YKIEKLPLVDNNGVLQGLITIKDIEKVIEFPNSAKDKQGRLLVGAAVGVTADAMTRIDAL
HCCCCCCEECCCCCEEEHHHHHHHHHHHHCCCCCCCCCCCEEEEEECCCCHHHHHHHHHH
VKASVDAIVLDTAHGHSQGVIDKVKEVRAKYPSLNIIAGNVATAEATKALIEAGANVVKV
HHCCCCEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHCCCCEEEE
GIGPGSICTTRVVAGVGVPQLTAVYDCATEARKHGIPVIADGGIKYSGDMVKALAAGAHV
ECCCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCEEECCCEEECCHHHHHHHCCCHH
VMLGSMFAGVAESPGETEIYQGRQFKVYRGMGSVGAMEKGSKDRYFQEGNKKLVPEGIEG
HHHHHHHHHHCCCCCCCHHCCCCEEEEEECCCCCCCCCCCCCCHHHHCCCCEECCCCCCC
RVPYKGPLADTVHQLVGGLRAGMGYCGAQDLEFLRENAQFIRMSGAGLLESHPHHVQITK
CCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHCCCCEEEECCCCCCCCCCCEEEEEE
EAPNYSL
CCCCCCC
>Mature Secondary Structure
MWESKFVKEGLTFDDVLLVPAKSDVLPREVSVKTVLSESLQLNIPLISAGMDTVTEADMA
CCCHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHCCCEEECEEEECCCHHHHHHHHH
IAMARQGGLGIIHKNMSIEQQAEQVDKVKRSESGVISDPFFLTPEHQVYDAEHLMGKYRI
HHHHCCCCCEEEECCCCHHHHHHHHHHHHHHCCCCCCCCEEECCCCCCCCHHHHHHHHEE
SGVPVVNNLDERKLVGIITNRDMRFIQDYSIKISDVMTKEQLITAPVGTTLSEAEKILQK
CCCCEECCCCCCEEEEEEECCCEEEHHHCCEEHHHHHHHHHHEECCCCCCHHHHHHHHHH
YKIEKLPLVDNNGVLQGLITIKDIEKVIEFPNSAKDKQGRLLVGAAVGVTADAMTRIDAL
HCCCCCCEECCCCCEEEHHHHHHHHHHHHCCCCCCCCCCCEEEEEECCCCHHHHHHHHHH
VKASVDAIVLDTAHGHSQGVIDKVKEVRAKYPSLNIIAGNVATAEATKALIEAGANVVKV
HHCCCCEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHCCCCEEEE
GIGPGSICTTRVVAGVGVPQLTAVYDCATEARKHGIPVIADGGIKYSGDMVKALAAGAHV
ECCCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCEEECCCEEECCHHHHHHHCCCHH
VMLGSMFAGVAESPGETEIYQGRQFKVYRGMGSVGAMEKGSKDRYFQEGNKKLVPEGIEG
HHHHHHHHHHCCCCCCCHHCCCCEEEEEECCCCCCCCCCCCCCHHHHCCCCEECCCCCCC
RVPYKGPLADTVHQLVGGLRAGMGYCGAQDLEFLRENAQFIRMSGAGLLESHPHHVQITK
CCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHCCCCEEEECCCCCCCCCCCEEEEEE
EAPNYSL
CCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11058132 [H]