Definition Clostridium botulinum A str. ATCC 3502, complete genome.
Accession NC_009495
Length 3,886,916

Click here to switch to the map view.

The map label for this gene is ydaM [C]

Identifier: 148379493

GI number: 148379493

Start: 1658182

End: 1660380

Strand: Direct

Name: ydaM [C]

Synonym: CBO1524

Alternate gene names: 148379493

Gene position: 1658182-1660380 (Clockwise)

Preceding gene: 148379491

Following gene: 148379494

Centisome position: 42.66

GC content: 28.33

Gene sequence:

>2199_bases
ATGCTTGCAGCAATAATAAAAAAAATTCAATTAATTTTAGAAACTAAAAAAAAGAATACATTTAAATATGAATGTATTAA
AATATGTTTAATGTATATAATTAGCGGATTTATTTGGATTTATTTTTCAGATAAAATTATAAAGAAATTTGTTAATGATA
AAGAGATGTTGATAATTATAAGTACATATAAAGGTTGGTTATATGTTATTATAACTGCACCAATTCTTTATTTAATAATA
AGAAGTATTCTAAAAAAAGTTTATTTAGCAGAAAAAAAACTCAATAAAAGCTATGAAGAGTTATTAGCGGTTAATGAAAA
ACTTGAATCCTATGTAAAGCGATTGACTAATTCCAAGGAAGAACTAAAAATTCAATATGATCAAACCATTGAAAGTGAAA
AGAAATTAAGCAAGAGTGAAGAAAGGTATAAAGCCCTTGTAAGTGAAATGCAACAAGGATTAGTACTTTTTCAAGGTAGT
GATAATGAGGAAGGAAAAATTATAAACTATAAACTTTTAGATTCAAATGCAAGTTATGAGAGATTAACAGGATTAAAAAA
AGAAGATATTTTAGGTAAAACTCTTTATGAGATATTCCCTAATATGGAAAAGAATCTGATTGAAAAAATCCAAAGAGTAG
CAATAACAGGACAATCTGTGCATTATCAACGCTATATAAAAGAAAAAGATAAATATTATGAAGCAATAGTGTATAGGCCT
AAAAAATTGCAATTTGCAGCAATTTTAACGGATATTACTGAAAGAAAATTTGCAGAGAAGGCCTTGAAAACCAGTGAATA
TAATTTTAGAAATATTTTTGAAAGTTCCTCAGATCCTATACTTATAACTCTAGATAATAAAGTTATTGATTGCAATTTAG
CTATGATCGAATTATTAGGATATGATTCAAAGTCATCTATACTTCATAAAAATCCAGTTCAGTTTTCTCCAGAGAAACAG
CCTAATGGAGAATCTTCTAAAGAAAAAGCTATTCAGGTATATAAGATTACTATGAAAAATAAGAAATATAAATTTGAATG
GTGGTTTAAAAGGGTTGATGGTACCTTATTGCCAGTAGAAGTTATGATGACAACTATATTACATAATGGGAAAAAAGTTT
TTCATTCTCTATGCAGAGATATTCGTGAAAGAAAAGAAATGGAAAATAAATTAGAATATTTAAGCTATCATGATCAACTA
ACAGGTTTATATAATAGAAGGTTTTTTGAAAACGAATTAAAGAGACTAGATGTGGAAGAAAATTTACCTTTGACTATTGT
TATGGCAGATGTTAATGGACTAAAGCTTGTAAATGATTCTTTTGGCCATGCCGCAGGAGACGAACTATTAAAAAAAGTAT
CAGAAATTATAAAAAGGGGATGTAGATATAATGATATTATTGCTAGACTTGGGGGAGATGAATTTGTAATTTTACTGCCT
AAGACAGATATATATGAAACAGAACAAATTGTTAAAAATATTAATGCTTTAGCTTTAAAGGAAACAGTAAGTGCTGTTAA
TATATCCATCTCCTTTGGATATGGAACTAAGAAGAAAGAAGAAGAAAAGATTGAAGAAATTTTAAAGAAAGCTGAAGATT
ATATGTATAAGAAAAAGCTTTTTGAGAGTCCAAGTATGAGAGGCAAAACTATAGGTGCTATAATTAGTACCCTTCATGAA
AAAAATAAAAGAGAAGAGGAACACTCTCATAGAGTCTCAAGGTTATGCCAAGATATGGGGCATGCTTTAGGATTAACTGA
AAGTGAGACAGAGGAACTAAAAACTATTGGTTTACTTCATGATATAGGAAAAATAGCTATAGAAGAAAATATACTAAACA
AGAGTGAAGAACTTACAGAGGATGAATGGCAAGAAATAAAACGACATTCAGAAATAGGATATAGAATACTTAACACGGTA
AATGATATGTTAGAAATATCAGAGTATGTATTATATCATCATGAAAGATGGGATGGAAAGGGATATCCTAAAGGTTTAAA
GGGAGAAGAGATACCACTTCAGTCAAGAATAATAACTATAATTGATGCTTATGATGCTATGACTAGCCAAAGAAGTTATA
GAAGCGCTTTACCAGAGGAGAGTGCTATAGAAGAGTTAAAAATAAATGCAGGCACTCAGTTTGATCCAGATCTTGTAAGA
ATATTTATTGAAAAAGTATTGAATAAATCTTTCTATTAA

Upstream 100 bases:

>100_bases
ATTTGTAAGTTACAAGTATGTAAAAAAATAGAGTTATACATATCAAGGAATAATGAATTCAGTAGAAGCTAATGAAACAC
AATAGTAATGGAGGCGTGTC

Downstream 100 bases:

>100_bases
AAAAGTATATTTTTGTACTTTTTAGATTTTCAGAGCTTTAGAATATAGATATATTTTTCGGTCATAATATATTTTGAATA
AAAAATATATTGAAGGAAGT

Product: putative sensory box-containing diguanylate cyclase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 732; Mature: 732

Protein sequence:

>732_residues
MLAAIIKKIQLILETKKKNTFKYECIKICLMYIISGFIWIYFSDKIIKKFVNDKEMLIIISTYKGWLYVIITAPILYLII
RSILKKVYLAEKKLNKSYEELLAVNEKLESYVKRLTNSKEELKIQYDQTIESEKKLSKSEERYKALVSEMQQGLVLFQGS
DNEEGKIINYKLLDSNASYERLTGLKKEDILGKTLYEIFPNMEKNLIEKIQRVAITGQSVHYQRYIKEKDKYYEAIVYRP
KKLQFAAILTDITERKFAEKALKTSEYNFRNIFESSSDPILITLDNKVIDCNLAMIELLGYDSKSSILHKNPVQFSPEKQ
PNGESSKEKAIQVYKITMKNKKYKFEWWFKRVDGTLLPVEVMMTTILHNGKKVFHSLCRDIRERKEMENKLEYLSYHDQL
TGLYNRRFFENELKRLDVEENLPLTIVMADVNGLKLVNDSFGHAAGDELLKKVSEIIKRGCRYNDIIARLGGDEFVILLP
KTDIYETEQIVKNINALALKETVSAVNISISFGYGTKKKEEEKIEEILKKAEDYMYKKKLFESPSMRGKTIGAIISTLHE
KNKREEEHSHRVSRLCQDMGHALGLTESETEELKTIGLLHDIGKIAIEENILNKSEELTEDEWQEIKRHSEIGYRILNTV
NDMLEISEYVLYHHERWDGKGYPKGLKGEEIPLQSRIITIIDAYDAMTSQRSYRSALPEESAIEELKINAGTQFDPDLVR
IFIEKVLNKSFY

Sequences:

>Translated_732_residues
MLAAIIKKIQLILETKKKNTFKYECIKICLMYIISGFIWIYFSDKIIKKFVNDKEMLIIISTYKGWLYVIITAPILYLII
RSILKKVYLAEKKLNKSYEELLAVNEKLESYVKRLTNSKEELKIQYDQTIESEKKLSKSEERYKALVSEMQQGLVLFQGS
DNEEGKIINYKLLDSNASYERLTGLKKEDILGKTLYEIFPNMEKNLIEKIQRVAITGQSVHYQRYIKEKDKYYEAIVYRP
KKLQFAAILTDITERKFAEKALKTSEYNFRNIFESSSDPILITLDNKVIDCNLAMIELLGYDSKSSILHKNPVQFSPEKQ
PNGESSKEKAIQVYKITMKNKKYKFEWWFKRVDGTLLPVEVMMTTILHNGKKVFHSLCRDIRERKEMENKLEYLSYHDQL
TGLYNRRFFENELKRLDVEENLPLTIVMADVNGLKLVNDSFGHAAGDELLKKVSEIIKRGCRYNDIIARLGGDEFVILLP
KTDIYETEQIVKNINALALKETVSAVNISISFGYGTKKKEEEKIEEILKKAEDYMYKKKLFESPSMRGKTIGAIISTLHE
KNKREEEHSHRVSRLCQDMGHALGLTESETEELKTIGLLHDIGKIAIEENILNKSEELTEDEWQEIKRHSEIGYRILNTV
NDMLEISEYVLYHHERWDGKGYPKGLKGEEIPLQSRIITIIDAYDAMTSQRSYRSALPEESAIEELKINAGTQFDPDLVR
IFIEKVLNKSFY
>Mature_732_residues
MLAAIIKKIQLILETKKKNTFKYECIKICLMYIISGFIWIYFSDKIIKKFVNDKEMLIIISTYKGWLYVIITAPILYLII
RSILKKVYLAEKKLNKSYEELLAVNEKLESYVKRLTNSKEELKIQYDQTIESEKKLSKSEERYKALVSEMQQGLVLFQGS
DNEEGKIINYKLLDSNASYERLTGLKKEDILGKTLYEIFPNMEKNLIEKIQRVAITGQSVHYQRYIKEKDKYYEAIVYRP
KKLQFAAILTDITERKFAEKALKTSEYNFRNIFESSSDPILITLDNKVIDCNLAMIELLGYDSKSSILHKNPVQFSPEKQ
PNGESSKEKAIQVYKITMKNKKYKFEWWFKRVDGTLLPVEVMMTTILHNGKKVFHSLCRDIRERKEMENKLEYLSYHDQL
TGLYNRRFFENELKRLDVEENLPLTIVMADVNGLKLVNDSFGHAAGDELLKKVSEIIKRGCRYNDIIARLGGDEFVILLP
KTDIYETEQIVKNINALALKETVSAVNISISFGYGTKKKEEEKIEEILKKAEDYMYKKKLFESPSMRGKTIGAIISTLHE
KNKREEEHSHRVSRLCQDMGHALGLTESETEELKTIGLLHDIGKIAIEENILNKSEELTEDEWQEIKRHSEIGYRILNTV
NDMLEISEYVLYHHERWDGKGYPKGLKGEEIPLQSRIITIIDAYDAMTSQRSYRSALPEESAIEELKINAGTQFDPDLVR
IFIEKVLNKSFY

Specific function: Unknown

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: Non Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HD domain [H]

Homologues:

Organism=Escherichia coli, GI87081881, Length=296, Percent_Identity=26.6891891891892, Blast_Score=114, Evalue=3e-26,
Organism=Escherichia coli, GI1788381, Length=176, Percent_Identity=32.3863636363636, Blast_Score=89, Evalue=7e-19,
Organism=Escherichia coli, GI87082007, Length=163, Percent_Identity=30.6748466257669, Blast_Score=86, Evalue=6e-18,
Organism=Escherichia coli, GI1786584, Length=216, Percent_Identity=27.7777777777778, Blast_Score=86, Evalue=1e-17,
Organism=Escherichia coli, GI1787262, Length=153, Percent_Identity=30.718954248366, Blast_Score=81, Evalue=3e-16,
Organism=Escherichia coli, GI145693134, Length=149, Percent_Identity=33.5570469798658, Blast_Score=79, Evalue=8e-16,
Organism=Escherichia coli, GI1787541, Length=205, Percent_Identity=27.8048780487805, Blast_Score=77, Evalue=5e-15,
Organism=Escherichia coli, GI87081974, Length=150, Percent_Identity=31.3333333333333, Blast_Score=68, Evalue=2e-12,
Organism=Escherichia coli, GI87081977, Length=101, Percent_Identity=38.6138613861386, Blast_Score=67, Evalue=3e-12,
Organism=Escherichia coli, GI1787816, Length=147, Percent_Identity=27.891156462585, Blast_Score=67, Evalue=4e-12,
Organism=Escherichia coli, GI1788085, Length=104, Percent_Identity=33.6538461538462, Blast_Score=67, Evalue=4e-12,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001054
- InterPro:   IPR000160
- InterPro:   IPR003607 [H]

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 85351; Mature: 85351

Theoretical pI: Translated: 8.25; Mature: 8.25

Prosite motif: PS50112 PAS ; PS50113 PAC ; PS50887 GGDEF

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.8 %Cys     (Translated Protein)
2.2 %Met     (Translated Protein)
3.0 %Cys+Met (Translated Protein)
0.8 %Cys     (Mature Protein)
2.2 %Met     (Mature Protein)
3.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MLAAIIKKIQLILETKKKNTFKYECIKICLMYIISGFIWIYFSDKIIKKFVNDKEMLIII
CHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHEECHHHHHHHCCCCEEEEEE
STYKGWLYVIITAPILYLIIRSILKKVYLAEKKLNKSYEELLAVNEKLESYVKRLTNSKE
EECCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHH
ELKIQYDQTIESEKKLSKSEERYKALVSEMQQGLVLFQGSDNEEGKIINYKLLDSNASYE
HEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEEECCCCCCCCEEEEEEECCCCCHH
RLTGLKKEDILGKTLYEIFPNMEKNLIEKIQRVAITGQSVHYQRYIKEKDKYYEAIVYRP
HHHCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCC
KKLQFAAILTDITERKFAEKALKTSEYNFRNIFESSSDPILITLDNKVIDCNLAMIELLG
CHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHCCCCCCEEEEECCEEEECHHHHHHHHC
YDSKSSILHKNPVQFSPEKQPNGESSKEKAIQVYKITMKNKKYKFEWWFKRVDGTLLPVE
CCCCCCCCCCCCCCCCCCCCCCCCCCHHHHEEEEEEEECCCCEEHHHHHHHHCCCCCHHH
VMMTTILHNGKKVFHSLCRDIRERKEMENKLEYLSYHDQLTGLYNRRFFENELKRLDVEE
HHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC
NLPLTIVMADVNGLKLVNDSFGHAAGDELLKKVSEIIKRGCRYNDIIARLGGDEFVILLP
CCCEEEEEECCCCCEEECCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCEEEEEC
KTDIYETEQIVKNINALALKETVSAVNISISFGYGTKKKEEEKIEEILKKAEDYMYKKKL
CCCCHHHHHHHHHHHHHHHHHHHHEEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHH
FESPSMRGKTIGAIISTLHEKNKREEEHSHRVSRLCQDMGHALGLTESETEELKTIGLLH
HCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH
DIGKIAIEENILNKSEELTEDEWQEIKRHSEIGYRILNTVNDMLEISEYVLYHHERWDGK
HHHHHHHHHHHHCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC
GYPKGLKGEEIPLQSRIITIIDAYDAMTSQRSYRSALPEESAIEELKINAGTQFDPDLVR
CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCCHHHHH
IFIEKVLNKSFY
HHHHHHHHCCCC
>Mature Secondary Structure
MLAAIIKKIQLILETKKKNTFKYECIKICLMYIISGFIWIYFSDKIIKKFVNDKEMLIII
CHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHEECHHHHHHHCCCCEEEEEE
STYKGWLYVIITAPILYLIIRSILKKVYLAEKKLNKSYEELLAVNEKLESYVKRLTNSKE
EECCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHH
ELKIQYDQTIESEKKLSKSEERYKALVSEMQQGLVLFQGSDNEEGKIINYKLLDSNASYE
HEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEEECCCCCCCCEEEEEEECCCCCHH
RLTGLKKEDILGKTLYEIFPNMEKNLIEKIQRVAITGQSVHYQRYIKEKDKYYEAIVYRP
HHHCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCC
KKLQFAAILTDITERKFAEKALKTSEYNFRNIFESSSDPILITLDNKVIDCNLAMIELLG
CHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHCCCCCCEEEEECCEEEECHHHHHHHHC
YDSKSSILHKNPVQFSPEKQPNGESSKEKAIQVYKITMKNKKYKFEWWFKRVDGTLLPVE
CCCCCCCCCCCCCCCCCCCCCCCCCCHHHHEEEEEEEECCCCEEHHHHHHHHCCCCCHHH
VMMTTILHNGKKVFHSLCRDIRERKEMENKLEYLSYHDQLTGLYNRRFFENELKRLDVEE
HHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC
NLPLTIVMADVNGLKLVNDSFGHAAGDELLKKVSEIIKRGCRYNDIIARLGGDEFVILLP
CCCEEEEEECCCCCEEECCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCEEEEEC
KTDIYETEQIVKNINALALKETVSAVNISISFGYGTKKKEEEKIEEILKKAEDYMYKKKL
CCCCHHHHHHHHHHHHHHHHHHHHEEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHH
FESPSMRGKTIGAIISTLHEKNKREEEHSHRVSRLCQDMGHALGLTESETEELKTIGLLH
HCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH
DIGKIAIEENILNKSEELTEDEWQEIKRHSEIGYRILNTVNDMLEISEYVLYHHERWDGK
HHHHHHHHHHHHCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC
GYPKGLKGEEIPLQSRIITIIDAYDAMTSQRSYRSALPEESAIEELKINAGTQFDPDLVR
CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCCHHHHH
IFIEKVLNKSFY
HHHHHHHHCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9537320 [H]