Definition Clostridium botulinum A str. ATCC 3502, complete genome.
Accession NC_009495
Length 3,886,916

Click here to switch to the map view.

The map label for this gene is ydaN [H]

Identifier: 148379279

GI number: 148379279

Start: 1413747

End: 1415843

Strand: Direct

Name: ydaN [H]

Synonym: CBO1298

Alternate gene names: 148379279

Gene position: 1413747-1415843 (Clockwise)

Preceding gene: 148379278

Following gene: 148379280

Centisome position: 36.37

GC content: 25.75

Gene sequence:

>2097_bases
ATGAAGAAGAATTTAAGAATAATAATTATTATTATAATGCTTTTCCTAGGAAATATTATAAGTGGACAAAATGTAGTGGC
GGCTCCAAATAAATCTAAAAATTTTAGAGTAGAAAAAGATATGAAGATGGAGGGAGTATTTGGAAGTAATGTATTTTTCT
TTAACATAGATAAAAGTTGGACAGTAGATAATGCCTATTTAAACTTAATTTTTACAGAGAGTGATCTTTTAGATAAAACT
CAATCTACATTAACAGCTTATATAAATGATTTCCCAGTATATTCTATGAAGATTGGAGATAAAAAGAAATATAAAGAATC
CATAAAAATTAATATACCAAAGGATAAATTGATATCTGGATATAATGAAGTTAAAATTAAAGTTTATAGTAGAATATCAG
AAAAACCCTGTATAGATGATGTAAATAGTGGGAATTGGTTTATAATTCACAAAGGTTCTTATGTTCATATGGATTTTAAG
GATAGAGAAGATACAAAAACTCTTAAAGAATTTCCTTTCCCCTATTTAAAAGCCAGTGACGAAAATCCAGCTAACAGTAT
GATTATGCTTCCGGATAATTTTTCACAGGGAGAAATCACATCTGCTATGATGCTTTGTTCTAATTTTGGATCAAAAAGAA
AATCTGATAATGTAAATATGAAAGTTTATAAAGCTTCAGAGGCTAATTTAAAAAACAAATTAGATATAATTTTTATAGGA
AGTAAAAATAATACTCCCCTAGATTTGTTAACTTTACTGTCCAAAGAAGAGATAAATAGGCTAGATAAAGATGCTATTGT
TAAAGAGGTTATTTCACCCTATAATCCTAGTAAAAAGCTATTGCTTTTAATATCTAATAATGAAAAGAATATGATAAAGG
CTTCAAAATTATTATGTAGTAAAGACTATATGAAGCAAATAGATAAAGACACTATAATAGTTAATAATAGTATGGATGTA
GAAGATATAAAAGAGGAAAAAGCGAATAGAGTATCATTGTCAGATTTAGGATATGGCAATGTAAGTTTAAAAGGACCTTT
TAAACAGGAGGCAACCTTTAATTTGAATATTCCCAAAGATAGATTTATAAAAGAGGGATCAAAGGTAGTTATAAATAATA
GATATTCTAAAAATATAGATTTTGATAGATCTCTTATAACGGTTTATATAAATGATATTCCTATAGGTAGCAAAAAATTA
GACAGTAAAACAGCGGATAGTAATAGTTTTGAAATAAGTATTCCAAAAGATATTAGAAATAGTAGCAATTATGAAATAAA
GGTAGTGTTCAATTTAGAAATTAAAGATTTATTTTGTACTTTTAGAGAGGGCGAAAATCCTTGGGCTTATATATTAAATA
ATTCTTACATTTATACCCCTTATAAAGAGGGTAGGGATAATGTATTTGAAAATTATCCTAATCCATTTATATCTAATGGA
GGTATGAATGATTTGACATTAGTACTTTCAGATAATACTACTTCTGAAGAACTTAATTTTGCAGGAAATATAATGGCTAC
CATAGGGCATGATGTAGATACCAATAGGGGTAAATTTAATGCAGTAGCAGCAAAGGATTTATCATCTAAACTTAAGAAGG
GTAATTTAATAATCATAGGAACACCAGAGAGCAATTCTATTATAAAAAATTCAAATAAAAATTTATATATTAAATTTAAC
AAGAATTTTAATGGATTTCTATCCAATGAGAAAATGAAATTTTTTCAGGATTATAGTAGCAAGTTAGCCTCTATTCAACT
TATAGACTCCCCTTATAATAAAGAGAACAAAGCAATGATAGTAACATCAACTTATACTAGAGATTTAGCTTTAGCACAAA
AATATTTAAGTGATATATCTTTAGTAAAAAGTCTTAAAGGAAATGCTGTAACTATAGATAGAGATGGTGTAATGAATTAT
TCATATTTTGGAGATAAATATGATAAAGAAAAAGAAGAAAATAGTAATATCAGTAAATTTAAAAATATAACTTTAAATTC
TAATATAAAAAACCTATTAATATTTTTTGTATTTATAATGGTTATTTTAGTTGGGGGTTCTTTATTATTTATAAAAAAAT
ATAAAAAAAATAGTTAG

Upstream 100 bases:

>100_bases
AACTTAAGCAGATGAATCCTGTGATACTACCGTCTTATAAGTTGGAAAAATCAAAAATCATAGCTGCTATTTGTTATGAT
TTTAGAAATCAGGAGTTAAT

Downstream 100 bases:

>100_bases
GATATATTATTACTAGTACAATAATTAAGATAACTTAATAAACAGAGTTCTTGGTTTTAGAGGGAGTTTTTACTCCCACT
AAAGCTTATTAACAGAATGC

Product: cellulose synthase domain protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 698; Mature: 698

Protein sequence:

>698_residues
MKKNLRIIIIIIMLFLGNIISGQNVVAAPNKSKNFRVEKDMKMEGVFGSNVFFFNIDKSWTVDNAYLNLIFTESDLLDKT
QSTLTAYINDFPVYSMKIGDKKKYKESIKINIPKDKLISGYNEVKIKVYSRISEKPCIDDVNSGNWFIIHKGSYVHMDFK
DREDTKTLKEFPFPYLKASDENPANSMIMLPDNFSQGEITSAMMLCSNFGSKRKSDNVNMKVYKASEANLKNKLDIIFIG
SKNNTPLDLLTLLSKEEINRLDKDAIVKEVISPYNPSKKLLLLISNNEKNMIKASKLLCSKDYMKQIDKDTIIVNNSMDV
EDIKEEKANRVSLSDLGYGNVSLKGPFKQEATFNLNIPKDRFIKEGSKVVINNRYSKNIDFDRSLITVYINDIPIGSKKL
DSKTADSNSFEISIPKDIRNSSNYEIKVVFNLEIKDLFCTFREGENPWAYILNNSYIYTPYKEGRDNVFENYPNPFISNG
GMNDLTLVLSDNTTSEELNFAGNIMATIGHDVDTNRGKFNAVAAKDLSSKLKKGNLIIIGTPESNSIIKNSNKNLYIKFN
KNFNGFLSNEKMKFFQDYSSKLASIQLIDSPYNKENKAMIVTSTYTRDLALAQKYLSDISLVKSLKGNAVTIDRDGVMNY
SYFGDKYDKEKEENSNISKFKNITLNSNIKNLLIFFVFIMVILVGGSLLFIKKYKKNS

Sequences:

>Translated_698_residues
MKKNLRIIIIIIMLFLGNIISGQNVVAAPNKSKNFRVEKDMKMEGVFGSNVFFFNIDKSWTVDNAYLNLIFTESDLLDKT
QSTLTAYINDFPVYSMKIGDKKKYKESIKINIPKDKLISGYNEVKIKVYSRISEKPCIDDVNSGNWFIIHKGSYVHMDFK
DREDTKTLKEFPFPYLKASDENPANSMIMLPDNFSQGEITSAMMLCSNFGSKRKSDNVNMKVYKASEANLKNKLDIIFIG
SKNNTPLDLLTLLSKEEINRLDKDAIVKEVISPYNPSKKLLLLISNNEKNMIKASKLLCSKDYMKQIDKDTIIVNNSMDV
EDIKEEKANRVSLSDLGYGNVSLKGPFKQEATFNLNIPKDRFIKEGSKVVINNRYSKNIDFDRSLITVYINDIPIGSKKL
DSKTADSNSFEISIPKDIRNSSNYEIKVVFNLEIKDLFCTFREGENPWAYILNNSYIYTPYKEGRDNVFENYPNPFISNG
GMNDLTLVLSDNTTSEELNFAGNIMATIGHDVDTNRGKFNAVAAKDLSSKLKKGNLIIIGTPESNSIIKNSNKNLYIKFN
KNFNGFLSNEKMKFFQDYSSKLASIQLIDSPYNKENKAMIVTSTYTRDLALAQKYLSDISLVKSLKGNAVTIDRDGVMNY
SYFGDKYDKEKEENSNISKFKNITLNSNIKNLLIFFVFIMVILVGGSLLFIKKYKKNS
>Mature_698_residues
MKKNLRIIIIIIMLFLGNIISGQNVVAAPNKSKNFRVEKDMKMEGVFGSNVFFFNIDKSWTVDNAYLNLIFTESDLLDKT
QSTLTAYINDFPVYSMKIGDKKKYKESIKINIPKDKLISGYNEVKIKVYSRISEKPCIDDVNSGNWFIIHKGSYVHMDFK
DREDTKTLKEFPFPYLKASDENPANSMIMLPDNFSQGEITSAMMLCSNFGSKRKSDNVNMKVYKASEANLKNKLDIIFIG
SKNNTPLDLLTLLSKEEINRLDKDAIVKEVISPYNPSKKLLLLISNNEKNMIKASKLLCSKDYMKQIDKDTIIVNNSMDV
EDIKEEKANRVSLSDLGYGNVSLKGPFKQEATFNLNIPKDRFIKEGSKVVINNRYSKNIDFDRSLITVYINDIPIGSKKL
DSKTADSNSFEISIPKDIRNSSNYEIKVVFNLEIKDLFCTFREGENPWAYILNNSYIYTPYKEGRDNVFENYPNPFISNG
GMNDLTLVLSDNTTSEELNFAGNIMATIGHDVDTNRGKFNAVAAKDLSSKLKKGNLIIIGTPESNSIIKNSNKNLYIKFN
KNFNGFLSNEKMKFFQDYSSKLASIQLIDSPYNKENKAMIVTSTYTRDLALAQKYLSDISLVKSLKGNAVTIDRDGVMNY
SYFGDKYDKEKEENSNISKFKNITLNSNIKNLLIFFVFIMVILVGGSLLFIKKYKKNS

Specific function: Unknown

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cell membrane; Single-pass membrane protein (Potential) [H]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

Organism=Escherichia coli, GI1789952, Length=402, Percent_Identity=24.6268656716418, Blast_Score=66, Evalue=1e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR018513 [H]

Pfam domain/function: PF03170 BcsB [H]

EC number: NA

Molecular weight: Translated: 79485; Mature: 79485

Theoretical pI: Translated: 9.62; Mature: 9.62

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
2.9 %Met     (Translated Protein)
3.4 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
2.9 %Met     (Mature Protein)
3.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKKNLRIIIIIIMLFLGNIISGQNVVAAPNKSKNFRVEKDMKMEGVFGSNVFFFNIDKSW
CCCCCEEHHHHHHHHHHHHCCCCEEEECCCCCCCEEECCCCEECCEECCCEEEEECCCCE
TVDNAYLNLIFTESDLLDKTQSTLTAYINDFPVYSMKIGDKKKYKESIKINIPKDKLISG
EECCEEEEEEEECCHHHHHHHHHHHHHHCCCCEEEEEECCHHHCCCCEEEECCHHHHHCC
YNEVKIKVYSRISEKPCIDDVNSGNWFIIHKGSYVHMDFKDREDTKTLKEFPFPYLKASD
CCEEEEEEEECCCCCCCCCCCCCCCEEEEECCCEEEEECCCCCHHHHHHHCCCCEEECCC
ENPANSMIMLPDNFSQGEITSAMMLCSNFGSKRKSDNVNMKVYKASEANLKNKLDIIFIG
CCCCCCEEEECCCCCCCHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCEEEEEEE
SKNNTPLDLLTLLSKEEINRLDKDAIVKEVISPYNPSKKLLLLISNNEKNMIKASKLLCS
CCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHC
KDYMKQIDKDTIIVNNSMDVEDIKEEKANRVSLSDLGYGNVSLKGPFKQEATFNLNIPKD
HHHHHHCCCCEEEEECCCCHHHHHHHHCCEEEEHHCCCCCEEECCCCCCCCEEEECCCHH
RFIKEGSKVVINNRYSKNIDFDRSLITVYINDIPIGSKKLDSKTADSNSFEISIPKDIRN
HHHCCCCEEEEECCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCEEEEECCHHHCC
SSNYEIKVVFNLEIKDLFCTFREGENPWAYILNNSYIYTPYKEGRDNVFENYPNPFISNG
CCCEEEEEEEEEEEEEEEEEECCCCCCEEEEEECCEEECCCCCCCCHHHHCCCCCCCCCC
GMNDLTLVLSDNTTSEELNFAGNIMATIGHDVDTNRGKFNAVAAKDLSSKLKKGNLIIIG
CCCEEEEEEECCCCCCHHCCCCCEEEEECCCCCCCCCCEEEEEHHHHHHHHCCCCEEEEE
TPESNSIIKNSNKNLYIKFNKNFNGFLSNEKMKFFQDYSSKLASIQLIDSPYNKENKAMI
CCCCCCEEECCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHEEEEEEECCCCCCCCEEE
VTSTYTRDLALAQKYLSDISLVKSLKGNAVTIDRDGVMNYSYFGDKYDKEKEENSNISKF
EEECCHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCEEEHHCCCCCCCCHHCCCCHHHH
KNITLNSNIKNLLIFFVFIMVILVGGSLLFIKKYKKNS
EEEEECCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCC
>Mature Secondary Structure
MKKNLRIIIIIIMLFLGNIISGQNVVAAPNKSKNFRVEKDMKMEGVFGSNVFFFNIDKSW
CCCCCEEHHHHHHHHHHHHCCCCEEEECCCCCCCEEECCCCEECCEECCCEEEEECCCCE
TVDNAYLNLIFTESDLLDKTQSTLTAYINDFPVYSMKIGDKKKYKESIKINIPKDKLISG
EECCEEEEEEEECCHHHHHHHHHHHHHHCCCCEEEEEECCHHHCCCCEEEECCHHHHHCC
YNEVKIKVYSRISEKPCIDDVNSGNWFIIHKGSYVHMDFKDREDTKTLKEFPFPYLKASD
CCEEEEEEEECCCCCCCCCCCCCCCEEEEECCCEEEEECCCCCHHHHHHHCCCCEEECCC
ENPANSMIMLPDNFSQGEITSAMMLCSNFGSKRKSDNVNMKVYKASEANLKNKLDIIFIG
CCCCCCEEEECCCCCCCHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCEEEEEEE
SKNNTPLDLLTLLSKEEINRLDKDAIVKEVISPYNPSKKLLLLISNNEKNMIKASKLLCS
CCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHC
KDYMKQIDKDTIIVNNSMDVEDIKEEKANRVSLSDLGYGNVSLKGPFKQEATFNLNIPKD
HHHHHHCCCCEEEEECCCCHHHHHHHHCCEEEEHHCCCCCEEECCCCCCCCEEEECCCHH
RFIKEGSKVVINNRYSKNIDFDRSLITVYINDIPIGSKKLDSKTADSNSFEISIPKDIRN
HHHCCCCEEEEECCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCEEEEECCHHHCC
SSNYEIKVVFNLEIKDLFCTFREGENPWAYILNNSYIYTPYKEGRDNVFENYPNPFISNG
CCCEEEEEEEEEEEEEEEEEECCCCCCEEEEEECCEEECCCCCCCCHHHHCCCCCCCCCC
GMNDLTLVLSDNTTSEELNFAGNIMATIGHDVDTNRGKFNAVAAKDLSSKLKKGNLIIIG
CCCEEEEEEECCCCCCHHCCCCCEEEEECCCCCCCCCCEEEEEHHHHHHHHCCCCEEEEE
TPESNSIIKNSNKNLYIKFNKNFNGFLSNEKMKFFQDYSSKLASIQLIDSPYNKENKAMI
CCCCCCEEECCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHEEEEEEECCCCCCCCEEE
VTSTYTRDLALAQKYLSDISLVKSLKGNAVTIDRDGVMNYSYFGDKYDKEKEENSNISKF
EEECCHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCEEEHHCCCCCCCCHHCCCCHHHH
KNITLNSNIKNLLIFFVFIMVILVGGSLLFIKKYKKNS
EEEEECCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 9384377 [H]