Definition Clostridium botulinum A str. ATCC 3502, complete genome.
Accession NC_009495
Length 3,886,916

Click here to switch to the map view.

The map label for this gene is 148379473

Identifier: 148379473

GI number: 148379473

Start: 1632478

End: 1633047

Strand: Reverse

Name: 148379473

Synonym: CBO1502

Alternate gene names: NA

Gene position: 1633047-1632478 (Counterclockwise)

Preceding gene: 148379476

Following gene: 148379467

Centisome position: 42.01

GC content: 33.86

Gene sequence:

>570_bases
ATGAAAAAATTAAAAATAGAAGATCCTAAAAGTCATAAATTAAATGTTATATGTGTTAAATCTACATTAGATAAATCTGT
TGTTTCTAATGAACGTCATCATAGCCATAGTGGAGTATATGAGGGAGAGAGAAAAGCTGGAAAAATGCATGGTTTTGGCA
CATATACATACACTAATGGAACTAAATATGTAGGTTGTTGGAAAGAAAATATGATGCATGGTGAAGGTGTTTTACTTTGG
GCTTCTGGGGAAAAATATACTGGCAGTTGGCAAAATGATGAAAAACATGGATATGGCATATATACTTGGCCCGATGGCGA
AAGTTATGTTGGATATTGGGAACATGATTTAAAATCCGGTCAGGGCATTTATACTTGGTCTGATGGTGATGTTTATACTG
GTGATTGGATTTCTGATATGCGTCATGGACATGGCGTTTATGTTTGTAACCATGGAGATAAATATATTGGTCAATGGGTA
AATGATTTAAGACATGGAAAAGGTATGTATATTGAAGCTAATGGAGAAGTTTTTATGGGTGAATACAAAGAAGATGAGAG
GATTGAATAA

Upstream 100 bases:

>100_bases
AATTATTAAATGAAATAAAAAATCATTTGTGTACAATGAATTATTTTCCAATAAAATTAGATAAATATTTACTAATACTT
AGAAAGGTAAGGTATATAAT

Downstream 100 bases:

>100_bases
AAACTAAAATAAGATACCTCCAAACTATAACAGTTTAGAGGTATCTTTGTATTTAACAATAGGAATTGTGATTATCTTAA
AATATTTAGCCTAGAATAAA

Product: MORN repeat protein

Products: NA

Alternate protein names: Morn Repeat Protein; Phosphatidylinositol-4-Phosphate 5-Kinase; MORN Repeat Protein; MORN Domain-Containing Protein; Phosphatidylinositol 4-Phosphate 5-Kinase; MORN Repeat Family Protein; Peptidase; Morn Repeat-Containing Protein; Cytoplasmic Protein; MORN Motif-Containing Protein; PEGA Domain-Containing Protein; Signal Peptide; Morn Motif-Containing Protein

Number of amino acids: Translated: 189; Mature: 189

Protein sequence:

>189_residues
MKKLKIEDPKSHKLNVICVKSTLDKSVVSNERHHSHSGVYEGERKAGKMHGFGTYTYTNGTKYVGCWKENMMHGEGVLLW
ASGEKYTGSWQNDEKHGYGIYTWPDGESYVGYWEHDLKSGQGIYTWSDGDVYTGDWISDMRHGHGVYVCNHGDKYIGQWV
NDLRHGKGMYIEANGEVFMGEYKEDERIE

Sequences:

>Translated_189_residues
MKKLKIEDPKSHKLNVICVKSTLDKSVVSNERHHSHSGVYEGERKAGKMHGFGTYTYTNGTKYVGCWKENMMHGEGVLLW
ASGEKYTGSWQNDEKHGYGIYTWPDGESYVGYWEHDLKSGQGIYTWSDGDVYTGDWISDMRHGHGVYVCNHGDKYIGQWV
NDLRHGKGMYIEANGEVFMGEYKEDERIE
>Mature_189_residues
MKKLKIEDPKSHKLNVICVKSTLDKSVVSNERHHSHSGVYEGERKAGKMHGFGTYTYTNGTKYVGCWKENMMHGEGVLLW
ASGEKYTGSWQNDEKHGYGIYTWPDGESYVGYWEHDLKSGQGIYTWSDGDVYTGDWISDMRHGHGVYVCNHGDKYIGQWV
NDLRHGKGMYIEANGEVFMGEYKEDERIE

Specific function: Unknown

COG id: COG4642

COG function: function code S; Uncharacterized protein conserved in bacteria

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

Organism=Homo sapiens, GI18254456, Length=139, Percent_Identity=39.568345323741, Blast_Score=105, Evalue=2e-23,
Organism=Homo sapiens, GI40316935, Length=134, Percent_Identity=35.8208955223881, Blast_Score=95, Evalue=3e-20,
Organism=Homo sapiens, GI13376267, Length=147, Percent_Identity=37.4149659863946, Blast_Score=94, Evalue=5e-20,
Organism=Homo sapiens, GI157502187, Length=151, Percent_Identity=34.4370860927152, Blast_Score=87, Evalue=1e-17,
Organism=Homo sapiens, GI153792461, Length=151, Percent_Identity=34.4370860927152, Blast_Score=87, Evalue=1e-17,
Organism=Homo sapiens, GI299523224, Length=161, Percent_Identity=31.6770186335404, Blast_Score=80, Evalue=1e-15,
Organism=Homo sapiens, GI33359215, Length=161, Percent_Identity=31.6770186335404, Blast_Score=80, Evalue=1e-15,
Organism=Homo sapiens, GI21704281, Length=140, Percent_Identity=32.1428571428571, Blast_Score=75, Evalue=3e-14,
Organism=Homo sapiens, GI270265786, Length=185, Percent_Identity=29.1891891891892, Blast_Score=69, Evalue=2e-12,
Organism=Homo sapiens, GI225690502, Length=139, Percent_Identity=28.7769784172662, Blast_Score=68, Evalue=5e-12,
Organism=Homo sapiens, GI225690500, Length=139, Percent_Identity=28.7769784172662, Blast_Score=68, Evalue=5e-12,
Organism=Homo sapiens, GI21735575, Length=144, Percent_Identity=32.6388888888889, Blast_Score=67, Evalue=1e-11,
Organism=Homo sapiens, GI21704283, Length=148, Percent_Identity=30.4054054054054, Blast_Score=65, Evalue=3e-11,
Organism=Homo sapiens, GI149999376, Length=105, Percent_Identity=31.4285714285714, Blast_Score=65, Evalue=4e-11,
Organism=Homo sapiens, GI30520314, Length=105, Percent_Identity=31.4285714285714, Blast_Score=65, Evalue=4e-11,
Organism=Caenorhabditis elegans, GI25149766, Length=137, Percent_Identity=32.8467153284672, Blast_Score=69, Evalue=2e-12,
Organism=Drosophila melanogaster, GI24654677, Length=142, Percent_Identity=33.8028169014084, Blast_Score=83, Evalue=8e-17,
Organism=Drosophila melanogaster, GI19921216, Length=138, Percent_Identity=35.5072463768116, Blast_Score=83, Evalue=9e-17,
Organism=Drosophila melanogaster, GI24762835, Length=148, Percent_Identity=35.1351351351351, Blast_Score=70, Evalue=6e-13,
Organism=Drosophila melanogaster, GI24583069, Length=148, Percent_Identity=29.7297297297297, Blast_Score=68, Evalue=4e-12,
Organism=Drosophila melanogaster, GI24583071, Length=148, Percent_Identity=29.7297297297297, Blast_Score=68, Evalue=4e-12,
Organism=Drosophila melanogaster, GI21358361, Length=151, Percent_Identity=27.8145695364238, Blast_Score=66, Evalue=2e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 21716; Mature: 21716

Theoretical pI: Translated: 6.51; Mature: 6.51

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.6 %Cys     (Translated Protein)
3.7 %Met     (Translated Protein)
5.3 %Cys+Met (Translated Protein)
1.6 %Cys     (Mature Protein)
3.7 %Met     (Mature Protein)
5.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKKLKIEDPKSHKLNVICVKSTLDKSVVSNERHHSHSGVYEGERKAGKMHGFGTYTYTNG
CCCCCCCCCCCCEEEEEEECCCCCHHHHCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECC
TKYVGCWKENMMHGEGVLLWASGEKYTGSWQNDEKHGYGIYTWPDGESYVGYWEHDLKSG
CEEEEEEHHCCCCCCEEEEEECCCEEECCCCCCCCCCCEEEECCCCCCEEEEEEHHCCCC
QGIYTWSDGDVYTGDWISDMRHGHGVYVCNHGDKYIGQWVNDLRHGKGMYIEANGEVFMG
CEEEEECCCCEEECHHHHHHHCCCEEEEECCCCHHHHHHHHHHHCCCCEEEEECCCEEEC
EYKEDERIE
CCCCCCCCC
>Mature Secondary Structure
MKKLKIEDPKSHKLNVICVKSTLDKSVVSNERHHSHSGVYEGERKAGKMHGFGTYTYTNG
CCCCCCCCCCCCEEEEEEECCCCCHHHHCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECC
TKYVGCWKENMMHGEGVLLWASGEKYTGSWQNDEKHGYGIYTWPDGESYVGYWEHDLKSG
CEEEEEEHHCCCCCCEEEEEECCCEEECCCCCCCCCCCEEEECCCCCCEEEEEEHHCCCC
QGIYTWSDGDVYTGDWISDMRHGHGVYVCNHGDKYIGQWVNDLRHGKGMYIEANGEVFMG
CEEEEECCCCEEECHHHHHHHCCCEEEEECCCCHHHHHHHHHHHCCCCEEEEECCCEEEC
EYKEDERIE
CCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: NA