Definition Clostridium botulinum A2 str. Kyoto chromosome, complete genome.
Accession NC_012563
Length 4,155,278

Click here to switch to the map view.

The map label for this gene is 226948836

Identifier: 226948836

GI number: 226948836

Start: 1772188

End: 1772757

Strand: Reverse

Name: 226948836

Synonym: CLM_1738

Alternate gene names: NA

Gene position: 1772757-1772188 (Counterclockwise)

Preceding gene: 226948839

Following gene: 226948815

Centisome position: 42.66

GC content: 33.51

Gene sequence:

>570_bases
ATGAAAAAATTAAAAATAGAAGATCCTAAAAGTCATAAATTAAATGTTATATGTGTTAAATCTACATTAGATAAATCTGT
TGTTTCTAATGAACGTCATCATAGCCATAGTGGAGTATACGAGGGAGAGAGAAAAGCTGGAAAAATGCATGGTTTTGGTA
CATATACATACACTAATGGAACTAAATATGTAGGTTGTTGGAAAGAAAATATGATGCATGGTGAAGGTGTTTTACTTTGG
GCTTCTGGGGAAAAATATACTGGCAGTTGGCAAAATGATGAAAAACATGGATATGGCATATATACTTGGCCCGATGGCGA
AAGTTATGTTGGATATTGGGAACATGATTTAAAATCCGGTCAGGGCATTTATACTTGGTCTGATGGTGATGTTTATACTG
GTGATTGGATTTCTGATATGCGTCATGGACATGGCGTTTATATTTGTAACCATGGAGATAAATATATTGGTCAATGGGTA
AATGATTTAAGACATGGAAAAGGTATGTATATTGAAGCTAATGGAGAAGTTTTTATGGGTGAATATAAAGAAGATGAGAG
GATTGAATAA

Upstream 100 bases:

>100_bases
AATTATTAAATGAAATAAAAAATCATTTGTGTACAATGAATTATTTTCCAATAAAATTAGATAAATATTTACTAATACTT
AGAAAGGTAAGGTATATAAT

Downstream 100 bases:

>100_bases
AAACTAAAATAAGATACCTCCAAACTATAACAGTTTAGAGGTATCTTTGTATTTAACAATAGGAATTGTGATTATCTTAA
AATATTTAGCCTAGAATAAA

Product: MORN repeat-containing protein

Products: NA

Alternate protein names: Morn Repeat Protein; Phosphatidylinositol-4-Phosphate 5-Kinase; MORN Repeat Protein; MORN Domain-Containing Protein; Phosphatidylinositol 4-Phosphate 5-Kinase; MORN Repeat Family Protein; Peptidase; Morn Repeat-Containing Protein; Cytoplasmic Protein; MORN Motif-Containing Protein; PEGA Domain-Containing Protein; Signal Peptide; Morn Motif-Containing Protein

Number of amino acids: Translated: 189; Mature: 189

Protein sequence:

>189_residues
MKKLKIEDPKSHKLNVICVKSTLDKSVVSNERHHSHSGVYEGERKAGKMHGFGTYTYTNGTKYVGCWKENMMHGEGVLLW
ASGEKYTGSWQNDEKHGYGIYTWPDGESYVGYWEHDLKSGQGIYTWSDGDVYTGDWISDMRHGHGVYICNHGDKYIGQWV
NDLRHGKGMYIEANGEVFMGEYKEDERIE

Sequences:

>Translated_189_residues
MKKLKIEDPKSHKLNVICVKSTLDKSVVSNERHHSHSGVYEGERKAGKMHGFGTYTYTNGTKYVGCWKENMMHGEGVLLW
ASGEKYTGSWQNDEKHGYGIYTWPDGESYVGYWEHDLKSGQGIYTWSDGDVYTGDWISDMRHGHGVYICNHGDKYIGQWV
NDLRHGKGMYIEANGEVFMGEYKEDERIE
>Mature_189_residues
MKKLKIEDPKSHKLNVICVKSTLDKSVVSNERHHSHSGVYEGERKAGKMHGFGTYTYTNGTKYVGCWKENMMHGEGVLLW
ASGEKYTGSWQNDEKHGYGIYTWPDGESYVGYWEHDLKSGQGIYTWSDGDVYTGDWISDMRHGHGVYICNHGDKYIGQWV
NDLRHGKGMYIEANGEVFMGEYKEDERIE

Specific function: Unknown

COG id: COG4642

COG function: function code S; Uncharacterized protein conserved in bacteria

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

Organism=Homo sapiens, GI18254456, Length=139, Percent_Identity=39.568345323741, Blast_Score=105, Evalue=2e-23,
Organism=Homo sapiens, GI13376267, Length=147, Percent_Identity=37.4149659863946, Blast_Score=95, Evalue=4e-20,
Organism=Homo sapiens, GI40316935, Length=134, Percent_Identity=35.8208955223881, Blast_Score=95, Evalue=4e-20,
Organism=Homo sapiens, GI157502187, Length=151, Percent_Identity=34.4370860927152, Blast_Score=87, Evalue=1e-17,
Organism=Homo sapiens, GI153792461, Length=151, Percent_Identity=34.4370860927152, Blast_Score=87, Evalue=1e-17,
Organism=Homo sapiens, GI299523224, Length=161, Percent_Identity=31.6770186335404, Blast_Score=80, Evalue=1e-15,
Organism=Homo sapiens, GI33359215, Length=161, Percent_Identity=31.6770186335404, Blast_Score=80, Evalue=1e-15,
Organism=Homo sapiens, GI21704281, Length=140, Percent_Identity=32.1428571428571, Blast_Score=75, Evalue=3e-14,
Organism=Homo sapiens, GI270265786, Length=185, Percent_Identity=29.1891891891892, Blast_Score=69, Evalue=3e-12,
Organism=Homo sapiens, GI225690502, Length=139, Percent_Identity=28.7769784172662, Blast_Score=68, Evalue=6e-12,
Organism=Homo sapiens, GI225690500, Length=139, Percent_Identity=28.7769784172662, Blast_Score=68, Evalue=6e-12,
Organism=Homo sapiens, GI21735575, Length=144, Percent_Identity=32.6388888888889, Blast_Score=67, Evalue=1e-11,
Organism=Homo sapiens, GI149999376, Length=105, Percent_Identity=32.3809523809524, Blast_Score=65, Evalue=2e-11,
Organism=Homo sapiens, GI30520314, Length=105, Percent_Identity=32.3809523809524, Blast_Score=65, Evalue=2e-11,
Organism=Homo sapiens, GI21704283, Length=148, Percent_Identity=30.4054054054054, Blast_Score=65, Evalue=3e-11,
Organism=Caenorhabditis elegans, GI25149766, Length=137, Percent_Identity=33.5766423357664, Blast_Score=69, Evalue=1e-12,
Organism=Drosophila melanogaster, GI24654677, Length=142, Percent_Identity=33.8028169014084, Blast_Score=84, Evalue=7e-17,
Organism=Drosophila melanogaster, GI19921216, Length=138, Percent_Identity=35.5072463768116, Blast_Score=83, Evalue=9e-17,
Organism=Drosophila melanogaster, GI24762835, Length=148, Percent_Identity=35.1351351351351, Blast_Score=71, Evalue=5e-13,
Organism=Drosophila melanogaster, GI24583069, Length=148, Percent_Identity=29.7297297297297, Blast_Score=68, Evalue=5e-12,
Organism=Drosophila melanogaster, GI24583071, Length=148, Percent_Identity=29.7297297297297, Blast_Score=68, Evalue=5e-12,
Organism=Drosophila melanogaster, GI21358361, Length=151, Percent_Identity=27.8145695364238, Blast_Score=65, Evalue=2e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 21730; Mature: 21730

Theoretical pI: Translated: 6.51; Mature: 6.51

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.6 %Cys     (Translated Protein)
3.7 %Met     (Translated Protein)
5.3 %Cys+Met (Translated Protein)
1.6 %Cys     (Mature Protein)
3.7 %Met     (Mature Protein)
5.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKKLKIEDPKSHKLNVICVKSTLDKSVVSNERHHSHSGVYEGERKAGKMHGFGTYTYTNG
CCCCCCCCCCCCEEEEEEECCCCCHHHHCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECC
TKYVGCWKENMMHGEGVLLWASGEKYTGSWQNDEKHGYGIYTWPDGESYVGYWEHDLKSG
CEEEEEEHHCCCCCCEEEEEECCCEEECCCCCCCCCCCEEEECCCCCCEEEEEEHHCCCC
QGIYTWSDGDVYTGDWISDMRHGHGVYICNHGDKYIGQWVNDLRHGKGMYIEANGEVFMG
CEEEEECCCCEEECHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCCEEEEECCCEEEC
EYKEDERIE
CCCCCCCCC
>Mature Secondary Structure
MKKLKIEDPKSHKLNVICVKSTLDKSVVSNERHHSHSGVYEGERKAGKMHGFGTYTYTNG
CCCCCCCCCCCCEEEEEEECCCCCHHHHCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECC
TKYVGCWKENMMHGEGVLLWASGEKYTGSWQNDEKHGYGIYTWPDGESYVGYWEHDLKSG
CEEEEEEHHCCCCCCEEEEEECCCEEECCCCCCCCCCCEEEECCCCCCEEEEEEHHCCCC
QGIYTWSDGDVYTGDWISDMRHGHGVYICNHGDKYIGQWVNDLRHGKGMYIEANGEVFMG
CEEEEECCCCEEECHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCCEEEEECCCEEEC
EYKEDERIE
CCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: NA