Definition Clostridium botulinum B1 str. Okra, complete genome.
Accession NC_010516
Length 3,958,233

Click here to switch to the map view.

The map label for this gene is yjeF [H]

Identifier: 170755570

GI number: 170755570

Start: 3710333

End: 3711835

Strand: Reverse

Name: yjeF [H]

Synonym: CLD_1108

Alternate gene names: 170755570

Gene position: 3711835-3710333 (Counterclockwise)

Preceding gene: 170756152

Following gene: 170755044

Centisome position: 93.78

GC content: 28.61

Gene sequence:

>1503_bases
ATGAGAATAACTTCTTCGGAAAATTTTAGAAAGATGGACAATTACTGCATAGAAAATATAGGAATACCAAGTATTGTGCT
AATGGAAAATGCAGCATTAAAGATTGTATCTAATATAGATTTACAACTAAATAATAGATTTGTTATAGTTTGTGGAAAAG
GTAACAATGGGGGAGATGGATTAGCTGTAGCAAGACATTTACATTGTTTAAATAAAGAAGTAGAAGTATTTATAATTGAG
AAGAGTAAAGATGGAACTAAGGATTTTAAAATAAATTACAATATATTAAAAAACATGAATTTAAATATTAAAACTATAAG
AGATTATGAAGATTTAGATTATTTAAGAGAAAGTATAATGAAAAGTGATATGGTTCTAGATGCTATTTTTGGCATAGGCC
TTAGTAGAAAAATAGAGGGAATATATAAGGATACTATATCTGTAATAAATGAAAATAGCAAAAGTACATTGGCTATAGAT
GTGCCTTCTGGATTAAATGCTAATACGGGTGAAATAGAAGGGGTTTGTATAGAGGCTAATACCACAGTTTCTTTTGAAAT
GTATAAAGAGGGGTTTTTAACTTATTATGGGGATAAATATTTAGGAAATATCATAATTGAAAGTATAGGAATTCCTAGGG
AAGTTTTGGATTTGTTTTCTAATGATCTTTATATTATTGATAAGTATATGTTTAAAAATAATCTTAAAGGAAGAAATAAA
TACGCCCATAAAGGTGATTTTGGTAAGGCATTAATTATAGCTGGAAGTAAAGGATTTTCAGGAGCGGCTTACCTGTGCAC
AGAAGCAGTGGTTAAAAGTGGAACAGGACTTGTAACTTTAGCTACATCTAATGATATTCAAAATATATTAAGCTCTAAAT
TAGAGGAAGCTATGACTATAAGTTATGAGGATTCTAAAGATGTTAAAAATATTATGGTAAAAAGTAGTTGTATAGCAATA
GGTCCAGGCATGGGTAAAAACAATAATACAGAGGAACTGTTAAGAAAAATAATAAGGGATTATAATAGAACCATGGTTAT
AGATGCTGATGGTATAAATGTTTTAGAAAATAATTTGGATATAATAAAAAAAGCAAGAGGAGAAATAGTTTTAACTCCAC
ATTTAGGGGAATTCTCAAGAATAACAGGTTATGACATAGATTATATAAAAGAAAATAGATTAAAATTAGCTAAGGAATTT
GCTAAAGAAAATAAAATTATATTACTCTTAAAAGGATATAATACCATAATTACAAATGGTGAAGAAGTATTTGTAAATTC
TACAGGAAATAGTGCTATGGCATCTGGAGGTATGGGAGATTGTCTAACAGGAATAATAACATCTTTTATAGCTCAGGGAT
ATGATCCTTTAGAGGCTACTTGTCTTGCGGCATATTTACATGGATATTGTGGTGAGAAATTATCTTCAAAAATGTTTTGT
GTAAATGCTACTCATGTGCTAGATTATATACCTTTTGCTATAAAGGAATTACAATACACATAG

Upstream 100 bases:

>100_bases
AAGATATAATAAGAAAAATATCCCAATCCTATAAAATTCATTTGAGCATATCCCATGAAAAAGAGTATGCTATAGCTTAT
GCTTTATTGGAGGTATTTAT

Downstream 100 bases:

>100_bases
CTAAATTAACATTCTATTCAGTTTAGAAATTTTATAATAGTATAGCTGAAGCATATAATTATGTAAAAGAATATAGCTAA
TTTGATAAGGCTTCTCTTAA

Product: carbohydrate kinase family protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 500; Mature: 500

Protein sequence:

>500_residues
MRITSSENFRKMDNYCIENIGIPSIVLMENAALKIVSNIDLQLNNRFVIVCGKGNNGGDGLAVARHLHCLNKEVEVFIIE
KSKDGTKDFKINYNILKNMNLNIKTIRDYEDLDYLRESIMKSDMVLDAIFGIGLSRKIEGIYKDTISVINENSKSTLAID
VPSGLNANTGEIEGVCIEANTTVSFEMYKEGFLTYYGDKYLGNIIIESIGIPREVLDLFSNDLYIIDKYMFKNNLKGRNK
YAHKGDFGKALIIAGSKGFSGAAYLCTEAVVKSGTGLVTLATSNDIQNILSSKLEEAMTISYEDSKDVKNIMVKSSCIAI
GPGMGKNNNTEELLRKIIRDYNRTMVIDADGINVLENNLDIIKKARGEIVLTPHLGEFSRITGYDIDYIKENRLKLAKEF
AKENKIILLLKGYNTIITNGEEVFVNSTGNSAMASGGMGDCLTGIITSFIAQGYDPLEATCLAAYLHGYCGEKLSSKMFC
VNATHVLDYIPFAIKELQYT

Sequences:

>Translated_500_residues
MRITSSENFRKMDNYCIENIGIPSIVLMENAALKIVSNIDLQLNNRFVIVCGKGNNGGDGLAVARHLHCLNKEVEVFIIE
KSKDGTKDFKINYNILKNMNLNIKTIRDYEDLDYLRESIMKSDMVLDAIFGIGLSRKIEGIYKDTISVINENSKSTLAID
VPSGLNANTGEIEGVCIEANTTVSFEMYKEGFLTYYGDKYLGNIIIESIGIPREVLDLFSNDLYIIDKYMFKNNLKGRNK
YAHKGDFGKALIIAGSKGFSGAAYLCTEAVVKSGTGLVTLATSNDIQNILSSKLEEAMTISYEDSKDVKNIMVKSSCIAI
GPGMGKNNNTEELLRKIIRDYNRTMVIDADGINVLENNLDIIKKARGEIVLTPHLGEFSRITGYDIDYIKENRLKLAKEF
AKENKIILLLKGYNTIITNGEEVFVNSTGNSAMASGGMGDCLTGIITSFIAQGYDPLEATCLAAYLHGYCGEKLSSKMFC
VNATHVLDYIPFAIKELQYT
>Mature_500_residues
MRITSSENFRKMDNYCIENIGIPSIVLMENAALKIVSNIDLQLNNRFVIVCGKGNNGGDGLAVARHLHCLNKEVEVFIIE
KSKDGTKDFKINYNILKNMNLNIKTIRDYEDLDYLRESIMKSDMVLDAIFGIGLSRKIEGIYKDTISVINENSKSTLAID
VPSGLNANTGEIEGVCIEANTTVSFEMYKEGFLTYYGDKYLGNIIIESIGIPREVLDLFSNDLYIIDKYMFKNNLKGRNK
YAHKGDFGKALIIAGSKGFSGAAYLCTEAVVKSGTGLVTLATSNDIQNILSSKLEEAMTISYEDSKDVKNIMVKSSCIAI
GPGMGKNNNTEELLRKIIRDYNRTMVIDADGINVLENNLDIIKKARGEIVLTPHLGEFSRITGYDIDYIKENRLKLAKEF
AKENKIILLLKGYNTIITNGEEVFVNSTGNSAMASGGMGDCLTGIITSFIAQGYDPLEATCLAAYLHGYCGEKLSSKMFC
VNATHVLDYIPFAIKELQYT

Specific function: Unknown

COG id: COG0063

COG function: function code G; Predicted sugar kinase

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 YjeF N-terminal domain [H]

Homologues:

Organism=Homo sapiens, GI119709830, Length=229, Percent_Identity=24.8908296943231, Blast_Score=75, Evalue=1e-13,
Organism=Escherichia coli, GI1790609, Length=439, Percent_Identity=30.2961275626424, Blast_Score=204, Evalue=7e-54,
Organism=Caenorhabditis elegans, GI17554656, Length=281, Percent_Identity=24.1992882562278, Blast_Score=86, Evalue=7e-17,
Organism=Saccharomyces cerevisiae, GI6322698, Length=236, Percent_Identity=24.1525423728814, Blast_Score=67, Evalue=7e-12,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR017953
- InterPro:   IPR000631
- InterPro:   IPR004443 [H]

Pfam domain/function: PF01256 Carb_kinase; PF03853 YjeF_N [H]

EC number: NA

Molecular weight: Translated: 55413; Mature: 55413

Theoretical pI: Translated: 5.59; Mature: 5.59

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.0 %Cys     (Translated Protein)
3.0 %Met     (Translated Protein)
5.0 %Cys+Met (Translated Protein)
2.0 %Cys     (Mature Protein)
3.0 %Met     (Mature Protein)
5.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MRITSSENFRKMDNYCIENIGIPSIVLMENAALKIVSNIDLQLNNRFVIVCGKGNNGGDG
CCCCCCCCHHHHHHHHHHHCCCCEEEEECCCEEEEEECCCEEECCEEEEEEECCCCCCCH
LAVARHLHCLNKEVEVFIIEKSKDGTKDFKINYNILKNMNLNIKTIRDYEDLDYLRESIM
HHHHHHHHHCCCCEEEEEEECCCCCCCEEEEEEEEEECCCCEEEEECCHHHHHHHHHHHH
KSDMVLDAIFGIGLSRKIEGIYKDTISVINENSKSTLAIDVPSGLNANTGEIEGVCIEAN
HHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCCEEEEEEECC
TTVSFEMYKEGFLTYYGDKYLGNIIIESIGIPREVLDLFSNDLYIIDKYMFKNNLKGRNK
CEEEEEEHHCCCEEEECCHHHHHHHHHHCCCCHHHHHHHCCCEEEEEEHHHHCCCCCCCC
YAHKGDFGKALIIAGSKGFSGAAYLCTEAVVKSGTGLVTLATSNDIQNILSSKLEEAMTI
CCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHEEE
SYEDSKDVKNIMVKSSCIAIGPGMGKNNNTEELLRKIIRDYNRTMVIDADGINVLENNLD
EECCCCHHHHHHHCCCEEEECCCCCCCCCHHHHHHHHHHHCCCEEEEECCCCHHHHCCHH
IIKKARGEIVLTPHLGEFSRITGYDIDYIKENRLKLAKEFAKENKIILLLKGYNTIITNG
HHHHCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCEEEECC
EEVFVNSTGNSAMASGGMGDCLTGIITSFIAQGYDPLEATCLAAYLHGYCGEKLSSKMFC
CEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCEEE
VNATHVLDYIPFAIKELQYT
EEHHHHHHHHHHHHHHHCCC
>Mature Secondary Structure
MRITSSENFRKMDNYCIENIGIPSIVLMENAALKIVSNIDLQLNNRFVIVCGKGNNGGDG
CCCCCCCCHHHHHHHHHHHCCCCEEEEECCCEEEEEECCCEEECCEEEEEEECCCCCCCH
LAVARHLHCLNKEVEVFIIEKSKDGTKDFKINYNILKNMNLNIKTIRDYEDLDYLRESIM
HHHHHHHHHCCCCEEEEEEECCCCCCCEEEEEEEEEECCCCEEEEECCHHHHHHHHHHHH
KSDMVLDAIFGIGLSRKIEGIYKDTISVINENSKSTLAIDVPSGLNANTGEIEGVCIEAN
HHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCCEEEEEEECC
TTVSFEMYKEGFLTYYGDKYLGNIIIESIGIPREVLDLFSNDLYIIDKYMFKNNLKGRNK
CEEEEEEHHCCCEEEECCHHHHHHHHHHCCCCHHHHHHHCCCEEEEEEHHHHCCCCCCCC
YAHKGDFGKALIIAGSKGFSGAAYLCTEAVVKSGTGLVTLATSNDIQNILSSKLEEAMTI
CCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHEEE
SYEDSKDVKNIMVKSSCIAIGPGMGKNNNTEELLRKIIRDYNRTMVIDADGINVLENNLD
EECCCCHHHHHHHCCCEEEECCCCCCCCCHHHHHHHHHHHCCCEEEEECCCCHHHHCCHH
IIKKARGEIVLTPHLGEFSRITGYDIDYIKENRLKLAKEFAKENKIILLLKGYNTIITNG
HHHHCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCEEEECC
EEVFVNSTGNSAMASGGMGDCLTGIITSFIAQGYDPLEATCLAAYLHGYCGEKLSSKMFC
CEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCEEE
VNATHVLDYIPFAIKELQYT
EEHHHHHHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 7610040; 9278503; 7511774 [H]