Definition Clostridium botulinum A2 str. Kyoto chromosome, complete genome.
Accession NC_012563
Length 4,155,278

Click here to switch to the map view.

The map label for this gene is yugS [H]

Identifier: 226950339

GI number: 226950339

Start: 3338023

End: 3339348

Strand: Direct

Name: yugS [H]

Synonym: CLM_3305

Alternate gene names: 226950339

Gene position: 3338023-3339348 (Clockwise)

Preceding gene: 226950330

Following gene: 226950349

Centisome position: 80.33

GC content: 24.96

Gene sequence:

>1326_bases
ATGAATAATATTGGTATGCAGTTATTTTTAATATTAATTTTAGTAATTATAAATGCATTTTTTTCCTCTGCAGAGATGGC
AATAATTTCTTTAAATAAAAATAGATTAAATACTATAATAGATGATGCTGAGGGGGAGAATTCTTTTTCTCGCAGGACAA
AAAAAGCAAAAATTTTATTAAATCTTTTGAAAGAACCTAGTAAATTTTTAGCTACCATCCAAGTTGGAATAACCTTAGCA
GGATTTTTGGCTAGTGCTTCTGCTGCTACAAGTATATCAAAATATATTGAGATATTTTTTAAGCGCTTAAATATTCCCAA
AAGTAGTAGTATAGCTTTGTTTTTAACTACTCTTTTATTATCTTATTTAACACTAGTTTTTGGAGAGTTACTACCAAAGA
GAATAGCTCTAAACAATTCAGAAAAAATAGCGTTATTCTCAATTAAACCAATAATTATTTTTATGAAAATCTCCCTCCCT
TTTGTAAATATATTAACCTACTCCACTAATTTTTTATTAAAAGTTCTAGGGATTGACTATGAGAACATAGAAGAAAAAAT
ATCAGAAGAAGAAATAAAAAAAATGATCGATTTAGGTGAAGAAACGGGTGTTTTCAATTCTACAGAAAAAGAAATGATAA
ATAGCATTTTTGACTTTGACAATACTCTAGCTAAAGAAATAATGACACCTAGAACTAGTGTATTTACAATGGATATAAAT
GATTCACCTGAAAATATTATTAATAATATGTTAGAAGAACGGTATTCTAGAGTTCCTATATATGAAGATGATATTGATAA
TATTATAGGAATTCTACATATAAAAGATATACTTTCTATTATAAATAAAGAAAATATAAAAAAAGAGGATTTAATAAATA
TATTAAGAATACCTTATTTTATACCAGAAACAAAAGCTATTGATTTTTTATTTAAAGAAATGCAAACAAGTAAAAATTAT
ATTGCCATACTTATCGATGAATATGGAGGATTTTCTGGTATAGTTACCATGGAAGATTTAATAGAAGAAGTTATGGGAAA
CATATTCGATGAGTATGATGAAGATCATACTGAAGAAATAATAAAAATAGATGCTAATACTTTCTTATTAGATGCTTCTA
TAACTATAGATGACTTAAATGAAAAACTTAATTTAGAGCTACCCTCAGAAAATTTTGATACTCTAGGTGGTTTTATATTA
GACATAACTGGAACTATCCCAAAATGTAATGTAAATTCTGAGATACAGTATAATAATTTAATATTTAAAATAGAAAAGGT
ATATAATAATAGAATAGAAAAAATAAAATTATATATAAGTGAGTAA

Upstream 100 bases:

>100_bases
TATTGCTACTATATATGCTTCAATTTTACATTTTTTTCTTATTATTCTTATTATAAAAAATTTAATAAAAGATACACTAT
ATAGGAAGGAGTGAATTTTT

Downstream 100 bases:

>100_bases
AAAAAGTAGAACCTCTATTTAGAGGTTCTATAAAATTAAAGTACATTATGGTCTTTCTTTTAATTTTATTAAAATTAGAA
AAAGTTGAAAAATTGTGTGA

Product: transporter, HlyC/CorC family

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 441; Mature: 441

Protein sequence:

>441_residues
MNNIGMQLFLILILVIINAFFSSAEMAIISLNKNRLNTIIDDAEGENSFSRRTKKAKILLNLLKEPSKFLATIQVGITLA
GFLASASAATSISKYIEIFFKRLNIPKSSSIALFLTTLLLSYLTLVFGELLPKRIALNNSEKIALFSIKPIIIFMKISLP
FVNILTYSTNFLLKVLGIDYENIEEKISEEEIKKMIDLGEETGVFNSTEKEMINSIFDFDNTLAKEIMTPRTSVFTMDIN
DSPENIINNMLEERYSRVPIYEDDIDNIIGILHIKDILSIINKENIKKEDLINILRIPYFIPETKAIDFLFKEMQTSKNY
IAILIDEYGGFSGIVTMEDLIEEVMGNIFDEYDEDHTEEIIKIDANTFLLDASITIDDLNEKLNLELPSENFDTLGGFIL
DITGTIPKCNVNSEIQYNNLIFKIEKVYNNRIEKIKLYISE

Sequences:

>Translated_441_residues
MNNIGMQLFLILILVIINAFFSSAEMAIISLNKNRLNTIIDDAEGENSFSRRTKKAKILLNLLKEPSKFLATIQVGITLA
GFLASASAATSISKYIEIFFKRLNIPKSSSIALFLTTLLLSYLTLVFGELLPKRIALNNSEKIALFSIKPIIIFMKISLP
FVNILTYSTNFLLKVLGIDYENIEEKISEEEIKKMIDLGEETGVFNSTEKEMINSIFDFDNTLAKEIMTPRTSVFTMDIN
DSPENIINNMLEERYSRVPIYEDDIDNIIGILHIKDILSIINKENIKKEDLINILRIPYFIPETKAIDFLFKEMQTSKNY
IAILIDEYGGFSGIVTMEDLIEEVMGNIFDEYDEDHTEEIIKIDANTFLLDASITIDDLNEKLNLELPSENFDTLGGFIL
DITGTIPKCNVNSEIQYNNLIFKIEKVYNNRIEKIKLYISE
>Mature_441_residues
MNNIGMQLFLILILVIINAFFSSAEMAIISLNKNRLNTIIDDAEGENSFSRRTKKAKILLNLLKEPSKFLATIQVGITLA
GFLASASAATSISKYIEIFFKRLNIPKSSSIALFLTTLLLSYLTLVFGELLPKRIALNNSEKIALFSIKPIIIFMKISLP
FVNILTYSTNFLLKVLGIDYENIEEKISEEEIKKMIDLGEETGVFNSTEKEMINSIFDFDNTLAKEIMTPRTSVFTMDIN
DSPENIINNMLEERYSRVPIYEDDIDNIIGILHIKDILSIINKENIKKEDLINILRIPYFIPETKAIDFLFKEMQTSKNY
IAILIDEYGGFSGIVTMEDLIEEVMGNIFDEYDEDHTEEIIKIDANTFLLDASITIDDLNEKLNLELPSENFDTLGGFIL
DITGTIPKCNVNSEIQYNNLIFKIEKVYNNRIEKIKLYISE

Specific function: Unknown

COG id: COG1253

COG function: function code R; Hemolysins and related proteins containing CBS domains

Gene ontology:

Cell location: Cell membrane; Multi-pass membrane protein (Potential) [H]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 2 CBS domains [H]

Homologues:

Organism=Homo sapiens, GI310128564, Length=388, Percent_Identity=31.1855670103093, Blast_Score=205, Evalue=7e-53,
Organism=Homo sapiens, GI94681046, Length=240, Percent_Identity=22.9166666666667, Blast_Score=67, Evalue=3e-11,
Organism=Homo sapiens, GI40068055, Length=364, Percent_Identity=21.978021978022, Blast_Score=66, Evalue=6e-11,
Organism=Homo sapiens, GI40068053, Length=364, Percent_Identity=21.978021978022, Blast_Score=66, Evalue=7e-11,
Organism=Escherichia coli, GI145693175, Length=426, Percent_Identity=26.5258215962441, Blast_Score=169, Evalue=5e-43,
Organism=Escherichia coli, GI1790664, Length=432, Percent_Identity=24.537037037037, Blast_Score=160, Evalue=2e-40,
Organism=Escherichia coli, GI1786879, Length=232, Percent_Identity=29.7413793103448, Blast_Score=118, Evalue=8e-28,
Organism=Escherichia coli, GI87082033, Length=276, Percent_Identity=25, Blast_Score=108, Evalue=5e-25,
Organism=Escherichia coli, GI1788119, Length=237, Percent_Identity=25.3164556962025, Blast_Score=77, Evalue=2e-15,
Organism=Saccharomyces cerevisiae, GI6324512, Length=247, Percent_Identity=25.9109311740891, Blast_Score=71, Evalue=3e-13,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR016169
- InterPro:   IPR000644
- InterPro:   IPR002550
- InterPro:   IPR005170 [H]

Pfam domain/function: PF00571 CBS; PF03471 CorC_HlyC; PF01595 DUF21 [H]

EC number: NA

Molecular weight: Translated: 50300; Mature: 50300

Theoretical pI: Translated: 4.36; Mature: 4.36

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.2 %Cys     (Translated Protein)
2.7 %Met     (Translated Protein)
2.9 %Cys+Met (Translated Protein)
0.2 %Cys     (Mature Protein)
2.7 %Met     (Mature Protein)
2.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNNIGMQLFLILILVIINAFFSSAEMAIISLNKNRLNTIIDDAEGENSFSRRTKKAKILL
CCCCHHHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHCCCCCCCHHHHHHHHHHHHH
NLLKEPSKFLATIQVGITLAGFLASASAATSISKYIEIFFKRLNIPKSSSIALFLTTLLL
HHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHH
SYLTLVFGELLPKRIALNNSEKIALFSIKPIIIFMKISLPFVNILTYSTNFLLKVLGIDY
HHHHHHHHHHHHHHHCCCCCCEEEEEEECEEEEEEEECCCHHHHHHHHHHHHHHHHCCCH
ENIEEKISEEEIKKMIDLGEETGVFNSTEKEMINSIFDFDNTLAKEIMTPRTSVFTMDIN
HHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCHHHHHHHHCCCCCEEEEEECC
DSPENIINNMLEERYSRVPIYEDDIDNIIGILHIKDILSIINKENIKKEDLINILRIPYF
CCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCC
IPETKAIDFLFKEMQTSKNYIAILIDEYGGFSGIVTMEDLIEEVMGNIFDEYDEDHTEEI
CCCHHHHHHHHHHHHCCCCEEEEEEECCCCCCCEEEHHHHHHHHHHHHHHHCCCCCHHHE
IKIDANTFLLDASITIDDLNEKLNLELPSENFDTLGGFILDITGTIPKCNVNSEIQYNNL
EEECCCEEEEEEEEEHCCCCCEEEEECCCCCCHHHCCEEEEEECCCCCCCCCCCEEECHH
IFKIEKVYNNRIEKIKLYISE
HEEHHHHHCCCEEEEEEEECC
>Mature Secondary Structure
MNNIGMQLFLILILVIINAFFSSAEMAIISLNKNRLNTIIDDAEGENSFSRRTKKAKILL
CCCCHHHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHCCCCCCCHHHHHHHHHHHHH
NLLKEPSKFLATIQVGITLAGFLASASAATSISKYIEIFFKRLNIPKSSSIALFLTTLLL
HHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHH
SYLTLVFGELLPKRIALNNSEKIALFSIKPIIIFMKISLPFVNILTYSTNFLLKVLGIDY
HHHHHHHHHHHHHHHCCCCCCEEEEEEECEEEEEEEECCCHHHHHHHHHHHHHHHHCCCH
ENIEEKISEEEIKKMIDLGEETGVFNSTEKEMINSIFDFDNTLAKEIMTPRTSVFTMDIN
HHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCHHHHHHHHCCCCCEEEEEECC
DSPENIINNMLEERYSRVPIYEDDIDNIIGILHIKDILSIINKENIKKEDLINILRIPYF
CCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCC
IPETKAIDFLFKEMQTSKNYIAILIDEYGGFSGIVTMEDLIEEVMGNIFDEYDEDHTEEI
CCCHHHHHHHHHHHHCCCCEEEEEEECCCCCCCEEEHHHHHHHHHHHHHHHCCCCCHHHE
IKIDANTFLLDASITIDDLNEKLNLELPSENFDTLGGFILDITGTIPKCNVNSEIQYNNL
EEECCCEEEEEEEEEHCCCCCEEEEECCCCCCHHHCCEEEEEECCCCCCCCCCCEEECHH
IFKIEKVYNNRIEKIKLYISE
HEEHHHHHCCCEEEEEEEECC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 9274030; 9384377 [H]