The gene/protein map for NC_009495 is currently unavailable.
Definition Clostridium botulinum A str. ATCC 3502, complete genome.
Accession NC_009495
Length 3,886,916

Click here to switch to the map view.

The map label for this gene is ydeE [H]

Identifier: 148378714

GI number: 148378714

Start: 822679

End: 823533

Strand: Reverse

Name: ydeE [H]

Synonym: CBO0714

Alternate gene names: 148378714

Gene position: 823533-822679 (Counterclockwise)

Preceding gene: 148378715

Following gene: 148378713

Centisome position: 21.19

GC content: 35.2

Gene sequence:

>855_bases
ATGGAGTGGATAGAACGATTAAATAGTGCTGTTAATTATATCGAAAAGAATATAAAAGAAACTATCGATTTGGAAGAAGT
ATCAAAGATTGCATGTTGCTCAACTTATCATTTTCAAAGGATGTTTGCCTATATAGCAGATATACCCTTATCAGAGTACA
TCCGCCGTAGGAGAATGTCATTAGCAGCTGTTGATTTACAGAGTAGCAACGAAAAAGTGATAGATATTTCTCTAAAATAT
GGATATGATTCACCCACAGCATTTAACAGAGCTTTTAAAAGTGTACATGGTATAGCACCATCTCGGGCGAAAGAAGAAGG
TACAATATTAAAAGCATTTCCTCCTATCAGCTTCAAAATAACAATAAAAGGAGATAGTGAAATGAATTACAGAATTGAAA
AGAAAGAATCATTTAGAATTGTAGGTGTTTCAGAACCATTAGAAAAAGAAATTGAAAAAAACTTTCAAATTGTACCGAAA
ATGTGGAACACAGCTGTAATGAATGGAACAATACCAAGACTTGCTTCCATTATGGAGGGAATGCCTATGGGTATGCTCGG
AGTAAGCTCCTGTAATGAACTAGATAATTGGAGATACTATATTGCAGTTGCAAGTAATCAACCAATAGGGAATGGCCTAG
AAGAATACATTGTCCCTAGCTCCCTATGGGCAATATTTTCAGGAAAAGGAACTGCTAAATCTATGCAGGAGCTAGAAAAA
AGAATCCTAACTGAATGGCTTCCAACTTCAGGATATGAATATGGAAATGCACCAGATATTGAGGTGTATTTAAAGGCAGA
CCCAGAGGATACTGAATATGAAGTATGGATTCCGGTTTTAAAAAAGGAGAATTAA

Upstream 100 bases:

>100_bases
TTTGTAATAAAGATTATTTACATGTTCTCTCTTTATCTGGCAGAAAAAGAGAGGTGTAATTATCTGAAATAGAGTATTTT
TATTGTTAGGAGATGAAGAT

Downstream 100 bases:

>100_bases
GTATAGTACATTTGGAAGATGTAGTTGTTGGTACAAGCATTCCATATAATTATGATAAATCGAAATTTTAGGTGGTGTTG
ATATGGCAATGTGGAATCCG

Product: AraC family transcription regulator

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 284; Mature: 284

Protein sequence:

>284_residues
MEWIERLNSAVNYIEKNIKETIDLEEVSKIACCSTYHFQRMFAYIADIPLSEYIRRRRMSLAAVDLQSSNEKVIDISLKY
GYDSPTAFNRAFKSVHGIAPSRAKEEGTILKAFPPISFKITIKGDSEMNYRIEKKESFRIVGVSEPLEKEIEKNFQIVPK
MWNTAVMNGTIPRLASIMEGMPMGMLGVSSCNELDNWRYYIAVASNQPIGNGLEEYIVPSSLWAIFSGKGTAKSMQELEK
RILTEWLPTSGYEYGNAPDIEVYLKADPEDTEYEVWIPVLKKEN

Sequences:

>Translated_284_residues
MEWIERLNSAVNYIEKNIKETIDLEEVSKIACCSTYHFQRMFAYIADIPLSEYIRRRRMSLAAVDLQSSNEKVIDISLKY
GYDSPTAFNRAFKSVHGIAPSRAKEEGTILKAFPPISFKITIKGDSEMNYRIEKKESFRIVGVSEPLEKEIEKNFQIVPK
MWNTAVMNGTIPRLASIMEGMPMGMLGVSSCNELDNWRYYIAVASNQPIGNGLEEYIVPSSLWAIFSGKGTAKSMQELEK
RILTEWLPTSGYEYGNAPDIEVYLKADPEDTEYEVWIPVLKKEN
>Mature_284_residues
MEWIERLNSAVNYIEKNIKETIDLEEVSKIACCSTYHFQRMFAYIADIPLSEYIRRRRMSLAAVDLQSSNEKVIDISLKY
GYDSPTAFNRAFKSVHGIAPSRAKEEGTILKAFPPISFKITIKGDSEMNYRIEKKESFRIVGVSEPLEKEIEKNFQIVPK
MWNTAVMNGTIPRLASIMEGMPMGMLGVSSCNELDNWRYYIAVASNQPIGNGLEEYIVPSSLWAIFSGKGTAKSMQELEK
RILTEWLPTSGYEYGNAPDIEVYLKADPEDTEYEVWIPVLKKEN

Specific function: Binds To The Right Arm Of The Replication Origin Oric Of The E.Coli Chromosome. Rob Binding May Influence The Formation Of The Nucleoprotein Structure, Required For Oric Function In The Initiation Of Replication. [C]

COG id: COG2207

COG function: function code K; AraC-type DNA-binding domain-containing proteins

Gene ontology:

Cell location: Cytoplasmic [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HTH araC/xylS-type DNA-binding domain [H]

Homologues:

Organism=Escherichia coli, GI1790857, Length=285, Percent_Identity=23.1578947368421, Blast_Score=76, Evalue=3e-15,
Organism=Escherichia coli, GI87081928, Length=98, Percent_Identity=32.6530612244898, Blast_Score=68, Evalue=7e-13,
Organism=Escherichia coli, GI1790497, Length=95, Percent_Identity=31.5789473684211, Blast_Score=66, Evalue=3e-12,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR010499
- InterPro:   IPR009057
- InterPro:   IPR012287
- InterPro:   IPR018062
- InterPro:   IPR020449
- InterPro:   IPR018060
- InterPro:   IPR011256 [H]

Pfam domain/function: PF06445 AraC_E_bind; PF00165 HTH_AraC [H]

EC number: NA

Molecular weight: Translated: 32428; Mature: 32428

Theoretical pI: Translated: 5.01; Mature: 5.01

Prosite motif: PS00041 HTH_ARAC_FAMILY_1 ; PS01124 HTH_ARAC_FAMILY_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.1 %Cys     (Translated Protein)
3.9 %Met     (Translated Protein)
4.9 %Cys+Met (Translated Protein)
1.1 %Cys     (Mature Protein)
3.9 %Met     (Mature Protein)
4.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MEWIERLNSAVNYIEKNIKETIDLEEVSKIACCSTYHFQRMFAYIADIPLSEYIRRRRMS
CHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH
LAAVDLQSSNEKVIDISLKYGYDSPTAFNRAFKSVHGIAPSRAKEEGTILKAFPPISFKI
HEEEEECCCCCEEEEEEEEECCCCCHHHHHHHHHHHCCCCCCCCCCCCEEEECCCCEEEE
TIKGDSEMNYRIEKKESFRIVGVSEPLEKEIEKNFQIVPKMWNTAVMNGTIPRLASIMEG
EEECCCCCCEEEECCCCEEEEECCCHHHHHHHCCCEEEHHHHHHHHHCCCHHHHHHHHCC
MPMGMLGVSSCNELDNWRYYIAVASNQPIGNGLEEYIVPSSLWAIFSGKGTAKSMQELEK
CCCCEEECCCCCCCCCCEEEEEEECCCCCCCCHHHHHCCHHHHHEECCCCCHHHHHHHHH
RILTEWLPTSGYEYGNAPDIEVYLKADPEDTEYEVWIPVLKKEN
HHHHHHCCCCCCCCCCCCCEEEEEECCCCCCCEEEEEEEEECCC
>Mature Secondary Structure
MEWIERLNSAVNYIEKNIKETIDLEEVSKIACCSTYHFQRMFAYIADIPLSEYIRRRRMS
CHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH
LAAVDLQSSNEKVIDISLKYGYDSPTAFNRAFKSVHGIAPSRAKEEGTILKAFPPISFKI
HEEEEECCCCCEEEEEEEEECCCCCHHHHHHHHHHHCCCCCCCCCCCCEEEECCCCEEEE
TIKGDSEMNYRIEKKESFRIVGVSEPLEKEIEKNFQIVPKMWNTAVMNGTIPRLASIMEG
EEECCCCCCEEEECCCCEEEEECCCHHHHHHHCCCEEEHHHHHHHHHCCCHHHHHHHHCC
MPMGMLGVSSCNELDNWRYYIAVASNQPIGNGLEEYIVPSSLWAIFSGKGTAKSMQELEK
CCCCEEECCCCCCCCCCEEEEEEECCCCCCCCHHHHHCCHHHHHEECCCCCHHHHHHHHH
RILTEWLPTSGYEYGNAPDIEVYLKADPEDTEYEVWIPVLKKEN
HHHHHHCCCCCCCCCCCCCEEEEEECCCCCCCEEEEEEEEECCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 9384377 [H]