Definition | Clostridium botulinum A str. ATCC 3502, complete genome. |
---|---|
Accession | NC_009495 |
Length | 3,886,916 |
Click here to switch to the map view.
The map label for this gene is ydeE [H]
Identifier: 148378714
GI number: 148378714
Start: 822679
End: 823533
Strand: Reverse
Name: ydeE [H]
Synonym: CBO0714
Alternate gene names: 148378714
Gene position: 823533-822679 (Counterclockwise)
Preceding gene: 148378715
Following gene: 148378713
Centisome position: 21.19
GC content: 35.2
Gene sequence:
>855_bases ATGGAGTGGATAGAACGATTAAATAGTGCTGTTAATTATATCGAAAAGAATATAAAAGAAACTATCGATTTGGAAGAAGT ATCAAAGATTGCATGTTGCTCAACTTATCATTTTCAAAGGATGTTTGCCTATATAGCAGATATACCCTTATCAGAGTACA TCCGCCGTAGGAGAATGTCATTAGCAGCTGTTGATTTACAGAGTAGCAACGAAAAAGTGATAGATATTTCTCTAAAATAT GGATATGATTCACCCACAGCATTTAACAGAGCTTTTAAAAGTGTACATGGTATAGCACCATCTCGGGCGAAAGAAGAAGG TACAATATTAAAAGCATTTCCTCCTATCAGCTTCAAAATAACAATAAAAGGAGATAGTGAAATGAATTACAGAATTGAAA AGAAAGAATCATTTAGAATTGTAGGTGTTTCAGAACCATTAGAAAAAGAAATTGAAAAAAACTTTCAAATTGTACCGAAA ATGTGGAACACAGCTGTAATGAATGGAACAATACCAAGACTTGCTTCCATTATGGAGGGAATGCCTATGGGTATGCTCGG AGTAAGCTCCTGTAATGAACTAGATAATTGGAGATACTATATTGCAGTTGCAAGTAATCAACCAATAGGGAATGGCCTAG AAGAATACATTGTCCCTAGCTCCCTATGGGCAATATTTTCAGGAAAAGGAACTGCTAAATCTATGCAGGAGCTAGAAAAA AGAATCCTAACTGAATGGCTTCCAACTTCAGGATATGAATATGGAAATGCACCAGATATTGAGGTGTATTTAAAGGCAGA CCCAGAGGATACTGAATATGAAGTATGGATTCCGGTTTTAAAAAAGGAGAATTAA
Upstream 100 bases:
>100_bases TTTGTAATAAAGATTATTTACATGTTCTCTCTTTATCTGGCAGAAAAAGAGAGGTGTAATTATCTGAAATAGAGTATTTT TATTGTTAGGAGATGAAGAT
Downstream 100 bases:
>100_bases GTATAGTACATTTGGAAGATGTAGTTGTTGGTACAAGCATTCCATATAATTATGATAAATCGAAATTTTAGGTGGTGTTG ATATGGCAATGTGGAATCCG
Product: AraC family transcription regulator
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 284; Mature: 284
Protein sequence:
>284_residues MEWIERLNSAVNYIEKNIKETIDLEEVSKIACCSTYHFQRMFAYIADIPLSEYIRRRRMSLAAVDLQSSNEKVIDISLKY GYDSPTAFNRAFKSVHGIAPSRAKEEGTILKAFPPISFKITIKGDSEMNYRIEKKESFRIVGVSEPLEKEIEKNFQIVPK MWNTAVMNGTIPRLASIMEGMPMGMLGVSSCNELDNWRYYIAVASNQPIGNGLEEYIVPSSLWAIFSGKGTAKSMQELEK RILTEWLPTSGYEYGNAPDIEVYLKADPEDTEYEVWIPVLKKEN
Sequences:
>Translated_284_residues MEWIERLNSAVNYIEKNIKETIDLEEVSKIACCSTYHFQRMFAYIADIPLSEYIRRRRMSLAAVDLQSSNEKVIDISLKY GYDSPTAFNRAFKSVHGIAPSRAKEEGTILKAFPPISFKITIKGDSEMNYRIEKKESFRIVGVSEPLEKEIEKNFQIVPK MWNTAVMNGTIPRLASIMEGMPMGMLGVSSCNELDNWRYYIAVASNQPIGNGLEEYIVPSSLWAIFSGKGTAKSMQELEK RILTEWLPTSGYEYGNAPDIEVYLKADPEDTEYEVWIPVLKKEN >Mature_284_residues MEWIERLNSAVNYIEKNIKETIDLEEVSKIACCSTYHFQRMFAYIADIPLSEYIRRRRMSLAAVDLQSSNEKVIDISLKY GYDSPTAFNRAFKSVHGIAPSRAKEEGTILKAFPPISFKITIKGDSEMNYRIEKKESFRIVGVSEPLEKEIEKNFQIVPK MWNTAVMNGTIPRLASIMEGMPMGMLGVSSCNELDNWRYYIAVASNQPIGNGLEEYIVPSSLWAIFSGKGTAKSMQELEK RILTEWLPTSGYEYGNAPDIEVYLKADPEDTEYEVWIPVLKKEN
Specific function: Binds To The Right Arm Of The Replication Origin Oric Of The E.Coli Chromosome. Rob Binding May Influence The Formation Of The Nucleoprotein Structure, Required For Oric Function In The Initiation Of Replication. [C]
COG id: COG2207
COG function: function code K; AraC-type DNA-binding domain-containing proteins
Gene ontology:
Cell location: Cytoplasmic [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 HTH araC/xylS-type DNA-binding domain [H]
Homologues:
Organism=Escherichia coli, GI1790857, Length=285, Percent_Identity=23.1578947368421, Blast_Score=76, Evalue=3e-15, Organism=Escherichia coli, GI87081928, Length=98, Percent_Identity=32.6530612244898, Blast_Score=68, Evalue=7e-13, Organism=Escherichia coli, GI1790497, Length=95, Percent_Identity=31.5789473684211, Blast_Score=66, Evalue=3e-12,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR010499 - InterPro: IPR009057 - InterPro: IPR012287 - InterPro: IPR018062 - InterPro: IPR020449 - InterPro: IPR018060 - InterPro: IPR011256 [H]
Pfam domain/function: PF06445 AraC_E_bind; PF00165 HTH_AraC [H]
EC number: NA
Molecular weight: Translated: 32428; Mature: 32428
Theoretical pI: Translated: 5.01; Mature: 5.01
Prosite motif: PS00041 HTH_ARAC_FAMILY_1 ; PS01124 HTH_ARAC_FAMILY_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.1 %Cys (Translated Protein) 3.9 %Met (Translated Protein) 4.9 %Cys+Met (Translated Protein) 1.1 %Cys (Mature Protein) 3.9 %Met (Mature Protein) 4.9 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MEWIERLNSAVNYIEKNIKETIDLEEVSKIACCSTYHFQRMFAYIADIPLSEYIRRRRMS CHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH LAAVDLQSSNEKVIDISLKYGYDSPTAFNRAFKSVHGIAPSRAKEEGTILKAFPPISFKI HEEEEECCCCCEEEEEEEEECCCCCHHHHHHHHHHHCCCCCCCCCCCCEEEECCCCEEEE TIKGDSEMNYRIEKKESFRIVGVSEPLEKEIEKNFQIVPKMWNTAVMNGTIPRLASIMEG EEECCCCCCEEEECCCCEEEEECCCHHHHHHHCCCEEEHHHHHHHHHCCCHHHHHHHHCC MPMGMLGVSSCNELDNWRYYIAVASNQPIGNGLEEYIVPSSLWAIFSGKGTAKSMQELEK CCCCEEECCCCCCCCCCEEEEEEECCCCCCCCHHHHHCCHHHHHEECCCCCHHHHHHHHH RILTEWLPTSGYEYGNAPDIEVYLKADPEDTEYEVWIPVLKKEN HHHHHHCCCCCCCCCCCCCEEEEEECCCCCCCEEEEEEEEECCC >Mature Secondary Structure MEWIERLNSAVNYIEKNIKETIDLEEVSKIACCSTYHFQRMFAYIADIPLSEYIRRRRMS CHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH LAAVDLQSSNEKVIDISLKYGYDSPTAFNRAFKSVHGIAPSRAKEEGTILKAFPPISFKI HEEEEECCCCCEEEEEEEEECCCCCHHHHHHHHHHHCCCCCCCCCCCCEEEECCCCEEEE TIKGDSEMNYRIEKKESFRIVGVSEPLEKEIEKNFQIVPKMWNTAVMNGTIPRLASIMEG EEECCCCCCEEEECCCCEEEEECCCHHHHHHHCCCEEEHHHHHHHHHCCCHHHHHHHHCC MPMGMLGVSSCNELDNWRYYIAVASNQPIGNGLEEYIVPSSLWAIFSGKGTAKSMQELEK CCCCEEECCCCCCCCCCEEEEEEECCCCCCCCHHHHHCCHHHHHEECCCCCHHHHHHHHH RILTEWLPTSGYEYGNAPDIEVYLKADPEDTEYEVWIPVLKKEN HHHHHHCCCCCCCCCCCCCEEEEEECCCCCCCEEEEEEEEECCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 9384377 [H]