Definition | Bacillus cereus E33L, complete genome. |
---|---|
Accession | NC_006274 |
Length | 5,300,915 |
Click here to switch to the map view.
The map label for this gene is araC [H]
Identifier: 52142914
GI number: 52142914
Start: 2458006
End: 2459109
Strand: Direct
Name: araC [H]
Synonym: BCZK2326
Alternate gene names: 52142914
Gene position: 2458006-2459109 (Clockwise)
Preceding gene: 52142916
Following gene: 52142911
Centisome position: 46.37
GC content: 29.53
Gene sequence:
>1104_bases GTGAAGGAAATGAATGAATATATACAGCAGATGATTGACTGGATTGAGGTCAATTTAAAAAAGGAATTTTCATTAGATGA ATTGTCCCGTTATATGGGGTATTCTCCTTATTATTGTTCTTTTAAATTTCACCAAGTAACGGGTTTCAGTATTAGACGCT ATGTTCTTCTTAGAAGGTTATATTTATCTATAGAAGATTTAAAGAATGGTAGGAAGATAATAGATATCGCATTGGATTAC AATTATTCTTCGCAAGAGGCCTATAGTAGAGCTTTCAAGAATGTTTTTGGAATGAATCCAAGAGAATTTCAACTTAACCA ATTGCCTATTCAATCATTTGTTAAACTCAATATAAATAAGGAAGGAGAGTTTAATATGAATATTTCTAGAAAAATAGAGG TTGAGCAATTACGAAATGCGAAGAGTGAGCTGTTTGATAAAGATGTATTAAACATATTGAATGGTCAAATGATGTATGAA GAATTTAAAAATGAAAAGCTAATGGGTGATTCTGATTACGCACCATTTAATGAAGCGATGTGTGTAAACCGAGTTACTAC ACTAGTTTTTAATGAAGAATTTATTAAGACAAGGGCAGCAGGACATAACAGTTCAGTAGAAAGTTACACAAAAAAGGTTA TAGATCCATTAAAGAAACTTTTTAAGAAAGAGTATAAATGTATTGTTTTATGGTTTGGTGAAGATATGTTTTGTCAAATG AACTTACTTACAATACTTTCATACCTTGAACAGTCGTTTTATGAGGGGAAGGTATACTTAAATAGCTTTAGAGAAGATGA ATTTAAAGTAAATCAAATTGAACTTGAATTAGGAAATTATTCTTCGGTATACAATGAAGTATTAGTAAATCATAAAAAGA CTTTCTATAAGGTACCACCTGTAATGTATCAGGCTATAGATCTATATTTAAAAATGCTAAAAGAAGATAATACTGTAATG AAATTTCTCTCTAAAAATAAAGATTTATCAACTCAAGAATTATTAACAAAGTTGTTTCAGCTATTTCCAACAATCGGATA TGGGGATTCCCAGTACATAGAGCTGATTAATAAAATAAAGAAGAAAGCTACACCAAAAATATAG
Upstream 100 bases:
>100_bases ACTAATACTCATATTTAGATTTTACAGAATGGCTCATGTTATAAAACACAAGAAATATCAAAAGAATGAAATGCACTTCC CTTATACTTTTATATAAGGA
Downstream 100 bases:
>100_bases GTGCAGCTTTCTTTTTACAATTGTTCAAGTAACTCTAATTCTCCCTCTACAAATAACATTAGTTCTTTAATTATTTCGCC AGAAAAATCAAAACGAATTC
Product: AraC family transcriptional regulator
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 367; Mature: 367
Protein sequence:
>367_residues MKEMNEYIQQMIDWIEVNLKKEFSLDELSRYMGYSPYYCSFKFHQVTGFSIRRYVLLRRLYLSIEDLKNGRKIIDIALDY NYSSQEAYSRAFKNVFGMNPREFQLNQLPIQSFVKLNINKEGEFNMNISRKIEVEQLRNAKSELFDKDVLNILNGQMMYE EFKNEKLMGDSDYAPFNEAMCVNRVTTLVFNEEFIKTRAAGHNSSVESYTKKVIDPLKKLFKKEYKCIVLWFGEDMFCQM NLLTILSYLEQSFYEGKVYLNSFREDEFKVNQIELELGNYSSVYNEVLVNHKKTFYKVPPVMYQAIDLYLKMLKEDNTVM KFLSKNKDLSTQELLTKLFQLFPTIGYGDSQYIELINKIKKKATPKI
Sequences:
>Translated_367_residues MKEMNEYIQQMIDWIEVNLKKEFSLDELSRYMGYSPYYCSFKFHQVTGFSIRRYVLLRRLYLSIEDLKNGRKIIDIALDY NYSSQEAYSRAFKNVFGMNPREFQLNQLPIQSFVKLNINKEGEFNMNISRKIEVEQLRNAKSELFDKDVLNILNGQMMYE EFKNEKLMGDSDYAPFNEAMCVNRVTTLVFNEEFIKTRAAGHNSSVESYTKKVIDPLKKLFKKEYKCIVLWFGEDMFCQM NLLTILSYLEQSFYEGKVYLNSFREDEFKVNQIELELGNYSSVYNEVLVNHKKTFYKVPPVMYQAIDLYLKMLKEDNTVM KFLSKNKDLSTQELLTKLFQLFPTIGYGDSQYIELINKIKKKATPKI >Mature_367_residues MKEMNEYIQQMIDWIEVNLKKEFSLDELSRYMGYSPYYCSFKFHQVTGFSIRRYVLLRRLYLSIEDLKNGRKIIDIALDY NYSSQEAYSRAFKNVFGMNPREFQLNQLPIQSFVKLNINKEGEFNMNISRKIEVEQLRNAKSELFDKDVLNILNGQMMYE EFKNEKLMGDSDYAPFNEAMCVNRVTTLVFNEEFIKTRAAGHNSSVESYTKKVIDPLKKLFKKEYKCIVLWFGEDMFCQM NLLTILSYLEQSFYEGKVYLNSFREDEFKVNQIELELGNYSSVYNEVLVNHKKTFYKVPPVMYQAIDLYLKMLKEDNTVM KFLSKNKDLSTQELLTKLFQLFPTIGYGDSQYIELINKIKKKATPKI
Specific function: Binds to the right arm of the replication origin oriC of the chromosome. Rob binding may influence the formation of the nucleoprotein structure, required for oriC function in the initiation of replication [H]
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 HTH araC/xylS-type DNA-binding domain [H]
Homologues:
Organism=Escherichia coli, GI1790857, Length=133, Percent_Identity=33.0827067669173, Blast_Score=74, Evalue=2e-14, Organism=Escherichia coli, GI1790497, Length=98, Percent_Identity=33.6734693877551, Blast_Score=66, Evalue=4e-12, Organism=Escherichia coli, GI87081928, Length=106, Percent_Identity=31.1320754716981, Blast_Score=63, Evalue=3e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR010499 - InterPro: IPR009057 - InterPro: IPR012287 - InterPro: IPR018062 - InterPro: IPR020449 - InterPro: IPR018060 - InterPro: IPR011256 [H]
Pfam domain/function: PF06445 AraC_E_bind; PF00165 HTH_AraC [H]
EC number: NA
Molecular weight: Translated: 43591; Mature: 43591
Theoretical pI: Translated: 8.46; Mature: 8.46
Prosite motif: PS00041 HTH_ARAC_FAMILY_1 ; PS01124 HTH_ARAC_FAMILY_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.1 %Cys (Translated Protein) 4.1 %Met (Translated Protein) 5.2 %Cys+Met (Translated Protein) 1.1 %Cys (Mature Protein) 4.1 %Met (Mature Protein) 5.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKEMNEYIQQMIDWIEVNLKKEFSLDELSRYMGYSPYYCSFKFHQVTGFSIRRYVLLRRL CCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCEEEEEEEEEECCHHHHHHHHHHHH YLSIEDLKNGRKIIDIALDYNYSSQEAYSRAFKNVFGMNPREFQLNQLPIQSFVKLNINK HHHHHHHCCCCEEEEEEEECCCCHHHHHHHHHHHHHCCCCCCEEHHHCCHHHHHEEECCC EGEFNMNISRKIEVEQLRNAKSELFDKDVLNILNGQMMYEEFKNEKLMGDSDYAPFNEAM CCCCCCCCCCEEEHHHHHHHHHHHHHHHHHHHHCCHHHHHHHCCCCCCCCCCCCCCHHHH CVNRVTTLVFNEEFIKTRAAGHNSSVESYTKKVIDPLKKLFKKEYKCIVLWFGEDMFCQM HHHHHHHHHCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHH NLLTILSYLEQSFYEGKVYLNSFREDEFKVNQIELELGNYSSVYNEVLVNHKKTFYKVPP HHHHHHHHHHHHHHCCHHHHHCCCCCCEEEEEEEEEECCHHHHHHHHHHHCCHHHEECCH VMYQAIDLYLKMLKEDNTVMKFLSKNKDLSTQELLTKLFQLFPTIGYGDSQYIELINKIK HHHHHHHHHHHHHHCCHHHHHHHHCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHH KKATPKI HHCCCCC >Mature Secondary Structure MKEMNEYIQQMIDWIEVNLKKEFSLDELSRYMGYSPYYCSFKFHQVTGFSIRRYVLLRRL CCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCEEEEEEEEEECCHHHHHHHHHHHH YLSIEDLKNGRKIIDIALDYNYSSQEAYSRAFKNVFGMNPREFQLNQLPIQSFVKLNINK HHHHHHHCCCCEEEEEEEECCCCHHHHHHHHHHHHHCCCCCCEEHHHCCHHHHHEEECCC EGEFNMNISRKIEVEQLRNAKSELFDKDVLNILNGQMMYEEFKNEKLMGDSDYAPFNEAM CCCCCCCCCCEEEHHHHHHHHHHHHHHHHHHHHCCHHHHHHHCCCCCCCCCCCCCCHHHH CVNRVTTLVFNEEFIKTRAAGHNSSVESYTKKVIDPLKKLFKKEYKCIVLWFGEDMFCQM HHHHHHHHHCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHH NLLTILSYLEQSFYEGKVYLNSFREDEFKVNQIELELGNYSSVYNEVLVNHKKTFYKVPP HHHHHHHHHHHHHHCCHHHHHCCCCCCEEEEEEEEEECCHHHHHHHHHHHCCHHHEECCH VMYQAIDLYLKMLKEDNTVMKFLSKNKDLSTQELLTKLFQLFPTIGYGDSQYIELINKIK HHHHHHHHHHHHHHCCHHHHHHHHCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHH KKATPKI HHCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]