Definition | Bacillus thuringiensis str. Al Hakam chromosome, complete genome. |
---|---|
Accession | NC_008600 |
Length | 5,257,091 |
Click here to switch to the map view.
The map label for this gene is araC [H]
Identifier: 118477976
GI number: 118477976
Start: 2493085
End: 2494188
Strand: Direct
Name: araC [H]
Synonym: BALH_2323
Alternate gene names: 118477976
Gene position: 2493085-2494188 (Clockwise)
Preceding gene: 118477974
Following gene: 118477978
Centisome position: 47.42
GC content: 29.62
Gene sequence:
>1104_bases GTGAAGGAAATGAATGAACATATACAGCAGATGATTGATTGGATTGAGCGCAATTTGAAAAAGGGATTTTCATTAGATGA ATTATCTCGTTATATGGGGTATTCTCCTTATTATTGCTCTTTTAAATTTCACCAAGTAACGGGTTTCAGTATTAGACGCT ATATTCTTCTTAGAAGGTTATACTTATCTATAGAAGATTTAAAGAATGGTAGGAAGATAATTGAGATCGCATTGGATTAC GATTACTCTTCCCAAGAAGCTTATAGTAGAGCTTTCAAAAATGTTTTTGGAATGAATCCAAAAGAATACCAACGTAACAA CATGCCTATTCAATCATTTGTTAAACTAAATTTAAATAAAGAAGGGGCGTTTAATATGAATATTTCTAGAAAATTAGAGG TTGAACAGTTACGAAATAGGAAGAGTGAGCTATTTGATAAAGAAGTACTTAACATATTAAATGGTCAAGTTATGTATGAA GAATTTAAAAACGAAAAGTTAATGGGTGATTCTGATTACGCACCATTTAATGAAGCGATGTGTGTAAACCGAGTTACTAC ACTAGTTTTTGATGAAGAATTTATTAAGACAAGGGCAGCAGGACATAACAGTTCAGTAGAAAGTTACACAAAAAAGGTTA TAGATCCATTAAAGAAACTTTTTACGAAAGAGTATAAATGTATTGTTTTATGGTTTGGTGAAGATATGTTTTGTCAAATG AACTTACTTACAATACTTTCATACCTTGAACAGTCTCGTTATGAGGGGAAGGTATACTTAAATAGCTTCAGAGAAGATGA ATTTAAAGTAAATCAAATTGAACTTGAATTAGGAAATTATTCTTCGGTATACAATGAAGTATTAGTAAATCATAAAAAGA CTTTCTATAAGGTACCACCTGTAATGTATCAGGCTATAGATCTATATTTAAAAATGCTAAAAGAAGATAATACTGTAATG AAATTTATCTCTAAAAATAAAGATTTATCAACTCAAGAATTATTAACAAAGTTGTTTCAACTATTTCCAACAATCGGATA TGGTGATTCTCAGTATATAGAACTGATTAATAAAATAGAGAAGAAAGCTACACCCAAAATATAG
Upstream 100 bases:
>100_bases ACTAATACTCATATATAGATTTTACAGAATGGCTCATGTTATAAAACACAAGAAATATCAAAAGAATGAAATGCACTTCT CTTATACTTTTATATAAGGA
Downstream 100 bases:
>100_bases GTGCAGCTTTCTTCCTATAATTGTTCAAGTAACTCTAATTCTCCCTCTACAAATAACATTAGTTCTTTAATTATTTCGCC AGAAAAATCAAAACGAATTC
Product: AraC family transcriptional regulator
Products: NA
Alternate protein names: ORFR [H]
Number of amino acids: Translated: 367; Mature: 367
Protein sequence:
>367_residues MKEMNEHIQQMIDWIERNLKKGFSLDELSRYMGYSPYYCSFKFHQVTGFSIRRYILLRRLYLSIEDLKNGRKIIEIALDY DYSSQEAYSRAFKNVFGMNPKEYQRNNMPIQSFVKLNLNKEGAFNMNISRKLEVEQLRNRKSELFDKEVLNILNGQVMYE EFKNEKLMGDSDYAPFNEAMCVNRVTTLVFDEEFIKTRAAGHNSSVESYTKKVIDPLKKLFTKEYKCIVLWFGEDMFCQM NLLTILSYLEQSRYEGKVYLNSFREDEFKVNQIELELGNYSSVYNEVLVNHKKTFYKVPPVMYQAIDLYLKMLKEDNTVM KFISKNKDLSTQELLTKLFQLFPTIGYGDSQYIELINKIEKKATPKI
Sequences:
>Translated_367_residues MKEMNEHIQQMIDWIERNLKKGFSLDELSRYMGYSPYYCSFKFHQVTGFSIRRYILLRRLYLSIEDLKNGRKIIEIALDY DYSSQEAYSRAFKNVFGMNPKEYQRNNMPIQSFVKLNLNKEGAFNMNISRKLEVEQLRNRKSELFDKEVLNILNGQVMYE EFKNEKLMGDSDYAPFNEAMCVNRVTTLVFDEEFIKTRAAGHNSSVESYTKKVIDPLKKLFTKEYKCIVLWFGEDMFCQM NLLTILSYLEQSRYEGKVYLNSFREDEFKVNQIELELGNYSSVYNEVLVNHKKTFYKVPPVMYQAIDLYLKMLKEDNTVM KFISKNKDLSTQELLTKLFQLFPTIGYGDSQYIELINKIEKKATPKI >Mature_367_residues MKEMNEHIQQMIDWIERNLKKGFSLDELSRYMGYSPYYCSFKFHQVTGFSIRRYILLRRLYLSIEDLKNGRKIIEIALDY DYSSQEAYSRAFKNVFGMNPKEYQRNNMPIQSFVKLNLNKEGAFNMNISRKLEVEQLRNRKSELFDKEVLNILNGQVMYE EFKNEKLMGDSDYAPFNEAMCVNRVTTLVFDEEFIKTRAAGHNSSVESYTKKVIDPLKKLFTKEYKCIVLWFGEDMFCQM NLLTILSYLEQSRYEGKVYLNSFREDEFKVNQIELELGNYSSVYNEVLVNHKKTFYKVPPVMYQAIDLYLKMLKEDNTVM KFISKNKDLSTQELLTKLFQLFPTIGYGDSQYIELINKIEKKATPKI
Specific function: Binds To The Right Arm Of The Replication Origin Oric Of The E.Coli Chromosome. Rob Binding May Influence The Formation Of The Nucleoprotein Structure, Required For Oric Function In The Initiation Of Replication. [C]
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 HTH araC/xylS-type DNA-binding domain [H]
Homologues:
Organism=Escherichia coli, GI1790857, Length=100, Percent_Identity=38, Blast_Score=73, Evalue=3e-14, Organism=Escherichia coli, GI87081928, Length=106, Percent_Identity=34.9056603773585, Blast_Score=67, Evalue=2e-12, Organism=Escherichia coli, GI1790497, Length=98, Percent_Identity=34.6938775510204, Blast_Score=66, Evalue=3e-12,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR009057 - InterPro: IPR012287 - InterPro: IPR018062 - InterPro: IPR018060 [H]
Pfam domain/function: PF00165 HTH_AraC [H]
EC number: NA
Molecular weight: Translated: 43607; Mature: 43607
Theoretical pI: Translated: 8.66; Mature: 8.66
Prosite motif: PS01124 HTH_ARAC_FAMILY_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.1 %Cys (Translated Protein) 4.1 %Met (Translated Protein) 5.2 %Cys+Met (Translated Protein) 1.1 %Cys (Mature Protein) 4.1 %Met (Mature Protein) 5.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKEMNEHIQQMIDWIERNLKKGFSLDELSRYMGYSPYYCSFKFHQVTGFSIRRYILLRRL CCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCEEEEEEEEEECCHHHHHHHHHHHH YLSIEDLKNGRKIIEIALDYDYSSQEAYSRAFKNVFGMNPKEYQRNNMPIQSFVKLNLNK HHHHHHHCCCCEEEEEEECCCCCHHHHHHHHHHHHHCCCHHHHHCCCCCHHHHHEECCCC EGAFNMNISRKLEVEQLRNRKSELFDKEVLNILNGQVMYEEFKNEKLMGDSDYAPFNEAM CCCCCCCCHHCCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHCCCCCCCCCCCCCCHHHH CVNRVTTLVFDEEFIKTRAAGHNSSVESYTKKVIDPLKKLFTKEYKCIVLWFGEDMFCQM HHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCHHHHH NLLTILSYLEQSRYEGKVYLNSFREDEFKVNQIELELGNYSSVYNEVLVNHKKTFYKVPP HHHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEEEEECCHHHHHHHHHHHCCHHHEECCH VMYQAIDLYLKMLKEDNTVMKFISKNKDLSTQELLTKLFQLFPTIGYGDSQYIELINKIE HHHHHHHHHHHHHHCCCHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHH KKATPKI HHCCCCC >Mature Secondary Structure MKEMNEHIQQMIDWIERNLKKGFSLDELSRYMGYSPYYCSFKFHQVTGFSIRRYILLRRL CCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCEEEEEEEEEECCHHHHHHHHHHHH YLSIEDLKNGRKIIEIALDYDYSSQEAYSRAFKNVFGMNPKEYQRNNMPIQSFVKLNLNK HHHHHHHCCCCEEEEEEECCCCCHHHHHHHHHHHHHCCCHHHHHCCCCCHHHHHEECCCC EGAFNMNISRKLEVEQLRNRKSELFDKEVLNILNGQVMYEEFKNEKLMGDSDYAPFNEAM CCCCCCCCHHCCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHCCCCCCCCCCCCCCHHHH CVNRVTTLVFDEEFIKTRAAGHNSSVESYTKKVIDPLKKLFTKEYKCIVLWFGEDMFCQM HHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCHHHHH NLLTILSYLEQSRYEGKVYLNSFREDEFKVNQIELELGNYSSVYNEVLVNHKKTFYKVPP HHHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEEEEECCHHHHHHHHHHHCCHHHEECCH VMYQAIDLYLKMLKEDNTVMKFISKNKDLSTQELLTKLFQLFPTIGYGDSQYIELINKIE HHHHHHHHHHHHHHCCCHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHH KKATPKI HHCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 6094471; 6094472 [H]