Definition Bacillus thuringiensis str. Al Hakam chromosome, complete genome.
Accession NC_008600
Length 5,257,091

Click here to switch to the map view.

The map label for this gene is araC [H]

Identifier: 118477976

GI number: 118477976

Start: 2493085

End: 2494188

Strand: Direct

Name: araC [H]

Synonym: BALH_2323

Alternate gene names: 118477976

Gene position: 2493085-2494188 (Clockwise)

Preceding gene: 118477974

Following gene: 118477978

Centisome position: 47.42

GC content: 29.62

Gene sequence:

>1104_bases
GTGAAGGAAATGAATGAACATATACAGCAGATGATTGATTGGATTGAGCGCAATTTGAAAAAGGGATTTTCATTAGATGA
ATTATCTCGTTATATGGGGTATTCTCCTTATTATTGCTCTTTTAAATTTCACCAAGTAACGGGTTTCAGTATTAGACGCT
ATATTCTTCTTAGAAGGTTATACTTATCTATAGAAGATTTAAAGAATGGTAGGAAGATAATTGAGATCGCATTGGATTAC
GATTACTCTTCCCAAGAAGCTTATAGTAGAGCTTTCAAAAATGTTTTTGGAATGAATCCAAAAGAATACCAACGTAACAA
CATGCCTATTCAATCATTTGTTAAACTAAATTTAAATAAAGAAGGGGCGTTTAATATGAATATTTCTAGAAAATTAGAGG
TTGAACAGTTACGAAATAGGAAGAGTGAGCTATTTGATAAAGAAGTACTTAACATATTAAATGGTCAAGTTATGTATGAA
GAATTTAAAAACGAAAAGTTAATGGGTGATTCTGATTACGCACCATTTAATGAAGCGATGTGTGTAAACCGAGTTACTAC
ACTAGTTTTTGATGAAGAATTTATTAAGACAAGGGCAGCAGGACATAACAGTTCAGTAGAAAGTTACACAAAAAAGGTTA
TAGATCCATTAAAGAAACTTTTTACGAAAGAGTATAAATGTATTGTTTTATGGTTTGGTGAAGATATGTTTTGTCAAATG
AACTTACTTACAATACTTTCATACCTTGAACAGTCTCGTTATGAGGGGAAGGTATACTTAAATAGCTTCAGAGAAGATGA
ATTTAAAGTAAATCAAATTGAACTTGAATTAGGAAATTATTCTTCGGTATACAATGAAGTATTAGTAAATCATAAAAAGA
CTTTCTATAAGGTACCACCTGTAATGTATCAGGCTATAGATCTATATTTAAAAATGCTAAAAGAAGATAATACTGTAATG
AAATTTATCTCTAAAAATAAAGATTTATCAACTCAAGAATTATTAACAAAGTTGTTTCAACTATTTCCAACAATCGGATA
TGGTGATTCTCAGTATATAGAACTGATTAATAAAATAGAGAAGAAAGCTACACCCAAAATATAG

Upstream 100 bases:

>100_bases
ACTAATACTCATATATAGATTTTACAGAATGGCTCATGTTATAAAACACAAGAAATATCAAAAGAATGAAATGCACTTCT
CTTATACTTTTATATAAGGA

Downstream 100 bases:

>100_bases
GTGCAGCTTTCTTCCTATAATTGTTCAAGTAACTCTAATTCTCCCTCTACAAATAACATTAGTTCTTTAATTATTTCGCC
AGAAAAATCAAAACGAATTC

Product: AraC family transcriptional regulator

Products: NA

Alternate protein names: ORFR [H]

Number of amino acids: Translated: 367; Mature: 367

Protein sequence:

>367_residues
MKEMNEHIQQMIDWIERNLKKGFSLDELSRYMGYSPYYCSFKFHQVTGFSIRRYILLRRLYLSIEDLKNGRKIIEIALDY
DYSSQEAYSRAFKNVFGMNPKEYQRNNMPIQSFVKLNLNKEGAFNMNISRKLEVEQLRNRKSELFDKEVLNILNGQVMYE
EFKNEKLMGDSDYAPFNEAMCVNRVTTLVFDEEFIKTRAAGHNSSVESYTKKVIDPLKKLFTKEYKCIVLWFGEDMFCQM
NLLTILSYLEQSRYEGKVYLNSFREDEFKVNQIELELGNYSSVYNEVLVNHKKTFYKVPPVMYQAIDLYLKMLKEDNTVM
KFISKNKDLSTQELLTKLFQLFPTIGYGDSQYIELINKIEKKATPKI

Sequences:

>Translated_367_residues
MKEMNEHIQQMIDWIERNLKKGFSLDELSRYMGYSPYYCSFKFHQVTGFSIRRYILLRRLYLSIEDLKNGRKIIEIALDY
DYSSQEAYSRAFKNVFGMNPKEYQRNNMPIQSFVKLNLNKEGAFNMNISRKLEVEQLRNRKSELFDKEVLNILNGQVMYE
EFKNEKLMGDSDYAPFNEAMCVNRVTTLVFDEEFIKTRAAGHNSSVESYTKKVIDPLKKLFTKEYKCIVLWFGEDMFCQM
NLLTILSYLEQSRYEGKVYLNSFREDEFKVNQIELELGNYSSVYNEVLVNHKKTFYKVPPVMYQAIDLYLKMLKEDNTVM
KFISKNKDLSTQELLTKLFQLFPTIGYGDSQYIELINKIEKKATPKI
>Mature_367_residues
MKEMNEHIQQMIDWIERNLKKGFSLDELSRYMGYSPYYCSFKFHQVTGFSIRRYILLRRLYLSIEDLKNGRKIIEIALDY
DYSSQEAYSRAFKNVFGMNPKEYQRNNMPIQSFVKLNLNKEGAFNMNISRKLEVEQLRNRKSELFDKEVLNILNGQVMYE
EFKNEKLMGDSDYAPFNEAMCVNRVTTLVFDEEFIKTRAAGHNSSVESYTKKVIDPLKKLFTKEYKCIVLWFGEDMFCQM
NLLTILSYLEQSRYEGKVYLNSFREDEFKVNQIELELGNYSSVYNEVLVNHKKTFYKVPPVMYQAIDLYLKMLKEDNTVM
KFISKNKDLSTQELLTKLFQLFPTIGYGDSQYIELINKIEKKATPKI

Specific function: Binds To The Right Arm Of The Replication Origin Oric Of The E.Coli Chromosome. Rob Binding May Influence The Formation Of The Nucleoprotein Structure, Required For Oric Function In The Initiation Of Replication. [C]

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HTH araC/xylS-type DNA-binding domain [H]

Homologues:

Organism=Escherichia coli, GI1790857, Length=100, Percent_Identity=38, Blast_Score=73, Evalue=3e-14,
Organism=Escherichia coli, GI87081928, Length=106, Percent_Identity=34.9056603773585, Blast_Score=67, Evalue=2e-12,
Organism=Escherichia coli, GI1790497, Length=98, Percent_Identity=34.6938775510204, Blast_Score=66, Evalue=3e-12,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR009057
- InterPro:   IPR012287
- InterPro:   IPR018062
- InterPro:   IPR018060 [H]

Pfam domain/function: PF00165 HTH_AraC [H]

EC number: NA

Molecular weight: Translated: 43607; Mature: 43607

Theoretical pI: Translated: 8.66; Mature: 8.66

Prosite motif: PS01124 HTH_ARAC_FAMILY_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.1 %Cys     (Translated Protein)
4.1 %Met     (Translated Protein)
5.2 %Cys+Met (Translated Protein)
1.1 %Cys     (Mature Protein)
4.1 %Met     (Mature Protein)
5.2 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKEMNEHIQQMIDWIERNLKKGFSLDELSRYMGYSPYYCSFKFHQVTGFSIRRYILLRRL
CCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCEEEEEEEEEECCHHHHHHHHHHHH
YLSIEDLKNGRKIIEIALDYDYSSQEAYSRAFKNVFGMNPKEYQRNNMPIQSFVKLNLNK
HHHHHHHCCCCEEEEEEECCCCCHHHHHHHHHHHHHCCCHHHHHCCCCCHHHHHEECCCC
EGAFNMNISRKLEVEQLRNRKSELFDKEVLNILNGQVMYEEFKNEKLMGDSDYAPFNEAM
CCCCCCCCHHCCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHCCCCCCCCCCCCCCHHHH
CVNRVTTLVFDEEFIKTRAAGHNSSVESYTKKVIDPLKKLFTKEYKCIVLWFGEDMFCQM
HHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCHHHHH
NLLTILSYLEQSRYEGKVYLNSFREDEFKVNQIELELGNYSSVYNEVLVNHKKTFYKVPP
HHHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEEEEECCHHHHHHHHHHHCCHHHEECCH
VMYQAIDLYLKMLKEDNTVMKFISKNKDLSTQELLTKLFQLFPTIGYGDSQYIELINKIE
HHHHHHHHHHHHHHCCCHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHH
KKATPKI
HHCCCCC
>Mature Secondary Structure
MKEMNEHIQQMIDWIERNLKKGFSLDELSRYMGYSPYYCSFKFHQVTGFSIRRYILLRRL
CCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCEEEEEEEEEECCHHHHHHHHHHHH
YLSIEDLKNGRKIIEIALDYDYSSQEAYSRAFKNVFGMNPKEYQRNNMPIQSFVKLNLNK
HHHHHHHCCCCEEEEEEECCCCCHHHHHHHHHHHHHCCCHHHHHCCCCCHHHHHEECCCC
EGAFNMNISRKLEVEQLRNRKSELFDKEVLNILNGQVMYEEFKNEKLMGDSDYAPFNEAM
CCCCCCCCHHCCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHCCCCCCCCCCCCCCHHHH
CVNRVTTLVFDEEFIKTRAAGHNSSVESYTKKVIDPLKKLFTKEYKCIVLWFGEDMFCQM
HHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCHHHHH
NLLTILSYLEQSRYEGKVYLNSFREDEFKVNQIELELGNYSSVYNEVLVNHKKTFYKVPP
HHHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEEEEECCHHHHHHHHHHHCCHHHEECCH
VMYQAIDLYLKMLKEDNTVMKFISKNKDLSTQELLTKLFQLFPTIGYGDSQYIELINKIE
HHHHHHHHHHHHHHCCCHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHH
KKATPKI
HHCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 6094471; 6094472 [H]