Definition Bacillus licheniformis ATCC 14580, complete genome.
Accession NC_006322
Length 4,222,645

Click here to switch to the map view.

The map label for this gene is rob [H]

Identifier: 52784480

GI number: 52784480

Start: 680440

End: 681318

Strand: Reverse

Name: rob [H]

Synonym: BLi00678

Alternate gene names: 52784480

Gene position: 681318-680440 (Counterclockwise)

Preceding gene: 52784481

Following gene: 52784476

Centisome position: 16.13

GC content: 43.23

Gene sequence:

>879_bases
ATGAATATGAATGATTACGAAAGAATTCAAAACGCGGTTCAGTTTATCGAAGAACACCTTCAAAACGATTTAAACATCAC
GGATATTGCGGCTAAATCGTGTTTCTCCCCATTTCATTTTCAAAGGCTTTTTCAGGCGATCTCCGGTTTTTCCGTTTATC
AATACATTCGCAACCGGCGTCTGTCTGAAGCTGCGCTTCTGCTTGAGAAAACAGACCGTTCCATTCTCGACATTGCAATC
AATTTCGGCTATGGCTCACAAGAAGCTTTCAGCAGGGCTTTCTCACAGTACTTCGGCATCACTCCGGCCAAGTATCGCAA
AGCGAAGGTCAAGCTTGATTTTCAATCTAAAATTAATTTCCTCGAATATAAAGAAAGGATGACTGGAGATATGAACATCC
CAAAGCCGCACATCACTCATTTAAACGACATACATATGATCGGATATGAGTACCGGACAAATTTAAACGATGAAAAGTAT
TTCGAAGATATCCCGAAATTCTACAACGATTTCGGCAGAAATGGATACTTTATGCAAATTCCTCAAAAAACGGATCCGAA
TATGTGTTACGGGCTGTCATGCCGCTTCCAGGATGACGGAGGTTTCTCATTTATTATCGGGGAAGCCGTTCGCGAAACAG
CAGCCGGGGAAGTGCCGGAGCCGCTTATTTATACGAAAATTCCCGGCGGCAAATACGCCGTATTTCATGTAAACGGATCG
ACTGAATCAGTGCAGAACACAAGAAGATACATCTATGGATCATGGCTGATGAATACCAATTACGAGAGAACAGAAGGACC
GGACTTTGAAGTGACTGATGTGTGCCGTTCCGTACCGCCGAACGAGATGAAGATGACCATCTATATCCCGCTTCTGTAA

Upstream 100 bases:

>100_bases
AAAAGCCTCCGCATTTGGAGGCTTTTAAAAAAGCACAAATCATCAAAAAAACCATCGCTTGATCCTGTACAATAGGATCA
TCAAATCGTAAAGGCTGTGG

Downstream 100 bases:

>100_bases
AAACGCCGGGGAACCTTCTGACCACAGGTTCCCCGAATCCTTTTATATAGATTTCAGCGCCTCGGCGGGCGTTTGATCAC
CCGAAACCAAATCGAACGAA

Product: hypothetical protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 292; Mature: 292

Protein sequence:

>292_residues
MNMNDYERIQNAVQFIEEHLQNDLNITDIAAKSCFSPFHFQRLFQAISGFSVYQYIRNRRLSEAALLLEKTDRSILDIAI
NFGYGSQEAFSRAFSQYFGITPAKYRKAKVKLDFQSKINFLEYKERMTGDMNIPKPHITHLNDIHMIGYEYRTNLNDEKY
FEDIPKFYNDFGRNGYFMQIPQKTDPNMCYGLSCRFQDDGGFSFIIGEAVRETAAGEVPEPLIYTKIPGGKYAVFHVNGS
TESVQNTRRYIYGSWLMNTNYERTEGPDFEVTDVCRSVPPNEMKMTIYIPLL

Sequences:

>Translated_292_residues
MNMNDYERIQNAVQFIEEHLQNDLNITDIAAKSCFSPFHFQRLFQAISGFSVYQYIRNRRLSEAALLLEKTDRSILDIAI
NFGYGSQEAFSRAFSQYFGITPAKYRKAKVKLDFQSKINFLEYKERMTGDMNIPKPHITHLNDIHMIGYEYRTNLNDEKY
FEDIPKFYNDFGRNGYFMQIPQKTDPNMCYGLSCRFQDDGGFSFIIGEAVRETAAGEVPEPLIYTKIPGGKYAVFHVNGS
TESVQNTRRYIYGSWLMNTNYERTEGPDFEVTDVCRSVPPNEMKMTIYIPLL
>Mature_292_residues
MNMNDYERIQNAVQFIEEHLQNDLNITDIAAKSCFSPFHFQRLFQAISGFSVYQYIRNRRLSEAALLLEKTDRSILDIAI
NFGYGSQEAFSRAFSQYFGITPAKYRKAKVKLDFQSKINFLEYKERMTGDMNIPKPHITHLNDIHMIGYEYRTNLNDEKY
FEDIPKFYNDFGRNGYFMQIPQKTDPNMCYGLSCRFQDDGGFSFIIGEAVRETAAGEVPEPLIYTKIPGGKYAVFHVNGS
TESVQNTRRYIYGSWLMNTNYERTEGPDFEVTDVCRSVPPNEMKMTIYIPLL

Specific function: Binds to the right arm of the replication origin oriC of the chromosome. Rob binding may influence the formation of the nucleoprotein structure, required for oriC function in the initiation of replication [H]

COG id: COG2207

COG function: function code K; AraC-type DNA-binding domain-containing proteins

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HTH araC/xylS-type DNA-binding domain [H]

Homologues:

Organism=Escherichia coli, GI1790857, Length=297, Percent_Identity=29.2929292929293, Blast_Score=114, Evalue=1e-26,
Organism=Escherichia coli, GI1790497, Length=104, Percent_Identity=42.3076923076923, Blast_Score=91, Evalue=9e-20,
Organism=Escherichia coli, GI87081928, Length=98, Percent_Identity=35.7142857142857, Blast_Score=82, Evalue=3e-17,
Organism=Escherichia coli, GI1786251, Length=102, Percent_Identity=33.3333333333333, Blast_Score=69, Evalue=5e-13,
Organism=Escherichia coli, GI1790559, Length=99, Percent_Identity=33.3333333333333, Blast_Score=65, Evalue=7e-12,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR010499
- InterPro:   IPR009057
- InterPro:   IPR012287
- InterPro:   IPR018062
- InterPro:   IPR020449
- InterPro:   IPR018060
- InterPro:   IPR011256 [H]

Pfam domain/function: PF06445 AraC_E_bind; PF00165 HTH_AraC [H]

EC number: NA

Molecular weight: Translated: 33985; Mature: 33985

Theoretical pI: Translated: 6.26; Mature: 6.26

Prosite motif: PS00041 HTH_ARAC_FAMILY_1 ; PS01124 HTH_ARAC_FAMILY_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.4 %Cys     (Translated Protein)
3.4 %Met     (Translated Protein)
4.8 %Cys+Met (Translated Protein)
1.4 %Cys     (Mature Protein)
3.4 %Met     (Mature Protein)
4.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNMNDYERIQNAVQFIEEHLQNDLNITDIAAKSCFSPFHFQRLFQAISGFSVYQYIRNRR
CCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCHHHHHHHHHHHCCHHHHHHHHHCC
LSEAALLLEKTDRSILDIAINFGYGSQEAFSRAFSQYFGITPAKYRKAKVKLDFQSKINF
HHHHHHHHHHCCCCEEEEEEECCCCCHHHHHHHHHHHHCCCHHHHCEEEEEEEHHHHCHH
LEYKERMTGDMNIPKPHITHLNDIHMIGYEYRTNLNDEKYFEDIPKFYNDFGRNGYFMQI
HHHHHHHCCCCCCCCCCCCCCCCEEEEEEHEECCCCHHHHHHHHHHHHHHHCCCCEEEEC
PQKTDPNMCYGLSCRFQDDGGFSFIIGEAVRETAAGEVPEPLIYTKIPGGKYAVFHVNGS
CCCCCCCCEEEEEEEEECCCCEEEEEHHHHHHHHCCCCCCCEEEEECCCCEEEEEEECCC
TESVQNTRRYIYGSWLMNTNYERTEGPDFEVTDVCRSVPPNEMKMTIYIPLL
HHHHHHHHHEEEEEEEECCCCCCCCCCCCHHHHHHHCCCCCCEEEEEEEECC
>Mature Secondary Structure
MNMNDYERIQNAVQFIEEHLQNDLNITDIAAKSCFSPFHFQRLFQAISGFSVYQYIRNRR
CCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCHHHHHHHHHHHCCHHHHHHHHHCC
LSEAALLLEKTDRSILDIAINFGYGSQEAFSRAFSQYFGITPAKYRKAKVKLDFQSKINF
HHHHHHHHHHCCCCEEEEEEECCCCCHHHHHHHHHHHHCCCHHHHCEEEEEEEHHHHCHH
LEYKERMTGDMNIPKPHITHLNDIHMIGYEYRTNLNDEKYFEDIPKFYNDFGRNGYFMQI
HHHHHHHCCCCCCCCCCCCCCCCEEEEEEHEECCCCHHHHHHHHHHHHHHHCCCCEEEEC
PQKTDPNMCYGLSCRFQDDGGFSFIIGEAVRETAAGEVPEPLIYTKIPGGKYAVFHVNGS
CCCCCCCCEEEEEEEEECCCCEEEEEHHHHHHHHCCCCCCCEEEEECCCCEEEEEEECCC
TESVQNTRRYIYGSWLMNTNYERTEGPDFEVTDVCRSVPPNEMKMTIYIPLL
HHHHHHHHHEEEEEEEECCCCCCCCCCCCHHHHHHHCCCCCCEEEEEEEECC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796 [H]