Definition Mesorhizobium sp. BNC1, complete genome.
Accession NC_008254
Length 4,412,446

Click here to switch to the map view.

The map label for this gene is cytR [H]

Identifier: 110632623

GI number: 110632623

Start: 302402

End: 303436

Strand: Reverse

Name: cytR [H]

Synonym: Meso_0262

Alternate gene names: 110632623

Gene position: 303436-302402 (Counterclockwise)

Preceding gene: 110632624

Following gene: 110632622

Centisome position: 6.88

GC content: 62.51

Gene sequence:

>1035_bases
ATGAGCCGAGCCCGCATCAAGGATATCGCAGCGGAACTCGGCCTTTCGCCAGCCACGGTCTCCCGCGCCCTGAGCCATTC
GCCGCTGGTGGCGGAGCCCACACGCAGCCGCGTGCGGGAGGCGGCGCTGCGCCTCAACTACCGGCCCAATGTGAGCGCAC
GCAACTTGCGCACCCAGCGCTCCATGGCCGTGCTGATGGTGGTGCGCGACATCGGCAACCCCTTCTATCTCGACATCGTC
AAGGGAGTGGAGGCCGCTGCCCGCGAGGCCGGATATGTCGTGCTGATGGGCAATACGGAAAACAACCCCGAACGCGAGAT
CGAATATTTCGACATGCTGCGCGATGGTCACGCCGACGGCATGATTCTGATCACGGGCAAGCTGCCGGGAGGAGAAAGCC
TCTTTCAGCAATTGAAGGATCTGCCGGTGGTCGTCGCGCTGGAGCCCATAGAGAAAAGCGGGTTTCCACATGTGATGATC
GATAATACGGGTGCCGCGAGCGAAGCGGTGAAGCATCTCATCTCACTCGGCCACAGGCGCATCGCGCATATTGCCGGCCC
GGTTCCCGAGCCCATGGCAACGCGTCGCCGAGAGGGCTACCGGCAAGCGATAAAGGCAGCGGGGCTGGAGCTGAGGCCGG
AATATGAGGGTATCGGCGATTACCTGCTTTCCTCGGGCGAACGGATCTGCCGGCAGTTCTTTTCGCTCCCCGAGCCGCCC
AGCGCCATTTTCGCGGCAAATGACGAGATGGCCTTCGGCGCCATCAATGAATTGAGGCGTATGGCCCTTTGCGTGCCGAA
TGACGTTTCCGTCGTCGGCTTCGACGACATTTTCCTGAGCCAGGCCTTTTATCCGCCGCTCACGACGGTGAGCCAGCCGC
GCGCGGCGATCGGCCGCGAGGCCATGCGCCTGCTGCTCGAAGTGATGCATGGCGAAGGTCACACCGGAGAGACAATCATG
ATGCCGACGACATTTCAGACACGGGCTTCCACAGCGCCGGCACGCAGCCGCAGGCCTCACCGAAGATCAGCATGA

Upstream 100 bases:

>100_bases
TCACCACCATAAATATCGACGAATCCGGCCGGCAGATGATCTTCGGCGCGACGCTGCTCGTGCTGATGCTCTTCTATGGC
CGCGGGCGGGCCATGCGCGC

Downstream 100 bases:

>100_bases
AGGAGGAGAAAATGAACGGCCTTCCGAAGGAAAAGCTCAGGATCGGACTTGTCGGCTCCGGCTTCATCGCACAGTTTCAC
CTGCGCTCGATGCTAGGCGT

Product: LacI family transcription regulator

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 344; Mature: 343

Protein sequence:

>344_residues
MSRARIKDIAAELGLSPATVSRALSHSPLVAEPTRSRVREAALRLNYRPNVSARNLRTQRSMAVLMVVRDIGNPFYLDIV
KGVEAAAREAGYVVLMGNTENNPEREIEYFDMLRDGHADGMILITGKLPGGESLFQQLKDLPVVVALEPIEKSGFPHVMI
DNTGAASEAVKHLISLGHRRIAHIAGPVPEPMATRRREGYRQAIKAAGLELRPEYEGIGDYLLSSGERICRQFFSLPEPP
SAIFAANDEMAFGAINELRRMALCVPNDVSVVGFDDIFLSQAFYPPLTTVSQPRAAIGREAMRLLLEVMHGEGHTGETIM
MPTTFQTRASTAPARSRRPHRRSA

Sequences:

>Translated_344_residues
MSRARIKDIAAELGLSPATVSRALSHSPLVAEPTRSRVREAALRLNYRPNVSARNLRTQRSMAVLMVVRDIGNPFYLDIV
KGVEAAAREAGYVVLMGNTENNPEREIEYFDMLRDGHADGMILITGKLPGGESLFQQLKDLPVVVALEPIEKSGFPHVMI
DNTGAASEAVKHLISLGHRRIAHIAGPVPEPMATRRREGYRQAIKAAGLELRPEYEGIGDYLLSSGERICRQFFSLPEPP
SAIFAANDEMAFGAINELRRMALCVPNDVSVVGFDDIFLSQAFYPPLTTVSQPRAAIGREAMRLLLEVMHGEGHTGETIM
MPTTFQTRASTAPARSRRPHRRSA
>Mature_343_residues
SRARIKDIAAELGLSPATVSRALSHSPLVAEPTRSRVREAALRLNYRPNVSARNLRTQRSMAVLMVVRDIGNPFYLDIVK
GVEAAAREAGYVVLMGNTENNPEREIEYFDMLRDGHADGMILITGKLPGGESLFQQLKDLPVVVALEPIEKSGFPHVMID
NTGAASEAVKHLISLGHRRIAHIAGPVPEPMATRRREGYRQAIKAAGLELRPEYEGIGDYLLSSGERICRQFFSLPEPPS
AIFAANDEMAFGAINELRRMALCVPNDVSVVGFDDIFLSQAFYPPLTTVSQPRAAIGREAMRLLLEVMHGEGHTGETIMM
PTTFQTRASTAPARSRRPHRRSA

Specific function: This protein negatively controls the transcription initiation of genes such as deoCABD, udp, and cdd encoding catabolizing enzymes and nupC, nupG, and tsx encoding transporting and pore-forming proteins. Binds cytidine and adenosine as effectors [H]

COG id: COG1609

COG function: function code K; Transcriptional regulators

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HTH lacI-type DNA-binding domain [H]

Homologues:

Organism=Escherichia coli, GI1790369, Length=328, Percent_Identity=39.6341463414634, Blast_Score=251, Evalue=5e-68,
Organism=Escherichia coli, GI1790194, Length=338, Percent_Identity=34.0236686390533, Blast_Score=183, Evalue=2e-47,
Organism=Escherichia coli, GI1787948, Length=306, Percent_Identity=31.0457516339869, Blast_Score=169, Evalue=3e-43,
Organism=Escherichia coli, GI1788474, Length=304, Percent_Identity=31.9078947368421, Blast_Score=147, Evalue=1e-36,
Organism=Escherichia coli, GI1789202, Length=292, Percent_Identity=32.5342465753425, Blast_Score=142, Evalue=3e-35,
Organism=Escherichia coli, GI1789068, Length=312, Percent_Identity=27.8846153846154, Blast_Score=125, Evalue=5e-30,
Organism=Escherichia coli, GI1787580, Length=319, Percent_Identity=28.2131661442006, Blast_Score=120, Evalue=1e-28,
Organism=Escherichia coli, GI1786540, Length=336, Percent_Identity=26.4880952380952, Blast_Score=114, Evalue=1e-26,
Organism=Escherichia coli, GI48994940, Length=314, Percent_Identity=27.7070063694268, Blast_Score=109, Evalue=3e-25,
Organism=Escherichia coli, GI1789456, Length=338, Percent_Identity=27.5147928994083, Blast_Score=102, Evalue=5e-23,
Organism=Escherichia coli, GI1790715, Length=318, Percent_Identity=26.7295597484277, Blast_Score=99, Evalue=3e-22,
Organism=Escherichia coli, GI1787906, Length=341, Percent_Identity=26.099706744868, Blast_Score=84, Evalue=9e-18,
Organism=Escherichia coli, GI1790689, Length=334, Percent_Identity=25.4491017964072, Blast_Score=84, Evalue=1e-17,
Organism=Escherichia coli, GI1786268, Length=308, Percent_Identity=23.0519480519481, Blast_Score=75, Evalue=4e-15,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000843
- InterPro:   IPR010982
- InterPro:   IPR001761 [H]

Pfam domain/function: PF00356 LacI; PF00532 Peripla_BP_1 [H]

EC number: NA

Molecular weight: Translated: 37877; Mature: 37746

Theoretical pI: Translated: 9.03; Mature: 9.03

Prosite motif: PS50932 HTH_LACI_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
4.1 %Met     (Translated Protein)
4.7 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
3.8 %Met     (Mature Protein)
4.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSRARIKDIAAELGLSPATVSRALSHSPLVAEPTRSRVREAALRLNYRPNVSARNLRTQR
CCCHHHHHHHHHHCCCHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHH
SMAVLMVVRDIGNPFYLDIVKGVEAAAREAGYVVLMGNTENNPEREIEYFDMLRDGHADG
HHHHHHHHHHCCCCEEHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHHCCCCCC
MILITGKLPGGESLFQQLKDLPVVVALEPIEKSGFPHVMIDNTGAASEAVKHLISLGHRR
EEEEEECCCCHHHHHHHHHCCCEEEEEECHHHCCCCEEEEECCCCHHHHHHHHHHHHHHH
IAHIAGPVPEPMATRRREGYRQAIKAAGLELRPEYEGIGDYLLSSGERICRQFFSLPEPP
HHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHCHHHHHHHHHCCCCCC
SAIFAANDEMAFGAINELRRMALCVPNDVSVVGFDDIFLSQAFYPPLTTVSQPRAAIGRE
CEEEECCCCHHHHHHHHHHHHHHCCCCCCEEEEHHHHHHHHHCCCCCCCCCCCHHHHHHH
AMRLLLEVMHGEGHTGETIMMPTTFQTRASTAPARSRRPHRRSA
HHHHHHHHHHCCCCCCCEEEECCCCHHHCCCCCHHHCCCCCCCC
>Mature Secondary Structure 
SRARIKDIAAELGLSPATVSRALSHSPLVAEPTRSRVREAALRLNYRPNVSARNLRTQR
CCHHHHHHHHHHCCCHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHH
SMAVLMVVRDIGNPFYLDIVKGVEAAAREAGYVVLMGNTENNPEREIEYFDMLRDGHADG
HHHHHHHHHHCCCCEEHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHHCCCCCC
MILITGKLPGGESLFQQLKDLPVVVALEPIEKSGFPHVMIDNTGAASEAVKHLISLGHRR
EEEEEECCCCHHHHHHHHHCCCEEEEEECHHHCCCCEEEEECCCCHHHHHHHHHHHHHHH
IAHIAGPVPEPMATRRREGYRQAIKAAGLELRPEYEGIGDYLLSSGERICRQFFSLPEPP
HHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHCHHHHHHHHHCCCCCC
SAIFAANDEMAFGAINELRRMALCVPNDVSVVGFDDIFLSQAFYPPLTTVSQPRAAIGRE
CEEEECCCCHHHHHHHHHHHHHHCCCCCCEEEEHHHHHHHHHCCCCCCCCCCCHHHHHHH
AMRLLLEVMHGEGHTGETIMMPTTFQTRASTAPARSRRPHRRSA
HHHHHHHHHHCCCCCCCEEEECCCCHHHCCCCCHHHCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796 [H]