The gene/protein map for NC_009937 is currently unavailable.
Definition Azorhizobium caulinodans ORS 571, complete genome.
Accession NC_009937
Length 5,369,772

Click here to switch to the map view.

The map label for this gene is acoR [H]

Identifier: 158423535

GI number: 158423535

Start: 2189730

End: 2191709

Strand: Direct

Name: acoR [H]

Synonym: AZC_1911

Alternate gene names: 158423535

Gene position: 2189730-2191709 (Clockwise)

Preceding gene: 158423529

Following gene: 158423543

Centisome position: 40.78

GC content: 68.18

Gene sequence:

>1980_bases
ATGCGCGAGGAACAGGCCGCTCATATCGATGAGCTGGTGCGGGCTGCCAGTGGCCTTGGCACGCGGCGCGACAGCATCAT
CGAGGAATCCTGGCGGCGGTGCGTCAACCAGCACAATCTCGATCCGGTGGTGCTGCGGGACCCCTGCATCCTTCCTCACC
AGCGGCTGCGCGAGCATCAGGAGGCGATGGACGAATTCCTCCATACCGCCCGCTTCGGCGTGGAGACGCTCTACCGGGAG
GTGGCGGGGCTCGGCTATGTGCTGCTGCTGACCGACGCCAAGGGCGTGACCGTCGATTTCATCGGCGATCCCACCTTCGA
CAACAATCTCATGCGGGCGGGCCTCTATCTGGGCGCCGACTGGAATGAGCCCCATGCCGGCACCTGCGCGGTGGGCACCT
GCATCGCTACAGGCGAGGCGCTGGTGGTCCATCAGACGGACCATTTCGATGCCACCCACATTCCCCTGACCTGCACCGCC
GCGCCCATTTATGACGCCCGCGGCTCGCTGGCGGCGGTGCTCGACATCTCGGCGCTGCGCTCGCCCGAGCCCAAGGAGAG
CCAGCATCTCGCGCTGCAACTGGTGAAGAGCTTTGCGCGCAAGATCGAGAGCGCCCATCTGCTGAACCGCTTCCGCCGCG
AATGGATTCTCAAGCTCGCGCCATCGCCGGAATTCGCTGATGTGGACCCGGCCTATGTGATCGCGGTCGATGGGGCGGGG
CGCATTATCGGCTTCAACAATGAGGCGCGGCGCCTCCTGATCCGCGAGCTGGAGCGCCAGCCCGCCGGGCTGGACACGCG
GACCATCGCCGGGCGCCAGCTCTCCGACTTCTTCGAGCTGGAGGTGGACGATCTGCCCCGGCTCGGCCATGCCCGACCGG
TCACGCAGCGGATGGTCCGCTTCCGTGCGAGCGGGCTGCCGCTGTTCGCCCAGAGCCTCGCCCCGCCGGCGCGGGTCTCC
GCCCCGGCCGTGCCGGCCGCGCCGCCGGACCTGCCCAAGCCGCTGCAGGCGGTGTTCCATGATGATCCCGCCATGGCGCA
GGTGGTGGGGCGGGCCGCCAAGCTCGTGAACACCCAGATGAGCCTTCTCATCTGCGGCGAGACCGGCACCGGCAAGGAGC
ATCTGGCCAAGGCCATTCATGCCGCCAGCGGCCGGGCCGCCAAACCCTTCGTGCCGGTCAATTGCGCGGCCCTGCCCGAG
ACGCTGATCGAGGGCGAACTGTTCGGCTATGAGGCCGGCGCCTTCACGGGCGCTGCCGCCAAGGGCAAGCGGGGGCTCGT
CAGCGAGGCGGATGGTGGCACATTGTTCCTCGATGAGATCGGCGACATGCCTTTGTCTTCCCAGACGCGCCTGCTGCGCG
TTCTGGCGGAGCGGGAGGTGACGCCGCTCGGGCGGTCTCGTCCCATCCCGGTCAACATTCGCGTCATTGCCGCGACGCAC
CGCGATCTGGTGACCGAGGTGAAGGGGGGTAGGTTCCGCGAGGATCTCTACTTCCGCCTCAACGGAGCGATCCTGACGCT
GCCCCCGCTGCGCCACAGGACCGATTTCGACTGGCTGGCCGACCGTATCCTTGCCGAGCGATGCCGTAGCCTGTCGCGGC
GTATCGAGCTCTCCGCCGACGCACGGGAGGCGCTGCATGCTCATTCCTGGCCGGGCAACATGCGGGAGCTGCTGAACGCC
CTCGACTATGCGCTGGCCGTTGCCTGCGGCCCCCTGATCGGCCGCGAGGACCTGCCGGATGGCATCCGCGCGGGCTGGCA
GGGGCGGGATCTCGCTCCCGCGCATATTATGCAGGGCGTAGATGTGGCCGCTTCGTCCGAGCGGACCCGGCTTCTTGCCG
ATCTTCGCCGGCATGACTGGAACATCTCGGCGGTCGCCCGCGCCCTTGGGGTGGACCGCACCACGATCCACCGCCGCATG
CGACGCCTGCAAATTGCCTCCCTGCGCGAGCGGAGCACCGGCCTGGCGGACGGGGATTGA

Upstream 100 bases:

>100_bases
GTGCAACGATGCGCGACGCTCCGTCAGGCTGCGCGTCACGCTTGCGATGGGGGGCGTTCGGGGCGACACTTGCGCAAAAG
AACGATCTGGGAGGATCGTC

Downstream 100 bases:

>100_bases
GCGGGTCGGTCCATCAGGCCACCGGTCCCAATCCTTCCGCAATACGCCCCGCCAGGAGGTCGAGCCCGAGGACCGCGATG
AGCGATACTCCGGCAAGCGG

Product: transcriptional regulator

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 659; Mature: 659

Protein sequence:

>659_residues
MREEQAAHIDELVRAASGLGTRRDSIIEESWRRCVNQHNLDPVVLRDPCILPHQRLREHQEAMDEFLHTARFGVETLYRE
VAGLGYVLLLTDAKGVTVDFIGDPTFDNNLMRAGLYLGADWNEPHAGTCAVGTCIATGEALVVHQTDHFDATHIPLTCTA
APIYDARGSLAAVLDISALRSPEPKESQHLALQLVKSFARKIESAHLLNRFRREWILKLAPSPEFADVDPAYVIAVDGAG
RIIGFNNEARRLLIRELERQPAGLDTRTIAGRQLSDFFELEVDDLPRLGHARPVTQRMVRFRASGLPLFAQSLAPPARVS
APAVPAAPPDLPKPLQAVFHDDPAMAQVVGRAAKLVNTQMSLLICGETGTGKEHLAKAIHAASGRAAKPFVPVNCAALPE
TLIEGELFGYEAGAFTGAAAKGKRGLVSEADGGTLFLDEIGDMPLSSQTRLLRVLAEREVTPLGRSRPIPVNIRVIAATH
RDLVTEVKGGRFREDLYFRLNGAILTLPPLRHRTDFDWLADRILAERCRSLSRRIELSADAREALHAHSWPGNMRELLNA
LDYALAVACGPLIGREDLPDGIRAGWQGRDLAPAHIMQGVDVAASSERTRLLADLRRHDWNISAVARALGVDRTTIHRRM
RRLQIASLRERSTGLADGD

Sequences:

>Translated_659_residues
MREEQAAHIDELVRAASGLGTRRDSIIEESWRRCVNQHNLDPVVLRDPCILPHQRLREHQEAMDEFLHTARFGVETLYRE
VAGLGYVLLLTDAKGVTVDFIGDPTFDNNLMRAGLYLGADWNEPHAGTCAVGTCIATGEALVVHQTDHFDATHIPLTCTA
APIYDARGSLAAVLDISALRSPEPKESQHLALQLVKSFARKIESAHLLNRFRREWILKLAPSPEFADVDPAYVIAVDGAG
RIIGFNNEARRLLIRELERQPAGLDTRTIAGRQLSDFFELEVDDLPRLGHARPVTQRMVRFRASGLPLFAQSLAPPARVS
APAVPAAPPDLPKPLQAVFHDDPAMAQVVGRAAKLVNTQMSLLICGETGTGKEHLAKAIHAASGRAAKPFVPVNCAALPE
TLIEGELFGYEAGAFTGAAAKGKRGLVSEADGGTLFLDEIGDMPLSSQTRLLRVLAEREVTPLGRSRPIPVNIRVIAATH
RDLVTEVKGGRFREDLYFRLNGAILTLPPLRHRTDFDWLADRILAERCRSLSRRIELSADAREALHAHSWPGNMRELLNA
LDYALAVACGPLIGREDLPDGIRAGWQGRDLAPAHIMQGVDVAASSERTRLLADLRRHDWNISAVARALGVDRTTIHRRM
RRLQIASLRERSTGLADGD
>Mature_659_residues
MREEQAAHIDELVRAASGLGTRRDSIIEESWRRCVNQHNLDPVVLRDPCILPHQRLREHQEAMDEFLHTARFGVETLYRE
VAGLGYVLLLTDAKGVTVDFIGDPTFDNNLMRAGLYLGADWNEPHAGTCAVGTCIATGEALVVHQTDHFDATHIPLTCTA
APIYDARGSLAAVLDISALRSPEPKESQHLALQLVKSFARKIESAHLLNRFRREWILKLAPSPEFADVDPAYVIAVDGAG
RIIGFNNEARRLLIRELERQPAGLDTRTIAGRQLSDFFELEVDDLPRLGHARPVTQRMVRFRASGLPLFAQSLAPPARVS
APAVPAAPPDLPKPLQAVFHDDPAMAQVVGRAAKLVNTQMSLLICGETGTGKEHLAKAIHAASGRAAKPFVPVNCAALPE
TLIEGELFGYEAGAFTGAAAKGKRGLVSEADGGTLFLDEIGDMPLSSQTRLLRVLAEREVTPLGRSRPIPVNIRVIAATH
RDLVTEVKGGRFREDLYFRLNGAILTLPPLRHRTDFDWLADRILAERCRSLSRRIELSADAREALHAHSWPGNMRELLNA
LDYALAVACGPLIGREDLPDGIRAGWQGRDLAPAHIMQGVDVAASSERTRLLADLRRHDWNISAVARALGVDRTTIHRRM
RRLQIASLRERSTGLADGD

Specific function: Required for sigma-54-dependent transcription of acoXABC [H]

COG id: COG3284

COG function: function code QK; Transcriptional activator of acetoin/glycerol metabolism

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 sigma-54 factor interaction domain [H]

Homologues:

Organism=Escherichia coli, GI1789233, Length=326, Percent_Identity=40.7975460122699, Blast_Score=234, Evalue=2e-62,
Organism=Escherichia coli, GI1788550, Length=249, Percent_Identity=51.004016064257, Blast_Score=231, Evalue=1e-61,
Organism=Escherichia coli, GI1790437, Length=328, Percent_Identity=42.0731707317073, Blast_Score=226, Evalue=2e-60,
Organism=Escherichia coli, GI1788905, Length=315, Percent_Identity=41.9047619047619, Blast_Score=225, Evalue=6e-60,
Organism=Escherichia coli, GI87082117, Length=324, Percent_Identity=41.9753086419753, Blast_Score=214, Evalue=1e-56,
Organism=Escherichia coli, GI1790299, Length=335, Percent_Identity=40.2985074626866, Blast_Score=210, Evalue=3e-55,
Organism=Escherichia coli, GI1786524, Length=326, Percent_Identity=42.638036809816, Blast_Score=207, Evalue=2e-54,
Organism=Escherichia coli, GI87082152, Length=312, Percent_Identity=41.6666666666667, Blast_Score=198, Evalue=8e-52,
Organism=Escherichia coli, GI1789087, Length=307, Percent_Identity=40.3908794788274, Blast_Score=194, Evalue=2e-50,
Organism=Escherichia coli, GI1787583, Length=305, Percent_Identity=39.0163934426229, Blast_Score=175, Evalue=7e-45,
Organism=Escherichia coli, GI87081872, Length=222, Percent_Identity=42.7927927927928, Blast_Score=168, Evalue=1e-42,
Organism=Escherichia coli, GI87081858, Length=645, Percent_Identity=25.1162790697674, Blast_Score=157, Evalue=3e-39,
Organism=Escherichia coli, GI1789828, Length=260, Percent_Identity=33.8461538461538, Blast_Score=124, Evalue=2e-29,

Paralogues:

None

Copy number: 10-20 Molecules/Cell [C]

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003593
- InterPro:   IPR020441
- InterPro:   IPR009057
- InterPro:   IPR002197
- InterPro:   IPR002078 [H]

Pfam domain/function: PF02954 HTH_8; PF00158 Sigma54_activat [H]

EC number: NA

Molecular weight: Translated: 72394; Mature: 72394

Theoretical pI: Translated: 7.23; Mature: 7.23

Prosite motif: PS00675 SIGMA54_INTERACT_1 ; PS00676 SIGMA54_INTERACT_2 ; PS00688 SIGMA54_INTERACT_3 ; PS50045 SIGMA54_INTERACT_4

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.4 %Cys     (Translated Protein)
1.5 %Met     (Translated Protein)
2.9 %Cys+Met (Translated Protein)
1.4 %Cys     (Mature Protein)
1.5 %Met     (Mature Protein)
2.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MREEQAAHIDELVRAASGLGTRRDSIIEESWRRCVNQHNLDPVVLRDPCILPHQRLREHQ
CCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHH
EAMDEFLHTARFGVETLYREVAGLGYVLLLTDAKGVTVDFIGDPTFDNNLMRAGLYLGAD
HHHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCEEEECCCCCCCCCHHHHCEEECCC
WNEPHAGTCAVGTCIATGEALVVHQTDHFDATHIPLTCTAAPIYDARGSLAAVLDISALR
CCCCCCCCHHHHHHHCCCCEEEEEECCCCCCCCCCEEEECCCCCCCCCCEEHEEEHHHHC
SPEPKESQHLALQLVKSFARKIESAHLLNRFRREWILKLAPSPEFADVDPAYVIAVDGAG
CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEECCCCCCCCCCCEEEEEECCCC
RIIGFNNEARRLLIRELERQPAGLDTRTIAGRQLSDFFELEVDDLPRLGHARPVTQRMVR
EEEECCHHHHHHHHHHHHCCCCCCCCHHCCCCHHHHHHHCCHHCCCCCCCCCHHHHHHHH
FRASGLPLFAQSLAPPARVSAPAVPAAPPDLPKPLQAVFHDDPAMAQVVGRAAKLVNTQM
HHHCCCCHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHCCCE
SLLICGETGTGKEHLAKAIHAASGRAAKPFVPVNCAALPETLIEGELFGYEAGAFTGAAA
EEEEECCCCCCHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCC
KGKRGLVSEADGGTLFLDEIGDMPLSSQTRLLRVLAEREVTPLGRSRPIPVNIRVIAATH
CCCCCCCCCCCCCEEEHHHHCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCEEEEEEEECH
RDLVTEVKGGRFREDLYFRLNGAILTLPPLRHRTDFDWLADRILAERCRSLSRRIELSAD
HHHHHHHCCCCCCEEEEEEECCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHEECCCH
AREALHAHSWPGNMRELLNALDYALAVACGPLIGREDLPDGIRAGWQGRDLAPAHIMQGV
HHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCCCCCCHHHHHCCC
DVAASSERTRLLADLRRHDWNISAVARALGVDRTTIHRRMRRLQIASLRERSTGLADGD
CHHCCCHHHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCC
>Mature Secondary Structure
MREEQAAHIDELVRAASGLGTRRDSIIEESWRRCVNQHNLDPVVLRDPCILPHQRLREHQ
CCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHH
EAMDEFLHTARFGVETLYREVAGLGYVLLLTDAKGVTVDFIGDPTFDNNLMRAGLYLGAD
HHHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCEEEECCCCCCCCCHHHHCEEECCC
WNEPHAGTCAVGTCIATGEALVVHQTDHFDATHIPLTCTAAPIYDARGSLAAVLDISALR
CCCCCCCCHHHHHHHCCCCEEEEEECCCCCCCCCCEEEECCCCCCCCCCEEHEEEHHHHC
SPEPKESQHLALQLVKSFARKIESAHLLNRFRREWILKLAPSPEFADVDPAYVIAVDGAG
CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEECCCCCCCCCCCEEEEEECCCC
RIIGFNNEARRLLIRELERQPAGLDTRTIAGRQLSDFFELEVDDLPRLGHARPVTQRMVR
EEEECCHHHHHHHHHHHHCCCCCCCCHHCCCCHHHHHHHCCHHCCCCCCCCCHHHHHHHH
FRASGLPLFAQSLAPPARVSAPAVPAAPPDLPKPLQAVFHDDPAMAQVVGRAAKLVNTQM
HHHCCCCHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHCCCE
SLLICGETGTGKEHLAKAIHAASGRAAKPFVPVNCAALPETLIEGELFGYEAGAFTGAAA
EEEEECCCCCCHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCC
KGKRGLVSEADGGTLFLDEIGDMPLSSQTRLLRVLAEREVTPLGRSRPIPVNIRVIAATH
CCCCCCCCCCCCCEEEHHHHCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCEEEEEEEECH
RDLVTEVKGGRFREDLYFRLNGAILTLPPLRHRTDFDWLADRILAERCRSLSRRIELSAD
HHHHHHHCCCCCCEEEEEEECCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHEECCCH
AREALHAHSWPGNMRELLNALDYALAVACGPLIGREDLPDGIRAGWQGRDLAPAHIMQGV
HHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCCCCCCHHHHHCCC
DVAASSERTRLLADLRRHDWNISAVARALGVDRTTIHRRMRRLQIASLRERSTGLADGD
CHHCCCHHHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 1378052 [H]