Definition | Azorhizobium caulinodans ORS 571, complete genome. |
---|---|
Accession | NC_009937 |
Length | 5,369,772 |
Click here to switch to the map view.
The map label for this gene is lysR [H]
Identifier: 158423977
GI number: 158423977
Start: 2679172
End: 2680194
Strand: Reverse
Name: lysR [H]
Synonym: AZC_2353
Alternate gene names: 158423977
Gene position: 2680194-2679172 (Counterclockwise)
Preceding gene: 158423978
Following gene: 158423976
Centisome position: 49.91
GC content: 65.4
Gene sequence:
>1023_bases GTGGATGCCTTGCGATGGCGGGCACGCGGGCCTAGCCTTGCGCCCGCTGCCGTGCATCGATCCGGGGGACAAGGGCTGGC GATGAGGTCCGATACCGTTCGTCTGCCCCCTCTCAATGCCCTGCGGGTGTTCCACGCGGTCATGCGTCACGGCAGTTTCC GCAGCGCCGCGGACGAACTGCTTGTCTCCCCGCAGGCTGTGGGCCAGCAGATCAAGCTTCTGGAAGACACCCTCGCGGTG CCTCTGTTCGATCGACGGGGACGGGCGATCGAACCCACTGAAGAGGCCATTCTCCTCTCCCATTACGTGCAGAGTGGCTT CGACGAGTTCCGCGAGGGGGTCAGGCGCATCTGCAAGGTGGGCCATCGCAACCGCATCAACCTGAATGCCAGCCCCTATT TCGCGACGCGCTATCTCGTGGACCGGCTCGATCGATTCCGCGATCGTCTACCCGGTGCGGATATCCGGCTGAAGACGATG GTGGAGCTCCCGGACTTTTCAGCCGACGAGGTGGATGCGGCCATTCAGTGGGGCTTCGGCCAGTGGCGGGATTATGAATC GACGCTGCTGGTCCAGGATCCGAAGGTGATCTGCTGCTCACCGGCGCGGGCCTCGGCCCTGCGCAGTCCGCAGGACCTCA GGACCGCACCGCTGCTCCATCTGGTGCTGGCAACGAACCTCTGGCCGCGCGTGTTGCGCCATCTCGGCGTGGATCCGGGC GAGGTGCAGAAGGAAATCCAGTTCCACGACGCCGCCAGCATGCGCCGGGCGACGCTCTCCGGGCTCGGGATCGGCCTCAT TTCCGTGCTCGATGCCCAGGAAGACCTCAAGGCGGGTCGACTCGTCGCGCCCTTCGGCCTTGATGCAATGGCGGGCATGG ACCCGGCCGATGTGCCGGGCTTCTATCTCGTGCTGCCGCGCTCCAATCGGCGGCTCAAGAGCGTCGCGGCGTTCTGCGAA TGGATCCTGTCGGAAGACTGGTCCCGCATGGAGCCGGATGCCCCTTTTCCGGCGGCGACTTAG
Upstream 100 bases:
>100_bases GAACGGGAGCCCGCCCGGCTTAAGGCGTTCTGTCCGGTCGGCTCTCGCCTCACCGGGAGGCCTGAGACGCGCCCCTGCAC GGAGGCGGGGGCACCTTCCC
Downstream 100 bases:
>100_bases GGTGACCGCTCGGAGATAAGCAGAGGATCATTGGCGAGAGAAGCGGCAGGTCAGCAGCTTCCTTTATGCCTGTCGCGGCA ACGCCTAAGGTGCTCGAGGG
Product: transcriptional regulator
Products: NA
Alternate protein names: Gcv operon activator [H]
Number of amino acids: Translated: 340; Mature: 340
Protein sequence:
>340_residues MDALRWRARGPSLAPAAVHRSGGQGLAMRSDTVRLPPLNALRVFHAVMRHGSFRSAADELLVSPQAVGQQIKLLEDTLAV PLFDRRGRAIEPTEEAILLSHYVQSGFDEFREGVRRICKVGHRNRINLNASPYFATRYLVDRLDRFRDRLPGADIRLKTM VELPDFSADEVDAAIQWGFGQWRDYESTLLVQDPKVICCSPARASALRSPQDLRTAPLLHLVLATNLWPRVLRHLGVDPG EVQKEIQFHDAASMRRATLSGLGIGLISVLDAQEDLKAGRLVAPFGLDAMAGMDPADVPGFYLVLPRSNRRLKSVAAFCE WILSEDWSRMEPDAPFPAAT
Sequences:
>Translated_340_residues MDALRWRARGPSLAPAAVHRSGGQGLAMRSDTVRLPPLNALRVFHAVMRHGSFRSAADELLVSPQAVGQQIKLLEDTLAV PLFDRRGRAIEPTEEAILLSHYVQSGFDEFREGVRRICKVGHRNRINLNASPYFATRYLVDRLDRFRDRLPGADIRLKTM VELPDFSADEVDAAIQWGFGQWRDYESTLLVQDPKVICCSPARASALRSPQDLRTAPLLHLVLATNLWPRVLRHLGVDPG EVQKEIQFHDAASMRRATLSGLGIGLISVLDAQEDLKAGRLVAPFGLDAMAGMDPADVPGFYLVLPRSNRRLKSVAAFCE WILSEDWSRMEPDAPFPAAT >Mature_340_residues MDALRWRARGPSLAPAAVHRSGGQGLAMRSDTVRLPPLNALRVFHAVMRHGSFRSAADELLVSPQAVGQQIKLLEDTLAV PLFDRRGRAIEPTEEAILLSHYVQSGFDEFREGVRRICKVGHRNRINLNASPYFATRYLVDRLDRFRDRLPGADIRLKTM VELPDFSADEVDAAIQWGFGQWRDYESTLLVQDPKVICCSPARASALRSPQDLRTAPLLHLVLATNLWPRVLRHLGVDPG EVQKEIQFHDAASMRRATLSGLGIGLISVLDAQEDLKAGRLVAPFGLDAMAGMDPADVPGFYLVLPRSNRRLKSVAAFCE WILSEDWSRMEPDAPFPAAT
Specific function: Regulatory protein for the glycine cleavage system operon (gcv). Mediates activation of gcv by glycine and repression by purines. GcvA is negatively autoregulated. Bind to three sites upstream of the gcv promoter [H]
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 HTH lysR-type DNA-binding domain [H]
Homologues:
Organism=Escherichia coli, GI1789173, Length=301, Percent_Identity=36.2126245847176, Blast_Score=167, Evalue=7e-43, Organism=Escherichia coli, GI1786448, Length=289, Percent_Identity=31.1418685121107, Blast_Score=129, Evalue=3e-31, Organism=Escherichia coli, GI157672245, Length=143, Percent_Identity=33.5664335664336, Blast_Score=72, Evalue=4e-14, Organism=Escherichia coli, GI1788706, Length=295, Percent_Identity=25.7627118644068, Blast_Score=72, Evalue=5e-14, Organism=Escherichia coli, GI87081978, Length=302, Percent_Identity=25.1655629139073, Blast_Score=65, Evalue=7e-12, Organism=Escherichia coli, GI1787128, Length=297, Percent_Identity=25.5892255892256, Blast_Score=64, Evalue=1e-11, Organism=Escherichia coli, GI1786401, Length=249, Percent_Identity=26.1044176706827, Blast_Score=63, Evalue=3e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000847 - InterPro: IPR005119 - InterPro: IPR011991 [H]
Pfam domain/function: PF00126 HTH_1; PF03466 LysR_substrate [H]
EC number: NA
Molecular weight: Translated: 37847; Mature: 37847
Theoretical pI: Translated: 8.34; Mature: 8.34
Prosite motif: PS50931 HTH_LYSR
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.2 %Cys (Translated Protein) 2.4 %Met (Translated Protein) 3.5 %Cys+Met (Translated Protein) 1.2 %Cys (Mature Protein) 2.4 %Met (Mature Protein) 3.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MDALRWRARGPSLAPAAVHRSGGQGLAMRSDTVRLPPLNALRVFHAVMRHGSFRSAADEL CCCCCCCCCCCCCCHHHHHCCCCCCEEECCCCEECCCHHHHHHHHHHHHCCCHHHHHHHH LVSPQAVGQQIKLLEDTLAVPLFDRRGRAIEPTEEAILLSHYVQSGFDEFREGVRRICKV HCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHC GHRNRINLNASPYFATRYLVDRLDRFRDRLPGADIRLKTMVELPDFSADEVDAAIQWGFG CCCCEEECCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCHHHHHHHHCCCC QWRDYESTLLVQDPKVICCSPARASALRSPQDLRTAPLLHLVLATNLWPRVLRHLGVDPG CCCCCCCEEEEECCCEEEECCHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHH EVQKEIQFHDAASMRRATLSGLGIGLISVLDAQEDLKAGRLVAPFGLDAMAGMDPADVPG HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHCCCCEECCCCCHHHCCCCCCCCCC FYLVLPRSNRRLKSVAAFCEWILSEDWSRMEPDAPFPAAT EEEEECCCCCHHHHHHHHHHHHHHCCHHHCCCCCCCCCCC >Mature Secondary Structure MDALRWRARGPSLAPAAVHRSGGQGLAMRSDTVRLPPLNALRVFHAVMRHGSFRSAADEL CCCCCCCCCCCCCCHHHHHCCCCCCEEECCCCEECCCHHHHHHHHHHHHCCCHHHHHHHH LVSPQAVGQQIKLLEDTLAVPLFDRRGRAIEPTEEAILLSHYVQSGFDEFREGVRRICKV HCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHC GHRNRINLNASPYFATRYLVDRLDRFRDRLPGADIRLKTMVELPDFSADEVDAAIQWGFG CCCCEEECCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCHHHHHHHHCCCC QWRDYESTLLVQDPKVICCSPARASALRSPQDLRTAPLLHLVLATNLWPRVLRHLGVDPG CCCCCCCEEEEECCCEEEECCHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHH EVQKEIQFHDAASMRRATLSGLGIGLISVLDAQEDLKAGRLVAPFGLDAMAGMDPADVPG HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHCCCCEECCCCCHHHCCCCCCCCCC FYLVLPRSNRRLKSVAAFCEWILSEDWSRMEPDAPFPAAT EEEEECCCCCHHHHHHHHHHHHHHCCHHHCCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]