Definition Rhizobium leguminosarum bv. trifolii WSM2304 chromosome, complete genome.
Accession NC_011369
Length 4,537,948

Click here to switch to the map view.

The map label for this gene is gcvA [H]

Identifier: 209550444

GI number: 209550444

Start: 2917386

End: 2918282

Strand: Direct

Name: gcvA [H]

Synonym: Rleg2_2866

Alternate gene names: 209550444

Gene position: 2917386-2918282 (Clockwise)

Preceding gene: 209550442

Following gene: 209550447

Centisome position: 64.29

GC content: 62.76

Gene sequence:

>897_bases
ATGAAACTCTCGAAACAATTTCCGCTGAATGCGCTGCGCGTCTTCGAAGCCGCGGCACGGCTCGGAAGTTTCACCAAGGC
GGGCGACGAACTCGGGATGACGCAGACGGCCGTCAGCTACCAGATCAAGCTGCTGGAGGAGAATGTCGGCGAGCCGCTCT
TCCTGCGCCGTCCCCGTCAGATCGCCTTGACCGAGACCGGCGAGCGGCTGGCGCCGAAGGTGACTGAAGCTTTCGCCATG
CTGCACGACGCGATGGCGACGGCGCGCGACAGCGTCGAAGGCACACTCGCCATCAGCTCGACCCATACGTTCGCCTCGAA
ATGGCTGGCGCCGCGCCTCGGCTCTTTCCATTTGAAACATCCGGCGATTGCGGTGCGCTTTCAGGCGAGTTCCGACATCA
TCGATTTTACCCGTGAACAGATCGATGTCGGGATCCGCTGGGGCGACGGCAACTGGCCGGGTCTGACCCTGCACCGGTTG
ATGGGGCTGGAGTTCACCCCGATGCTCAGCCCGAAGCTGGCGGAATCAGCCGGCGGGATCAAGGAACCGCGCGATCTCCT
GAAGTTGGATCTCTTCGACGCCGGCGACATCTGGTGGAAACAATGGTTCGAGGCAGCCGGCGTCACCGACACGGATCTCG
ATCGCCGCCCGCGCAACCAGCTCGGCTCCCAGGCGGTGGAAGCCGACGCGGCAATCGCCGGCCATGGCGTTGCCATCCTC
CATCCGGCCTTCTACGCGGCCGAGATCGCCCTCGGCCGGCTGTATCAACCTTTCGAACTGACCCGCAGCGACGGCAAGGC
CTATTGGCTCGTCTATCCGGAAAACCGTCGCAACGTGCCGAAGATCAAAGCTTTCCGCAACTGGATCCTCGACGAGATGA
AGGCAGCCGGCAATTAA

Upstream 100 bases:

>100_bases
ACATCACCTCGATAGATTGATGGTATGAGTTGCAGTTTGCCGCCGAAAATGCTGCAATTCCAATCGAACGACCGTCTGCC
TGCATCAAGAGAACTTATCC

Downstream 100 bases:

>100_bases
CGCATGTCGCCCGGAAGTGTGCAGCGGTTTCCCAGGCAAAGCGCGAAGCGCTTTTGCCGCAACGACATGCATCAAAACAA
AAAGCTAAAGCGCGTCGCAT

Product: LysR family transcriptional regulator

Products: NA

Alternate protein names: Gcv operon activator [H]

Number of amino acids: Translated: 298; Mature: 298

Protein sequence:

>298_residues
MKLSKQFPLNALRVFEAAARLGSFTKAGDELGMTQTAVSYQIKLLEENVGEPLFLRRPRQIALTETGERLAPKVTEAFAM
LHDAMATARDSVEGTLAISSTHTFASKWLAPRLGSFHLKHPAIAVRFQASSDIIDFTREQIDVGIRWGDGNWPGLTLHRL
MGLEFTPMLSPKLAESAGGIKEPRDLLKLDLFDAGDIWWKQWFEAAGVTDTDLDRRPRNQLGSQAVEADAAIAGHGVAIL
HPAFYAAEIALGRLYQPFELTRSDGKAYWLVYPENRRNVPKIKAFRNWILDEMKAAGN

Sequences:

>Translated_298_residues
MKLSKQFPLNALRVFEAAARLGSFTKAGDELGMTQTAVSYQIKLLEENVGEPLFLRRPRQIALTETGERLAPKVTEAFAM
LHDAMATARDSVEGTLAISSTHTFASKWLAPRLGSFHLKHPAIAVRFQASSDIIDFTREQIDVGIRWGDGNWPGLTLHRL
MGLEFTPMLSPKLAESAGGIKEPRDLLKLDLFDAGDIWWKQWFEAAGVTDTDLDRRPRNQLGSQAVEADAAIAGHGVAIL
HPAFYAAEIALGRLYQPFELTRSDGKAYWLVYPENRRNVPKIKAFRNWILDEMKAAGN
>Mature_298_residues
MKLSKQFPLNALRVFEAAARLGSFTKAGDELGMTQTAVSYQIKLLEENVGEPLFLRRPRQIALTETGERLAPKVTEAFAM
LHDAMATARDSVEGTLAISSTHTFASKWLAPRLGSFHLKHPAIAVRFQASSDIIDFTREQIDVGIRWGDGNWPGLTLHRL
MGLEFTPMLSPKLAESAGGIKEPRDLLKLDLFDAGDIWWKQWFEAAGVTDTDLDRRPRNQLGSQAVEADAAIAGHGVAIL
HPAFYAAEIALGRLYQPFELTRSDGKAYWLVYPENRRNVPKIKAFRNWILDEMKAAGN

Specific function: Regulatory protein for the glycine cleavage system operon (gcv). Mediates activation of gcv by glycine and repression by purines. GcvA is negatively autoregulated. Bind to three sites upstream of the gcv promoter [H]

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HTH lysR-type DNA-binding domain [H]

Homologues:

Organism=Escherichia coli, GI1789173, Length=285, Percent_Identity=40, Blast_Score=184, Evalue=6e-48,
Organism=Escherichia coli, GI1786448, Length=302, Percent_Identity=33.112582781457, Blast_Score=145, Evalue=3e-36,
Organism=Escherichia coli, GI1788706, Length=306, Percent_Identity=28.1045751633987, Blast_Score=115, Evalue=3e-27,
Organism=Escherichia coli, GI157672245, Length=166, Percent_Identity=34.9397590361446, Blast_Score=84, Evalue=1e-17,
Organism=Escherichia coli, GI87081978, Length=184, Percent_Identity=30.9782608695652, Blast_Score=84, Evalue=2e-17,
Organism=Escherichia coli, GI145693193, Length=302, Percent_Identity=24.5033112582781, Blast_Score=82, Evalue=5e-17,
Organism=Escherichia coli, GI1787128, Length=292, Percent_Identity=26.3698630136986, Blast_Score=80, Evalue=1e-16,
Organism=Escherichia coli, GI1786401, Length=298, Percent_Identity=25.503355704698, Blast_Score=76, Evalue=2e-15,
Organism=Escherichia coli, GI1787589, Length=276, Percent_Identity=26.0869565217391, Blast_Score=70, Evalue=2e-13,
Organism=Escherichia coli, GI1789440, Length=141, Percent_Identity=29.0780141843972, Blast_Score=67, Evalue=1e-12,
Organism=Escherichia coli, GI1786713, Length=64, Percent_Identity=50, Blast_Score=62, Evalue=6e-11,
Organism=Escherichia coli, GI1788748, Length=123, Percent_Identity=28.4552845528455, Blast_Score=61, Evalue=7e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000847
- InterPro:   IPR005119
- InterPro:   IPR011991 [H]

Pfam domain/function: PF00126 HTH_1; PF03466 LysR_substrate [H]

EC number: NA

Molecular weight: Translated: 33200; Mature: 33200

Theoretical pI: Translated: 7.95; Mature: 7.95

Prosite motif: PS50931 HTH_LYSR

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.0 %Cys     (Translated Protein)
2.3 %Met     (Translated Protein)
2.3 %Cys+Met (Translated Protein)
0.0 %Cys     (Mature Protein)
2.3 %Met     (Mature Protein)
2.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKLSKQFPLNALRVFEAAARLGSFTKAGDELGMTQTAVSYQIKLLEENVGEPLFLRRPRQ
CCCCCCCCHHHHHHHHHHHHHCCHHHCCHHCCCHHHHHHHHEEEHHHCCCCCEEECCCCE
IALTETGERLAPKVTEAFAMLHDAMATARDSVEGTLAISSTHTFASKWLAPRLGSFHLKH
EEEECCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHCCCCEEECC
PAIAVRFQASSDIIDFTREQIDVGIRWGDGNWPGLTLHRLMGLEFTPMLSPKLAESAGGI
CEEEEEEECCCCHHHHHHHHEEEEEEECCCCCCCHHHHHHHCCCCCCCCCCHHHHHCCCC
KEPRDLLKLDLFDAGDIWWKQWFEAAGVTDTDLDRRPRNQLGSQAVEADAAIAGHGVAIL
CCCHHHHEEECCCCCHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEE
HPAFYAAEIALGRLYQPFELTRSDGKAYWLVYPENRRNVPKIKAFRNWILDEMKAAGN
HHHHHHHHHHHHHCCCCHHHCCCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCC
>Mature Secondary Structure
MKLSKQFPLNALRVFEAAARLGSFTKAGDELGMTQTAVSYQIKLLEENVGEPLFLRRPRQ
CCCCCCCCHHHHHHHHHHHHHCCHHHCCHHCCCHHHHHHHHEEEHHHCCCCCEEECCCCE
IALTETGERLAPKVTEAFAMLHDAMATARDSVEGTLAISSTHTFASKWLAPRLGSFHLKH
EEEECCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHCCCCEEECC
PAIAVRFQASSDIIDFTREQIDVGIRWGDGNWPGLTLHRLMGLEFTPMLSPKLAESAGGI
CEEEEEEECCCCHHHHHHHHEEEEEEECCCCCCCHHHHHHHCCCCCCCCCCHHHHHCCCC
KEPRDLLKLDLFDAGDIWWKQWFEAAGVTDTDLDRRPRNQLGSQAVEADAAIAGHGVAIL
CCCHHHHEEECCCCCHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEE
HPAFYAAEIALGRLYQPFELTRSDGKAYWLVYPENRRNVPKIKAFRNWILDEMKAAGN
HHHHHHHHHHHHHCCCCHHHCCCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796 [H]