Definition Sinorhizobium medicae WSM419 chromosome, complete genome.
Accession NC_009636
Length 3,781,904

Click here to switch to the map view.

The map label for this gene is gcvA [H]

Identifier: 150395796

GI number: 150395796

Start: 621208

End: 622143

Strand: Direct

Name: gcvA [H]

Synonym: Smed_0572

Alternate gene names: 150395796

Gene position: 621208-622143 (Clockwise)

Preceding gene: 150395795

Following gene: 150395798

Centisome position: 16.43

GC content: 66.13

Gene sequence:

>936_bases
ATGAAGGAGTTGAGCGCGGTTCATCTGAACGGGTTGCGTGCGACCGAAGCGGTCGGCCGCCTCGGCTCGCTCGCCGCTGC
GGCGGAGGAGCTTGGCGTCACGCCGGGTGCTGTCAGCCAGCAGATCGCCAAGACGGAGGCGCAACTCGGGCGCACGCTCT
TCGAACGTACGCCGCGCGGCCTTGTGGTTTCGGATAGCGGGCGGGCGCTTCTGACGCGATTGTCGAGCGCCTTCGGGGAA
CTCGCAGAAGCGGTCGCACAGGCACGCCGCCGCGATGAATCGGTTTTGACCGTGTCGGTGGCTCCGGTCTTTGCAGCTCG
CTGGCTCGTCTACCGGCTGGACCGCTTCGCCGAGCACAATCCGGACGTCCGGCTGCGGATCGACGCGACCACGACGCTCG
CCAATCTCGAAACCTCCGATGTCGACCTCGGTATCCGCGTCGGAGCAGGCCGCTGGCCGGGCGTGCGGTCGGAGCTGCTG
CTTGAGCAGGAGGTTTTTCCGGTCTGCTCGCCGGCGCTCGCTGCCGGGCTGCACAAGCCCGCCGATATATTGAAACTGCC
GGCAGTGATCGATGCGCACGCAATGTTTTCCTGGGAGCTGTGGCTGGACGCGGCCGGCGTCTCCGGTGCGGCCATGACGG
TGCGCCACACCTTCAACGATGCTTCCCTCGCTCTCGATGCAGCCATTGCCGGGCAGGGGGTGATGCTCGCCTGGCAGACG
CTTGCCGGCTACGCGCTGCTGAGGGGGTCTCTGGTGGCACCTTTCGGCATTCGCGCCAAAACCGGCTTCGGGCACTACTT
CGTTACGTCCGCCTCACGCCGAGAAAGCAAGGGCGCGCTGGCTTTCAAGCGCTGGGTGCGCGAGGAGGTCGAGGAAGGAA
TGCGCCAGCTCGCTTCTATTCCGGCTCCTTCAGCCGCTCCTCCATCATCGCCTTGA

Upstream 100 bases:

>100_bases
CCCGGCAGCGTCTAACTCCGGGCCTCGCCAGCCAGGAGCATCCTCCCCTTCGCCGTAGTTTACAATTGAGATTGCCACTG
CGGCGGTTTAGAAAAGCTAT

Downstream 100 bases:

>100_bases
AAGCGGGACGGGACTGGCAGCGCGCCAGCCACGATTTCACCTGAGGATGGCTGTCGAAAAGGGTCGTCTGGCTCATCGCG
TAACGGAAAACTTCAGCGAG

Product: LysR family transcriptional regulator

Products: NA

Alternate protein names: Gcv operon activator [H]

Number of amino acids: Translated: 311; Mature: 311

Protein sequence:

>311_residues
MKELSAVHLNGLRATEAVGRLGSLAAAAEELGVTPGAVSQQIAKTEAQLGRTLFERTPRGLVVSDSGRALLTRLSSAFGE
LAEAVAQARRRDESVLTVSVAPVFAARWLVYRLDRFAEHNPDVRLRIDATTTLANLETSDVDLGIRVGAGRWPGVRSELL
LEQEVFPVCSPALAAGLHKPADILKLPAVIDAHAMFSWELWLDAAGVSGAAMTVRHTFNDASLALDAAIAGQGVMLAWQT
LAGYALLRGSLVAPFGIRAKTGFGHYFVTSASRRESKGALAFKRWVREEVEEGMRQLASIPAPSAAPPSSP

Sequences:

>Translated_311_residues
MKELSAVHLNGLRATEAVGRLGSLAAAAEELGVTPGAVSQQIAKTEAQLGRTLFERTPRGLVVSDSGRALLTRLSSAFGE
LAEAVAQARRRDESVLTVSVAPVFAARWLVYRLDRFAEHNPDVRLRIDATTTLANLETSDVDLGIRVGAGRWPGVRSELL
LEQEVFPVCSPALAAGLHKPADILKLPAVIDAHAMFSWELWLDAAGVSGAAMTVRHTFNDASLALDAAIAGQGVMLAWQT
LAGYALLRGSLVAPFGIRAKTGFGHYFVTSASRRESKGALAFKRWVREEVEEGMRQLASIPAPSAAPPSSP
>Mature_311_residues
MKELSAVHLNGLRATEAVGRLGSLAAAAEELGVTPGAVSQQIAKTEAQLGRTLFERTPRGLVVSDSGRALLTRLSSAFGE
LAEAVAQARRRDESVLTVSVAPVFAARWLVYRLDRFAEHNPDVRLRIDATTTLANLETSDVDLGIRVGAGRWPGVRSELL
LEQEVFPVCSPALAAGLHKPADILKLPAVIDAHAMFSWELWLDAAGVSGAAMTVRHTFNDASLALDAAIAGQGVMLAWQT
LAGYALLRGSLVAPFGIRAKTGFGHYFVTSASRRESKGALAFKRWVREEVEEGMRQLASIPAPSAAPPSSP

Specific function: Regulatory protein for the glycine cleavage system operon (gcv). Mediates activation of gcv by glycine and repression by purines. GcvA is negatively autoregulated. Bind to three sites upstream of the gcv promoter [H]

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HTH lysR-type DNA-binding domain [H]

Homologues:

Organism=Escherichia coli, GI1789173, Length=284, Percent_Identity=36.2676056338028, Blast_Score=155, Evalue=3e-39,
Organism=Escherichia coli, GI1786448, Length=295, Percent_Identity=33.5593220338983, Blast_Score=154, Evalue=7e-39,
Organism=Escherichia coli, GI1788706, Length=300, Percent_Identity=27.6666666666667, Blast_Score=112, Evalue=3e-26,
Organism=Escherichia coli, GI87081978, Length=237, Percent_Identity=27.0042194092827, Blast_Score=71, Evalue=7e-14,
Organism=Escherichia coli, GI1786401, Length=241, Percent_Identity=25.7261410788382, Blast_Score=69, Evalue=4e-13,
Organism=Escherichia coli, GI1787128, Length=253, Percent_Identity=24.1106719367589, Blast_Score=61, Evalue=8e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000847
- InterPro:   IPR005119
- InterPro:   IPR011991 [H]

Pfam domain/function: PF00126 HTH_1; PF03466 LysR_substrate [H]

EC number: NA

Molecular weight: Translated: 33140; Mature: 33140

Theoretical pI: Translated: 8.51; Mature: 8.51

Prosite motif: PS50931 HTH_LYSR

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.3 %Cys     (Translated Protein)
1.6 %Met     (Translated Protein)
1.9 %Cys+Met (Translated Protein)
0.3 %Cys     (Mature Protein)
1.6 %Met     (Mature Protein)
1.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKELSAVHLNGLRATEAVGRLGSLAAAAEELGVTPGAVSQQIAKTEAQLGRTLFERTPRG
CCCCCEEECCCCHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHCCCCC
LVVSDSGRALLTRLSSAFGELAEAVAQARRRDESVLTVSVAPVFAARWLVYRLDRFAEHN
EEEECCCHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEHHHHHHHHHHHHHHHHHCCC
PDVRLRIDATTTLANLETSDVDLGIRVGAGRWPGVRSELLLEQEVFPVCSPALAAGLHKP
CCEEEEEECCEEEECCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCHHHHHCCCCC
ADILKLPAVIDAHAMFSWELWLDAAGVSGAAMTVRHTFNDASLALDAAIAGQGVMLAWQT
HHHHHCCHHHHHHHHEEEEEEEECCCCCCCEEEEEEECCCHHHHHHHHHCCCCHHHHHHH
LAGYALLRGSLVAPFGIRAKTGFGHYFVTSASRRESKGALAFKRWVREEVEEGMRQLASI
HHHHHHHHCCCCCCCCCEECCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHCC
PAPSAAPPSSP
CCCCCCCCCCC
>Mature Secondary Structure
MKELSAVHLNGLRATEAVGRLGSLAAAAEELGVTPGAVSQQIAKTEAQLGRTLFERTPRG
CCCCCEEECCCCHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHCCCCC
LVVSDSGRALLTRLSSAFGELAEAVAQARRRDESVLTVSVAPVFAARWLVYRLDRFAEHN
EEEECCCHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEHHHHHHHHHHHHHHHHHCCC
PDVRLRIDATTTLANLETSDVDLGIRVGAGRWPGVRSELLLEQEVFPVCSPALAAGLHKP
CCEEEEEECCEEEECCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCHHHHHCCCCC
ADILKLPAVIDAHAMFSWELWLDAAGVSGAAMTVRHTFNDASLALDAAIAGQGVMLAWQT
HHHHHCCHHHHHHHHEEEEEEEECCCCCCCEEEEEEECCCHHHHHHHHHCCCCHHHHHHH
LAGYALLRGSLVAPFGIRAKTGFGHYFVTSASRRESKGALAFKRWVREEVEEGMRQLASI
HHHHHHHHCCCCCCCCCEECCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHCC
PAPSAAPPSSP
CCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796 [H]