Definition Agrobacterium tumefaciens str. C58 chromosome circular, complete sequence.
Accession NC_003062
Length 2,841,580

Click here to switch to the map view.

The map label for this gene is gcvA [H]

Identifier: 15889362

GI number: 15889362

Start: 2037092

End: 2037994

Strand: Reverse

Name: gcvA [H]

Synonym: Atu2078

Alternate gene names: 15889362

Gene position: 2037994-2037092 (Counterclockwise)

Preceding gene: 15889365

Following gene: 15889361

Centisome position: 71.72

GC content: 62.35

Gene sequence:

>903_bases
ATGGCGACGGAACTTCCATCTCTGAAGGGGCTGCAGGCCTTCGAGGCCGCAGCGCGTTATCGCAGCGTCACGCTTGCTTC
CAACGAGCTGAACGTCACCCCTGGTGCGGTCAGCCTGCAGATCCGCGAGCTGGAGGCGCGCCTTGGCGTGCAGCTGTTCT
TTCGCAAACCGCGCAGCATCCAGCTGACGCGCGAGGGAGAGCGTTATTACGGCGCGCTCCGCACCGCCTTCCGGATGATG
CGCGAAGCGACGGCGGAACTGACGGCACGCTCCGAGATCACTGTTCTGACGCTTAGCTGCACGCCGACTTTCGCCGTGCA
ATGGCTGATGCCGCGATTGCCTTCCTTCCAGCAGCAACACCCGCATGTGGATGTCCGTATCAGCGTGACCAACCGGCTGG
TGGATTTTTCGCGGGACGATGTCGATCTGGCGGTGCGGCATGGTTTCGGGCGTTATGAAGGGCTGGAGAGCATCCGTTTC
ATCGATGACAGCACCCTGCCGGTCTGCGCACCGCAACTTCTGGAAAAATACGGGCCGCTTCGGGAGGCGGGAGACTTGAA
ATCCGTGCCGCTGCTGCATGATGAAAACCGCAACGAGTGGCGGCGCTGGCTGGAGGCGGCGGGCGCATCCGAGGTGGACG
CTTCGGGCGGCACGGTCTTCATCGACAGCAATGGCGCGCTGGATGCGGCCAAGGCTGGGCACGGCATTGCGCTGACGCGG
CGTTCGCTTGTTTCCCGCGAACTTGTGGAGGGCGCATTGATAGCGCCCTTCGGCACGGACATGGCCAGCACGCTCGCCTA
TTTCCTGGTTTATCCACGGCGCATGCTGGATAATCCCGATCTCGTGACGCTCATCGACTGGATGCTTTCGCAAGCGGGTT
CCACTGAGGCTGGTTCTCTTTGA

Upstream 100 bases:

>100_bases
GCGTTCCGCAGCTTTCATGTCCTGTTGCGTCTATTTACTCCCTGTTTTAAGATGTTTTCAAACGATTGTTTCTCGTTGGA
TAATTGAGTTGAGCTAAACC

Downstream 100 bases:

>100_bases
CGCCAATCCGGTCTATGCCGGAAGACAACAACGGGAATCACTCTTGAAAACTATCGCCATCGATTTCGAGACCGCCAATG
AGGAGCGGGGCAGCGCCTGC

Product: LysR family transcriptional regulator

Products: NA

Alternate protein names: Gcv operon activator [H]

Number of amino acids: Translated: 300; Mature: 299

Protein sequence:

>300_residues
MATELPSLKGLQAFEAAARYRSVTLASNELNVTPGAVSLQIRELEARLGVQLFFRKPRSIQLTREGERYYGALRTAFRMM
REATAELTARSEITVLTLSCTPTFAVQWLMPRLPSFQQQHPHVDVRISVTNRLVDFSRDDVDLAVRHGFGRYEGLESIRF
IDDSTLPVCAPQLLEKYGPLREAGDLKSVPLLHDENRNEWRRWLEAAGASEVDASGGTVFIDSNGALDAAKAGHGIALTR
RSLVSRELVEGALIAPFGTDMASTLAYFLVYPRRMLDNPDLVTLIDWMLSQAGSTEAGSL

Sequences:

>Translated_300_residues
MATELPSLKGLQAFEAAARYRSVTLASNELNVTPGAVSLQIRELEARLGVQLFFRKPRSIQLTREGERYYGALRTAFRMM
REATAELTARSEITVLTLSCTPTFAVQWLMPRLPSFQQQHPHVDVRISVTNRLVDFSRDDVDLAVRHGFGRYEGLESIRF
IDDSTLPVCAPQLLEKYGPLREAGDLKSVPLLHDENRNEWRRWLEAAGASEVDASGGTVFIDSNGALDAAKAGHGIALTR
RSLVSRELVEGALIAPFGTDMASTLAYFLVYPRRMLDNPDLVTLIDWMLSQAGSTEAGSL
>Mature_299_residues
ATELPSLKGLQAFEAAARYRSVTLASNELNVTPGAVSLQIRELEARLGVQLFFRKPRSIQLTREGERYYGALRTAFRMMR
EATAELTARSEITVLTLSCTPTFAVQWLMPRLPSFQQQHPHVDVRISVTNRLVDFSRDDVDLAVRHGFGRYEGLESIRFI
DDSTLPVCAPQLLEKYGPLREAGDLKSVPLLHDENRNEWRRWLEAAGASEVDASGGTVFIDSNGALDAAKAGHGIALTRR
SLVSRELVEGALIAPFGTDMASTLAYFLVYPRRMLDNPDLVTLIDWMLSQAGSTEAGSL

Specific function: Regulatory protein for the glycine cleavage system operon (gcv). Mediates activation of gcv by glycine and repression by purines. GcvA is negatively autoregulated. Bind to three sites upstream of the gcv promoter [H]

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HTH lysR-type DNA-binding domain [H]

Homologues:

Organism=Escherichia coli, GI1789173, Length=296, Percent_Identity=38.1756756756757, Blast_Score=209, Evalue=3e-55,
Organism=Escherichia coli, GI1786448, Length=287, Percent_Identity=34.1463414634146, Blast_Score=153, Evalue=2e-38,
Organism=Escherichia coli, GI1788706, Length=297, Percent_Identity=31.6498316498317, Blast_Score=152, Evalue=2e-38,
Organism=Escherichia coli, GI145693193, Length=275, Percent_Identity=29.8181818181818, Blast_Score=87, Evalue=2e-18,
Organism=Escherichia coli, GI1786401, Length=247, Percent_Identity=23.4817813765182, Blast_Score=75, Evalue=7e-15,
Organism=Escherichia coli, GI1787128, Length=289, Percent_Identity=24.9134948096886, Blast_Score=74, Evalue=8e-15,
Organism=Escherichia coli, GI87081978, Length=263, Percent_Identity=25.8555133079848, Blast_Score=73, Evalue=2e-14,
Organism=Escherichia coli, GI1787589, Length=281, Percent_Identity=26.3345195729537, Blast_Score=70, Evalue=2e-13,
Organism=Escherichia coli, GI157672245, Length=220, Percent_Identity=28.6363636363636, Blast_Score=63, Evalue=3e-11,
Organism=Escherichia coli, GI1789639, Length=251, Percent_Identity=26.2948207171315, Blast_Score=62, Evalue=7e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000847
- InterPro:   IPR005119
- InterPro:   IPR011991 [H]

Pfam domain/function: PF00126 HTH_1; PF03466 LysR_substrate [H]

EC number: NA

Molecular weight: Translated: 33268; Mature: 33137

Theoretical pI: Translated: 6.06; Mature: 6.06

Prosite motif: PS50931 HTH_LYSR

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.7 %Cys     (Translated Protein)
2.3 %Met     (Translated Protein)
3.0 %Cys+Met (Translated Protein)
0.7 %Cys     (Mature Protein)
2.0 %Met     (Mature Protein)
2.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MATELPSLKGLQAFEAAARYRSVTLASNELNVTPGAVSLQIRELEARLGVQLFFRKPRSI
CCCCCCCCCCHHHHHHHHHHHEEEEECCCCCCCCCEEEEEHHHHHHHHCCCEEECCCCEE
QLTREGERYYGALRTAFRMMREATAELTARSEITVLTLSCTPTFAVQWLMPRLPSFQQQH
EEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCHHHHHHHHHHCCCCCHHHC
PHVDVRISVTNRLVDFSRDDVDLAVRHGFGRYEGLESIRFIDDSTLPVCAPQLLEKYGPL
CCEEEEEEEEHHHHHCCCCCHHHHHHCCCCHHHCCCCEEEECCCCCCCHHHHHHHHCCCC
REAGDLKSVPLLHDENRNEWRRWLEAAGASEVDASGGTVFIDSNGALDAAKAGHGIALTR
CCCCCCCCCCCEECCCHHHHHHHHHHCCCCCCCCCCCEEEEECCCCCCHHHCCCCHHHHH
RSLVSRELVEGALIAPFGTDMASTLAYFLVYPRRMLDNPDLVTLIDWMLSQAGSTEAGSL
HHHHHHHHHCCCEECCCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCCCC
>Mature Secondary Structure 
ATELPSLKGLQAFEAAARYRSVTLASNELNVTPGAVSLQIRELEARLGVQLFFRKPRSI
CCCCCCCCCHHHHHHHHHHHEEEEECCCCCCCCCEEEEEHHHHHHHHCCCEEECCCCEE
QLTREGERYYGALRTAFRMMREATAELTARSEITVLTLSCTPTFAVQWLMPRLPSFQQQH
EEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCHHHHHHHHHHCCCCCHHHC
PHVDVRISVTNRLVDFSRDDVDLAVRHGFGRYEGLESIRFIDDSTLPVCAPQLLEKYGPL
CCEEEEEEEEHHHHHCCCCCHHHHHHCCCCHHHCCCCEEEECCCCCCCHHHHHHHHCCCC
REAGDLKSVPLLHDENRNEWRRWLEAAGASEVDASGGTVFIDSNGALDAAKAGHGIALTR
CCCCCCCCCCCEECCCHHHHHHHHHHCCCCCCCCCCCEEEEECCCCCCHHHCCCCHHHHH
RSLVSRELVEGALIAPFGTDMASTLAYFLVYPRRMLDNPDLVTLIDWMLSQAGSTEAGSL
HHHHHHHHHCCCEECCCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796 [H]