Definition | Agrobacterium tumefaciens str. C58 chromosome circular, complete sequence. |
---|---|
Accession | NC_003062 |
Length | 2,841,580 |
Click here to switch to the map view.
The map label for this gene is gcvA [H]
Identifier: 15889589
GI number: 15889589
Start: 2290087
End: 2291016
Strand: Direct
Name: gcvA [H]
Synonym: Atu2312
Alternate gene names: 15889589
Gene position: 2290087-2291016 (Clockwise)
Preceding gene: 15889588
Following gene: 159185155
Centisome position: 80.59
GC content: 60.11
Gene sequence:
>930_bases ATGAAGATGTCTCGCCAGTTCCCTCTGAATGCGCTGCGCGTCTTCGATGCCGCCGCCAGGCACCTGAGTTTCACCAAGGC GGGTGAAGAGCTGGGCATGACGCAGACCGCGGTGAGTTACCAGGTGAAGCTCTTGGAGGAGAATATCGGTGAGCCGCTCT TCATCCGCAAAGCGCGCCAGGTTCAGCTGACCGAGGCCGGGCAGAAACTGGCGCCTAAAGTGGCGGAAGCCTTCCACAGC CTGCGGGAGGCCGTCGACAATGTGCGCGACACATCCGACACCACCCTGACCATCCATTCCACCGCCACTTTCGCATCGCG CTGGCTGTCGCGGCATCTCGGCGCTTTCCAGCTTGAACATCCCTCGATCGCGGTGCGGCTCGATACATCAAGCGCATTGA TCGATTTTTCCCAGTCGGACTGCGATGTCGCCATTCGCTGGAGCCGCGACGATGGCAAGGGGCTGGCCTATCATCAATTG CTGCGCGGCGTTTATACGCCGATGCTGCATCCGAGCCTCGCCGAGAGCATCGGCGGATTGCACAGGCCGGAAGATCTGCT GCGCCTGCGCATCATCGATCCCGGCGACATATGGTGGAGCCAGTGGTTTCGTGAGGTGGGCGTCGAAAATCCGGGCCTTG ACCGTTATCCACGCAGCCGCCTGAATGTGCAGGCCTTCGAAGCCGCCGCGGCAATCGCCAATCAGGGGGTGGCCATGCTC ACCCCGGAGCTTTATGCCGACGAGATCGCGCTCGGCCGCCTTTACCAGCCATTCGAGCACCTCAGCAGCGAAGGCAAGAA CTACTGGCTGGTCTATCCGGAAAACCGCAGGAACATTCGCAAGATAAAACTGTTTCGCGACTGGATATTGAAGCGCATAG AGGAAAGCCGGCCGCAGCAGCAGCCTCAATATCCCATATCAGGCGCATAA
Upstream 100 bases:
>100_bases AAATGTCAGAAGCTCAGTTGATGAATTGGGCTAAAAATGCTCTGGTTTCGGGAGAAAAACCAATGAAACCTTTGGCGACT AGCATAAAGAGAACTTATCC
Downstream 100 bases:
>100_bases AAGCAGCGCGGCGTATCCTCGTCCGGAAACAGACAGAGATGATATTCACCATCTTCGGACCGCCTTGCCTCGCTCTGCGA CCAGCTGAAGACATGGCGCC
Product: LysR family transcriptional regulator
Products: NA
Alternate protein names: Gcv operon activator [H]
Number of amino acids: Translated: 309; Mature: 309
Protein sequence:
>309_residues MKMSRQFPLNALRVFDAAARHLSFTKAGEELGMTQTAVSYQVKLLEENIGEPLFIRKARQVQLTEAGQKLAPKVAEAFHS LREAVDNVRDTSDTTLTIHSTATFASRWLSRHLGAFQLEHPSIAVRLDTSSALIDFSQSDCDVAIRWSRDDGKGLAYHQL LRGVYTPMLHPSLAESIGGLHRPEDLLRLRIIDPGDIWWSQWFREVGVENPGLDRYPRSRLNVQAFEAAAAIANQGVAML TPELYADEIALGRLYQPFEHLSSEGKNYWLVYPENRRNIRKIKLFRDWILKRIEESRPQQQPQYPISGA
Sequences:
>Translated_309_residues MKMSRQFPLNALRVFDAAARHLSFTKAGEELGMTQTAVSYQVKLLEENIGEPLFIRKARQVQLTEAGQKLAPKVAEAFHS LREAVDNVRDTSDTTLTIHSTATFASRWLSRHLGAFQLEHPSIAVRLDTSSALIDFSQSDCDVAIRWSRDDGKGLAYHQL LRGVYTPMLHPSLAESIGGLHRPEDLLRLRIIDPGDIWWSQWFREVGVENPGLDRYPRSRLNVQAFEAAAAIANQGVAML TPELYADEIALGRLYQPFEHLSSEGKNYWLVYPENRRNIRKIKLFRDWILKRIEESRPQQQPQYPISGA >Mature_309_residues MKMSRQFPLNALRVFDAAARHLSFTKAGEELGMTQTAVSYQVKLLEENIGEPLFIRKARQVQLTEAGQKLAPKVAEAFHS LREAVDNVRDTSDTTLTIHSTATFASRWLSRHLGAFQLEHPSIAVRLDTSSALIDFSQSDCDVAIRWSRDDGKGLAYHQL LRGVYTPMLHPSLAESIGGLHRPEDLLRLRIIDPGDIWWSQWFREVGVENPGLDRYPRSRLNVQAFEAAAAIANQGVAML TPELYADEIALGRLYQPFEHLSSEGKNYWLVYPENRRNIRKIKLFRDWILKRIEESRPQQQPQYPISGA
Specific function: Regulatory protein for the glycine cleavage system operon (gcv). Mediates activation of gcv by glycine and repression by purines. GcvA is negatively autoregulated. Bind to three sites upstream of the gcv promoter [H]
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 HTH lysR-type DNA-binding domain [H]
Homologues:
Organism=Escherichia coli, GI1789173, Length=293, Percent_Identity=36.518771331058, Blast_Score=159, Evalue=3e-40, Organism=Escherichia coli, GI1786448, Length=302, Percent_Identity=31.7880794701987, Blast_Score=130, Evalue=9e-32, Organism=Escherichia coli, GI1788706, Length=300, Percent_Identity=26.6666666666667, Blast_Score=109, Evalue=3e-25, Organism=Escherichia coli, GI1787128, Length=305, Percent_Identity=27.5409836065574, Blast_Score=92, Evalue=4e-20, Organism=Escherichia coli, GI157672245, Length=170, Percent_Identity=32.3529411764706, Blast_Score=71, Evalue=9e-14, Organism=Escherichia coli, GI1790208, Length=215, Percent_Identity=26.046511627907, Blast_Score=68, Evalue=8e-13, Organism=Escherichia coli, GI87081978, Length=155, Percent_Identity=29.0322580645161, Blast_Score=68, Evalue=9e-13, Organism=Escherichia coli, GI145693105, Length=161, Percent_Identity=27.9503105590062, Blast_Score=66, Evalue=3e-12, Organism=Escherichia coli, GI1786508, Length=168, Percent_Identity=26.1904761904762, Blast_Score=65, Evalue=7e-12, Organism=Escherichia coli, GI1786401, Length=138, Percent_Identity=29.7101449275362, Blast_Score=64, Evalue=2e-11, Organism=Escherichia coli, GI1788748, Length=65, Percent_Identity=47.6923076923077, Blast_Score=62, Evalue=6e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000847 - InterPro: IPR005119 - InterPro: IPR011991 [H]
Pfam domain/function: PF00126 HTH_1; PF03466 LysR_substrate [H]
EC number: NA
Molecular weight: Translated: 35192; Mature: 35192
Theoretical pI: Translated: 8.43; Mature: 8.43
Prosite motif: PS50931 HTH_LYSR
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.3 %Cys (Translated Protein) 1.6 %Met (Translated Protein) 1.9 %Cys+Met (Translated Protein) 0.3 %Cys (Mature Protein) 1.6 %Met (Mature Protein) 1.9 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKMSRQFPLNALRVFDAAARHLSFTKAGEELGMTQTAVSYQVKLLEENIGEPLFIRKARQ CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCEEEHHHHH VQLTEAGQKLAPKVAEAFHSLREAVDNVRDTSDTTLTIHSTATFASRWLSRHLGAFQLEH HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECHHHHHHHHHHHCCCCEEECC PSIAVRLDTSSALIDFSQSDCDVAIRWSRDDGKGLAYHQLLRGVYTPMLHPSLAESIGGL CEEEEEECCCCCEEECCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCHHHHHHHCCC HRPEDLLRLRIIDPGDIWWSQWFREVGVENPGLDRYPRSRLNVQAFEAAAAIANQGVAML CCCCHHEEEEECCCCHHHHHHHHHHHCCCCCCCCCCCHHHCCHHHHHHHHHHHHCCEEEE TPELYADEIALGRLYQPFEHLSSEGKNYWLVYPENRRNIRKIKLFRDWILKRIEESRPQQ CHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCC QPQYPISGA CCCCCCCCC >Mature Secondary Structure MKMSRQFPLNALRVFDAAARHLSFTKAGEELGMTQTAVSYQVKLLEENIGEPLFIRKARQ CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCEEEHHHHH VQLTEAGQKLAPKVAEAFHSLREAVDNVRDTSDTTLTIHSTATFASRWLSRHLGAFQLEH HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECHHHHHHHHHHHCCCCEEECC PSIAVRLDTSSALIDFSQSDCDVAIRWSRDDGKGLAYHQLLRGVYTPMLHPSLAESIGGL CEEEEEECCCCCEEECCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCHHHHHHHCCC HRPEDLLRLRIIDPGDIWWSQWFREVGVENPGLDRYPRSRLNVQAFEAAAAIANQGVAML CCCCHHEEEEECCCCHHHHHHHHHHHCCCCCCCCCCCHHHCCHHHHHHHHHHHHCCEEEE TPELYADEIALGRLYQPFEHLSSEGKNYWLVYPENRRNIRKIKLFRDWILKRIEESRPQQ CHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCC QPQYPISGA CCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]