Definition | Agrobacterium tumefaciens str. C58 chromosome circular, complete sequence. |
---|---|
Accession | NC_003062 |
Length | 2,841,580 |
Click here to switch to the map view.
The map label for this gene is gcvA [H]
Identifier: 15889362
GI number: 15889362
Start: 2037092
End: 2037994
Strand: Reverse
Name: gcvA [H]
Synonym: Atu2078
Alternate gene names: 15889362
Gene position: 2037994-2037092 (Counterclockwise)
Preceding gene: 15889365
Following gene: 15889361
Centisome position: 71.72
GC content: 62.35
Gene sequence:
>903_bases ATGGCGACGGAACTTCCATCTCTGAAGGGGCTGCAGGCCTTCGAGGCCGCAGCGCGTTATCGCAGCGTCACGCTTGCTTC CAACGAGCTGAACGTCACCCCTGGTGCGGTCAGCCTGCAGATCCGCGAGCTGGAGGCGCGCCTTGGCGTGCAGCTGTTCT TTCGCAAACCGCGCAGCATCCAGCTGACGCGCGAGGGAGAGCGTTATTACGGCGCGCTCCGCACCGCCTTCCGGATGATG CGCGAAGCGACGGCGGAACTGACGGCACGCTCCGAGATCACTGTTCTGACGCTTAGCTGCACGCCGACTTTCGCCGTGCA ATGGCTGATGCCGCGATTGCCTTCCTTCCAGCAGCAACACCCGCATGTGGATGTCCGTATCAGCGTGACCAACCGGCTGG TGGATTTTTCGCGGGACGATGTCGATCTGGCGGTGCGGCATGGTTTCGGGCGTTATGAAGGGCTGGAGAGCATCCGTTTC ATCGATGACAGCACCCTGCCGGTCTGCGCACCGCAACTTCTGGAAAAATACGGGCCGCTTCGGGAGGCGGGAGACTTGAA ATCCGTGCCGCTGCTGCATGATGAAAACCGCAACGAGTGGCGGCGCTGGCTGGAGGCGGCGGGCGCATCCGAGGTGGACG CTTCGGGCGGCACGGTCTTCATCGACAGCAATGGCGCGCTGGATGCGGCCAAGGCTGGGCACGGCATTGCGCTGACGCGG CGTTCGCTTGTTTCCCGCGAACTTGTGGAGGGCGCATTGATAGCGCCCTTCGGCACGGACATGGCCAGCACGCTCGCCTA TTTCCTGGTTTATCCACGGCGCATGCTGGATAATCCCGATCTCGTGACGCTCATCGACTGGATGCTTTCGCAAGCGGGTT CCACTGAGGCTGGTTCTCTTTGA
Upstream 100 bases:
>100_bases GCGTTCCGCAGCTTTCATGTCCTGTTGCGTCTATTTACTCCCTGTTTTAAGATGTTTTCAAACGATTGTTTCTCGTTGGA TAATTGAGTTGAGCTAAACC
Downstream 100 bases:
>100_bases CGCCAATCCGGTCTATGCCGGAAGACAACAACGGGAATCACTCTTGAAAACTATCGCCATCGATTTCGAGACCGCCAATG AGGAGCGGGGCAGCGCCTGC
Product: LysR family transcriptional regulator
Products: NA
Alternate protein names: Gcv operon activator [H]
Number of amino acids: Translated: 300; Mature: 299
Protein sequence:
>300_residues MATELPSLKGLQAFEAAARYRSVTLASNELNVTPGAVSLQIRELEARLGVQLFFRKPRSIQLTREGERYYGALRTAFRMM REATAELTARSEITVLTLSCTPTFAVQWLMPRLPSFQQQHPHVDVRISVTNRLVDFSRDDVDLAVRHGFGRYEGLESIRF IDDSTLPVCAPQLLEKYGPLREAGDLKSVPLLHDENRNEWRRWLEAAGASEVDASGGTVFIDSNGALDAAKAGHGIALTR RSLVSRELVEGALIAPFGTDMASTLAYFLVYPRRMLDNPDLVTLIDWMLSQAGSTEAGSL
Sequences:
>Translated_300_residues MATELPSLKGLQAFEAAARYRSVTLASNELNVTPGAVSLQIRELEARLGVQLFFRKPRSIQLTREGERYYGALRTAFRMM REATAELTARSEITVLTLSCTPTFAVQWLMPRLPSFQQQHPHVDVRISVTNRLVDFSRDDVDLAVRHGFGRYEGLESIRF IDDSTLPVCAPQLLEKYGPLREAGDLKSVPLLHDENRNEWRRWLEAAGASEVDASGGTVFIDSNGALDAAKAGHGIALTR RSLVSRELVEGALIAPFGTDMASTLAYFLVYPRRMLDNPDLVTLIDWMLSQAGSTEAGSL >Mature_299_residues ATELPSLKGLQAFEAAARYRSVTLASNELNVTPGAVSLQIRELEARLGVQLFFRKPRSIQLTREGERYYGALRTAFRMMR EATAELTARSEITVLTLSCTPTFAVQWLMPRLPSFQQQHPHVDVRISVTNRLVDFSRDDVDLAVRHGFGRYEGLESIRFI DDSTLPVCAPQLLEKYGPLREAGDLKSVPLLHDENRNEWRRWLEAAGASEVDASGGTVFIDSNGALDAAKAGHGIALTRR SLVSRELVEGALIAPFGTDMASTLAYFLVYPRRMLDNPDLVTLIDWMLSQAGSTEAGSL
Specific function: Regulatory protein for the glycine cleavage system operon (gcv). Mediates activation of gcv by glycine and repression by purines. GcvA is negatively autoregulated. Bind to three sites upstream of the gcv promoter [H]
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 HTH lysR-type DNA-binding domain [H]
Homologues:
Organism=Escherichia coli, GI1789173, Length=296, Percent_Identity=38.1756756756757, Blast_Score=209, Evalue=3e-55, Organism=Escherichia coli, GI1786448, Length=287, Percent_Identity=34.1463414634146, Blast_Score=153, Evalue=2e-38, Organism=Escherichia coli, GI1788706, Length=297, Percent_Identity=31.6498316498317, Blast_Score=152, Evalue=2e-38, Organism=Escherichia coli, GI145693193, Length=275, Percent_Identity=29.8181818181818, Blast_Score=87, Evalue=2e-18, Organism=Escherichia coli, GI1786401, Length=247, Percent_Identity=23.4817813765182, Blast_Score=75, Evalue=7e-15, Organism=Escherichia coli, GI1787128, Length=289, Percent_Identity=24.9134948096886, Blast_Score=74, Evalue=8e-15, Organism=Escherichia coli, GI87081978, Length=263, Percent_Identity=25.8555133079848, Blast_Score=73, Evalue=2e-14, Organism=Escherichia coli, GI1787589, Length=281, Percent_Identity=26.3345195729537, Blast_Score=70, Evalue=2e-13, Organism=Escherichia coli, GI157672245, Length=220, Percent_Identity=28.6363636363636, Blast_Score=63, Evalue=3e-11, Organism=Escherichia coli, GI1789639, Length=251, Percent_Identity=26.2948207171315, Blast_Score=62, Evalue=7e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000847 - InterPro: IPR005119 - InterPro: IPR011991 [H]
Pfam domain/function: PF00126 HTH_1; PF03466 LysR_substrate [H]
EC number: NA
Molecular weight: Translated: 33268; Mature: 33137
Theoretical pI: Translated: 6.06; Mature: 6.06
Prosite motif: PS50931 HTH_LYSR
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.7 %Cys (Translated Protein) 2.3 %Met (Translated Protein) 3.0 %Cys+Met (Translated Protein) 0.7 %Cys (Mature Protein) 2.0 %Met (Mature Protein) 2.7 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MATELPSLKGLQAFEAAARYRSVTLASNELNVTPGAVSLQIRELEARLGVQLFFRKPRSI CCCCCCCCCCHHHHHHHHHHHEEEEECCCCCCCCCEEEEEHHHHHHHHCCCEEECCCCEE QLTREGERYYGALRTAFRMMREATAELTARSEITVLTLSCTPTFAVQWLMPRLPSFQQQH EEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCHHHHHHHHHHCCCCCHHHC PHVDVRISVTNRLVDFSRDDVDLAVRHGFGRYEGLESIRFIDDSTLPVCAPQLLEKYGPL CCEEEEEEEEHHHHHCCCCCHHHHHHCCCCHHHCCCCEEEECCCCCCCHHHHHHHHCCCC REAGDLKSVPLLHDENRNEWRRWLEAAGASEVDASGGTVFIDSNGALDAAKAGHGIALTR CCCCCCCCCCCEECCCHHHHHHHHHHCCCCCCCCCCCEEEEECCCCCCHHHCCCCHHHHH RSLVSRELVEGALIAPFGTDMASTLAYFLVYPRRMLDNPDLVTLIDWMLSQAGSTEAGSL HHHHHHHHHCCCEECCCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCCCC >Mature Secondary Structure ATELPSLKGLQAFEAAARYRSVTLASNELNVTPGAVSLQIRELEARLGVQLFFRKPRSI CCCCCCCCCHHHHHHHHHHHEEEEECCCCCCCCCEEEEEHHHHHHHHCCCEEECCCCEE QLTREGERYYGALRTAFRMMREATAELTARSEITVLTLSCTPTFAVQWLMPRLPSFQQQH EEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCHHHHHHHHHHCCCCCHHHC PHVDVRISVTNRLVDFSRDDVDLAVRHGFGRYEGLESIRFIDDSTLPVCAPQLLEKYGPL CCEEEEEEEEHHHHHCCCCCHHHHHHCCCCHHHCCCCEEEECCCCCCCHHHHHHHHCCCC REAGDLKSVPLLHDENRNEWRRWLEAAGASEVDASGGTVFIDSNGALDAAKAGHGIALTR CCCCCCCCCCCEECCCHHHHHHHHHHCCCCCCCCCCCEEEEECCCCCCHHHCCCCHHHHH RSLVSRELVEGALIAPFGTDMASTLAYFLVYPRRMLDNPDLVTLIDWMLSQAGSTEAGSL HHHHHHHHHCCCEECCCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]