Definition | Acidovorax citrulli AAC00-1 chromosome, complete genome. |
---|---|
Accession | NC_008752 |
Length | 5,352,772 |
Click here to switch to the map view.
The map label for this gene is gcvA [H]
Identifier: 120611095
GI number: 120611095
Start: 2646146
End: 2647114
Strand: Reverse
Name: gcvA [H]
Synonym: Aave_2424
Alternate gene names: 120611095
Gene position: 2647114-2646146 (Counterclockwise)
Preceding gene: 120611103
Following gene: 120611093
Centisome position: 49.45
GC content: 69.56
Gene sequence:
>969_bases ATGGATTCACGCCTGTCCAATTCGGTTCTGGCCTGGCTGCGCTGTTTCGATGCGGCCGCACGGCAGGGCAGCTTCACACG CGCCGCCGCGGAGCTGTGCATCACCCAGGGCGCGGTGAGCCAGAAGGTGAAGCAACTGGAACGCTGGCTGGGACGTCCCC TGTTCCTGCGCACCCCGCGGTTACTGGTGCCCACCCCGGAGGGCAAATGGCTGGCCGTGGTGCTGCGCGAAAGCTTCGAG GCCATCGAGGGCACGCTCGCGCAGATGCGGCGCTCCACCCCGGCCCACGCTGCAGCCACGCTGAGCTGCTCACCTTCGTT CGCCATGCAGTGGCTTACGCCACGGCTGGGCGAATTCTTCCGCCGGCACCCCGACACCGGGCTGCGCGTGTTCGGAGAGT TCCACCGCATCGACCGCACCCGCATGGTGCGCGACGGGGTCGAGGCGGCCGTCCGCTTCGACCCGGAGGAGTACACCGAC CTGGACGCCACCGGGTTCCTCGATGAATGGCTGGTCCCCGTGGCCAGCCCGGCTTTCGTAGCGGCCCACCCGGACCTGCG GGATGCGCCGGCCAGGCTGCGGCCGGAATGGCTGCTGCACGACGGCAGCGCCTGGGAAGGAGCCGACACCTTCGAGGAGT GGCAGCACTGGTTCGCCGCGCAGGGTGCGCCATGCCCGGACTGGGGAGGCGGCCCCCAGTTCAACCTGTCCCAGCTGGCG GTGGGAGCAGCCATCACCGGCCAGGGCATCGCCATGGGTCGTGCCGCGCTCGTGCTCGAAGACGTGGCGGCCGGCCGGCT CGTACCGCTGTGTTCCTGGAGCACGCTCTCCCGGGCGCGCTACGCCTTCGTGAGTTCGCCCCAGGCGGGGCCTGCCATGC TGCGCGTGCGGGACTGGCTGGTGGAGGAAGGGCAGCGCTTCAAAGAGGCGCGTGCGCAGGTGCTGCCACCATTGTTAATC TGCATCTAG
Upstream 100 bases:
>100_bases AGTTCGGTCATGGAACGGTAAAGAAAGCGTTGCACGCATTCTTGTGCCGGCCCATGGGCCGTGCAATGCATTTATATTGA CGACAGCATTAGTTTCATTG
Downstream 100 bases:
>100_bases AGCGTGTTATGAACTTCCCGCCGCAGCCAGCACCCGTGCGGCTGCAGGCAACATGAGCACGGCAAAGACCAGGAAATGCA GCCCGCCGAGCACTTCGGGT
Product: LysR family transcriptional regulator
Products: NA
Alternate protein names: Gcv operon activator [H]
Number of amino acids: Translated: 322; Mature: 322
Protein sequence:
>322_residues MDSRLSNSVLAWLRCFDAAARQGSFTRAAAELCITQGAVSQKVKQLERWLGRPLFLRTPRLLVPTPEGKWLAVVLRESFE AIEGTLAQMRRSTPAHAAATLSCSPSFAMQWLTPRLGEFFRRHPDTGLRVFGEFHRIDRTRMVRDGVEAAVRFDPEEYTD LDATGFLDEWLVPVASPAFVAAHPDLRDAPARLRPEWLLHDGSAWEGADTFEEWQHWFAAQGAPCPDWGGGPQFNLSQLA VGAAITGQGIAMGRAALVLEDVAAGRLVPLCSWSTLSRARYAFVSSPQAGPAMLRVRDWLVEEGQRFKEARAQVLPPLLI CI
Sequences:
>Translated_322_residues MDSRLSNSVLAWLRCFDAAARQGSFTRAAAELCITQGAVSQKVKQLERWLGRPLFLRTPRLLVPTPEGKWLAVVLRESFE AIEGTLAQMRRSTPAHAAATLSCSPSFAMQWLTPRLGEFFRRHPDTGLRVFGEFHRIDRTRMVRDGVEAAVRFDPEEYTD LDATGFLDEWLVPVASPAFVAAHPDLRDAPARLRPEWLLHDGSAWEGADTFEEWQHWFAAQGAPCPDWGGGPQFNLSQLA VGAAITGQGIAMGRAALVLEDVAAGRLVPLCSWSTLSRARYAFVSSPQAGPAMLRVRDWLVEEGQRFKEARAQVLPPLLI CI >Mature_322_residues MDSRLSNSVLAWLRCFDAAARQGSFTRAAAELCITQGAVSQKVKQLERWLGRPLFLRTPRLLVPTPEGKWLAVVLRESFE AIEGTLAQMRRSTPAHAAATLSCSPSFAMQWLTPRLGEFFRRHPDTGLRVFGEFHRIDRTRMVRDGVEAAVRFDPEEYTD LDATGFLDEWLVPVASPAFVAAHPDLRDAPARLRPEWLLHDGSAWEGADTFEEWQHWFAAQGAPCPDWGGGPQFNLSQLA VGAAITGQGIAMGRAALVLEDVAAGRLVPLCSWSTLSRARYAFVSSPQAGPAMLRVRDWLVEEGQRFKEARAQVLPPLLI CI
Specific function: Regulatory protein for the glycine cleavage system operon (gcv). Mediates activation of gcv by glycine and repression by purines. GcvA is negatively autoregulated. Bind to three sites upstream of the gcv promoter [H]
COG id: COG0583
COG function: function code K; Transcriptional regulator
Gene ontology:
Cell location: Cytoplasmic [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 HTH lysR-type DNA-binding domain [H]
Homologues:
Organism=Escherichia coli, GI1789173, Length=276, Percent_Identity=35.8695652173913, Blast_Score=140, Evalue=9e-35, Organism=Escherichia coli, GI1788706, Length=279, Percent_Identity=33.3333333333333, Blast_Score=140, Evalue=2e-34, Organism=Escherichia coli, GI1786448, Length=305, Percent_Identity=29.1803278688525, Blast_Score=108, Evalue=5e-25, Organism=Escherichia coli, GI157672245, Length=235, Percent_Identity=27.6595744680851, Blast_Score=76, Evalue=3e-15, Organism=Escherichia coli, GI1787589, Length=165, Percent_Identity=31.5151515151515, Blast_Score=67, Evalue=2e-12, Organism=Escherichia coli, GI1789283, Length=301, Percent_Identity=27.906976744186, Blast_Score=65, Evalue=4e-12,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000847 - InterPro: IPR005119 - InterPro: IPR011991 [H]
Pfam domain/function: PF00126 HTH_1; PF03466 LysR_substrate [H]
EC number: NA
Molecular weight: Translated: 35718; Mature: 35718
Theoretical pI: Translated: 7.09; Mature: 7.09
Prosite motif: PS50931 HTH_LYSR
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.9 %Cys (Translated Protein) 1.9 %Met (Translated Protein) 3.7 %Cys+Met (Translated Protein) 1.9 %Cys (Mature Protein) 1.9 %Met (Mature Protein) 3.7 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MDSRLSNSVLAWLRCFDAAARQGSFTRAAAELCITQGAVSQKVKQLERWLGRPLFLRTPR CCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCE LLVPTPEGKWLAVVLRESFEAIEGTLAQMRRSTPAHAAATLSCSPSFAMQWLTPRLGEFF EEEECCCCCEEHHHHHHHHHHHHHHHHHHHCCCCCHHEEEEECCHHHHHHHHHHHHHHHH RRHPDTGLRVFGEFHRIDRTRMVRDGVEAAVRFDPEEYTDLDATGFLDEWLVPVASPAFV HHCCCCHHHHHHHHHHHHHHHHHHHHHHHHEEECCHHHCCCCCCCCHHHHHHHCCCCCEE AAHPDLRDAPARLRPEWLLHDGSAWEGADTFEEWQHWFAAQGAPCPDWGGGPQFNLSQLA EECCCCCCCCCCCCCCEEEECCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHH VGAAITGQGIAMGRAALVLEDVAAGRLVPLCSWSTLSRARYAFVSSPQAGPAMLRVRDWL HHHHCCCCCHHHHHHHHHHHHHHCCCEEEECCCHHHHHHHHHHHCCCCCCHHHHHHHHHH VEEGQRFKEARAQVLPPLLICI HHHHHHHHHHHHHHCCHHHHCC >Mature Secondary Structure MDSRLSNSVLAWLRCFDAAARQGSFTRAAAELCITQGAVSQKVKQLERWLGRPLFLRTPR CCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCE LLVPTPEGKWLAVVLRESFEAIEGTLAQMRRSTPAHAAATLSCSPSFAMQWLTPRLGEFF EEEECCCCCEEHHHHHHHHHHHHHHHHHHHCCCCCHHEEEEECCHHHHHHHHHHHHHHHH RRHPDTGLRVFGEFHRIDRTRMVRDGVEAAVRFDPEEYTDLDATGFLDEWLVPVASPAFV HHCCCCHHHHHHHHHHHHHHHHHHHHHHHHEEECCHHHCCCCCCCCHHHHHHHCCCCCEE AAHPDLRDAPARLRPEWLLHDGSAWEGADTFEEWQHWFAAQGAPCPDWGGGPQFNLSQLA EECCCCCCCCCCCCCCEEEECCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHH VGAAITGQGIAMGRAALVLEDVAAGRLVPLCSWSTLSRARYAFVSSPQAGPAMLRVRDWL HHHHCCCCCHHHHHHHHHHHHHHCCCEEEECCCHHHHHHHHHHHCCCCCCHHHHHHHHHH VEEGQRFKEARAQVLPPLLICI HHHHHHHHHHHHHHCCHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]