Definition | Salmonella enterica subsp. enterica serovar Typhimurium str. LT2 chromosome, complete genome. |
---|---|
Accession | NC_003197 |
Length | 4,857,432 |
Click here to switch to the map view.
The map label for this gene is gcvA [H]
Identifier: 16766286
GI number: 16766286
Start: 3134272
End: 3135189
Strand: Reverse
Name: gcvA [H]
Synonym: STM2982
Alternate gene names: 16766286
Gene position: 3135189-3134272 (Counterclockwise)
Preceding gene: 16766287
Following gene: 16766285
Centisome position: 64.54
GC content: 50.98
Gene sequence:
>918_bases ATGTCAAAACGATTACCTCCTTTAAATGCGTTACGCGTTTTTGATGCTGCCGCGCGTCATTTGAGCTTTACGCGAGCAGC GGAAGAACTTTTTGTGACGCAGGCCGCAGTAAGCCACCAAATTAAGTCACTTGAGGATTTTTTGGGGCTAAAACTGTTCC GCCGCCGCAATCGTTCGCTGCTGCTGACGGAAGAAGGCCAAAGCTATTTTCTCGACATTAAAGAGATATTTTCCCAGCTC ACGGAAGCAACGCGTAAGCTCCAGGCGCGCAGTGCCAAAGGGGCGCTGACGGTGAGTTTACCGCCCAGTTTTGCCATCCA GTGGCTGGTGCCCAGGCTCTCCAGCTTTAACTCAGCTTATCCGGGAATTGACGTTCGGATACAAGCCGTTGACCGTCAGG AAGATAAACTGGCAGACGATGTCGATGTCGCGATTTTTTATGGCCGGGGAAACTGGCCGGGCTTGCGCGTCGAAAAATTA TACGCAGAATATTTACTGCCCGTCTGTTCTCCTTTATTACTCACAGGCGAAAAACCGTTAAAAACGCCGGAAGATCTGGC GAAACACACGCTATTGCACGACGCCTCCCGGCGTGACTGGCAGGCCTATACTCGTCAGTTAGGGCTCAACCATATTAACG TTCAGCAAGGGCCGATATTTAGCCACAGCGCGATGGTGCTACAGGCCGCTATTCACGGACAGGGGATCGCGCTGGCGAAT AATGTTATGGCGCAGTCGGAAATTGAAGCCGGGCGACTGGTGTGTCCATTTAACGATGTTCTGGTGAGTAAGAATGCTTT TTATCTGGTTTGTCATGACAGCCAGGCAGAACTGGGTAAAATAGCCGCCTTCCGTCAGTGGATTTTGGCGAAGGCGGCGA CGGAGCAAGAAAAATTCCGTTTTCGTTACGAACAATAA
Upstream 100 bases:
>100_bases AACCTCAACGGACAATTTATAATGTCTCAGATTAAAAAAACTAATAGGTTACATGTTGTTACCTATTTGTTAAATTCATT CGACATTAAGTCCAGAGGCC
Downstream 100 bases:
>100_bases TTAACTTAGGGTATGACCATGACCAGCCGTTTTATGCTGATTGTCGCCGCTATCAGCGGTTTTATTTACGTTGCTCTGGG CGCGTTCGGGGCGCATGTTT
Product: DNA-binding transcriptional activator GcvA
Products: NA
Alternate protein names: Gcv operon activator [H]
Number of amino acids: Translated: 305; Mature: 304
Protein sequence:
>305_residues MSKRLPPLNALRVFDAAARHLSFTRAAEELFVTQAAVSHQIKSLEDFLGLKLFRRRNRSLLLTEEGQSYFLDIKEIFSQL TEATRKLQARSAKGALTVSLPPSFAIQWLVPRLSSFNSAYPGIDVRIQAVDRQEDKLADDVDVAIFYGRGNWPGLRVEKL YAEYLLPVCSPLLLTGEKPLKTPEDLAKHTLLHDASRRDWQAYTRQLGLNHINVQQGPIFSHSAMVLQAAIHGQGIALAN NVMAQSEIEAGRLVCPFNDVLVSKNAFYLVCHDSQAELGKIAAFRQWILAKAATEQEKFRFRYEQ
Sequences:
>Translated_305_residues MSKRLPPLNALRVFDAAARHLSFTRAAEELFVTQAAVSHQIKSLEDFLGLKLFRRRNRSLLLTEEGQSYFLDIKEIFSQL TEATRKLQARSAKGALTVSLPPSFAIQWLVPRLSSFNSAYPGIDVRIQAVDRQEDKLADDVDVAIFYGRGNWPGLRVEKL YAEYLLPVCSPLLLTGEKPLKTPEDLAKHTLLHDASRRDWQAYTRQLGLNHINVQQGPIFSHSAMVLQAAIHGQGIALAN NVMAQSEIEAGRLVCPFNDVLVSKNAFYLVCHDSQAELGKIAAFRQWILAKAATEQEKFRFRYEQ >Mature_304_residues SKRLPPLNALRVFDAAARHLSFTRAAEELFVTQAAVSHQIKSLEDFLGLKLFRRRNRSLLLTEEGQSYFLDIKEIFSQLT EATRKLQARSAKGALTVSLPPSFAIQWLVPRLSSFNSAYPGIDVRIQAVDRQEDKLADDVDVAIFYGRGNWPGLRVEKLY AEYLLPVCSPLLLTGEKPLKTPEDLAKHTLLHDASRRDWQAYTRQLGLNHINVQQGPIFSHSAMVLQAAIHGQGIALANN VMAQSEIEAGRLVCPFNDVLVSKNAFYLVCHDSQAELGKIAAFRQWILAKAATEQEKFRFRYEQ
Specific function: Regulatory protein for the glycine cleavage system operon (gcv). Mediates activation of gcv by glycine and repression by purines. GcvA is negatively autoregulated. Bind to three sites upstream of the gcv promoter [H]
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 HTH lysR-type DNA-binding domain [H]
Homologues:
Organism=Escherichia coli, GI1789173, Length=305, Percent_Identity=98.3606557377049, Blast_Score=618, Evalue=1e-178, Organism=Escherichia coli, GI1786448, Length=286, Percent_Identity=34.965034965035, Blast_Score=149, Evalue=3e-37, Organism=Escherichia coli, GI1788706, Length=294, Percent_Identity=30.6122448979592, Blast_Score=141, Evalue=6e-35, Organism=Escherichia coli, GI145693193, Length=296, Percent_Identity=28.0405405405405, Blast_Score=107, Evalue=1e-24, Organism=Escherichia coli, GI1786401, Length=284, Percent_Identity=28.8732394366197, Blast_Score=97, Evalue=2e-21, Organism=Escherichia coli, GI157672245, Length=212, Percent_Identity=31.1320754716981, Blast_Score=87, Evalue=2e-18, Organism=Escherichia coli, GI87081978, Length=257, Percent_Identity=31.9066147859922, Blast_Score=84, Evalue=1e-17, Organism=Escherichia coli, GI1789639, Length=250, Percent_Identity=26.4, Blast_Score=71, Evalue=1e-13, Organism=Escherichia coli, GI1787128, Length=305, Percent_Identity=23.2786885245902, Blast_Score=67, Evalue=1e-12, Organism=Escherichia coli, GI1788887, Length=176, Percent_Identity=34.0909090909091, Blast_Score=65, Evalue=5e-12, Organism=Escherichia coli, GI1789440, Length=266, Percent_Identity=26.6917293233083, Blast_Score=65, Evalue=6e-12, Organism=Escherichia coli, GI145693105, Length=143, Percent_Identity=29.3706293706294, Blast_Score=65, Evalue=6e-12, Organism=Escherichia coli, GI1790262, Length=176, Percent_Identity=28.9772727272727, Blast_Score=64, Evalue=8e-12, Organism=Escherichia coli, GI1787879, Length=130, Percent_Identity=32.3076923076923, Blast_Score=64, Evalue=2e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000847 - InterPro: IPR005119 - InterPro: IPR011991 [H]
Pfam domain/function: PF00126 HTH_1; PF03466 LysR_substrate [H]
EC number: NA
Molecular weight: Translated: 34391; Mature: 34260
Theoretical pI: Translated: 9.30; Mature: 9.30
Prosite motif: PS50931 HTH_LYSR
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.0 %Cys (Translated Protein) 1.0 %Met (Translated Protein) 2.0 %Cys+Met (Translated Protein) 1.0 %Cys (Mature Protein) 0.7 %Met (Mature Protein) 1.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSKRLPPLNALRVFDAAARHLSFTRAAEELFVTQAAVSHQIKSLEDFLGLKLFRRRNRSL CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEE LLTEEGQSYFLDIKEIFSQLTEATRKLQARSAKGALTVSLPPSFAIQWLVPRLSSFNSAY EEEECCCCEEEEHHHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHCCCCC PGIDVRIQAVDRQEDKLADDVDVAIFYGRGNWPGLRVEKLYAEYLLPVCSPLLLTGEKPL CCCEEEEEEECCCHHHCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHHCHHEECCCCCC KTPEDLAKHTLLHDASRRDWQAYTRQLGLNHINVQQGPIFSHSAMVLQAAIHGQGIALAN CCHHHHHHHHHHHCCCCHHHHHHHHHHCCCEEECCCCCCCCHHHHHHHHHHCCCCEEHHH NVMAQSEIEAGRLVCPFNDVLVSKNAFYLVCHDSQAELGKIAAFRQWILAKAATEQEKFR HHHHHHHCCCCCEECCCHHHEECCCEEEEEEECCCHHHHHHHHHHHHHHHHHHCHHHHHH FRYEQ CCCCC >Mature Secondary Structure SKRLPPLNALRVFDAAARHLSFTRAAEELFVTQAAVSHQIKSLEDFLGLKLFRRRNRSL CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEE LLTEEGQSYFLDIKEIFSQLTEATRKLQARSAKGALTVSLPPSFAIQWLVPRLSSFNSAY EEEECCCCEEEEHHHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHCCCCC PGIDVRIQAVDRQEDKLADDVDVAIFYGRGNWPGLRVEKLYAEYLLPVCSPLLLTGEKPL CCCEEEEEEECCCHHHCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHHCHHEECCCCCC KTPEDLAKHTLLHDASRRDWQAYTRQLGLNHINVQQGPIFSHSAMVLQAAIHGQGIALAN CCHHHHHHHHHHHCCCCHHHHHHHHHHCCCEEECCCCCCCCHHHHHHHHHHCCCCEEHHH NVMAQSEIEAGRLVCPFNDVLVSKNAFYLVCHDSQAELGKIAAFRQWILAKAATEQEKFR HHHHHHHCCCCCEECCCHHHEECCCEEEEEEECCCHHHHHHHHHHHHHHHHHHCHHHHHH FRYEQ CCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]