| Definition | Escherichia coli O157:H7 str. EC4115, complete genome. |
|---|---|
| Accession | NC_011353 |
| Length | 5,572,075 |
Click here to switch to the map view.
The map label for this gene is gcvA
Identifier: 209400092
GI number: 209400092
Start: 3768447
End: 3769364
Strand: Reverse
Name: gcvA
Synonym: ECH74115_4072
Alternate gene names: 209400092
Gene position: 3769364-3768447 (Counterclockwise)
Preceding gene: 209400704
Following gene: 209398748
Centisome position: 67.65
GC content: 49.02
Gene sequence:
>918_bases ATGTCTAAACGATTACCACCGCTAAATGCCTTACGAGTTTTTGATGCCGCAGCACGCCATTTAAGTTTCACTCGCGCAGC AGAAGAGCTTTTTGTGACCCAGGCCGCAGTAAGTCATCAAATCAAGTCTCTTGAGGATTTTTTGGGGCTAAAACTGTTCC GCCGCCGTAATCGTTCACTCCTGCTGACCGAGGAAGGGCAAAGCTATTTCCTCGATATCAAAGAGATATTTTCGCAATTA ACCGAAGCGACGCGTAAACTCCAGGCCCGTAGCGCCAAGGGGGCGTTGACGGTCAGTTTACTCCCCAGTTTCGCCATTCA TTGGTTGGTTCCGCGACTTTCCAGCTTTAATTCAGCTTATCCGGGAATTGACGTTCGAATCCAGGCGGTTGATCGTCAGG AAGATAAGCTGGCGGATGATGTTGATGTGGCGATATTTTATGGTCGGGGCAACTGGCCGGGGCTACGGGTGGAAAAACTG TACGCCGAATATTTATTGCCGGTGTGTTCGCCGCTACTGCTGACTGGCGAAAAACCCTTGAAGACCCCGGAAGATCTGGC TAAACATACGTTATTACATGATGCGTCACGCCGTGACTGGCAGACATATACTCGTCAATTGGGGTTAAATCATATCAACG TTCAGCAAGGGCCAATTTTTAGCCATAGCGCCATGGTGCTGCAAGCGGCTATTCACGGGCAGGGAGTGGCGCTGGCAAAT AACGTGATGGCGCAATCTGAAATTGAAGCCGGACGTCTTGTTTGCCCGTTTAATGATGTTCTGGTCAGTAAAAACGCTTT TTATCTGGTTTGTCATGACAGCCAGGCAGAACTGGGTAAAATAGCCGCCTTTCGCCAATGGATCCTGGCGAAAGCCGCTG CTGAACAAGAAAAATTCCGCTTTCGTTATGAACAATAA
Upstream 100 bases:
>100_bases AGCTCAACGGACAATTTATAATGGCTCAGATTAAAAAAACTAATAGGTTACACAGTGTGATCTAATTGTTAAATTCATTT AACATCAAAGTTTAAAAGCC
Downstream 100 bases:
>100_bases TTTACGTAGGGTACGACCATGACCAGCCGTTTTATGCTGATTTTCGCCGCCATTAGCGGCTTCATTTTTGTGGCACTGGG CGCTTTTGGCGCGCATGTGT
Product: DNA-binding transcriptional activator GcvA
Products: NA
Alternate protein names: Gcv operon activator
Number of amino acids: Translated: 305; Mature: 304
Protein sequence:
>305_residues MSKRLPPLNALRVFDAAARHLSFTRAAEELFVTQAAVSHQIKSLEDFLGLKLFRRRNRSLLLTEEGQSYFLDIKEIFSQL TEATRKLQARSAKGALTVSLLPSFAIHWLVPRLSSFNSAYPGIDVRIQAVDRQEDKLADDVDVAIFYGRGNWPGLRVEKL YAEYLLPVCSPLLLTGEKPLKTPEDLAKHTLLHDASRRDWQTYTRQLGLNHINVQQGPIFSHSAMVLQAAIHGQGVALAN NVMAQSEIEAGRLVCPFNDVLVSKNAFYLVCHDSQAELGKIAAFRQWILAKAAAEQEKFRFRYEQ
Sequences:
>Translated_305_residues MSKRLPPLNALRVFDAAARHLSFTRAAEELFVTQAAVSHQIKSLEDFLGLKLFRRRNRSLLLTEEGQSYFLDIKEIFSQL TEATRKLQARSAKGALTVSLLPSFAIHWLVPRLSSFNSAYPGIDVRIQAVDRQEDKLADDVDVAIFYGRGNWPGLRVEKL YAEYLLPVCSPLLLTGEKPLKTPEDLAKHTLLHDASRRDWQTYTRQLGLNHINVQQGPIFSHSAMVLQAAIHGQGVALAN NVMAQSEIEAGRLVCPFNDVLVSKNAFYLVCHDSQAELGKIAAFRQWILAKAAAEQEKFRFRYEQ >Mature_304_residues SKRLPPLNALRVFDAAARHLSFTRAAEELFVTQAAVSHQIKSLEDFLGLKLFRRRNRSLLLTEEGQSYFLDIKEIFSQLT EATRKLQARSAKGALTVSLLPSFAIHWLVPRLSSFNSAYPGIDVRIQAVDRQEDKLADDVDVAIFYGRGNWPGLRVEKLY AEYLLPVCSPLLLTGEKPLKTPEDLAKHTLLHDASRRDWQTYTRQLGLNHINVQQGPIFSHSAMVLQAAIHGQGVALANN VMAQSEIEAGRLVCPFNDVLVSKNAFYLVCHDSQAELGKIAAFRQWILAKAAAEQEKFRFRYEQ
Specific function: Regulatory protein for the glycine cleavage system operon (gcv). Mediates activation of gcv by glycine and repression by purines. GcvA is negatively autoregulated. Bind to three sites upstream of the gcv promoter
COG id: COG0583
COG function: function code K; Transcriptional regulator
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 HTH lysR-type DNA-binding domain
Homologues:
Organism=Escherichia coli, GI1789173, Length=305, Percent_Identity=100, Blast_Score=627, Evalue=0.0, Organism=Escherichia coli, GI1786448, Length=286, Percent_Identity=34.965034965035, Blast_Score=146, Evalue=1e-36, Organism=Escherichia coli, GI1788706, Length=288, Percent_Identity=31.25, Blast_Score=140, Evalue=9e-35, Organism=Escherichia coli, GI145693193, Length=296, Percent_Identity=27.3648648648649, Blast_Score=102, Evalue=3e-23, Organism=Escherichia coli, GI1786401, Length=284, Percent_Identity=28.8732394366197, Blast_Score=99, Evalue=2e-22, Organism=Escherichia coli, GI157672245, Length=212, Percent_Identity=31.1320754716981, Blast_Score=88, Evalue=7e-19, Organism=Escherichia coli, GI87081978, Length=257, Percent_Identity=31.1284046692607, Blast_Score=81, Evalue=1e-16, Organism=Escherichia coli, GI1789639, Length=250, Percent_Identity=26, Blast_Score=70, Evalue=2e-13, Organism=Escherichia coli, GI145693105, Length=143, Percent_Identity=29.3706293706294, Blast_Score=68, Evalue=7e-13, Organism=Escherichia coli, GI1788887, Length=176, Percent_Identity=34.0909090909091, Blast_Score=67, Evalue=1e-12, Organism=Escherichia coli, GI1787128, Length=306, Percent_Identity=24.1830065359477, Blast_Score=66, Evalue=3e-12, Organism=Escherichia coli, GI1789440, Length=266, Percent_Identity=26.3157894736842, Blast_Score=64, Evalue=2e-11, Organism=Escherichia coli, GI1787879, Length=130, Percent_Identity=32.3076923076923, Blast_Score=63, Evalue=2e-11, Organism=Escherichia coli, GI1790262, Length=176, Percent_Identity=28.4090909090909, Blast_Score=62, Evalue=4e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): GCVA_ECO57 (P0A9F8)
Other databases:
- EMBL: AE005174 - EMBL: BA000007 - PIR: D91087 - PIR: F85932 - RefSeq: NP_289363.1 - RefSeq: NP_311695.1 - ProteinModelPortal: P0A9F8 - SMR: P0A9F8 - EnsemblBacteria: EBESCT00000028054 - EnsemblBacteria: EBESCT00000055715 - GeneID: 916524 - GeneID: 958267 - GenomeReviews: AE005174_GR - GenomeReviews: BA000007_GR - KEGG: ece:Z4125 - KEGG: ecs:ECs3668 - GeneTree: EBGT00070000031706 - HOGENOM: HBG685425 - OMA: FNSAYPE - ProtClustDB: PRK11139 - BioCyc: ECOL83334:ECS3668-MONOMER - GO: GO:0005737 - InterPro: IPR000847 - InterPro: IPR005119 - InterPro: IPR011991 - Gene3D: G3DSA:1.10.10.10 - PRINTS: PR00039
Pfam domain/function: PF00126 HTH_1; PF03466 LysR_substrate
EC number: NA
Molecular weight: Translated: 34402; Mature: 34271
Theoretical pI: Translated: 9.30; Mature: 9.30
Prosite motif: PS50931 HTH_LYSR
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.0 %Cys (Translated Protein) 1.0 %Met (Translated Protein) 2.0 %Cys+Met (Translated Protein) 1.0 %Cys (Mature Protein) 0.7 %Met (Mature Protein) 1.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSKRLPPLNALRVFDAAARHLSFTRAAEELFVTQAAVSHQIKSLEDFLGLKLFRRRNRSL CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEE LLTEEGQSYFLDIKEIFSQLTEATRKLQARSAKGALTVSLLPSFAIHWLVPRLSSFNSAY EEEECCCCEEEEHHHHHHHHHHHHHHHHHHCCCCCEEEHHHHHHHHHHHHHHHHHCCCCC PGIDVRIQAVDRQEDKLADDVDVAIFYGRGNWPGLRVEKLYAEYLLPVCSPLLLTGEKPL CCCEEEEEEECCCHHHCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHHCHHHCCCCCCC KTPEDLAKHTLLHDASRRDWQTYTRQLGLNHINVQQGPIFSHSAMVLQAAIHGQGVALAN CCHHHHHHHHHHHCCCCHHHHHHHHHHCCCEEECCCCCCCCHHHHHHHHHHCCCCCHHHH NVMAQSEIEAGRLVCPFNDVLVSKNAFYLVCHDSQAELGKIAAFRQWILAKAAAEQEKFR HHHHHHHCCCCCEECCCHHHEECCCEEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHH FRYEQ CCCCC >Mature Secondary Structure SKRLPPLNALRVFDAAARHLSFTRAAEELFVTQAAVSHQIKSLEDFLGLKLFRRRNRSL CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEE LLTEEGQSYFLDIKEIFSQLTEATRKLQARSAKGALTVSLLPSFAIHWLVPRLSSFNSAY EEEECCCCEEEEHHHHHHHHHHHHHHHHHHCCCCCEEEHHHHHHHHHHHHHHHHHCCCCC PGIDVRIQAVDRQEDKLADDVDVAIFYGRGNWPGLRVEKLYAEYLLPVCSPLLLTGEKPL CCCEEEEEEECCCHHHCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHHCHHHCCCCCCC KTPEDLAKHTLLHDASRRDWQTYTRQLGLNHINVQQGPIFSHSAMVLQAAIHGQGVALAN CCHHHHHHHHHHHCCCCHHHHHHHHHHCCCEEECCCCCCCCHHHHHHHHHHCCCCCHHHH NVMAQSEIEAGRLVCPFNDVLVSKNAFYLVCHDSQAELGKIAAFRQWILAKAAAEQEKFR HHHHHHHCCCCCEECCCHHHEECCCEEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHH FRYEQ CCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796