| Definition | Yersinia pestis CO92 chromosome, complete genome. |
|---|---|
| Accession | NC_003143 |
| Length | 4,653,728 |
Click here to switch to the map view.
The map label for this gene is gcvA [H]
Identifier: 218928197
GI number: 218928197
Start: 1169923
End: 1170840
Strand: Direct
Name: gcvA [H]
Synonym: YPO1029
Alternate gene names: 218928197
Gene position: 1169923-1170840 (Clockwise)
Preceding gene: 218928194
Following gene: 218928198
Centisome position: 25.14
GC content: 49.13
Gene sequence:
>918_bases ATGTCAAAACGATTACCACCCCTGAATGCCTTACGGGCTTTTGATGCCGCCGCCCGTCACCTCAGTTTCACTAAAGCAGC CGAGGAGTTATTTGTCACTCAAGCCGCTGTTAGCCACCAGATCAAATCGCTGGAGGATTTTCTCGGGCTGAAATTGTTCC GCCGACGTAATCGCTCTTTGTTGTTGACCGAAGAAGGGCAGAGCTATTACCTCGATATCAAAGAGATTTTCACGTCTATT AATGAAGCGACTCGTAAGTTGCAAGCGCGCAGTGCGAAGGGAGCGTTAACCGTCAGTTTGCCGCCCAGTTTTGCCATTCA ATGGCTAGTTCCTCGTTTGTCTGGGTTTAATGCGGCTTATCCAGGCATTGATGTGAGGATCCAAGCGGTGGATCGGGAAG AGGATAAGCTCGCTGATGATGTGGATGTCGCGATATTCTACGGCCGCGGTAATTGGAGTGGGTTGCGTACAGAGCGCTTG TATGCTGAATTCCTATTACCCGTCTGTGCACCTAGCTTACTAACTGGTGAGAATGGATTAAAAGTACCATCAGATCTGGC TAATCACACCTTGTTGCATGATACTTCACGCCGTGACTGGTTAGCGTATACCCGCCAACTGGGGGTACCGCAGATTAATG TGCAGCAAGGCCCGATATTTAGCCACAGTGCCATGGTGGTTCAGGCTGCGGTTCATGGTCAAGGGATTGCTCTGGTGAAT AATGTCATGGCCCAGTCTGAGATTGAAGCGGGCCGATTGGTTTGCCCGTTTAACGATGTATTGGTGAGTAAGAATGCTTT TTATTTGGTTTGTCATGACAGTCAGGCAGAACTGGGTAAAATAGCCGCCTTCCGTCAGTGGATACTGGCAAGAGCTGCCA GCGAGCAGGAGAAATTACGTTTTCGTTATGAAAACTGA
Upstream 100 bases:
>100_bases ATCCTCAATGGACAATTTATAATGGCTCGGATTATAAAAACTAATAAGTAAACAAAGGGTTTCATCTGATGATGGGCCGT TATAAAAAAGTGTCCAATTC
Downstream 100 bases:
>100_bases TCAATCATAATGTTTGGTTAATTGTAATGCTTGGTTAATCACTGCACTTGATTGGTTACAGTATCTGCGCAGTAATGACA ACCCATTCATGATGGTAACC
Product: DNA-binding transcriptional activator GcvA
Products: NA
Alternate protein names: Gcv operon activator [H]
Number of amino acids: Translated: 305; Mature: 304
Protein sequence:
>305_residues MSKRLPPLNALRAFDAAARHLSFTKAAEELFVTQAAVSHQIKSLEDFLGLKLFRRRNRSLLLTEEGQSYYLDIKEIFTSI NEATRKLQARSAKGALTVSLPPSFAIQWLVPRLSGFNAAYPGIDVRIQAVDREEDKLADDVDVAIFYGRGNWSGLRTERL YAEFLLPVCAPSLLTGENGLKVPSDLANHTLLHDTSRRDWLAYTRQLGVPQINVQQGPIFSHSAMVVQAAVHGQGIALVN NVMAQSEIEAGRLVCPFNDVLVSKNAFYLVCHDSQAELGKIAAFRQWILARAASEQEKLRFRYEN
Sequences:
>Translated_305_residues MSKRLPPLNALRAFDAAARHLSFTKAAEELFVTQAAVSHQIKSLEDFLGLKLFRRRNRSLLLTEEGQSYYLDIKEIFTSI NEATRKLQARSAKGALTVSLPPSFAIQWLVPRLSGFNAAYPGIDVRIQAVDREEDKLADDVDVAIFYGRGNWSGLRTERL YAEFLLPVCAPSLLTGENGLKVPSDLANHTLLHDTSRRDWLAYTRQLGVPQINVQQGPIFSHSAMVVQAAVHGQGIALVN NVMAQSEIEAGRLVCPFNDVLVSKNAFYLVCHDSQAELGKIAAFRQWILARAASEQEKLRFRYEN >Mature_304_residues SKRLPPLNALRAFDAAARHLSFTKAAEELFVTQAAVSHQIKSLEDFLGLKLFRRRNRSLLLTEEGQSYYLDIKEIFTSIN EATRKLQARSAKGALTVSLPPSFAIQWLVPRLSGFNAAYPGIDVRIQAVDREEDKLADDVDVAIFYGRGNWSGLRTERLY AEFLLPVCAPSLLTGENGLKVPSDLANHTLLHDTSRRDWLAYTRQLGVPQINVQQGPIFSHSAMVVQAAVHGQGIALVNN VMAQSEIEAGRLVCPFNDVLVSKNAFYLVCHDSQAELGKIAAFRQWILARAASEQEKLRFRYEN
Specific function: Regulatory protein for the glycine cleavage system operon (gcv). Mediates activation of gcv by glycine and repression by purines. GcvA is negatively autoregulated. Bind to three sites upstream of the gcv promoter [H]
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 HTH lysR-type DNA-binding domain [H]
Homologues:
Organism=Escherichia coli, GI1789173, Length=305, Percent_Identity=87.8688524590164, Blast_Score=561, Evalue=1e-161, Organism=Escherichia coli, GI1786448, Length=286, Percent_Identity=34.6153846153846, Blast_Score=145, Evalue=4e-36, Organism=Escherichia coli, GI1788706, Length=288, Percent_Identity=30.9027777777778, Blast_Score=140, Evalue=1e-34, Organism=Escherichia coli, GI145693193, Length=296, Percent_Identity=27.027027027027, Blast_Score=103, Evalue=1e-23, Organism=Escherichia coli, GI1786401, Length=274, Percent_Identity=27.7372262773723, Blast_Score=84, Evalue=1e-17, Organism=Escherichia coli, GI1789639, Length=250, Percent_Identity=28.4, Blast_Score=81, Evalue=1e-16, Organism=Escherichia coli, GI157672245, Length=162, Percent_Identity=33.3333333333333, Blast_Score=80, Evalue=1e-16, Organism=Escherichia coli, GI87081978, Length=261, Percent_Identity=28.735632183908, Blast_Score=77, Evalue=2e-15, Organism=Escherichia coli, GI1787128, Length=304, Percent_Identity=23.6842105263158, Blast_Score=72, Evalue=6e-14, Organism=Escherichia coli, GI145693105, Length=122, Percent_Identity=31.1475409836066, Blast_Score=65, Evalue=6e-12, Organism=Escherichia coli, GI1790262, Length=257, Percent_Identity=25.6809338521401, Blast_Score=62, Evalue=6e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000847 - InterPro: IPR005119 - InterPro: IPR011991 [H]
Pfam domain/function: PF00126 HTH_1; PF03466 LysR_substrate [H]
EC number: NA
Molecular weight: Translated: 34082; Mature: 33951
Theoretical pI: Translated: 8.78; Mature: 8.78
Prosite motif: PS50931 HTH_LYSR
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.0 %Cys (Translated Protein) 1.0 %Met (Translated Protein) 2.0 %Cys+Met (Translated Protein) 1.0 %Cys (Mature Protein) 0.7 %Met (Mature Protein) 1.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSKRLPPLNALRAFDAAARHLSFTKAAEELFVTQAAVSHQIKSLEDFLGLKLFRRRNRSL CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEE LLTEEGQSYYLDIKEIFTSINEATRKLQARSAKGALTVSLPPSFAIQWLVPRLSGFNAAY EEEECCCEEEEEHHHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHCCCCCCC PGIDVRIQAVDREEDKLADDVDVAIFYGRGNWSGLRTERLYAEFLLPVCAPSLLTGENGL CCCEEEEEEECCCHHHCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHCCHHEECCCCC KVPSDLANHTLLHDTSRRDWLAYTRQLGVPQINVQQGPIFSHSAMVVQAAVHGQGIALVN CCCHHHHCCEEECCCCCHHHHHHHHHCCCCEEECCCCCCCCCCHHHEEHHHCCCCHHHHH NVMAQSEIEAGRLVCPFNDVLVSKNAFYLVCHDSQAELGKIAAFRQWILARAASEQEKLR HHHHHHHCCCCCEECCCHHHEECCCEEEEEEECCCHHHHHHHHHHHHHHHHHCCCHHHHE FRYEN EECCC >Mature Secondary Structure SKRLPPLNALRAFDAAARHLSFTKAAEELFVTQAAVSHQIKSLEDFLGLKLFRRRNRSL CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEE LLTEEGQSYYLDIKEIFTSINEATRKLQARSAKGALTVSLPPSFAIQWLVPRLSGFNAAY EEEECCCEEEEEHHHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHCCCCCCC PGIDVRIQAVDREEDKLADDVDVAIFYGRGNWSGLRTERLYAEFLLPVCAPSLLTGENGL CCCEEEEEEECCCHHHCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHCCHHEECCCCC KVPSDLANHTLLHDTSRRDWLAYTRQLGVPQINVQQGPIFSHSAMVVQAAVHGQGIALVN CCCHHHHCCEEECCCCCHHHHHHHHHCCCCEEECCCCCCCCCCHHHEEHHHCCCCHHHHH NVMAQSEIEAGRLVCPFNDVLVSKNAFYLVCHDSQAELGKIAAFRQWILARAASEQEKLR HHHHHHHCCCCCEECCCHHHEECCCEEEEEEECCCHHHHHHHHHHHHHHHHHCCCHHHHE FRYEN EECCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]