Definition | Ralstonia eutropha JMP134 chromosome 1, complete sequence. |
---|---|
Accession | NC_007347 |
Length | 3,806,533 |
Click here to switch to the map view.
The map label for this gene is gcvA [H]
Identifier: 73542316
GI number: 73542316
Start: 2886293
End: 2887243
Strand: Direct
Name: gcvA [H]
Synonym: Reut_A2631
Alternate gene names: 73542316
Gene position: 2886293-2887243 (Clockwise)
Preceding gene: 73542314
Following gene: 73542317
Centisome position: 75.82
GC content: 66.88
Gene sequence:
>951_bases ATGGCCCGCAAACCTATCGATCGCCCTGTCCACATCCCGCCGTTGCAAGCGCTGCGTGCGCTGGAAGCTGCCGCGCGCCA TCGCAGCTTCTCGCGTGCCGCAGAAGAGCTTGCACTGACCCACAGCGCCATCAGTCATCACATGCGGGCGCTCGAGGACA AGCTCGGCACCAAGCTATTTCAGCGTACCGGCAGCCAGATGGCGCCGACCAGTGCCGGAGCAAGGCTCGCCGAGCAAATC CGTGCCGCGCTGGACGATATTGAAAGCGCCGTGCGCGAAGCCAGCAGCAGCACGACCACACCGGTCGTGCGACTGCAGGT CAGCGTCATGGCGGACCTGGCCAACGCCTGGCTGATTCGCCGCTTGCCGAGCCTGCACGCCCAGGTGCCGTCGCTGGACC TGCACCTGCGGCTGCATGCCGAGATCACGCCGCCCGACCCGTACAGCGTCGACGTCGGCATCTGGCACCAGCGCATCGAT CTTCCCGGCTTCGAATGCCACAACCTGATCGAAGACCATGTTATCGCCGTGGCCAGCCCGGCGCTCCTGGCGCGCTACCC CGGCTTCACGCCAGCCGACGTGCCTCGCATGCCGATGCTGCGCTTTGCCCTGCGGCCCTGGCGCGACTGGCTCGAAGCCG CCGGCCTGCCAGATGCCGAGCCCGAACGCGGCCCGATCTTCCAGGACGCCGGGCTGATGCTGCAGTCAGCCGTGGCCGGC CTGGGCGTAGCCACCGCCAGGGCGCAACTGGCCCACGACTACCTTGAGAGCGGCCAGTTGGTGCAGGTCGGTTCTACCCG CATTCCGTCCAGCCTGCATTACTGGGTCACCTGGCGCGAGGGCAATCCTCGCGAAAAAGCGATCCAGCAGTTCCATGCCT GGCTGCAGGAACAGGTGCGCCGCGAAACGCCGACCGCGGAGCCGACGACGACAGACGACGCCATGAAGTAA
Upstream 100 bases:
>100_bases CGGGTAGCAGCAGAGGTCGTCATGCCGCCATTGTCCTCACCAGCATGGTGCAACGCCAACGAAAGTTACGGTCGGCAATT GTGAGTAGAATTTGACGGCC
Downstream 100 bases:
>100_bases CAGACATGAAATGTTTTCGATGTCCAAAGTTGGCAATACGAATCATTTTCGTTTGATTCAGAAGAATCCATGGCTTAGCG TCATGCAGGGAAGCGTTGCC
Product: LysR family transcriptional regulator
Products: NA
Alternate protein names: Gcv operon activator [H]
Number of amino acids: Translated: 316; Mature: 315
Protein sequence:
>316_residues MARKPIDRPVHIPPLQALRALEAAARHRSFSRAAEELALTHSAISHHMRALEDKLGTKLFQRTGSQMAPTSAGARLAEQI RAALDDIESAVREASSSTTTPVVRLQVSVMADLANAWLIRRLPSLHAQVPSLDLHLRLHAEITPPDPYSVDVGIWHQRID LPGFECHNLIEDHVIAVASPALLARYPGFTPADVPRMPMLRFALRPWRDWLEAAGLPDAEPERGPIFQDAGLMLQSAVAG LGVATARAQLAHDYLESGQLVQVGSTRIPSSLHYWVTWREGNPREKAIQQFHAWLQEQVRRETPTAEPTTTDDAMK
Sequences:
>Translated_316_residues MARKPIDRPVHIPPLQALRALEAAARHRSFSRAAEELALTHSAISHHMRALEDKLGTKLFQRTGSQMAPTSAGARLAEQI RAALDDIESAVREASSSTTTPVVRLQVSVMADLANAWLIRRLPSLHAQVPSLDLHLRLHAEITPPDPYSVDVGIWHQRID LPGFECHNLIEDHVIAVASPALLARYPGFTPADVPRMPMLRFALRPWRDWLEAAGLPDAEPERGPIFQDAGLMLQSAVAG LGVATARAQLAHDYLESGQLVQVGSTRIPSSLHYWVTWREGNPREKAIQQFHAWLQEQVRRETPTAEPTTTDDAMK >Mature_315_residues ARKPIDRPVHIPPLQALRALEAAARHRSFSRAAEELALTHSAISHHMRALEDKLGTKLFQRTGSQMAPTSAGARLAEQIR AALDDIESAVREASSSTTTPVVRLQVSVMADLANAWLIRRLPSLHAQVPSLDLHLRLHAEITPPDPYSVDVGIWHQRIDL PGFECHNLIEDHVIAVASPALLARYPGFTPADVPRMPMLRFALRPWRDWLEAAGLPDAEPERGPIFQDAGLMLQSAVAGL GVATARAQLAHDYLESGQLVQVGSTRIPSSLHYWVTWREGNPREKAIQQFHAWLQEQVRRETPTAEPTTTDDAMK
Specific function: Regulatory protein for the glycine cleavage system operon (gcv). Mediates activation of gcv by glycine and repression by purines. GcvA is negatively autoregulated. Bind to three sites upstream of the gcv promoter [H]
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 HTH lysR-type DNA-binding domain [H]
Homologues:
Organism=Escherichia coli, GI1789173, Length=288, Percent_Identity=32.9861111111111, Blast_Score=149, Evalue=3e-37, Organism=Escherichia coli, GI1786448, Length=294, Percent_Identity=28.9115646258503, Blast_Score=100, Evalue=9e-23, Organism=Escherichia coli, GI1788706, Length=310, Percent_Identity=26.4516129032258, Blast_Score=99, Evalue=5e-22, Organism=Escherichia coli, GI1787128, Length=280, Percent_Identity=27.5, Blast_Score=79, Evalue=3e-16, Organism=Escherichia coli, GI1786401, Length=278, Percent_Identity=23.7410071942446, Blast_Score=64, Evalue=1e-11, Organism=Escherichia coli, GI1787589, Length=137, Percent_Identity=29.1970802919708, Blast_Score=63, Evalue=3e-11, Organism=Escherichia coli, GI1790399, Length=122, Percent_Identity=31.1475409836066, Blast_Score=62, Evalue=4e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000847 - InterPro: IPR005119 - InterPro: IPR011991 [H]
Pfam domain/function: PF00126 HTH_1; PF03466 LysR_substrate [H]
EC number: NA
Molecular weight: Translated: 35037; Mature: 34906
Theoretical pI: Translated: 7.06; Mature: 7.06
Prosite motif: PS50931 HTH_LYSR
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.3 %Cys (Translated Protein) 2.5 %Met (Translated Protein) 2.8 %Cys+Met (Translated Protein) 0.3 %Cys (Mature Protein) 2.2 %Met (Mature Protein) 2.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MARKPIDRPVHIPPLQALRALEAAARHRSFSRAAEELALTHSAISHHMRALEDKLGTKLF CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH QRTGSQMAPTSAGARLAEQIRAALDDIESAVREASSSTTTPVVRLQVSVMADLANAWLIR HHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHH RLPSLHAQVPSLDLHLRLHAEITPPDPYSVDVGIWHQRIDLPGFECHNLIEDHVIAVASP HHHHHHHCCCCCEEEEEEEEEECCCCCCEEEHHHHHHHCCCCCHHHHHHHHHHHHHHCCH ALLARYPGFTPADVPRMPMLRFALRPWRDWLEAAGLPDAEPERGPIFQDAGLMLQSAVAG HHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHH LGVATARAQLAHDYLESGQLVQVGSTRIPSSLHYWVTWREGNPREKAIQQFHAWLQEQVR HHHHHHHHHHHHHHHHCCCEEEECCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHH RETPTAEPTTTDDAMK HCCCCCCCCCCHHCCC >Mature Secondary Structure ARKPIDRPVHIPPLQALRALEAAARHRSFSRAAEELALTHSAISHHMRALEDKLGTKLF CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH QRTGSQMAPTSAGARLAEQIRAALDDIESAVREASSSTTTPVVRLQVSVMADLANAWLIR HHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHH RLPSLHAQVPSLDLHLRLHAEITPPDPYSVDVGIWHQRIDLPGFECHNLIEDHVIAVASP HHHHHHHCCCCCEEEEEEEEEECCCCCCEEEHHHHHHHCCCCCHHHHHHHHHHHHHHCCH ALLARYPGFTPADVPRMPMLRFALRPWRDWLEAAGLPDAEPERGPIFQDAGLMLQSAVAG HHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHH LGVATARAQLAHDYLESGQLVQVGSTRIPSSLHYWVTWREGNPREKAIQQFHAWLQEQVR HHHHHHHHHHHHHHHHCCCEEEECCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHH RETPTAEPTTTDDAMK HCCCCCCCCCCHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]