| Definition | Ruegeria sp. TM1040, complete genome. |
|---|---|
| Accession | NC_008044 |
| Length | 3,200,938 |
Click here to switch to the map view.
The map label for this gene is gcvA [H]
Identifier: 99080685
GI number: 99080685
Start: 894888
End: 895775
Strand: Reverse
Name: gcvA [H]
Synonym: TM1040_0844
Alternate gene names: 99080685
Gene position: 895775-894888 (Counterclockwise)
Preceding gene: 99080689
Following gene: 99080681
Centisome position: 27.98
GC content: 62.95
Gene sequence:
>888_bases ATGTATCAGGAATTGCCCCCTCTGCCATGGCTGCGCGCTTTTGATGCCAGCGCCCGGCTTGGAAACTTCACCCTCGCCGC CCAGGAGCTTGGGCTGACGCCCTCTGCAGTCAGCTACCAGGTGCGTGGACTTGAGGCGACGCTCGGCCACAAGCTCTTTG TGAGAAAGCAAAAATCGCTATTCCTGACACGGCTGGGACAGGCTTACCTGCCCGTGGTGAGCAAGGCATTTGGCGATCTG GAGGCCACCACGTCGAACCTCTTTGGCAAGAGTGCTGCGCAAGAAATCACGCTCAGGTGCCTGAGTTCGCTCAACATGCT GTGGCTTGCGCCGCGGCTTTCGCGTTTTCAGGCGCAGCACCCGGACTGCCGCCTGCGGGTGCTCTCGGCGCTTTGGGGTG AGACCCCAGAGGCGCTGCAGATCGACATCGACATTCGCTATGGCAATGGCAGTTGGTCAGATGGCGAGGTGACGCGTCTG ATGCATCACGAGGTGCTCGCTGTCTGTGCGCCTGCGCTGCATCCCGGCCCCGATCCGCAGGCGATTGCGTCCTGCCCGCT GATTGAGGTGATTGGCGTCACCGACACATGGCATCACTTCTTTGCCCTGCATGGGCTCGCACCTCCCTCTTCTGGGCCGG CGTTGAAGGTCGACCAATCGCTGATCGCGCTTGAGCTCGCCACGCGTGGCATGGGCGTTGCGCTGATCGCCCGTGTCTTT GCCCAGCGCTATCTGGAGAGCGGTGCCTTGGTGGAGGCGACGGACCTCGCCCTGCCCGCGCGCGAGGGGCACTACATCGT GCAGCCGGAGGCCCGCAATCACTTTCGCCCGGAAGTCGCGGCGCTCAAAGACTGGCTCTTGCAGGAGGCCGCGCAAACAG GCTGTTGA
Upstream 100 bases:
>100_bases ATTCATCGCGCCAGCGTTACAGCTCCGGCTCCAATCCCCATAACGAAAAATAATTCGCCGTAACCTTAAAAATATTCGCA GATTGGTGTAGAGCAAAACC
Downstream 100 bases:
>100_bases TCTCCCCCGGCGCAGAGGCGCACCGGGAGAGATCTGGCTTCATCAATTTGTCGCAACACTCAGCGGATAGTCCACGTTGA GCGAAAGCGCCGGGATCTGT
Product: LysR family transcriptional regulator
Products: NA
Alternate protein names: Gcv operon activator [H]
Number of amino acids: Translated: 295; Mature: 295
Protein sequence:
>295_residues MYQELPPLPWLRAFDASARLGNFTLAAQELGLTPSAVSYQVRGLEATLGHKLFVRKQKSLFLTRLGQAYLPVVSKAFGDL EATTSNLFGKSAAQEITLRCLSSLNMLWLAPRLSRFQAQHPDCRLRVLSALWGETPEALQIDIDIRYGNGSWSDGEVTRL MHHEVLAVCAPALHPGPDPQAIASCPLIEVIGVTDTWHHFFALHGLAPPSSGPALKVDQSLIALELATRGMGVALIARVF AQRYLESGALVEATDLALPAREGHYIVQPEARNHFRPEVAALKDWLLQEAAQTGC
Sequences:
>Translated_295_residues MYQELPPLPWLRAFDASARLGNFTLAAQELGLTPSAVSYQVRGLEATLGHKLFVRKQKSLFLTRLGQAYLPVVSKAFGDL EATTSNLFGKSAAQEITLRCLSSLNMLWLAPRLSRFQAQHPDCRLRVLSALWGETPEALQIDIDIRYGNGSWSDGEVTRL MHHEVLAVCAPALHPGPDPQAIASCPLIEVIGVTDTWHHFFALHGLAPPSSGPALKVDQSLIALELATRGMGVALIARVF AQRYLESGALVEATDLALPAREGHYIVQPEARNHFRPEVAALKDWLLQEAAQTGC >Mature_295_residues MYQELPPLPWLRAFDASARLGNFTLAAQELGLTPSAVSYQVRGLEATLGHKLFVRKQKSLFLTRLGQAYLPVVSKAFGDL EATTSNLFGKSAAQEITLRCLSSLNMLWLAPRLSRFQAQHPDCRLRVLSALWGETPEALQIDIDIRYGNGSWSDGEVTRL MHHEVLAVCAPALHPGPDPQAIASCPLIEVIGVTDTWHHFFALHGLAPPSSGPALKVDQSLIALELATRGMGVALIARVF AQRYLESGALVEATDLALPAREGHYIVQPEARNHFRPEVAALKDWLLQEAAQTGC
Specific function: Regulatory protein for the glycine cleavage system operon (gcv). Mediates activation of gcv by glycine and repression by purines. GcvA is negatively autoregulated. Bind to three sites upstream of the gcv promoter [H]
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 HTH lysR-type DNA-binding domain [H]
Homologues:
Organism=Escherichia coli, GI1789173, Length=295, Percent_Identity=34.9152542372881, Blast_Score=155, Evalue=3e-39, Organism=Escherichia coli, GI1786448, Length=288, Percent_Identity=32.2916666666667, Blast_Score=130, Evalue=1e-31, Organism=Escherichia coli, GI1788706, Length=295, Percent_Identity=27.1186440677966, Blast_Score=98, Evalue=5e-22, Organism=Escherichia coli, GI1786508, Length=134, Percent_Identity=30.5970149253731, Blast_Score=67, Evalue=9e-13, Organism=Escherichia coli, GI1790262, Length=258, Percent_Identity=27.1317829457364, Blast_Score=65, Evalue=6e-12, Organism=Escherichia coli, GI87082024, Length=271, Percent_Identity=29.1512915129151, Blast_Score=62, Evalue=5e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000847 - InterPro: IPR005119 - InterPro: IPR011991 [H]
Pfam domain/function: PF00126 HTH_1; PF03466 LysR_substrate [H]
EC number: NA
Molecular weight: Translated: 32274; Mature: 32274
Theoretical pI: Translated: 6.85; Mature: 6.85
Prosite motif: PS50931 HTH_LYSR
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.7 %Cys (Translated Protein) 1.4 %Met (Translated Protein) 3.1 %Cys+Met (Translated Protein) 1.7 %Cys (Mature Protein) 1.4 %Met (Mature Protein) 3.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MYQELPPLPWLRAFDASARLGNFTLAAQELGLTPSAVSYQVRGLEATLGHKLFVRKQKSL CCCCCCCCCHHHHCCCCCCCCCEEEEHHHHCCCCCHHEEEEECHHHHHCCHHHHHHHHHH FLTRLGQAYLPVVSKAFGDLEATTSNLFGKSAAQEITLRCLSSLNMLWLAPRLSRFQAQH HHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCC PDCRLRVLSALWGETPEALQIDIDIRYGNGSWSDGEVTRLMHHEVLAVCAPALHPGPDPQ CCHHHHHHHHHHCCCCCEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCHH AIASCPLIEVIGVTDTWHHFFALHGLAPPSSGPALKVDQSLIALELATRGMGVALIARVF HHHCCCEEEEECCCCHHHHHHHHHCCCCCCCCCCEEECHHHHHHHHHHCCCHHHHHHHHH AQRYLESGALVEATDLALPAREGHYIVQPEARNHFRPEVAALKDWLLQEAAQTGC HHHHHHCCCEEEEHHCCCCCCCCCEEECCCCCCCCCHHHHHHHHHHHHHHHHCCC >Mature Secondary Structure MYQELPPLPWLRAFDASARLGNFTLAAQELGLTPSAVSYQVRGLEATLGHKLFVRKQKSL CCCCCCCCCHHHHCCCCCCCCCEEEEHHHHCCCCCHHEEEEECHHHHHCCHHHHHHHHHH FLTRLGQAYLPVVSKAFGDLEATTSNLFGKSAAQEITLRCLSSLNMLWLAPRLSRFQAQH HHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCC PDCRLRVLSALWGETPEALQIDIDIRYGNGSWSDGEVTRLMHHEVLAVCAPALHPGPDPQ CCHHHHHHHHHHCCCCCEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCHH AIASCPLIEVIGVTDTWHHFFALHGLAPPSSGPALKVDQSLIALELATRGMGVALIARVF HHHCCCEEEEECCCCHHHHHHHHHCCCCCCCCCCEEECHHHHHHHHHHCCCHHHHHHHHH AQRYLESGALVEATDLALPAREGHYIVQPEARNHFRPEVAALKDWLLQEAAQTGC HHHHHHCCCEEEEHHCCCCCCCCCEEECCCCCCCCCHHHHHHHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]