Definition | Sphingopyxis alaskensis RB2256, complete genome. |
---|---|
Accession | NC_008048 |
Length | 3,345,170 |
Click here to switch to the map view.
The map label for this gene is gcvA [H]
Identifier: 103485861
GI number: 103485861
Start: 377378
End: 378283
Strand: Direct
Name: gcvA [H]
Synonym: Sala_0366
Alternate gene names: 103485861
Gene position: 377378-378283 (Clockwise)
Preceding gene: 103485859
Following gene: 103485862
Centisome position: 11.28
GC content: 71.52
Gene sequence:
>906_bases ATGACTCAACCTGATTCATCTATAGCGCAGCTGCGCCGCCTGCCGCCGCTCGCCGCCCTGCGCGCGTTCGAAGCCGCCGC GCGCCATGTCAGCTTTCGGCAGGCGGCGGAGGAGCTGGGGGTGACGCCGACGGCGATCAGCCACCAGGTCCGGCTGCTGG AGGACAGCCTGGGATTCGCGCTGTTCGTGCGGCGCGCGCGCGGCGTCGTGCTGACCGACGCGGGGCGGCGGCTGTTCCCG ACCCTGCGCGACGGCTTCGATGCGTTCGAGCGCGCGATCTGGGAATTGTCGCCGCGCCCGCGCCGCGTCGCGGTGACGTT GAGCGCGACGACGCTGTTCACCGTGCGCCGCATCCTGCCCGCGATCGGCCGGTTCCGCGAACGTTTCCCGGACTATGACC TGCGCCTCCATGCCTCCGACGATCCGGTCGACCTGATCGCCGGGGAGGCCGATGTCGCGGTCCGTTATGGATCGGGCGCC TTCCGCGGGTTGGTCGCCACGCCCCTGCTGGCCGAGCGCTTCGGCGTCCTATGCAGCCCCAAGATGGCGATTGCGCGGCC GAACGATCTGCGCGGCGCCACCTTGCTGCACATCGAATGGCGGCGCGCGGGCAAGGCGCCCGACTGGCGCCGGTGGGCGC GGCTTGCGGGGATCGAGGGGCTGGCGGTCGATCGGGGACCGCGCTTCACCGAGGATGATCATGCGCTGCAAGCCGCGGCG GCGGGCAGCGGCGTGGTCTTGTCGGGGCTGGCGCTCGCGCAGCCGACGATCGACGCCGGGCTGCTCGTCCACCCGTTCGG CCCGGTGATCGAGGGCGAAGCCTATCATGTCGTCACGACACCCGACAACCGCGACGCGCCGGCGGTGTGTGCGGTCTGCG ACTGGCTGCGCGAGGACGTCGTTTAG
Upstream 100 bases:
>100_bases ATTCAATTCAAACATGGTGAAATCCTTTTCATCCATAGGAGAGGTGCTATCATGAATAATATGGATTATTTCAGTAGGTC GAACACGAATCCGTCATCTT
Downstream 100 bases:
>100_bases CGCGCAATCGGCCTGCGCTTGCGCGGAACGCTATTAAGGTCTAAACCGCCGCCGGATGGCCGCCCCGCGAGGGGCGGCCC TTATTATTTCCGCTTTTGGA
Product: LysR family transcriptional regulator
Products: NA
Alternate protein names: Gcv operon activator [H]
Number of amino acids: Translated: 301; Mature: 300
Protein sequence:
>301_residues MTQPDSSIAQLRRLPPLAALRAFEAAARHVSFRQAAEELGVTPTAISHQVRLLEDSLGFALFVRRARGVVLTDAGRRLFP TLRDGFDAFERAIWELSPRPRRVAVTLSATTLFTVRRILPAIGRFRERFPDYDLRLHASDDPVDLIAGEADVAVRYGSGA FRGLVATPLLAERFGVLCSPKMAIARPNDLRGATLLHIEWRRAGKAPDWRRWARLAGIEGLAVDRGPRFTEDDHALQAAA AGSGVVLSGLALAQPTIDAGLLVHPFGPVIEGEAYHVVTTPDNRDAPAVCAVCDWLREDVV
Sequences:
>Translated_301_residues MTQPDSSIAQLRRLPPLAALRAFEAAARHVSFRQAAEELGVTPTAISHQVRLLEDSLGFALFVRRARGVVLTDAGRRLFP TLRDGFDAFERAIWELSPRPRRVAVTLSATTLFTVRRILPAIGRFRERFPDYDLRLHASDDPVDLIAGEADVAVRYGSGA FRGLVATPLLAERFGVLCSPKMAIARPNDLRGATLLHIEWRRAGKAPDWRRWARLAGIEGLAVDRGPRFTEDDHALQAAA AGSGVVLSGLALAQPTIDAGLLVHPFGPVIEGEAYHVVTTPDNRDAPAVCAVCDWLREDVV >Mature_300_residues TQPDSSIAQLRRLPPLAALRAFEAAARHVSFRQAAEELGVTPTAISHQVRLLEDSLGFALFVRRARGVVLTDAGRRLFPT LRDGFDAFERAIWELSPRPRRVAVTLSATTLFTVRRILPAIGRFRERFPDYDLRLHASDDPVDLIAGEADVAVRYGSGAF RGLVATPLLAERFGVLCSPKMAIARPNDLRGATLLHIEWRRAGKAPDWRRWARLAGIEGLAVDRGPRFTEDDHALQAAAA GSGVVLSGLALAQPTIDAGLLVHPFGPVIEGEAYHVVTTPDNRDAPAVCAVCDWLREDVV
Specific function: Regulatory protein for the glycine cleavage system operon (gcv). Mediates activation of gcv by glycine and repression by purines. GcvA is negatively autoregulated. Bind to three sites upstream of the gcv promoter [H]
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 HTH lysR-type DNA-binding domain [H]
Homologues:
Organism=Escherichia coli, GI1789173, Length=291, Percent_Identity=38.1443298969072, Blast_Score=190, Evalue=1e-49, Organism=Escherichia coli, GI1788706, Length=292, Percent_Identity=32.1917808219178, Blast_Score=140, Evalue=1e-34, Organism=Escherichia coli, GI1786448, Length=293, Percent_Identity=31.740614334471, Blast_Score=114, Evalue=9e-27, Organism=Escherichia coli, GI1786401, Length=164, Percent_Identity=33.5365853658537, Blast_Score=79, Evalue=4e-16, Organism=Escherichia coli, GI145693105, Length=170, Percent_Identity=29.4117647058824, Blast_Score=76, Evalue=3e-15, Organism=Escherichia coli, GI1787128, Length=261, Percent_Identity=26.8199233716475, Blast_Score=74, Evalue=1e-14, Organism=Escherichia coli, GI157672245, Length=175, Percent_Identity=33.7142857142857, Blast_Score=68, Evalue=7e-13, Organism=Escherichia coli, GI87081978, Length=178, Percent_Identity=30.3370786516854, Blast_Score=68, Evalue=9e-13, Organism=Escherichia coli, GI1788748, Length=185, Percent_Identity=30.2702702702703, Blast_Score=65, Evalue=4e-12, Organism=Escherichia coli, GI1787589, Length=152, Percent_Identity=30.9210526315789, Blast_Score=63, Evalue=2e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000847 - InterPro: IPR005119 - InterPro: IPR011991 [H]
Pfam domain/function: PF00126 HTH_1; PF03466 LysR_substrate [H]
EC number: NA
Molecular weight: Translated: 32902; Mature: 32771
Theoretical pI: Translated: 8.52; Mature: 8.52
Prosite motif: PS50931 HTH_LYSR
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.0 %Cys (Translated Protein) 0.7 %Met (Translated Protein) 1.7 %Cys+Met (Translated Protein) 1.0 %Cys (Mature Protein) 0.3 %Met (Mature Protein) 1.3 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTQPDSSIAQLRRLPPLAALRAFEAAARHVSFRQAAEELGVTPTAISHQVRLLEDSLGFA CCCCCHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHH LFVRRARGVVLTDAGRRLFPTLRDGFDAFERAIWELSPRPRRVAVTLSATTLFTVRRILP HHHHHHCCEEEECCCCHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEHHHHHHHHHHHH AIGRFRERFPDYDLRLHASDDPVDLIAGEADVAVRYGSGAFRGLVATPLLAERFGVLCSP HHHHHHHHCCCCCEEEECCCCCCEEEECCCCEEEEECCCHHHHHHHHHHHHHHHCCCCCC KMAIARPNDLRGATLLHIEWRRAGKAPDWRRWARLAGIEGLAVDRGPRFTEDDHALQAAA CEEECCCCCCCCEEEEEEEHHCCCCCCCHHHHHHHHCCCCEEECCCCCCCCCHHHHHHHH AGSGVVLSGLALAQPTIDAGLLVHPFGPVIEGEAYHVVTTPDNRDAPAVCAVCDWLREDV CCCCCEECCHHHHCCCCCCCEEEECCCCCCCCCEEEEEECCCCCCCCHHHHHHHHHHHHC V C >Mature Secondary Structure TQPDSSIAQLRRLPPLAALRAFEAAARHVSFRQAAEELGVTPTAISHQVRLLEDSLGFA CCCCHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHH LFVRRARGVVLTDAGRRLFPTLRDGFDAFERAIWELSPRPRRVAVTLSATTLFTVRRILP HHHHHHCCEEEECCCCHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEHHHHHHHHHHHH AIGRFRERFPDYDLRLHASDDPVDLIAGEADVAVRYGSGAFRGLVATPLLAERFGVLCSP HHHHHHHHCCCCCEEEECCCCCCEEEECCCCEEEEECCCHHHHHHHHHHHHHHHCCCCCC KMAIARPNDLRGATLLHIEWRRAGKAPDWRRWARLAGIEGLAVDRGPRFTEDDHALQAAA CEEECCCCCCCCEEEEEEEHHCCCCCCCHHHHHHHHCCCCEEECCCCCCCCCHHHHHHHH AGSGVVLSGLALAQPTIDAGLLVHPFGPVIEGEAYHVVTTPDNRDAPAVCAVCDWLREDV CCCCCEECCHHHHCCCCCCCEEEECCCCCCCCCEEEEEECCCCCCCCHHHHHHHHHHHHC V C
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]