The gene/protein map for NC_008048 is currently unavailable.
Definition Sphingopyxis alaskensis RB2256, complete genome.
Accession NC_008048
Length 3,345,170

Click here to switch to the map view.

The map label for this gene is gcvA [H]

Identifier: 103488200

GI number: 103488200

Start: 2869767

End: 2870699

Strand: Reverse

Name: gcvA [H]

Synonym: Sala_2723

Alternate gene names: 103488200

Gene position: 2870699-2869767 (Counterclockwise)

Preceding gene: 103488202

Following gene: 103488199

Centisome position: 85.82

GC content: 66.13

Gene sequence:

>933_bases
ATGATTGATGGCATGAACCGGATTCCGCCCCTCGCCGCCGTTCGCAGCTTTGAAGCCGCCGGGCGTTTGCAGAACTTCTC
GCGCGCGGCCGAAGAGCTGGGCATGACCCAGGCGGCGATCAGCTATCAGATTCGTCAATTGGAGGACCGGCTCGGCCGCG
CGCTGTTCGTCCGCGAAAAGGGGCGCGTGCGCCTGTCCGAAACGGGCCAGCGGCTGCTCCCGGCGATCAGCAATGCCTTC
GCCACGATGAGCGACGCTTTTGCCGCGCTGGGCAGCGACGAGGCCGATGTGCTCACGATCAACGCGGTGACCAGCTTCGG
CGGCACATGGCTCAGCGCGCGGATCGGCGGCTTCCAGCTCCTTTATCCCGAACTGGCGGTGCGCATGTCGATGGGTAACG
ATCTGATCGACTTCAACGCCTCGAATGTCGATGTCGCGATCCGCATGGGGCGCGGTCAATGGCCGGGGCTGCGCAGCGAC
TTTCTGATGCGCCAGCATGTTGCGCCGCTTGCTTCGCCCGCCTTTGTCGAAAAACATCGCATTCGCGAGCCCGCCGACCT
GCTTTGCGTCGAGCGGCTCGCGCCGAACGACAGCTGGTGGGCCGACTGGTTTGCCGCCGCCGGGGTCGCGACGCCGCTCG
CGCCGTCGCGGCGCGGCATCGAACTCGACAGCCAGCTGCAGGAAGCGAGCGCGGTGCAGGCGGGGTTCGGCGTCGCGATG
ATGACCCCGCTTTTCTGGCAGGCCGAGATCGCCGCGGGGCGGATGGTCCAGCCGTTCGATACGCTCCATATATCGGAGTC
GGCGATGTGGCTCGTCCACCGCGAGAACCGGGTCGGCGTGCGCAAGATCGAACGCTTCCGCGAATGGCTGCATGCCGAAC
TCGCCAAGGATCGCCACCGGTACCCCGATTTGCTGTGGCAGCCGCCGACTTGA

Upstream 100 bases:

>100_bases
TCCATCATCGTCTTTCTCCTGTCGGTTCGATGCGTGGCAGATGCGCCTTTCTGCGCGCGACCGCCAACTAAAGCTTTGCC
ATCAATCATAAGGTCAGCTT

Downstream 100 bases:

>100_bases
TCTTGGGCGATGATGTATATACATTATGTATATACAGGAGCGAGCGATGACCAAGCGAAATCAGGCAAATGAGGAAAGGC
GGCTGTTCGCTTTGAACCAC

Product: LysR family transcriptional regulator

Products: NA

Alternate protein names: Gcv operon activator [H]

Number of amino acids: Translated: 310; Mature: 310

Protein sequence:

>310_residues
MIDGMNRIPPLAAVRSFEAAGRLQNFSRAAEELGMTQAAISYQIRQLEDRLGRALFVREKGRVRLSETGQRLLPAISNAF
ATMSDAFAALGSDEADVLTINAVTSFGGTWLSARIGGFQLLYPELAVRMSMGNDLIDFNASNVDVAIRMGRGQWPGLRSD
FLMRQHVAPLASPAFVEKHRIREPADLLCVERLAPNDSWWADWFAAAGVATPLAPSRRGIELDSQLQEASAVQAGFGVAM
MTPLFWQAEIAAGRMVQPFDTLHISESAMWLVHRENRVGVRKIERFREWLHAELAKDRHRYPDLLWQPPT

Sequences:

>Translated_310_residues
MIDGMNRIPPLAAVRSFEAAGRLQNFSRAAEELGMTQAAISYQIRQLEDRLGRALFVREKGRVRLSETGQRLLPAISNAF
ATMSDAFAALGSDEADVLTINAVTSFGGTWLSARIGGFQLLYPELAVRMSMGNDLIDFNASNVDVAIRMGRGQWPGLRSD
FLMRQHVAPLASPAFVEKHRIREPADLLCVERLAPNDSWWADWFAAAGVATPLAPSRRGIELDSQLQEASAVQAGFGVAM
MTPLFWQAEIAAGRMVQPFDTLHISESAMWLVHRENRVGVRKIERFREWLHAELAKDRHRYPDLLWQPPT
>Mature_310_residues
MIDGMNRIPPLAAVRSFEAAGRLQNFSRAAEELGMTQAAISYQIRQLEDRLGRALFVREKGRVRLSETGQRLLPAISNAF
ATMSDAFAALGSDEADVLTINAVTSFGGTWLSARIGGFQLLYPELAVRMSMGNDLIDFNASNVDVAIRMGRGQWPGLRSD
FLMRQHVAPLASPAFVEKHRIREPADLLCVERLAPNDSWWADWFAAAGVATPLAPSRRGIELDSQLQEASAVQAGFGVAM
MTPLFWQAEIAAGRMVQPFDTLHISESAMWLVHRENRVGVRKIERFREWLHAELAKDRHRYPDLLWQPPT

Specific function: Regulatory protein for the glycine cleavage system operon (gcv). Mediates activation of gcv by glycine and repression by purines. GcvA is negatively autoregulated. Bind to three sites upstream of the gcv promoter [H]

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HTH lysR-type DNA-binding domain [H]

Homologues:

Organism=Escherichia coli, GI1789173, Length=302, Percent_Identity=35.0993377483444, Blast_Score=164, Evalue=5e-42,
Organism=Escherichia coli, GI1786448, Length=289, Percent_Identity=30.4498269896194, Blast_Score=127, Evalue=1e-30,
Organism=Escherichia coli, GI1788706, Length=287, Percent_Identity=28.9198606271777, Blast_Score=127, Evalue=1e-30,
Organism=Escherichia coli, GI1786401, Length=263, Percent_Identity=25.8555133079848, Blast_Score=73, Evalue=3e-14,
Organism=Escherichia coli, GI87081978, Length=178, Percent_Identity=29.2134831460674, Blast_Score=72, Evalue=5e-14,
Organism=Escherichia coli, GI145693193, Length=297, Percent_Identity=21.8855218855219, Blast_Score=70, Evalue=2e-13,
Organism=Escherichia coli, GI1787128, Length=252, Percent_Identity=26.984126984127, Blast_Score=67, Evalue=2e-12,
Organism=Escherichia coli, GI48994962, Length=69, Percent_Identity=42.0289855072464, Blast_Score=65, Evalue=6e-12,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000847
- InterPro:   IPR005119
- InterPro:   IPR011991 [H]

Pfam domain/function: PF00126 HTH_1; PF03466 LysR_substrate [H]

EC number: NA

Molecular weight: Translated: 34598; Mature: 34598

Theoretical pI: Translated: 7.17; Mature: 7.17

Prosite motif: PS50931 HTH_LYSR

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.3 %Cys     (Translated Protein)
3.9 %Met     (Translated Protein)
4.2 %Cys+Met (Translated Protein)
0.3 %Cys     (Mature Protein)
3.9 %Met     (Mature Protein)
4.2 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MIDGMNRIPPLAAVRSFEAAGRLQNFSRAAEELGMTQAAISYQIRQLEDRLGRALFVREK
CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCEEEEECC
GRVRLSETGQRLLPAISNAFATMSDAFAALGSDEADVLTINAVTSFGGTWLSARIGGFQL
CCEEHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEHHHCCCHHHHHHHCCEEE
LYPELAVRMSMGNDLIDFNASNVDVAIRMGRGQWPGLRSDFLMRQHVAPLASPAFVEKHR
EHHHHHHHHHCCCCEEECCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHCCCHHHHHHHH
IREPADLLCVERLAPNDSWWADWFAAAGVATPLAPSRRGIELDSQLQEASAVQAGFGVAM
CCCCHHHHHHHHHCCCCCHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCHHH
MTPLFWQAEIAAGRMVQPFDTLHISESAMWLVHRENRVGVRKIERFREWLHAELAKDRHR
HHHHHHHHHHHCCCCCCCHHHEEECCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHC
YPDLLWQPPT
CCCCCCCCCC
>Mature Secondary Structure
MIDGMNRIPPLAAVRSFEAAGRLQNFSRAAEELGMTQAAISYQIRQLEDRLGRALFVREK
CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCEEEEECC
GRVRLSETGQRLLPAISNAFATMSDAFAALGSDEADVLTINAVTSFGGTWLSARIGGFQL
CCEEHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEHHHCCCHHHHHHHCCEEE
LYPELAVRMSMGNDLIDFNASNVDVAIRMGRGQWPGLRSDFLMRQHVAPLASPAFVEKHR
EHHHHHHHHHCCCCEEECCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHCCCHHHHHHHH
IREPADLLCVERLAPNDSWWADWFAAAGVATPLAPSRRGIELDSQLQEASAVQAGFGVAM
CCCCHHHHHHHHHCCCCCHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCHHH
MTPLFWQAEIAAGRMVQPFDTLHISESAMWLVHRENRVGVRKIERFREWLHAELAKDRHR
HHHHHHHHHHHCCCCCCCHHHEEECCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHC
YPDLLWQPPT
CCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796 [H]