The gene/protein map for NC_002678 is currently unavailable.
Definition Mesorhizobium loti MAFF303099 chromosome, complete genome.
Accession NC_002678
Length 7,036,071

Click here to switch to the map view.

The map label for this gene is gcvA [H]

Identifier: 13473584

GI number: 13473584

Start: 3378581

End: 3379513

Strand: Reverse

Name: gcvA [H]

Synonym: mll4232

Alternate gene names: 13473584

Gene position: 3379513-3378581 (Counterclockwise)

Preceding gene: 13473585

Following gene: 13473577

Centisome position: 48.03

GC content: 65.7

Gene sequence:

>933_bases
ATGCCCCGCCTCCTGCCCGGAACGCGCGCGCTGAGGACCTTCGAGGCGGCGGCGCGTCACCTCAATTTCACCCGCGCCGC
CGACGAGCTTGGGCTGACGCCGGCCGCGGTCAGCCACCAGGTCAAGGAGATCGAAGATCAGCTCGACCTGGTGCTGTTCA
CGCGCACCAGCCGCACCATGCGGCTGACGGAAGCGGGAAACGTGCTTCACGAGGCGTCGATCGAAGCGCTCGACCTGCTC
AACCGGGCGGTGTCGCGCGCCCGCAAGATGACGCGCGGCACGGCGCTCCTGAAAGTGACGCTCGACGCGCAGTTCGCGAC
GAAATGGCTGATGCGGCGCATCGACGATTTCCGTCGTCAGCGGCCAGGCATCGAGTTGCGTTTCGACATTACCTACGATG
TCCGGGATTTCGAGCGCGACGACGTCGATATCGGCATCCGGTTCGGCACCGGCAGATATGCCGGCCTTTGCGCGCACCGG
CTGTTCGACAACATCATCATCCCGGTGTGCAGCCCGGCGCTGCTGGCTTCAGGGCCGCCGCTCAACGAACCGCGCGATCT
CTTCCGGCACACGCTCGCGCATATCGACTGGTCGCGGCAAGGCGTCACCTGGCCGAATTGGAGCATGTGGATGCAGGCGG
CCGGTGTCGACGATTTCGACGACAGCCGCACCCTCGTCTTCGGCTCCTCGACGGATGCCACGCAGGCGGCCCTCGACGGC
AATGCCGTGGCTCTGGCCGACTTCGCGATGGTGGCCAACGATTTGTCGCAAGGGCGCCTCGTGCGCCCCTTCGAACTCGG
CATCAAGGTCGCGCCGGAGTTCGCCTATTTCCTGGTCTATCCGGAAACCGCGAAGGACGACGCCCGCATCACGGCGTTTC
GTGAGTGGCTGCTGGAGGAAGCGGCAAAGACGCACGGCACGGACAAGGTGTAG

Upstream 100 bases:

>100_bases
CCGGTTTGAAGGTATCTCGATGTCGAGATCATGCCGGATTGAAAAACAAACGGGAAACGAGTAGTTCTTTTCAGATCATC
AAGTTTTATTTGGAGATCAG

Downstream 100 bases:

>100_bases
GCTTCTGCCGGGCCGAAGATCAACGCGCCGCCTTAGCTTGCCGCGGCGCCGAACCAGCCGATCACCGCGCCGATGATCAG
CAAGACACCCAGCCAATGGC

Product: transcriptional regulator

Products: NA

Alternate protein names: Gcv operon activator [H]

Number of amino acids: Translated: 310; Mature: 309

Protein sequence:

>310_residues
MPRLLPGTRALRTFEAAARHLNFTRAADELGLTPAAVSHQVKEIEDQLDLVLFTRTSRTMRLTEAGNVLHEASIEALDLL
NRAVSRARKMTRGTALLKVTLDAQFATKWLMRRIDDFRRQRPGIELRFDITYDVRDFERDDVDIGIRFGTGRYAGLCAHR
LFDNIIIPVCSPALLASGPPLNEPRDLFRHTLAHIDWSRQGVTWPNWSMWMQAAGVDDFDDSRTLVFGSSTDATQAALDG
NAVALADFAMVANDLSQGRLVRPFELGIKVAPEFAYFLVYPETAKDDARITAFREWLLEEAAKTHGTDKV

Sequences:

>Translated_310_residues
MPRLLPGTRALRTFEAAARHLNFTRAADELGLTPAAVSHQVKEIEDQLDLVLFTRTSRTMRLTEAGNVLHEASIEALDLL
NRAVSRARKMTRGTALLKVTLDAQFATKWLMRRIDDFRRQRPGIELRFDITYDVRDFERDDVDIGIRFGTGRYAGLCAHR
LFDNIIIPVCSPALLASGPPLNEPRDLFRHTLAHIDWSRQGVTWPNWSMWMQAAGVDDFDDSRTLVFGSSTDATQAALDG
NAVALADFAMVANDLSQGRLVRPFELGIKVAPEFAYFLVYPETAKDDARITAFREWLLEEAAKTHGTDKV
>Mature_309_residues
PRLLPGTRALRTFEAAARHLNFTRAADELGLTPAAVSHQVKEIEDQLDLVLFTRTSRTMRLTEAGNVLHEASIEALDLLN
RAVSRARKMTRGTALLKVTLDAQFATKWLMRRIDDFRRQRPGIELRFDITYDVRDFERDDVDIGIRFGTGRYAGLCAHRL
FDNIIIPVCSPALLASGPPLNEPRDLFRHTLAHIDWSRQGVTWPNWSMWMQAAGVDDFDDSRTLVFGSSTDATQAALDGN
AVALADFAMVANDLSQGRLVRPFELGIKVAPEFAYFLVYPETAKDDARITAFREWLLEEAAKTHGTDKV

Specific function: Regulatory protein for the glycine cleavage system operon (gcv). Mediates activation of gcv by glycine and repression by purines. GcvA is negatively autoregulated. Bind to three sites upstream of the gcv promoter [H]

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HTH lysR-type DNA-binding domain [H]

Homologues:

Organism=Escherichia coli, GI1789173, Length=302, Percent_Identity=39.0728476821192, Blast_Score=196, Evalue=1e-51,
Organism=Escherichia coli, GI1786448, Length=290, Percent_Identity=31.0344827586207, Blast_Score=134, Evalue=9e-33,
Organism=Escherichia coli, GI1788706, Length=307, Percent_Identity=32.2475570032573, Blast_Score=131, Evalue=6e-32,
Organism=Escherichia coli, GI145693193, Length=289, Percent_Identity=26.2975778546713, Blast_Score=81, Evalue=1e-16,
Organism=Escherichia coli, GI87081978, Length=273, Percent_Identity=26.3736263736264, Blast_Score=78, Evalue=6e-16,
Organism=Escherichia coli, GI1786401, Length=304, Percent_Identity=25.9868421052632, Blast_Score=77, Evalue=2e-15,
Organism=Escherichia coli, GI157672245, Length=148, Percent_Identity=33.1081081081081, Blast_Score=75, Evalue=5e-15,
Organism=Escherichia coli, GI1787128, Length=290, Percent_Identity=25.8620689655172, Blast_Score=72, Evalue=6e-14,
Organism=Escherichia coli, GI1789440, Length=271, Percent_Identity=24.7232472324723, Blast_Score=70, Evalue=2e-13,
Organism=Escherichia coli, GI2367136, Length=203, Percent_Identity=31.0344827586207, Blast_Score=69, Evalue=5e-13,
Organism=Escherichia coli, GI1787589, Length=275, Percent_Identity=24.7272727272727, Blast_Score=69, Evalue=6e-13,
Organism=Escherichia coli, GI1786508, Length=121, Percent_Identity=34.7107438016529, Blast_Score=65, Evalue=5e-12,
Organism=Escherichia coli, GI1790262, Length=215, Percent_Identity=24.1860465116279, Blast_Score=63, Evalue=2e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000847
- InterPro:   IPR005119
- InterPro:   IPR011991 [H]

Pfam domain/function: PF00126 HTH_1; PF03466 LysR_substrate [H]

EC number: NA

Molecular weight: Translated: 34873; Mature: 34742

Theoretical pI: Translated: 6.41; Mature: 6.41

Prosite motif: PS50931 HTH_LYSR

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
2.3 %Met     (Translated Protein)
2.9 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
1.9 %Met     (Mature Protein)
2.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MPRLLPGTRALRTFEAAARHLNFTRAADELGLTPAAVSHQVKEIEDQLDLVLFTRTSRTM
CCCCCCCHHHHHHHHHHHHHCCHHHHHHHHCCCHHHHHHHHHHHHHHHCEEEEEECCCEE
RLTEAGNVLHEASIEALDLLNRAVSRARKMTRGTALLKVTLDAQFATKWLMRRIDDFRRQ
EEHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEECHHHHHHHHHHHHHHHHHH
RPGIELRFDITYDVRDFERDDVDIGIRFGTGRYAGLCAHRLFDNIIIPVCSPALLASGPP
CCCCEEEEEEEECCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHCCHHCCCHHHHCCCCC
LNEPRDLFRHTLAHIDWSRQGVTWPNWSMWMQAAGVDDFDDSRTLVFGSSTDATQAALDG
CCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHEECCCCCCCCCCEEEEECCCCCHHHHCCC
NAVALADFAMVANDLSQGRLVRPFELGIKVAPEFAYFLVYPETAKDDARITAFREWLLEE
CEEHHHHHHHHHHHHCCCCEECHHHCCEEECCCEEEEEEECCCCCCCHHHHHHHHHHHHH
AAKTHGTDKV
HHHHCCCCCC
>Mature Secondary Structure 
PRLLPGTRALRTFEAAARHLNFTRAADELGLTPAAVSHQVKEIEDQLDLVLFTRTSRTM
CCCCCCHHHHHHHHHHHHHCCHHHHHHHHCCCHHHHHHHHHHHHHHHCEEEEEECCCEE
RLTEAGNVLHEASIEALDLLNRAVSRARKMTRGTALLKVTLDAQFATKWLMRRIDDFRRQ
EEHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEECHHHHHHHHHHHHHHHHHH
RPGIELRFDITYDVRDFERDDVDIGIRFGTGRYAGLCAHRLFDNIIIPVCSPALLASGPP
CCCCEEEEEEEECCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHCCHHCCCHHHHCCCCC
LNEPRDLFRHTLAHIDWSRQGVTWPNWSMWMQAAGVDDFDDSRTLVFGSSTDATQAALDG
CCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHEECCCCCCCCCCEEEEECCCCCHHHHCCC
NAVALADFAMVANDLSQGRLVRPFELGIKVAPEFAYFLVYPETAKDDARITAFREWLLEE
CEEHHHHHHHHHHHHCCCCEECHHHCCEEECCCEEEEEEECCCCCCCHHHHHHHHHHHHH
AAKTHGTDKV
HHHHCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796 [H]