The gene/protein map for NC_002678 is currently unavailable.
Definition Mesorhizobium loti MAFF303099 chromosome, complete genome.
Accession NC_002678
Length 7,036,071

Click here to switch to the map view.

The map label for this gene is gcvA [H]

Identifier: 13471050

GI number: 13471050

Start: 743054

End: 743971

Strand: Direct

Name: gcvA [H]

Synonym: mlr0919

Alternate gene names: 13471050

Gene position: 743054-743971 (Clockwise)

Preceding gene: 13471047

Following gene: 13471052

Centisome position: 10.56

GC content: 64.92

Gene sequence:

>918_bases
ATGAGGCGTGGGCGCCTGCCGTTGACGGCATTGAGAAGTTTCGAGGCGGCGGGCCGGCACCTGAGCTTCAGCAAGGCCGC
CGAGGAGCTTTTCGTTTCCCAGGCGGCGATCAGCCGCCAGATCCGCGAGCTTGAAACTTTCCTGCGCCAGCCTTTGTTCG
AACGTCACCATCGCCGCGTCGAACTGACCGACAGCGGCCGCCGGCTGCTCGATCAGCTTGTCAGAAGTTTCGACGCCATC
GATCGGCTGCTCGGCGAATTGGTGGCAGCCCCGGCGCAGTCCGTGGTTCGTGTCAGCGTCGAGCCGTCGCTTGCCTCCGT
CTGGCTGGTGCCCCGGCTCAACCGGTTTCGCCAGTTGCGGCCCGATATCGACGTGTCCCTCGAAGTCGATGCCAGGCTGA
TCGAATTCCGCGGCGATCAACCCGAGCTTGCCTTGCGCTTCAGCGCCAACGCGACCTCCTGGCCCCGTAGCCAGGCAGAG
CGCCTTGCCTCCACAGTCGATTCACCGGTTCTGTCGCCGGCGCTGCTCGCCTCTGGCCCTCCTCTCGAGAAGCCGATCGA
CCTTGCCCGCTACACACTTTTGCATGAGGAAAATCGCCAGGGCTGGGCGCGCTGGTTCGAAGCGGCCGGCGTGCCCGCCG
ATGCCGTGCCCGCGCGGGGGCCGATGCTGGCGGATATATCGCTTTCAAAGCAGGCCGCCCTGCTCGGACATGGCGTGGCG
CTGGGCGATCTCTTGCAGATCGGCAACGAGCTCGAAACCGGCGCGTTGATCAAGCCGTTCGACATTGACGTCGCGTCCGG
CGCCTACTGGCTGGTGGCCAGGAGCTTGAAGGAGCTTTCCGAGCCGGCCGCCGCGTTCGCCGACTGGGTCAGAAGCGAAT
TTGCCGAAAGCAGACGGACACTGGAGGCAAAGGTCTAA

Upstream 100 bases:

>100_bases
CCTGGATCATAGTCGGTCTCCTTTGACGCGATTAGACCTCCACTGGCCTTGGCCTTCAAACGAGTATAATTTCCACTTCG
GATAATCCTGGGTTATGCGA

Downstream 100 bases:

>100_bases
GCCGGTTCCCGCGCCTGGGCTCCCGCTATTTTGCCTTCTTGTATTTTTCGAGAAGCTTGTCGATGTCGTTTACCCGGTGT
CGCGGGCCGCCGCGAAGTCC

Product: transcriptional regulator

Products: NA

Alternate protein names: Gcv operon activator [H]

Number of amino acids: Translated: 305; Mature: 305

Protein sequence:

>305_residues
MRRGRLPLTALRSFEAAGRHLSFSKAAEELFVSQAAISRQIRELETFLRQPLFERHHRRVELTDSGRRLLDQLVRSFDAI
DRLLGELVAAPAQSVVRVSVEPSLASVWLVPRLNRFRQLRPDIDVSLEVDARLIEFRGDQPELALRFSANATSWPRSQAE
RLASTVDSPVLSPALLASGPPLEKPIDLARYTLLHEENRQGWARWFEAAGVPADAVPARGPMLADISLSKQAALLGHGVA
LGDLLQIGNELETGALIKPFDIDVASGAYWLVARSLKELSEPAAAFADWVRSEFAESRRTLEAKV

Sequences:

>Translated_305_residues
MRRGRLPLTALRSFEAAGRHLSFSKAAEELFVSQAAISRQIRELETFLRQPLFERHHRRVELTDSGRRLLDQLVRSFDAI
DRLLGELVAAPAQSVVRVSVEPSLASVWLVPRLNRFRQLRPDIDVSLEVDARLIEFRGDQPELALRFSANATSWPRSQAE
RLASTVDSPVLSPALLASGPPLEKPIDLARYTLLHEENRQGWARWFEAAGVPADAVPARGPMLADISLSKQAALLGHGVA
LGDLLQIGNELETGALIKPFDIDVASGAYWLVARSLKELSEPAAAFADWVRSEFAESRRTLEAKV
>Mature_305_residues
MRRGRLPLTALRSFEAAGRHLSFSKAAEELFVSQAAISRQIRELETFLRQPLFERHHRRVELTDSGRRLLDQLVRSFDAI
DRLLGELVAAPAQSVVRVSVEPSLASVWLVPRLNRFRQLRPDIDVSLEVDARLIEFRGDQPELALRFSANATSWPRSQAE
RLASTVDSPVLSPALLASGPPLEKPIDLARYTLLHEENRQGWARWFEAAGVPADAVPARGPMLADISLSKQAALLGHGVA
LGDLLQIGNELETGALIKPFDIDVASGAYWLVARSLKELSEPAAAFADWVRSEFAESRRTLEAKV

Specific function: Regulatory protein for the glycine cleavage system operon (gcv). Mediates activation of gcv by glycine and repression by purines. GcvA is negatively autoregulated. Bind to three sites upstream of the gcv promoter [H]

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HTH lysR-type DNA-binding domain [H]

Homologues:

Organism=Escherichia coli, GI1789173, Length=285, Percent_Identity=38.2456140350877, Blast_Score=182, Evalue=3e-47,
Organism=Escherichia coli, GI1786448, Length=299, Percent_Identity=32.1070234113712, Blast_Score=112, Evalue=3e-26,
Organism=Escherichia coli, GI1788706, Length=298, Percent_Identity=30.2013422818792, Blast_Score=105, Evalue=3e-24,
Organism=Escherichia coli, GI1787128, Length=315, Percent_Identity=25.7142857142857, Blast_Score=71, Evalue=1e-13,
Organism=Escherichia coli, GI87081978, Length=293, Percent_Identity=24.2320819112628, Blast_Score=65, Evalue=5e-12,
Organism=Escherichia coli, GI157672245, Length=121, Percent_Identity=34.7107438016529, Blast_Score=65, Evalue=8e-12,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000847
- InterPro:   IPR005119
- InterPro:   IPR011991 [H]

Pfam domain/function: PF00126 HTH_1; PF03466 LysR_substrate [H]

EC number: NA

Molecular weight: Translated: 33836; Mature: 33836

Theoretical pI: Translated: 7.10; Mature: 7.10

Prosite motif: PS50931 HTH_LYSR

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.0 %Cys     (Translated Protein)
0.7 %Met     (Translated Protein)
0.7 %Cys+Met (Translated Protein)
0.0 %Cys     (Mature Protein)
0.7 %Met     (Mature Protein)
0.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MRRGRLPLTALRSFEAAGRHLSFSKAAEELFVSQAAISRQIRELETFLRQPLFERHHRRV
CCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEE
ELTDSGRRLLDQLVRSFDAIDRLLGELVAAPAQSVVRVSVEPSLASVWLVPRLNRFRQLR
EECHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHEEEEECCCHHHHHHHHHHHHHHHCC
PDIDVSLEVDARLIEFRGDQPELALRFSANATSWPRSQAERLASTVDSPVLSPALLASGP
CCCCEEEEECCEEEEECCCCCCEEEEEECCCCCCCHHHHHHHHHHHCCHHCCHHHHCCCC
PLEKPIDLARYTLLHEENRQGWARWFEAAGVPADAVPARGPMLADISLSKQAALLGHGVA
CCCCHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCCCCCCCCEEEECCCCHHHHHHHCCHH
LGDLLQIGNELETGALIKPFDIDVASGAYWLVARSLKELSEPAAAFADWVRSEFAESRRT
HHHHHHHCCCCCCCCEECCCCCEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
LEAKV
HHCCC
>Mature Secondary Structure
MRRGRLPLTALRSFEAAGRHLSFSKAAEELFVSQAAISRQIRELETFLRQPLFERHHRRV
CCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEE
ELTDSGRRLLDQLVRSFDAIDRLLGELVAAPAQSVVRVSVEPSLASVWLVPRLNRFRQLR
EECHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHEEEEECCCHHHHHHHHHHHHHHHCC
PDIDVSLEVDARLIEFRGDQPELALRFSANATSWPRSQAERLASTVDSPVLSPALLASGP
CCCCEEEEECCEEEEECCCCCCEEEEEECCCCCCCHHHHHHHHHHHCCHHCCHHHHCCCC
PLEKPIDLARYTLLHEENRQGWARWFEAAGVPADAVPARGPMLADISLSKQAALLGHGVA
CCCCHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCCCCCCCCEEEECCCCHHHHHHHCCHH
LGDLLQIGNELETGALIKPFDIDVASGAYWLVARSLKELSEPAAAFADWVRSEFAESRRT
HHHHHHHCCCCCCCCEECCCCCEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
LEAKV
HHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796 [H]