Definition Shewanella halifaxensis HAW-EB4 chromosome, complete genome.
Accession NC_010334
Length 5,226,917

Click here to switch to the map view.

The map label for this gene is gcvA [H]

Identifier: 167622353

GI number: 167622353

Start: 484396

End: 485298

Strand: Direct

Name: gcvA [H]

Synonym: Shal_0413

Alternate gene names: 167622353

Gene position: 484396-485298 (Clockwise)

Preceding gene: 167622352

Following gene: 167622354

Centisome position: 9.27

GC content: 45.74

Gene sequence:

>903_bases
ATGAATACAACTCAAGTTAGTCCAAGTGCCCGCTTACCGCAGTTATCTTGGTTTATGGCATTCAAAGCGGTAGCTGATAA
ACAAAGTTTTACCGAGGCCAGCCGTGAGCTATGTCTGACTCAATCCGCCATCAGTCAACAAGTGGCTAAGCTTGAAGCTG
TGCTGCGTACCAAGCTGTTCTATCGTGGTGGGCGTCAAATTCAACTGACCGAAGATGGTCAGCGATTACTGTTACAGATC
AATCAACCGATTCAAGAGTTAATGGGTGTCGTTGATGGCTTTCATCATGATGCCGAGCATTACACGCTACATATTGAGAT
GGAGCCAGTGTTTAGCCGCCAAGTGATTAGCAAATTATTACCCAAGTTTTTGGCTAAATACCCTAAGCTATTGGTCGGGC
AGATGCTCACTACCAATCATTTCGATTTTTTACCACAAACTGAGTTGGCGATAAAGTGGGGCGACGGTGAGTGGGAAGGT
TTTGACTCGCAGTTTTTGTGCAGCTTGGATTATGTGCCTGTTTGCTCTCCTGAGTACCTAAAACGCAAACCTTTAACTGT
GCCAGCAGACTTAGCCAATGCCAACATTATCCATGACCGAGACAACTTCGATTGGAAGTATTGGCTTAATTATTATCCTG
TCGCAAAACTAGAGCTGCAAAAGTGCCATTACGCCAGTGAAAGCCAAGTGGTGATGTCGCTAGCCATGAGTGGCTTAGGC
GTTGCCGTATGTGCTTATCAGCAGATCCAGGAGCAGTTAGCCGATGGGAGTTTGGTGATGCCGTTTCCAGAGTTAAAGGT
GCGCCATCAGCGCGCATATTATATTTTAACCCGTAAGAATAAGTCGTTATCGCCACAAGCCACTAGCTTTATGCAATTTG
TCAGAGAAGCCATGCTGACTTAG

Upstream 100 bases:

>100_bases
AAGCCCGGCTGCTAAATGTTTAACCATTAGATTTAGTCATTAATAATGCAATGCCCTTGCTACACATAGTTGTAGCCGGG
GCTTATTATCGCAGGATGTT

Downstream 100 bases:

>100_bases
ATGCTACGAGGCTAGTGTTATAGTCATTATGTAAGACGTTACCTTGCTTGTTAATCTATTAACATCATATGCATCAACTA
TTTGAGCCATCAGTAAACGA

Product: LysR family transcriptional regulator

Products: NA

Alternate protein names: Gcv operon activator [H]

Number of amino acids: Translated: 300; Mature: 300

Protein sequence:

>300_residues
MNTTQVSPSARLPQLSWFMAFKAVADKQSFTEASRELCLTQSAISQQVAKLEAVLRTKLFYRGGRQIQLTEDGQRLLLQI
NQPIQELMGVVDGFHHDAEHYTLHIEMEPVFSRQVISKLLPKFLAKYPKLLVGQMLTTNHFDFLPQTELAIKWGDGEWEG
FDSQFLCSLDYVPVCSPEYLKRKPLTVPADLANANIIHDRDNFDWKYWLNYYPVAKLELQKCHYASESQVVMSLAMSGLG
VAVCAYQQIQEQLADGSLVMPFPELKVRHQRAYYILTRKNKSLSPQATSFMQFVREAMLT

Sequences:

>Translated_300_residues
MNTTQVSPSARLPQLSWFMAFKAVADKQSFTEASRELCLTQSAISQQVAKLEAVLRTKLFYRGGRQIQLTEDGQRLLLQI
NQPIQELMGVVDGFHHDAEHYTLHIEMEPVFSRQVISKLLPKFLAKYPKLLVGQMLTTNHFDFLPQTELAIKWGDGEWEG
FDSQFLCSLDYVPVCSPEYLKRKPLTVPADLANANIIHDRDNFDWKYWLNYYPVAKLELQKCHYASESQVVMSLAMSGLG
VAVCAYQQIQEQLADGSLVMPFPELKVRHQRAYYILTRKNKSLSPQATSFMQFVREAMLT
>Mature_300_residues
MNTTQVSPSARLPQLSWFMAFKAVADKQSFTEASRELCLTQSAISQQVAKLEAVLRTKLFYRGGRQIQLTEDGQRLLLQI
NQPIQELMGVVDGFHHDAEHYTLHIEMEPVFSRQVISKLLPKFLAKYPKLLVGQMLTTNHFDFLPQTELAIKWGDGEWEG
FDSQFLCSLDYVPVCSPEYLKRKPLTVPADLANANIIHDRDNFDWKYWLNYYPVAKLELQKCHYASESQVVMSLAMSGLG
VAVCAYQQIQEQLADGSLVMPFPELKVRHQRAYYILTRKNKSLSPQATSFMQFVREAMLT

Specific function: Regulatory protein for the glycine cleavage system operon (gcv). Mediates activation of gcv by glycine and repression by purines. GcvA is negatively autoregulated. Bind to three sites upstream of the gcv promoter [H]

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HTH lysR-type DNA-binding domain [H]

Homologues:

Organism=Escherichia coli, GI1789173, Length=288, Percent_Identity=30.9027777777778, Blast_Score=133, Evalue=2e-32,
Organism=Escherichia coli, GI1788706, Length=296, Percent_Identity=28.0405405405405, Blast_Score=117, Evalue=7e-28,
Organism=Escherichia coli, GI1786448, Length=250, Percent_Identity=28.4, Blast_Score=84, Evalue=1e-17,
Organism=Escherichia coli, GI1786401, Length=301, Percent_Identity=23.9202657807309, Blast_Score=70, Evalue=1e-13,
Organism=Escherichia coli, GI157672245, Length=112, Percent_Identity=36.6071428571429, Blast_Score=66, Evalue=2e-12,
Organism=Escherichia coli, GI1787128, Length=270, Percent_Identity=28.1481481481481, Blast_Score=65, Evalue=6e-12,
Organism=Escherichia coli, GI87081978, Length=268, Percent_Identity=24.6268656716418, Blast_Score=64, Evalue=1e-11,
Organism=Escherichia coli, GI1789639, Length=289, Percent_Identity=24.5674740484429, Blast_Score=62, Evalue=6e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000847
- InterPro:   IPR005119
- InterPro:   IPR011991 [H]

Pfam domain/function: PF00126 HTH_1; PF03466 LysR_substrate [H]

EC number: NA

Molecular weight: Translated: 34404; Mature: 34404

Theoretical pI: Translated: 7.56; Mature: 7.56

Prosite motif: PS50931 HTH_LYSR

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.7 %Cys     (Translated Protein)
3.3 %Met     (Translated Protein)
5.0 %Cys+Met (Translated Protein)
1.7 %Cys     (Mature Protein)
3.3 %Met     (Mature Protein)
5.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNTTQVSPSARLPQLSWFMAFKAVADKQSFTEASRELCLTQSAISQQVAKLEAVLRTKLF
CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
YRGGRQIQLTEDGQRLLLQINQPIQELMGVVDGFHHDAEHYTLHIEMEPVFSRQVISKLL
HCCCCEEEECCCCHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEECCHHHHHHHHHHH
PKFLAKYPKLLVGQMLTTNHFDFLPQTELAIKWGDGEWEGFDSQFLCSLDYVPVCSPEYL
HHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCEEEEECCCCCCCHHHH
KRKPLTVPADLANANIIHDRDNFDWKYWLNYYPVAKLELQKCHYASESQVVMSLAMSGLG
CCCCCCCCCCCCCCCEEECCCCCCEEEEECCCCCHHEEHHHHCCCCHHHHHHHHHHHCCH
VAVCAYQQIQEQLADGSLVMPFPELKVRHQRAYYILTRKNKSLSPQATSFMQFVREAMLT
HHHHHHHHHHHHHCCCCEECCCHHHHHCCCEEEEEEEECCCCCCHHHHHHHHHHHHHHCC
>Mature Secondary Structure
MNTTQVSPSARLPQLSWFMAFKAVADKQSFTEASRELCLTQSAISQQVAKLEAVLRTKLF
CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
YRGGRQIQLTEDGQRLLLQINQPIQELMGVVDGFHHDAEHYTLHIEMEPVFSRQVISKLL
HCCCCEEEECCCCHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEECCHHHHHHHHHHH
PKFLAKYPKLLVGQMLTTNHFDFLPQTELAIKWGDGEWEGFDSQFLCSLDYVPVCSPEYL
HHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCEEEEECCCCCCCHHHH
KRKPLTVPADLANANIIHDRDNFDWKYWLNYYPVAKLELQKCHYASESQVVMSLAMSGLG
CCCCCCCCCCCCCCCEEECCCCCCEEEEECCCCCHHEEHHHHCCCCHHHHHHHHHHHCCH
VAVCAYQQIQEQLADGSLVMPFPELKVRHQRAYYILTRKNKSLSPQATSFMQFVREAMLT
HHHHHHHHHHHHHCCCCEECCCHHHHHCCCEEEEEEEECCCCCCHHHHHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796 [H]