Definition Shewanella amazonensis SB2B chromosome, complete genome.
Accession NC_008700
Length 4,306,142

Click here to switch to the map view.

The map label for this gene is gcvA [H]

Identifier: 119776491

GI number: 119776491

Start: 3986603

End: 3987493

Strand: Direct

Name: gcvA [H]

Synonym: Sama_3359

Alternate gene names: 119776491

Gene position: 3986603-3987493 (Clockwise)

Preceding gene: 119776489

Following gene: 119776495

Centisome position: 92.58

GC content: 59.03

Gene sequence:

>891_bases
ATGCGTAAACTCCCTCCGCTTCGTGCACTGCAGGTTTTTGAAGCTGCGGCCCGTCATCTGCACTTTTCCCGGGCCGCTGA
AGAGTTGTGCCTGACCCAAAGCGCCGTTTCACATCAGGTCAGGGGATTGGAGCAACATTTGGGGCAAACCCTGTTTGCAC
GACGGGGCCGCGAACTGGCGCTGACCCCCAAGGGAGAACAGCTTTTTCTGGCGGTGCAGTCGGCGCTGGATGGTCTGGAT
AGCCTTTGTCGCCAGTTGAATGAGGCTGAAAGCCGTGAACTTCGGCTGGCGGTGTACAGCTCCTTCGCCGTGAAGTGGTT
GATACCGAGACTGGGCGACTTTCGCCGTCAGCATCCGGGGATCAAAATTCACCTCGAGATGGTCAGCGGCGACCCGCCGT
TGTCGGATCAGGTGGCGGATATGTTTATCTGCGGTGAGCAGCATCAACGTGGGTTTTGGCAGACACTGCTGAGGCCTGAG
CGGCTTATTCCCGTGTGCAGTCCGGCATTGGCCAATGCCCTTGGCGAAGCCCTGGTCATGCCGCTGCGACTTGACAGCCT
GCCGCTGCTTTCAGTTGACGAAGCAGACATAGGACCAGACTGGGCACGCTGGGCAAAGTCACAGGCGCAAACTCTGACCC
AGGCGCAGTTGCAGAGCTACAGCCATGTACTGCTGGCGATAGAAGCGGCCATTGCCGGTCAGGGCATAGCTCTGGCATCG
GATTTCATTGTGGAAGGCGACATCGCCGCGGGAAAGCTGATGGCGCTACCCTGGCCGGCACTGGAAACCGGGTTTGGTTT
TCACTTCTGTTGCCGGGAGCGGCGTTTGAAAGAGCCTGCAATGGCCGCCTTTGCCGAATGGATCCAGCAACAAGCCGCGA
TGGCCGGATAG

Upstream 100 bases:

>100_bases
TGAGCGCATGATATAGCCCACAACTAACTAAGAAAAATGATGTTTTTTTATCAATCATGAATTAGATTCATGCTCAACCA
TCTCTATTCAGGCTTTTCGG

Downstream 100 bases:

>100_bases
CAGACATAAAAAAATCAGCCTCGTGGGCTGATTTTTTAGCGAAATCGACTGGCGTTATACCAAGAGGCCGATTTTTTCGT
ATACCTGTTTCAGGGTTACT

Product: LysR family transcriptional regulator

Products: NA

Alternate protein names: Gcv operon activator [H]

Number of amino acids: Translated: 296; Mature: 296

Protein sequence:

>296_residues
MRKLPPLRALQVFEAAARHLHFSRAAEELCLTQSAVSHQVRGLEQHLGQTLFARRGRELALTPKGEQLFLAVQSALDGLD
SLCRQLNEAESRELRLAVYSSFAVKWLIPRLGDFRRQHPGIKIHLEMVSGDPPLSDQVADMFICGEQHQRGFWQTLLRPE
RLIPVCSPALANALGEALVMPLRLDSLPLLSVDEADIGPDWARWAKSQAQTLTQAQLQSYSHVLLAIEAAIAGQGIALAS
DFIVEGDIAAGKLMALPWPALETGFGFHFCCRERRLKEPAMAAFAEWIQQQAAMAG

Sequences:

>Translated_296_residues
MRKLPPLRALQVFEAAARHLHFSRAAEELCLTQSAVSHQVRGLEQHLGQTLFARRGRELALTPKGEQLFLAVQSALDGLD
SLCRQLNEAESRELRLAVYSSFAVKWLIPRLGDFRRQHPGIKIHLEMVSGDPPLSDQVADMFICGEQHQRGFWQTLLRPE
RLIPVCSPALANALGEALVMPLRLDSLPLLSVDEADIGPDWARWAKSQAQTLTQAQLQSYSHVLLAIEAAIAGQGIALAS
DFIVEGDIAAGKLMALPWPALETGFGFHFCCRERRLKEPAMAAFAEWIQQQAAMAG
>Mature_296_residues
MRKLPPLRALQVFEAAARHLHFSRAAEELCLTQSAVSHQVRGLEQHLGQTLFARRGRELALTPKGEQLFLAVQSALDGLD
SLCRQLNEAESRELRLAVYSSFAVKWLIPRLGDFRRQHPGIKIHLEMVSGDPPLSDQVADMFICGEQHQRGFWQTLLRPE
RLIPVCSPALANALGEALVMPLRLDSLPLLSVDEADIGPDWARWAKSQAQTLTQAQLQSYSHVLLAIEAAIAGQGIALAS
DFIVEGDIAAGKLMALPWPALETGFGFHFCCRERRLKEPAMAAFAEWIQQQAAMAG

Specific function: Regulatory protein for the glycine cleavage system operon (gcv). Mediates activation of gcv by glycine and repression by purines. GcvA is negatively autoregulated. Bind to three sites upstream of the gcv promoter [H]

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HTH lysR-type DNA-binding domain [H]

Homologues:

Organism=Escherichia coli, GI1789173, Length=301, Percent_Identity=37.5415282392027, Blast_Score=172, Evalue=3e-44,
Organism=Escherichia coli, GI1786448, Length=266, Percent_Identity=34.5864661654135, Blast_Score=127, Evalue=9e-31,
Organism=Escherichia coli, GI1788706, Length=297, Percent_Identity=31.986531986532, Blast_Score=121, Evalue=7e-29,
Organism=Escherichia coli, GI145693193, Length=141, Percent_Identity=37.5886524822695, Blast_Score=76, Evalue=3e-15,
Organism=Escherichia coli, GI1786401, Length=270, Percent_Identity=24.8148148148148, Blast_Score=70, Evalue=1e-13,
Organism=Escherichia coli, GI157672245, Length=186, Percent_Identity=31.1827956989247, Blast_Score=69, Evalue=5e-13,
Organism=Escherichia coli, GI1790208, Length=304, Percent_Identity=25.9868421052632, Blast_Score=68, Evalue=6e-13,
Organism=Escherichia coli, GI1787879, Length=143, Percent_Identity=35.6643356643357, Blast_Score=68, Evalue=8e-13,
Organism=Escherichia coli, GI1790262, Length=109, Percent_Identity=35.7798165137615, Blast_Score=67, Evalue=1e-12,
Organism=Escherichia coli, GI1787128, Length=137, Percent_Identity=32.1167883211679, Blast_Score=65, Evalue=5e-12,
Organism=Escherichia coli, GI1787589, Length=139, Percent_Identity=32.3741007194245, Blast_Score=65, Evalue=5e-12,
Organism=Escherichia coli, GI1788748, Length=123, Percent_Identity=38.2113821138211, Blast_Score=64, Evalue=1e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000847
- InterPro:   IPR005119
- InterPro:   IPR011991 [H]

Pfam domain/function: PF00126 HTH_1; PF03466 LysR_substrate [H]

EC number: NA

Molecular weight: Translated: 32743; Mature: 32743

Theoretical pI: Translated: 6.79; Mature: 6.79

Prosite motif: PS50931 HTH_LYSR

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.0 %Cys     (Translated Protein)
2.4 %Met     (Translated Protein)
4.4 %Cys+Met (Translated Protein)
2.0 %Cys     (Mature Protein)
2.4 %Met     (Mature Protein)
4.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MRKLPPLRALQVFEAAARHLHFSRAAEELCLTQSAVSHQVRGLEQHLGQTLFARRGRELA
CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEE
LTPKGEQLFLAVQSALDGLDSLCRQLNEAESRELRLAVYSSFAVKWLIPRLGDFRRQHPG
ECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
IKIHLEMVSGDPPLSDQVADMFICGEQHQRGFWQTLLRPERLIPVCSPALANALGEALVM
EEEEEEEECCCCCCCHHHHHHHHCCCHHHHHHHHHHCCHHHCCCCCCHHHHHHHHHHHHH
PLRLDSLPLLSVDEADIGPDWARWAKSQAQTLTQAQLQSYSHVLLAIEAAIAGQGIALAS
HHCCCCCCCEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEHH
DFIVEGDIAAGKLMALPWPALETGFGFHFCCRERRLKEPAMAAFAEWIQQQAAMAG
HHEEECCCCCCCEEECCCCHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHCCC
>Mature Secondary Structure
MRKLPPLRALQVFEAAARHLHFSRAAEELCLTQSAVSHQVRGLEQHLGQTLFARRGRELA
CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEE
LTPKGEQLFLAVQSALDGLDSLCRQLNEAESRELRLAVYSSFAVKWLIPRLGDFRRQHPG
ECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
IKIHLEMVSGDPPLSDQVADMFICGEQHQRGFWQTLLRPERLIPVCSPALANALGEALVM
EEEEEEEECCCCCCCHHHHHHHHCCCHHHHHHHHHHCCHHHCCCCCCHHHHHHHHHHHHH
PLRLDSLPLLSVDEADIGPDWARWAKSQAQTLTQAQLQSYSHVLLAIEAAIAGQGIALAS
HHCCCCCCCEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEHH
DFIVEGDIAAGKLMALPWPALETGFGFHFCCRERRLKEPAMAAFAEWIQQQAAMAG
HHEEECCCCCCCEEECCCCHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796 [H]