| Definition | Shewanella halifaxensis HAW-EB4 chromosome, complete genome. |
|---|---|
| Accession | NC_010334 |
| Length | 5,226,917 |
Click here to switch to the map view.
The map label for this gene is gcvA [H]
Identifier: 167622353
GI number: 167622353
Start: 484396
End: 485298
Strand: Direct
Name: gcvA [H]
Synonym: Shal_0413
Alternate gene names: 167622353
Gene position: 484396-485298 (Clockwise)
Preceding gene: 167622352
Following gene: 167622354
Centisome position: 9.27
GC content: 45.74
Gene sequence:
>903_bases ATGAATACAACTCAAGTTAGTCCAAGTGCCCGCTTACCGCAGTTATCTTGGTTTATGGCATTCAAAGCGGTAGCTGATAA ACAAAGTTTTACCGAGGCCAGCCGTGAGCTATGTCTGACTCAATCCGCCATCAGTCAACAAGTGGCTAAGCTTGAAGCTG TGCTGCGTACCAAGCTGTTCTATCGTGGTGGGCGTCAAATTCAACTGACCGAAGATGGTCAGCGATTACTGTTACAGATC AATCAACCGATTCAAGAGTTAATGGGTGTCGTTGATGGCTTTCATCATGATGCCGAGCATTACACGCTACATATTGAGAT GGAGCCAGTGTTTAGCCGCCAAGTGATTAGCAAATTATTACCCAAGTTTTTGGCTAAATACCCTAAGCTATTGGTCGGGC AGATGCTCACTACCAATCATTTCGATTTTTTACCACAAACTGAGTTGGCGATAAAGTGGGGCGACGGTGAGTGGGAAGGT TTTGACTCGCAGTTTTTGTGCAGCTTGGATTATGTGCCTGTTTGCTCTCCTGAGTACCTAAAACGCAAACCTTTAACTGT GCCAGCAGACTTAGCCAATGCCAACATTATCCATGACCGAGACAACTTCGATTGGAAGTATTGGCTTAATTATTATCCTG TCGCAAAACTAGAGCTGCAAAAGTGCCATTACGCCAGTGAAAGCCAAGTGGTGATGTCGCTAGCCATGAGTGGCTTAGGC GTTGCCGTATGTGCTTATCAGCAGATCCAGGAGCAGTTAGCCGATGGGAGTTTGGTGATGCCGTTTCCAGAGTTAAAGGT GCGCCATCAGCGCGCATATTATATTTTAACCCGTAAGAATAAGTCGTTATCGCCACAAGCCACTAGCTTTATGCAATTTG TCAGAGAAGCCATGCTGACTTAG
Upstream 100 bases:
>100_bases AAGCCCGGCTGCTAAATGTTTAACCATTAGATTTAGTCATTAATAATGCAATGCCCTTGCTACACATAGTTGTAGCCGGG GCTTATTATCGCAGGATGTT
Downstream 100 bases:
>100_bases ATGCTACGAGGCTAGTGTTATAGTCATTATGTAAGACGTTACCTTGCTTGTTAATCTATTAACATCATATGCATCAACTA TTTGAGCCATCAGTAAACGA
Product: LysR family transcriptional regulator
Products: NA
Alternate protein names: Gcv operon activator [H]
Number of amino acids: Translated: 300; Mature: 300
Protein sequence:
>300_residues MNTTQVSPSARLPQLSWFMAFKAVADKQSFTEASRELCLTQSAISQQVAKLEAVLRTKLFYRGGRQIQLTEDGQRLLLQI NQPIQELMGVVDGFHHDAEHYTLHIEMEPVFSRQVISKLLPKFLAKYPKLLVGQMLTTNHFDFLPQTELAIKWGDGEWEG FDSQFLCSLDYVPVCSPEYLKRKPLTVPADLANANIIHDRDNFDWKYWLNYYPVAKLELQKCHYASESQVVMSLAMSGLG VAVCAYQQIQEQLADGSLVMPFPELKVRHQRAYYILTRKNKSLSPQATSFMQFVREAMLT
Sequences:
>Translated_300_residues MNTTQVSPSARLPQLSWFMAFKAVADKQSFTEASRELCLTQSAISQQVAKLEAVLRTKLFYRGGRQIQLTEDGQRLLLQI NQPIQELMGVVDGFHHDAEHYTLHIEMEPVFSRQVISKLLPKFLAKYPKLLVGQMLTTNHFDFLPQTELAIKWGDGEWEG FDSQFLCSLDYVPVCSPEYLKRKPLTVPADLANANIIHDRDNFDWKYWLNYYPVAKLELQKCHYASESQVVMSLAMSGLG VAVCAYQQIQEQLADGSLVMPFPELKVRHQRAYYILTRKNKSLSPQATSFMQFVREAMLT >Mature_300_residues MNTTQVSPSARLPQLSWFMAFKAVADKQSFTEASRELCLTQSAISQQVAKLEAVLRTKLFYRGGRQIQLTEDGQRLLLQI NQPIQELMGVVDGFHHDAEHYTLHIEMEPVFSRQVISKLLPKFLAKYPKLLVGQMLTTNHFDFLPQTELAIKWGDGEWEG FDSQFLCSLDYVPVCSPEYLKRKPLTVPADLANANIIHDRDNFDWKYWLNYYPVAKLELQKCHYASESQVVMSLAMSGLG VAVCAYQQIQEQLADGSLVMPFPELKVRHQRAYYILTRKNKSLSPQATSFMQFVREAMLT
Specific function: Regulatory protein for the glycine cleavage system operon (gcv). Mediates activation of gcv by glycine and repression by purines. GcvA is negatively autoregulated. Bind to three sites upstream of the gcv promoter [H]
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 HTH lysR-type DNA-binding domain [H]
Homologues:
Organism=Escherichia coli, GI1789173, Length=288, Percent_Identity=30.9027777777778, Blast_Score=133, Evalue=2e-32, Organism=Escherichia coli, GI1788706, Length=296, Percent_Identity=28.0405405405405, Blast_Score=117, Evalue=7e-28, Organism=Escherichia coli, GI1786448, Length=250, Percent_Identity=28.4, Blast_Score=84, Evalue=1e-17, Organism=Escherichia coli, GI1786401, Length=301, Percent_Identity=23.9202657807309, Blast_Score=70, Evalue=1e-13, Organism=Escherichia coli, GI157672245, Length=112, Percent_Identity=36.6071428571429, Blast_Score=66, Evalue=2e-12, Organism=Escherichia coli, GI1787128, Length=270, Percent_Identity=28.1481481481481, Blast_Score=65, Evalue=6e-12, Organism=Escherichia coli, GI87081978, Length=268, Percent_Identity=24.6268656716418, Blast_Score=64, Evalue=1e-11, Organism=Escherichia coli, GI1789639, Length=289, Percent_Identity=24.5674740484429, Blast_Score=62, Evalue=6e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000847 - InterPro: IPR005119 - InterPro: IPR011991 [H]
Pfam domain/function: PF00126 HTH_1; PF03466 LysR_substrate [H]
EC number: NA
Molecular weight: Translated: 34404; Mature: 34404
Theoretical pI: Translated: 7.56; Mature: 7.56
Prosite motif: PS50931 HTH_LYSR
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.7 %Cys (Translated Protein) 3.3 %Met (Translated Protein) 5.0 %Cys+Met (Translated Protein) 1.7 %Cys (Mature Protein) 3.3 %Met (Mature Protein) 5.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MNTTQVSPSARLPQLSWFMAFKAVADKQSFTEASRELCLTQSAISQQVAKLEAVLRTKLF CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH YRGGRQIQLTEDGQRLLLQINQPIQELMGVVDGFHHDAEHYTLHIEMEPVFSRQVISKLL HCCCCEEEECCCCHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEECCHHHHHHHHHHH PKFLAKYPKLLVGQMLTTNHFDFLPQTELAIKWGDGEWEGFDSQFLCSLDYVPVCSPEYL HHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCEEEEECCCCCCCHHHH KRKPLTVPADLANANIIHDRDNFDWKYWLNYYPVAKLELQKCHYASESQVVMSLAMSGLG CCCCCCCCCCCCCCCEEECCCCCCEEEEECCCCCHHEEHHHHCCCCHHHHHHHHHHHCCH VAVCAYQQIQEQLADGSLVMPFPELKVRHQRAYYILTRKNKSLSPQATSFMQFVREAMLT HHHHHHHHHHHHHCCCCEECCCHHHHHCCCEEEEEEEECCCCCCHHHHHHHHHHHHHHCC >Mature Secondary Structure MNTTQVSPSARLPQLSWFMAFKAVADKQSFTEASRELCLTQSAISQQVAKLEAVLRTKLF CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH YRGGRQIQLTEDGQRLLLQINQPIQELMGVVDGFHHDAEHYTLHIEMEPVFSRQVISKLL HCCCCEEEECCCCHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEECCHHHHHHHHHHH PKFLAKYPKLLVGQMLTTNHFDFLPQTELAIKWGDGEWEGFDSQFLCSLDYVPVCSPEYL HHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCEEEEECCCCCCCHHHH KRKPLTVPADLANANIIHDRDNFDWKYWLNYYPVAKLELQKCHYASESQVVMSLAMSGLG CCCCCCCCCCCCCCCEEECCCCCCEEEEECCCCCHHEEHHHHCCCCHHHHHHHHHHHCCH VAVCAYQQIQEQLADGSLVMPFPELKVRHQRAYYILTRKNKSLSPQATSFMQFVREAMLT HHHHHHHHHHHHHCCCCEECCCHHHHHCCCEEEEEEEECCCCCCHHHHHHHHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]