Definition Burkholderia mallei NCTC 10247 chromosome II, complete genome.
Accession NC_009079
Length 2,352,693

Click here to switch to the map view.

The map label for this gene is gcvA [H]

Identifier: 126446071

GI number: 126446071

Start: 2018811

End: 2019707

Strand: Direct

Name: gcvA [H]

Synonym: BMA10247_A2086

Alternate gene names: 126446071

Gene position: 2018811-2019707 (Clockwise)

Preceding gene: 126446674

Following gene: 126445640

Centisome position: 85.81

GC content: 69.9

Gene sequence:

>897_bases
ATGAAGCATCTGTATCCGAACGTCGCCGAACTGCACGCGTTCGCCAGTTCTGCGAAGCATCTGAACTTTTCGTATGCGGC
GCGCGAGCTCGGCCTCACGCCGAGCGCGGTCAGCCGGCAGATCGCGAGCCTCGAGGCGCTGCTCGGCGTGAAGCTGTTCG
TGCGCGAAGGCCGCAATCTCGCGCTCACGCGCGCGGGGCAGGTCTATCAGGCGCGCGTCGCGGGGCCCCTGCGCGAGATC
GGCAATGCGTCGCTCGAATTGCTGAGCGCGCGCGAGGACAGCAATCTGCTGACCATTGCGAGCGTGCCGACCTTCACGAC
GAAATGGCTCGTGCCGCGCCTGCCGCGCTTTCTCGAAACCGCGCCTGACATCACGCTGAGCTTCCGGCGCCACCTCGCGC
CGGGCGACCTGTTCCCGCTCGGGCTCGATGCGGCGATCCGCTACGGCGACGGCCGCTGGGAAGGCGTCCAGTGCGACTAT
CTCGACGGCCGCACGTTCGTGCCCGTATGCGCGCCCGGTTTCGCCGAGCGTCACGCCCTGCGCGAACCCGCGGACATCGC
CGCCGCGCCGCGCCTCGTGCACGAGCAGGCCGAATGCGCGTGGCTCGCGTGGGCGGACCGGCACCGCGCCACGCAGATGA
ACGCGCTCGCCGGCCCACGCTTCGAGCAGTACTCGGTGCTGATCCAGGCGGCGCAGGCAGGGCTCGGGATCGCGCTCATT
CCCGCGTTCCTGATTCGCGCGCCGCTCGCGGCCGGCACGCTCGTGCAGCCGCTCGACGCGCCCGTCGACGTCGACGAGCA
GAGCCATTACCTGTGCTACGCGCCGGAGCGGCTGCAGGCGAGTGCATCGCTGCGGCTGCTGCGCGAATGGATGCTCGCCG
AGTGCGCATGCGCATGA

Upstream 100 bases:

>100_bases
ATCGCGACGACGGCCGCCAGTGTGGGCGCGCGACATGGCGCCGTCATGAAAAAAACGTACAAACTCCGCATCGCTCTTTG
CATCGACGGAATACTTCGCG

Downstream 100 bases:

>100_bases
CGCGCATCGCGCCGCCTCGTCCGCGCCGCCGCCGCACCTGCGCTCTCGTGACGCGGCGAAACCGCGCGCCGCTCGGCTTT
TGCGCGCGGCGGACCTAGTA

Product: LysR family transcriptional regulator

Products: NA

Alternate protein names: Gcv operon activator [H]

Number of amino acids: Translated: 298; Mature: 298

Protein sequence:

>298_residues
MKHLYPNVAELHAFASSAKHLNFSYAARELGLTPSAVSRQIASLEALLGVKLFVREGRNLALTRAGQVYQARVAGPLREI
GNASLELLSAREDSNLLTIASVPTFTTKWLVPRLPRFLETAPDITLSFRRHLAPGDLFPLGLDAAIRYGDGRWEGVQCDY
LDGRTFVPVCAPGFAERHALREPADIAAAPRLVHEQAECAWLAWADRHRATQMNALAGPRFEQYSVLIQAAQAGLGIALI
PAFLIRAPLAAGTLVQPLDAPVDVDEQSHYLCYAPERLQASASLRLLREWMLAECACA

Sequences:

>Translated_298_residues
MKHLYPNVAELHAFASSAKHLNFSYAARELGLTPSAVSRQIASLEALLGVKLFVREGRNLALTRAGQVYQARVAGPLREI
GNASLELLSAREDSNLLTIASVPTFTTKWLVPRLPRFLETAPDITLSFRRHLAPGDLFPLGLDAAIRYGDGRWEGVQCDY
LDGRTFVPVCAPGFAERHALREPADIAAAPRLVHEQAECAWLAWADRHRATQMNALAGPRFEQYSVLIQAAQAGLGIALI
PAFLIRAPLAAGTLVQPLDAPVDVDEQSHYLCYAPERLQASASLRLLREWMLAECACA
>Mature_298_residues
MKHLYPNVAELHAFASSAKHLNFSYAARELGLTPSAVSRQIASLEALLGVKLFVREGRNLALTRAGQVYQARVAGPLREI
GNASLELLSAREDSNLLTIASVPTFTTKWLVPRLPRFLETAPDITLSFRRHLAPGDLFPLGLDAAIRYGDGRWEGVQCDY
LDGRTFVPVCAPGFAERHALREPADIAAAPRLVHEQAECAWLAWADRHRATQMNALAGPRFEQYSVLIQAAQAGLGIALI
PAFLIRAPLAAGTLVQPLDAPVDVDEQSHYLCYAPERLQASASLRLLREWMLAECACA

Specific function: Regulatory protein for the glycine cleavage system operon (gcv). Mediates activation of gcv by glycine and repression by purines. GcvA is negatively autoregulated. Bind to three sites upstream of the gcv promoter [H]

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HTH lysR-type DNA-binding domain [H]

Homologues:

Organism=Escherichia coli, GI1789173, Length=298, Percent_Identity=32.8859060402685, Blast_Score=145, Evalue=3e-36,
Organism=Escherichia coli, GI1788706, Length=277, Percent_Identity=30.6859205776173, Blast_Score=125, Evalue=4e-30,
Organism=Escherichia coli, GI1786448, Length=285, Percent_Identity=31.2280701754386, Blast_Score=122, Evalue=3e-29,
Organism=Escherichia coli, GI145693193, Length=298, Percent_Identity=29.8657718120805, Blast_Score=83, Evalue=2e-17,
Organism=Escherichia coli, GI1786401, Length=249, Percent_Identity=28.5140562248996, Blast_Score=79, Evalue=4e-16,
Organism=Escherichia coli, GI157672245, Length=198, Percent_Identity=30.8080808080808, Blast_Score=69, Evalue=4e-13,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000847
- InterPro:   IPR005119
- InterPro:   IPR011991 [H]

Pfam domain/function: PF00126 HTH_1; PF03466 LysR_substrate [H]

EC number: NA

Molecular weight: Translated: 32628; Mature: 32628

Theoretical pI: Translated: 7.20; Mature: 7.20

Prosite motif: PS50931 HTH_LYSR

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.0 %Cys     (Translated Protein)
1.0 %Met     (Translated Protein)
3.0 %Cys+Met (Translated Protein)
2.0 %Cys     (Mature Protein)
1.0 %Met     (Mature Protein)
3.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKHLYPNVAELHAFASSAKHLNFSYAARELGLTPSAVSRQIASLEALLGVKLFVREGRNL
CCCCCCCHHHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCCE
ALTRAGQVYQARVAGPLREIGNASLELLSAREDSNLLTIASVPTFTTKWLVPRLPRFLET
EEEECCCHHHHHHHHHHHHHCCHHEEHHCCCCCCCEEEEEECCCHHHHHHHHHHHHHHHC
APDITLSFRRHLAPGDLFPLGLDAAIRYGDGRWEGVQCDYLDGRTFVPVCAPGFAERHAL
CCCCCHHHHHHCCCCCCCCCCHHHEEEECCCCCCCEEEEEECCCEEEEECCCCHHHHHHC
REPADIAAAPRLVHEQAECAWLAWADRHRATQMNALAGPRFEQYSVLIQAAQAGLGIALI
CCCHHHHHHHHHHHHHHHEEEEEHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHH
PAFLIRAPLAAGTLVQPLDAPVDVDEQSHYLCYAPERLQASASLRLLREWMLAECACA
HHHHHHCCHHCCCCCCCCCCCCCCCCCCCEEEECHHHHHHHHHHHHHHHHHHHHHCCC
>Mature Secondary Structure
MKHLYPNVAELHAFASSAKHLNFSYAARELGLTPSAVSRQIASLEALLGVKLFVREGRNL
CCCCCCCHHHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCCE
ALTRAGQVYQARVAGPLREIGNASLELLSAREDSNLLTIASVPTFTTKWLVPRLPRFLET
EEEECCCHHHHHHHHHHHHHCCHHEEHHCCCCCCCEEEEEECCCHHHHHHHHHHHHHHHC
APDITLSFRRHLAPGDLFPLGLDAAIRYGDGRWEGVQCDYLDGRTFVPVCAPGFAERHAL
CCCCCHHHHHHCCCCCCCCCCHHHEEEECCCCCCCEEEEEECCCEEEEECCCCHHHHHHC
REPADIAAAPRLVHEQAECAWLAWADRHRATQMNALAGPRFEQYSVLIQAAQAGLGIALI
CCCHHHHHHHHHHHHHHHEEEEEHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHH
PAFLIRAPLAAGTLVQPLDAPVDVDEQSHYLCYAPERLQASASLRLLREWMLAECACA
HHHHHHCCHHCCCCCCCCCCCCCCCCCCCEEEECHHHHHHHHHHHHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796 [H]