Definition Burkholderia mallei NCTC 10247 chromosome II, complete genome.
Accession NC_009079
Length 2,352,693

Click here to switch to the map view.

The map label for this gene is gcvA [H]

Identifier: 126446644

GI number: 126446644

Start: 1890860

End: 1891759

Strand: Direct

Name: gcvA [H]

Synonym: BMA10247_A1967

Alternate gene names: 126446644

Gene position: 1890860-1891759 (Clockwise)

Preceding gene: 126445748

Following gene: 126447488

Centisome position: 80.37

GC content: 72.44

Gene sequence:

>900_bases
ATGACGAGACCAGACCGTCTGCCGCCGATGCAGACCTTGTCCGCGTTCGAAGCGGCCGCGCGCCTCGCGAGCTTCACGGC
CGCCGCGCGCGAGCTCGGCTCGACGCAGCCGGCCGTCAGCCAGCGCGTCGTTCAGCTCGAAGAGGATCTCGGCACGCCGC
TCTTCGAGCGCGGGCGCCGCGGCGTCACGCTGACCGAGGACGGCACGCGGCTCTTCGCGGCGGTCCGCCAAAGCCTCGAC
GCGCTGCGCGCCGCGACGGCGGACATCCGCAGCCGCCGCGCGAACGGCACATTCACGCTCGTCACCGATTTCGGCTTCGC
CACCTACTGGCTGATGCCGCGCCTCGACGATCTGAAGCGCGCGATGCCCGGCGTCGACGTGCGCGTCGTCACGTCTCAGG
ACATCGATCCGCAGCGCGAGCACGCCGACGTCGCGATCCTGTTCGGCGCCGGCGACTGGCCGGGCTGCACATCGACGCGG
CTCTTTCAGGAACACGTGACGCCCGTGTGCTCGCCCGCGTTTCGCACCGCGCATGCCGATATCGCGCGGCCAGCCGACCT
GTTGCGCGCGCCGCTGCTGCACGTGCAGCCGACGCGCCCCGAGCGCTGGCTCGCGTGGCGCGATTGGTTCGACGCGCATG
GGCTCGCCGCGCCGCCCGAGCCGCACGGGCTGACGTTCAACAGCTACTCGCTCGTGATTCAAGCGGCGCTGATGAATCAG
GGTGTCGCGCTCGGCTGGGCGCCGCTCGTCGACACGCCGATCGCGGCCGGCCAGCTCGTGCGGCTCGTCGACGCGCCCGT
CGTCACGCCGCGCGGCTACTACCTCGTTCTGCCGCCCGCGCGGCCGGAGGCGCGCGCGGTGCCGCTCTTTCGCCGCTGGC
TGCTCGGCGCATGCGCATGA

Upstream 100 bases:

>100_bases
CGGCGGCGCGACAGGCAGCCGCCGCCGGCCGGATCTCACTATTGCCGGCCGTAATGCGGCCGTGAAGGCATCGAATGCTT
ATGAGCCATCAGAGGTGCTT

Downstream 100 bases:

>100_bases
CGCATCGGGCATTGGCGCGCCGATTCCAGCAGCGCGGCGATTCGGTTCTATCCGCATCCGATCTGGCTTGATTAACTGGC
GTATCGCCTCGGACGCCCGG

Product: LysR family transcriptional regulator

Products: NA

Alternate protein names: Gcv operon activator [H]

Number of amino acids: Translated: 299; Mature: 298

Protein sequence:

>299_residues
MTRPDRLPPMQTLSAFEAAARLASFTAAARELGSTQPAVSQRVVQLEEDLGTPLFERGRRGVTLTEDGTRLFAAVRQSLD
ALRAATADIRSRRANGTFTLVTDFGFATYWLMPRLDDLKRAMPGVDVRVVTSQDIDPQREHADVAILFGAGDWPGCTSTR
LFQEHVTPVCSPAFRTAHADIARPADLLRAPLLHVQPTRPERWLAWRDWFDAHGLAAPPEPHGLTFNSYSLVIQAALMNQ
GVALGWAPLVDTPIAAGQLVRLVDAPVVTPRGYYLVLPPARPEARAVPLFRRWLLGACA

Sequences:

>Translated_299_residues
MTRPDRLPPMQTLSAFEAAARLASFTAAARELGSTQPAVSQRVVQLEEDLGTPLFERGRRGVTLTEDGTRLFAAVRQSLD
ALRAATADIRSRRANGTFTLVTDFGFATYWLMPRLDDLKRAMPGVDVRVVTSQDIDPQREHADVAILFGAGDWPGCTSTR
LFQEHVTPVCSPAFRTAHADIARPADLLRAPLLHVQPTRPERWLAWRDWFDAHGLAAPPEPHGLTFNSYSLVIQAALMNQ
GVALGWAPLVDTPIAAGQLVRLVDAPVVTPRGYYLVLPPARPEARAVPLFRRWLLGACA
>Mature_298_residues
TRPDRLPPMQTLSAFEAAARLASFTAAARELGSTQPAVSQRVVQLEEDLGTPLFERGRRGVTLTEDGTRLFAAVRQSLDA
LRAATADIRSRRANGTFTLVTDFGFATYWLMPRLDDLKRAMPGVDVRVVTSQDIDPQREHADVAILFGAGDWPGCTSTRL
FQEHVTPVCSPAFRTAHADIARPADLLRAPLLHVQPTRPERWLAWRDWFDAHGLAAPPEPHGLTFNSYSLVIQAALMNQG
VALGWAPLVDTPIAAGQLVRLVDAPVVTPRGYYLVLPPARPEARAVPLFRRWLLGACA

Specific function: Regulatory protein for the glycine cleavage system operon (gcv). Mediates activation of gcv by glycine and repression by purines. GcvA is negatively autoregulated. Bind to three sites upstream of the gcv promoter [H]

COG id: COG0583

COG function: function code K; Transcriptional regulator

Gene ontology:

Cell location: Cytoplasmic [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HTH lysR-type DNA-binding domain [H]

Homologues:

Organism=Escherichia coli, GI1789173, Length=298, Percent_Identity=37.248322147651, Blast_Score=191, Evalue=4e-50,
Organism=Escherichia coli, GI1788706, Length=292, Percent_Identity=34.2465753424658, Blast_Score=153, Evalue=1e-38,
Organism=Escherichia coli, GI1786448, Length=295, Percent_Identity=30.5084745762712, Blast_Score=122, Evalue=2e-29,
Organism=Escherichia coli, GI1786401, Length=262, Percent_Identity=28.2442748091603, Blast_Score=82, Evalue=4e-17,
Organism=Escherichia coli, GI87081978, Length=269, Percent_Identity=28.996282527881, Blast_Score=79, Evalue=4e-16,
Organism=Escherichia coli, GI145693193, Length=304, Percent_Identity=25.9868421052632, Blast_Score=73, Evalue=2e-14,
Organism=Escherichia coli, GI145693105, Length=123, Percent_Identity=33.3333333333333, Blast_Score=67, Evalue=1e-12,
Organism=Escherichia coli, GI1787601, Length=126, Percent_Identity=34.1269841269841, Blast_Score=66, Evalue=3e-12,
Organism=Escherichia coli, GI157672245, Length=166, Percent_Identity=30.7228915662651, Blast_Score=65, Evalue=5e-12,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000847
- InterPro:   IPR005119
- InterPro:   IPR011991 [H]

Pfam domain/function: PF00126 HTH_1; PF03466 LysR_substrate [H]

EC number: NA

Molecular weight: Translated: 32843; Mature: 32712

Theoretical pI: Translated: 8.50; Mature: 8.50

Prosite motif: PS50931 HTH_LYSR

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.0 %Cys     (Translated Protein)
1.7 %Met     (Translated Protein)
2.7 %Cys+Met (Translated Protein)
1.0 %Cys     (Mature Protein)
1.3 %Met     (Mature Protein)
2.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTRPDRLPPMQTLSAFEAAARLASFTAAARELGSTQPAVSQRVVQLEEDLGTPLFERGRR
CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCHHHHCCCC
GVTLTEDGTRLFAAVRQSLDALRAATADIRSRRANGTFTLVTDFGFATYWLMPRLDDLKR
CEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHCCCHHHHHH
AMPGVDVRVVTSQDIDPQREHADVAILFGAGDWPGCTSTRLFQEHVTPVCSPAFRTAHAD
HCCCCCEEEEECCCCCCCCCCCCEEEEEECCCCCCCCHHHHHHHHCCHHCCHHHHHHHHH
IARPADLLRAPLLHVQPTRPERWLAWRDWFDAHGLAAPPEPHGLTFNSYSLVIQAALMNQ
HCCCHHHHHCCCCEECCCCCHHHHHHHHHHCCCCCCCCCCCCCEEECHHHHHHHHHHHHC
GVALGWAPLVDTPIAAGQLVRLVDAPVVTPRGYYLVLPPARPEARAVPLFRRWLLGACA
CCEEEEHHHHCCCCHHHHHHHHHCCCEECCCCCEEEECCCCCCCCCCHHHHHHHHHCCC
>Mature Secondary Structure 
TRPDRLPPMQTLSAFEAAARLASFTAAARELGSTQPAVSQRVVQLEEDLGTPLFERGRR
CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCHHHHCCCC
GVTLTEDGTRLFAAVRQSLDALRAATADIRSRRANGTFTLVTDFGFATYWLMPRLDDLKR
CEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHCCCHHHHHH
AMPGVDVRVVTSQDIDPQREHADVAILFGAGDWPGCTSTRLFQEHVTPVCSPAFRTAHAD
HCCCCCEEEEECCCCCCCCCCCCEEEEEECCCCCCCCHHHHHHHHCCHHCCHHHHHHHHH
IARPADLLRAPLLHVQPTRPERWLAWRDWFDAHGLAAPPEPHGLTFNSYSLVIQAALMNQ
HCCCHHHHHCCCCEECCCCCHHHHHHHHHHCCCCCCCCCCCCCEEECHHHHHHHHHHHHC
GVALGWAPLVDTPIAAGQLVRLVDAPVVTPRGYYLVLPPARPEARAVPLFRRWLLGACA
CCEEEEHHHHCCCCHHHHHHHHHCCCEECCCCCEEEECCCCCCCCCCHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796 [H]