Definition | Burkholderia mallei NCTC 10247 chromosome II, complete genome. |
---|---|
Accession | NC_009079 |
Length | 2,352,693 |
Click here to switch to the map view.
The map label for this gene is gcvA [H]
Identifier: 126446644
GI number: 126446644
Start: 1890860
End: 1891759
Strand: Direct
Name: gcvA [H]
Synonym: BMA10247_A1967
Alternate gene names: 126446644
Gene position: 1890860-1891759 (Clockwise)
Preceding gene: 126445748
Following gene: 126447488
Centisome position: 80.37
GC content: 72.44
Gene sequence:
>900_bases ATGACGAGACCAGACCGTCTGCCGCCGATGCAGACCTTGTCCGCGTTCGAAGCGGCCGCGCGCCTCGCGAGCTTCACGGC CGCCGCGCGCGAGCTCGGCTCGACGCAGCCGGCCGTCAGCCAGCGCGTCGTTCAGCTCGAAGAGGATCTCGGCACGCCGC TCTTCGAGCGCGGGCGCCGCGGCGTCACGCTGACCGAGGACGGCACGCGGCTCTTCGCGGCGGTCCGCCAAAGCCTCGAC GCGCTGCGCGCCGCGACGGCGGACATCCGCAGCCGCCGCGCGAACGGCACATTCACGCTCGTCACCGATTTCGGCTTCGC CACCTACTGGCTGATGCCGCGCCTCGACGATCTGAAGCGCGCGATGCCCGGCGTCGACGTGCGCGTCGTCACGTCTCAGG ACATCGATCCGCAGCGCGAGCACGCCGACGTCGCGATCCTGTTCGGCGCCGGCGACTGGCCGGGCTGCACATCGACGCGG CTCTTTCAGGAACACGTGACGCCCGTGTGCTCGCCCGCGTTTCGCACCGCGCATGCCGATATCGCGCGGCCAGCCGACCT GTTGCGCGCGCCGCTGCTGCACGTGCAGCCGACGCGCCCCGAGCGCTGGCTCGCGTGGCGCGATTGGTTCGACGCGCATG GGCTCGCCGCGCCGCCCGAGCCGCACGGGCTGACGTTCAACAGCTACTCGCTCGTGATTCAAGCGGCGCTGATGAATCAG GGTGTCGCGCTCGGCTGGGCGCCGCTCGTCGACACGCCGATCGCGGCCGGCCAGCTCGTGCGGCTCGTCGACGCGCCCGT CGTCACGCCGCGCGGCTACTACCTCGTTCTGCCGCCCGCGCGGCCGGAGGCGCGCGCGGTGCCGCTCTTTCGCCGCTGGC TGCTCGGCGCATGCGCATGA
Upstream 100 bases:
>100_bases CGGCGGCGCGACAGGCAGCCGCCGCCGGCCGGATCTCACTATTGCCGGCCGTAATGCGGCCGTGAAGGCATCGAATGCTT ATGAGCCATCAGAGGTGCTT
Downstream 100 bases:
>100_bases CGCATCGGGCATTGGCGCGCCGATTCCAGCAGCGCGGCGATTCGGTTCTATCCGCATCCGATCTGGCTTGATTAACTGGC GTATCGCCTCGGACGCCCGG
Product: LysR family transcriptional regulator
Products: NA
Alternate protein names: Gcv operon activator [H]
Number of amino acids: Translated: 299; Mature: 298
Protein sequence:
>299_residues MTRPDRLPPMQTLSAFEAAARLASFTAAARELGSTQPAVSQRVVQLEEDLGTPLFERGRRGVTLTEDGTRLFAAVRQSLD ALRAATADIRSRRANGTFTLVTDFGFATYWLMPRLDDLKRAMPGVDVRVVTSQDIDPQREHADVAILFGAGDWPGCTSTR LFQEHVTPVCSPAFRTAHADIARPADLLRAPLLHVQPTRPERWLAWRDWFDAHGLAAPPEPHGLTFNSYSLVIQAALMNQ GVALGWAPLVDTPIAAGQLVRLVDAPVVTPRGYYLVLPPARPEARAVPLFRRWLLGACA
Sequences:
>Translated_299_residues MTRPDRLPPMQTLSAFEAAARLASFTAAARELGSTQPAVSQRVVQLEEDLGTPLFERGRRGVTLTEDGTRLFAAVRQSLD ALRAATADIRSRRANGTFTLVTDFGFATYWLMPRLDDLKRAMPGVDVRVVTSQDIDPQREHADVAILFGAGDWPGCTSTR LFQEHVTPVCSPAFRTAHADIARPADLLRAPLLHVQPTRPERWLAWRDWFDAHGLAAPPEPHGLTFNSYSLVIQAALMNQ GVALGWAPLVDTPIAAGQLVRLVDAPVVTPRGYYLVLPPARPEARAVPLFRRWLLGACA >Mature_298_residues TRPDRLPPMQTLSAFEAAARLASFTAAARELGSTQPAVSQRVVQLEEDLGTPLFERGRRGVTLTEDGTRLFAAVRQSLDA LRAATADIRSRRANGTFTLVTDFGFATYWLMPRLDDLKRAMPGVDVRVVTSQDIDPQREHADVAILFGAGDWPGCTSTRL FQEHVTPVCSPAFRTAHADIARPADLLRAPLLHVQPTRPERWLAWRDWFDAHGLAAPPEPHGLTFNSYSLVIQAALMNQG VALGWAPLVDTPIAAGQLVRLVDAPVVTPRGYYLVLPPARPEARAVPLFRRWLLGACA
Specific function: Regulatory protein for the glycine cleavage system operon (gcv). Mediates activation of gcv by glycine and repression by purines. GcvA is negatively autoregulated. Bind to three sites upstream of the gcv promoter [H]
COG id: COG0583
COG function: function code K; Transcriptional regulator
Gene ontology:
Cell location: Cytoplasmic [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 HTH lysR-type DNA-binding domain [H]
Homologues:
Organism=Escherichia coli, GI1789173, Length=298, Percent_Identity=37.248322147651, Blast_Score=191, Evalue=4e-50, Organism=Escherichia coli, GI1788706, Length=292, Percent_Identity=34.2465753424658, Blast_Score=153, Evalue=1e-38, Organism=Escherichia coli, GI1786448, Length=295, Percent_Identity=30.5084745762712, Blast_Score=122, Evalue=2e-29, Organism=Escherichia coli, GI1786401, Length=262, Percent_Identity=28.2442748091603, Blast_Score=82, Evalue=4e-17, Organism=Escherichia coli, GI87081978, Length=269, Percent_Identity=28.996282527881, Blast_Score=79, Evalue=4e-16, Organism=Escherichia coli, GI145693193, Length=304, Percent_Identity=25.9868421052632, Blast_Score=73, Evalue=2e-14, Organism=Escherichia coli, GI145693105, Length=123, Percent_Identity=33.3333333333333, Blast_Score=67, Evalue=1e-12, Organism=Escherichia coli, GI1787601, Length=126, Percent_Identity=34.1269841269841, Blast_Score=66, Evalue=3e-12, Organism=Escherichia coli, GI157672245, Length=166, Percent_Identity=30.7228915662651, Blast_Score=65, Evalue=5e-12,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000847 - InterPro: IPR005119 - InterPro: IPR011991 [H]
Pfam domain/function: PF00126 HTH_1; PF03466 LysR_substrate [H]
EC number: NA
Molecular weight: Translated: 32843; Mature: 32712
Theoretical pI: Translated: 8.50; Mature: 8.50
Prosite motif: PS50931 HTH_LYSR
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.0 %Cys (Translated Protein) 1.7 %Met (Translated Protein) 2.7 %Cys+Met (Translated Protein) 1.0 %Cys (Mature Protein) 1.3 %Met (Mature Protein) 2.3 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTRPDRLPPMQTLSAFEAAARLASFTAAARELGSTQPAVSQRVVQLEEDLGTPLFERGRR CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCHHHHCCCC GVTLTEDGTRLFAAVRQSLDALRAATADIRSRRANGTFTLVTDFGFATYWLMPRLDDLKR CEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHCCCHHHHHH AMPGVDVRVVTSQDIDPQREHADVAILFGAGDWPGCTSTRLFQEHVTPVCSPAFRTAHAD HCCCCCEEEEECCCCCCCCCCCCEEEEEECCCCCCCCHHHHHHHHCCHHCCHHHHHHHHH IARPADLLRAPLLHVQPTRPERWLAWRDWFDAHGLAAPPEPHGLTFNSYSLVIQAALMNQ HCCCHHHHHCCCCEECCCCCHHHHHHHHHHCCCCCCCCCCCCCEEECHHHHHHHHHHHHC GVALGWAPLVDTPIAAGQLVRLVDAPVVTPRGYYLVLPPARPEARAVPLFRRWLLGACA CCEEEEHHHHCCCCHHHHHHHHHCCCEECCCCCEEEECCCCCCCCCCHHHHHHHHHCCC >Mature Secondary Structure TRPDRLPPMQTLSAFEAAARLASFTAAARELGSTQPAVSQRVVQLEEDLGTPLFERGRR CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCHHHHCCCC GVTLTEDGTRLFAAVRQSLDALRAATADIRSRRANGTFTLVTDFGFATYWLMPRLDDLKR CEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHCCCHHHHHH AMPGVDVRVVTSQDIDPQREHADVAILFGAGDWPGCTSTRLFQEHVTPVCSPAFRTAHAD HCCCCCEEEEECCCCCCCCCCCCEEEEEECCCCCCCCHHHHHHHHCCHHCCHHHHHHHHH IARPADLLRAPLLHVQPTRPERWLAWRDWFDAHGLAAPPEPHGLTFNSYSLVIQAALMNQ HCCCHHHHHCCCCEECCCCCHHHHHHHHHHCCCCCCCCCCCCCEEECHHHHHHHHHHHHC GVALGWAPLVDTPIAAGQLVRLVDAPVVTPRGYYLVLPPARPEARAVPLFRRWLLGACA CCEEEEHHHHCCCCHHHHHHHHHCCCEECCCCCEEEECCCCCCCCCCHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]