Definition | Burkholderia mallei NCTC 10247 chromosome II, complete genome. |
---|---|
Accession | NC_009079 |
Length | 2,352,693 |
Click here to switch to the map view.
The map label for this gene is gcvA [H]
Identifier: 126446071
GI number: 126446071
Start: 2018811
End: 2019707
Strand: Direct
Name: gcvA [H]
Synonym: BMA10247_A2086
Alternate gene names: 126446071
Gene position: 2018811-2019707 (Clockwise)
Preceding gene: 126446674
Following gene: 126445640
Centisome position: 85.81
GC content: 69.9
Gene sequence:
>897_bases ATGAAGCATCTGTATCCGAACGTCGCCGAACTGCACGCGTTCGCCAGTTCTGCGAAGCATCTGAACTTTTCGTATGCGGC GCGCGAGCTCGGCCTCACGCCGAGCGCGGTCAGCCGGCAGATCGCGAGCCTCGAGGCGCTGCTCGGCGTGAAGCTGTTCG TGCGCGAAGGCCGCAATCTCGCGCTCACGCGCGCGGGGCAGGTCTATCAGGCGCGCGTCGCGGGGCCCCTGCGCGAGATC GGCAATGCGTCGCTCGAATTGCTGAGCGCGCGCGAGGACAGCAATCTGCTGACCATTGCGAGCGTGCCGACCTTCACGAC GAAATGGCTCGTGCCGCGCCTGCCGCGCTTTCTCGAAACCGCGCCTGACATCACGCTGAGCTTCCGGCGCCACCTCGCGC CGGGCGACCTGTTCCCGCTCGGGCTCGATGCGGCGATCCGCTACGGCGACGGCCGCTGGGAAGGCGTCCAGTGCGACTAT CTCGACGGCCGCACGTTCGTGCCCGTATGCGCGCCCGGTTTCGCCGAGCGTCACGCCCTGCGCGAACCCGCGGACATCGC CGCCGCGCCGCGCCTCGTGCACGAGCAGGCCGAATGCGCGTGGCTCGCGTGGGCGGACCGGCACCGCGCCACGCAGATGA ACGCGCTCGCCGGCCCACGCTTCGAGCAGTACTCGGTGCTGATCCAGGCGGCGCAGGCAGGGCTCGGGATCGCGCTCATT CCCGCGTTCCTGATTCGCGCGCCGCTCGCGGCCGGCACGCTCGTGCAGCCGCTCGACGCGCCCGTCGACGTCGACGAGCA GAGCCATTACCTGTGCTACGCGCCGGAGCGGCTGCAGGCGAGTGCATCGCTGCGGCTGCTGCGCGAATGGATGCTCGCCG AGTGCGCATGCGCATGA
Upstream 100 bases:
>100_bases ATCGCGACGACGGCCGCCAGTGTGGGCGCGCGACATGGCGCCGTCATGAAAAAAACGTACAAACTCCGCATCGCTCTTTG CATCGACGGAATACTTCGCG
Downstream 100 bases:
>100_bases CGCGCATCGCGCCGCCTCGTCCGCGCCGCCGCCGCACCTGCGCTCTCGTGACGCGGCGAAACCGCGCGCCGCTCGGCTTT TGCGCGCGGCGGACCTAGTA
Product: LysR family transcriptional regulator
Products: NA
Alternate protein names: Gcv operon activator [H]
Number of amino acids: Translated: 298; Mature: 298
Protein sequence:
>298_residues MKHLYPNVAELHAFASSAKHLNFSYAARELGLTPSAVSRQIASLEALLGVKLFVREGRNLALTRAGQVYQARVAGPLREI GNASLELLSAREDSNLLTIASVPTFTTKWLVPRLPRFLETAPDITLSFRRHLAPGDLFPLGLDAAIRYGDGRWEGVQCDY LDGRTFVPVCAPGFAERHALREPADIAAAPRLVHEQAECAWLAWADRHRATQMNALAGPRFEQYSVLIQAAQAGLGIALI PAFLIRAPLAAGTLVQPLDAPVDVDEQSHYLCYAPERLQASASLRLLREWMLAECACA
Sequences:
>Translated_298_residues MKHLYPNVAELHAFASSAKHLNFSYAARELGLTPSAVSRQIASLEALLGVKLFVREGRNLALTRAGQVYQARVAGPLREI GNASLELLSAREDSNLLTIASVPTFTTKWLVPRLPRFLETAPDITLSFRRHLAPGDLFPLGLDAAIRYGDGRWEGVQCDY LDGRTFVPVCAPGFAERHALREPADIAAAPRLVHEQAECAWLAWADRHRATQMNALAGPRFEQYSVLIQAAQAGLGIALI PAFLIRAPLAAGTLVQPLDAPVDVDEQSHYLCYAPERLQASASLRLLREWMLAECACA >Mature_298_residues MKHLYPNVAELHAFASSAKHLNFSYAARELGLTPSAVSRQIASLEALLGVKLFVREGRNLALTRAGQVYQARVAGPLREI GNASLELLSAREDSNLLTIASVPTFTTKWLVPRLPRFLETAPDITLSFRRHLAPGDLFPLGLDAAIRYGDGRWEGVQCDY LDGRTFVPVCAPGFAERHALREPADIAAAPRLVHEQAECAWLAWADRHRATQMNALAGPRFEQYSVLIQAAQAGLGIALI PAFLIRAPLAAGTLVQPLDAPVDVDEQSHYLCYAPERLQASASLRLLREWMLAECACA
Specific function: Regulatory protein for the glycine cleavage system operon (gcv). Mediates activation of gcv by glycine and repression by purines. GcvA is negatively autoregulated. Bind to three sites upstream of the gcv promoter [H]
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 HTH lysR-type DNA-binding domain [H]
Homologues:
Organism=Escherichia coli, GI1789173, Length=298, Percent_Identity=32.8859060402685, Blast_Score=145, Evalue=3e-36, Organism=Escherichia coli, GI1788706, Length=277, Percent_Identity=30.6859205776173, Blast_Score=125, Evalue=4e-30, Organism=Escherichia coli, GI1786448, Length=285, Percent_Identity=31.2280701754386, Blast_Score=122, Evalue=3e-29, Organism=Escherichia coli, GI145693193, Length=298, Percent_Identity=29.8657718120805, Blast_Score=83, Evalue=2e-17, Organism=Escherichia coli, GI1786401, Length=249, Percent_Identity=28.5140562248996, Blast_Score=79, Evalue=4e-16, Organism=Escherichia coli, GI157672245, Length=198, Percent_Identity=30.8080808080808, Blast_Score=69, Evalue=4e-13,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000847 - InterPro: IPR005119 - InterPro: IPR011991 [H]
Pfam domain/function: PF00126 HTH_1; PF03466 LysR_substrate [H]
EC number: NA
Molecular weight: Translated: 32628; Mature: 32628
Theoretical pI: Translated: 7.20; Mature: 7.20
Prosite motif: PS50931 HTH_LYSR
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.0 %Cys (Translated Protein) 1.0 %Met (Translated Protein) 3.0 %Cys+Met (Translated Protein) 2.0 %Cys (Mature Protein) 1.0 %Met (Mature Protein) 3.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKHLYPNVAELHAFASSAKHLNFSYAARELGLTPSAVSRQIASLEALLGVKLFVREGRNL CCCCCCCHHHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCCE ALTRAGQVYQARVAGPLREIGNASLELLSAREDSNLLTIASVPTFTTKWLVPRLPRFLET EEEECCCHHHHHHHHHHHHHCCHHEEHHCCCCCCCEEEEEECCCHHHHHHHHHHHHHHHC APDITLSFRRHLAPGDLFPLGLDAAIRYGDGRWEGVQCDYLDGRTFVPVCAPGFAERHAL CCCCCHHHHHHCCCCCCCCCCHHHEEEECCCCCCCEEEEEECCCEEEEECCCCHHHHHHC REPADIAAAPRLVHEQAECAWLAWADRHRATQMNALAGPRFEQYSVLIQAAQAGLGIALI CCCHHHHHHHHHHHHHHHEEEEEHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHH PAFLIRAPLAAGTLVQPLDAPVDVDEQSHYLCYAPERLQASASLRLLREWMLAECACA HHHHHHCCHHCCCCCCCCCCCCCCCCCCCEEEECHHHHHHHHHHHHHHHHHHHHHCCC >Mature Secondary Structure MKHLYPNVAELHAFASSAKHLNFSYAARELGLTPSAVSRQIASLEALLGVKLFVREGRNL CCCCCCCHHHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCCE ALTRAGQVYQARVAGPLREIGNASLELLSAREDSNLLTIASVPTFTTKWLVPRLPRFLET EEEECCCHHHHHHHHHHHHHCCHHEEHHCCCCCCCEEEEEECCCHHHHHHHHHHHHHHHC APDITLSFRRHLAPGDLFPLGLDAAIRYGDGRWEGVQCDYLDGRTFVPVCAPGFAERHAL CCCCCHHHHHHCCCCCCCCCCHHHEEEECCCCCCCEEEEEECCCEEEEECCCCHHHHHHC REPADIAAAPRLVHEQAECAWLAWADRHRATQMNALAGPRFEQYSVLIQAAQAGLGIALI CCCHHHHHHHHHHHHHHHEEEEEHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHH PAFLIRAPLAAGTLVQPLDAPVDVDEQSHYLCYAPERLQASASLRLLREWMLAECACA HHHHHHCCHHCCCCCCCCCCCCCCCCCCCEEEECHHHHHHHHHHHHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]