Definition Burkholderia thailandensis E264 chromosome chromosome I, complete sequence.
Accession NC_007651
Length 3,809,201

Click here to switch to the map view.

The map label for this gene is atoC [H]

Identifier: 83719894

GI number: 83719894

Start: 1883709

End: 1885598

Strand: Reverse

Name: atoC [H]

Synonym: BTH_I1675

Alternate gene names: 83719894

Gene position: 1885598-1883709 (Counterclockwise)

Preceding gene: 83719840

Following gene: 83721102

Centisome position: 49.5

GC content: 70.32

Gene sequence:

>1890_bases
ATGTCCTGGGAAATCGCGCCGACGTTCGCTCGGCCCGCCGCTGCGTTCGCGCGCGGTGCGCGCGCGTTCGCGATCGCCGC
ATCGATGTCCGGCCGAACAAGTTCGACCGCGACGCCGCGCGGCCGCCTCCAGCGACACGACGCCGCCGACGCGCCGACGT
TCCTCGCGCATCGCGCAACGCGTGTCGCGCGCATCGCCGTGCCCTGCCCCCGCCCCGGAAGAGCGCACATGCGTCGCAGC
AACACCGACACCGCACGCCGGCCGCCCGGCATCCCCGCCGTCCTCCCGCATCGGTCGCGCCAACGACGCGCGCCACGCGA
CGACGCCCGCGGCGCTCGCGCCCCGCACACCGAACATTGCACGCCGCGCGCCGCGCTCGGCGCGCCTGCCCCCGACACGT
TCCGAACAACATTCCAGCTGAGGTGCATCGTGCCCGACTTCCGCTTTCCCTCCGATTGCGCACCGCCCGGCCACGCCGGG
CAGCGCACCGCGTCCCCCGCCGACGCGGCCGCGCGCCAGCTCGTCTACCTGTCGCGCACGCCCGACACGACGCTCGTCGA
GCATCTGCGCGCGCGACGCTGGAACGTGCATGTCGCGCGCTCCGCGCACGAAGCGGCGCGGCGCGTGAAGCCGGATCAGC
CGCAGGCGGGCATCGCCGATCTCGACGGCTTCGCGCCGCGTGAGCTGCCGACGCTCGAGGCGGTGTTGCGCCAGCAGCAG
GTCGGCTGGATCGCGCTCGCCGGCGACGCGCGCATCAACGATCCCGACGTGCGCCGGCTGATTCGCCAGTACTGTTTCGA
TTACATGCCGGGCCTGCCGCCGCACGAGACGATCGATTATCTCGTGGGCCACGCCTACGGGATGGTCGCGCTGTGCGATC
TCGACCTCATGGCGGGCGCCACCGAAACCGGCGACGAAATGGTCGGCGCGTGCGACGCGATGCAGCAGCTGTTCCGGATG
ATCCGCAAGGTCGCCGCGACCGACGCGACCGTGTTCATCTCCGGCGAATCCGGCACCGGCAAGGAGCTGACCGCGCTCGC
GATTCACGAGCGCTCCGAGCGCCGCAAGGCGCCGTTCGTCGCGATCAACTGCGGCGCGATTCCGAATCATCTGCTGCAGT
CCGAACTGTTCGGCTACGAGCGCGGCGCATTCACGGGCGCGAGCCAGCGCAAGATCGGCCGCGTCGAATCGGCGGACGGC
GGCACGCTGTTTCTGGACGAAATCGGCGACATGCCGCTCGAAAGCCAGGCGAGCATGCTGCGCTTCCTGCAGGAAGGCAA
GATCGAGCGGCTCGGCGGGCACGAGTCGATCCCGGTCGACGTGAGGATCATCTCCGCGACGCATGTCGATCTCGACGCGG
CGATGCGCGAAGGCCGCTTTCGCGAAGACCTGTACCACCGGCTGTGCGTGCTGAAGCTCGAGGAGCCGCCGCTGCGCGCG
CGCGACAAGGACATCGAAATCCTCGCGCATCACATCCTGCATCGGTTCAGAAGCGACGGCGCACGCCGCATCCACGGTTT
CACGTCGTGCGCGATCGAAGCGATGTACAACTATCAGTGGCCCGGCAACGTGCGCGAGCTGATCAACCGGATTCGGCGCG
CGATCGTGATGTCGGACAGCCGGCATCTGTCGGCCGCCGATCTCGATCTCGCGCCGTTCGCCGCCCGCCAGGCGACGACG
CTCGCCGAGGCGCGCGAGCGCGCCGAGCGCCGGACGATCGAGGCGTCGCTGCTGCGGCATCGCAATCGCCTGACCGAAGC
GGCGGCGGAGCTCGGGGTGTCTCGCGCGACGCTGTATCGGCTGATGGTGTCGCACGGCCTGCGCGAGTTGTCATGGGGCA
CGCATCGCACGGGCGCGAGCGATCCGGACGACGAAGCCGAGCCCACGTGA

Upstream 100 bases:

>100_bases
AACCATTCGCCGTCGCCGCGCTAAAGCGCAGCCGTCGACGGCTCTTGACGGGGAAAGCGCATTGCGTCGTCGCGTTCGCG
GCGCGCAACGGGGGGACCAG

Downstream 100 bases:

>100_bases
AGTGCGGGCGGCGTGTGCGCGAAGCGTCGGCGCAGGGCGCATCAAGCGTCGATCGCCGCAAGGCGCGCGAGAACGCGTAT
TCCGAACCGGCGCGAAGCAC

Product: sigma-54 dependent DNA-binding transcriptional regulator

Products: NA

Alternate protein names: Ornithine decarboxylase antizyme; Ornithine/arginine decarboxylase inhibitor [H]

Number of amino acids: Translated: 629; Mature: 628

Protein sequence:

>629_residues
MSWEIAPTFARPAAAFARGARAFAIAASMSGRTSSTATPRGRLQRHDAADAPTFLAHRATRVARIAVPCPRPGRAHMRRS
NTDTARRPPGIPAVLPHRSRQRRAPRDDARGARAPHTEHCTPRAALGAPAPDTFRTTFQLRCIVPDFRFPSDCAPPGHAG
QRTASPADAAARQLVYLSRTPDTTLVEHLRARRWNVHVARSAHEAARRVKPDQPQAGIADLDGFAPRELPTLEAVLRQQQ
VGWIALAGDARINDPDVRRLIRQYCFDYMPGLPPHETIDYLVGHAYGMVALCDLDLMAGATETGDEMVGACDAMQQLFRM
IRKVAATDATVFISGESGTGKELTALAIHERSERRKAPFVAINCGAIPNHLLQSELFGYERGAFTGASQRKIGRVESADG
GTLFLDEIGDMPLESQASMLRFLQEGKIERLGGHESIPVDVRIISATHVDLDAAMREGRFREDLYHRLCVLKLEEPPLRA
RDKDIEILAHHILHRFRSDGARRIHGFTSCAIEAMYNYQWPGNVRELINRIRRAIVMSDSRHLSAADLDLAPFAARQATT
LAEARERAERRTIEASLLRHRNRLTEAAAELGVSRATLYRLMVSHGLRELSWGTHRTGASDPDDEAEPT

Sequences:

>Translated_629_residues
MSWEIAPTFARPAAAFARGARAFAIAASMSGRTSSTATPRGRLQRHDAADAPTFLAHRATRVARIAVPCPRPGRAHMRRS
NTDTARRPPGIPAVLPHRSRQRRAPRDDARGARAPHTEHCTPRAALGAPAPDTFRTTFQLRCIVPDFRFPSDCAPPGHAG
QRTASPADAAARQLVYLSRTPDTTLVEHLRARRWNVHVARSAHEAARRVKPDQPQAGIADLDGFAPRELPTLEAVLRQQQ
VGWIALAGDARINDPDVRRLIRQYCFDYMPGLPPHETIDYLVGHAYGMVALCDLDLMAGATETGDEMVGACDAMQQLFRM
IRKVAATDATVFISGESGTGKELTALAIHERSERRKAPFVAINCGAIPNHLLQSELFGYERGAFTGASQRKIGRVESADG
GTLFLDEIGDMPLESQASMLRFLQEGKIERLGGHESIPVDVRIISATHVDLDAAMREGRFREDLYHRLCVLKLEEPPLRA
RDKDIEILAHHILHRFRSDGARRIHGFTSCAIEAMYNYQWPGNVRELINRIRRAIVMSDSRHLSAADLDLAPFAARQATT
LAEARERAERRTIEASLLRHRNRLTEAAAELGVSRATLYRLMVSHGLRELSWGTHRTGASDPDDEAEPT
>Mature_628_residues
SWEIAPTFARPAAAFARGARAFAIAASMSGRTSSTATPRGRLQRHDAADAPTFLAHRATRVARIAVPCPRPGRAHMRRSN
TDTARRPPGIPAVLPHRSRQRRAPRDDARGARAPHTEHCTPRAALGAPAPDTFRTTFQLRCIVPDFRFPSDCAPPGHAGQ
RTASPADAAARQLVYLSRTPDTTLVEHLRARRWNVHVARSAHEAARRVKPDQPQAGIADLDGFAPRELPTLEAVLRQQQV
GWIALAGDARINDPDVRRLIRQYCFDYMPGLPPHETIDYLVGHAYGMVALCDLDLMAGATETGDEMVGACDAMQQLFRMI
RKVAATDATVFISGESGTGKELTALAIHERSERRKAPFVAINCGAIPNHLLQSELFGYERGAFTGASQRKIGRVESADGG
TLFLDEIGDMPLESQASMLRFLQEGKIERLGGHESIPVDVRIISATHVDLDAAMREGRFREDLYHRLCVLKLEEPPLRAR
DKDIEILAHHILHRFRSDGARRIHGFTSCAIEAMYNYQWPGNVRELINRIRRAIVMSDSRHLSAADLDLAPFAARQATTL
AEARERAERRTIEASLLRHRNRLTEAAAELGVSRATLYRLMVSHGLRELSWGTHRTGASDPDDEAEPT

Specific function: Member of the two-component regulatory system AtoS/AtoC involved in the transcriptional regulation of the ato genes for acetoacetate metabolism. Also an inhibitor of polyamine biosynthesis [H]

COG id: COG2204

COG function: function code T; Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains

Gene ontology:

Cell location: Cytoplasm (Probable) [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 sigma-54 factor interaction domain [H]

Homologues:

Organism=Escherichia coli, GI1790437, Length=429, Percent_Identity=39.3939393939394, Blast_Score=262, Evalue=6e-71,
Organism=Escherichia coli, GI1788550, Length=304, Percent_Identity=46.7105263157895, Blast_Score=259, Evalue=3e-70,
Organism=Escherichia coli, GI1788905, Length=295, Percent_Identity=44.7457627118644, Blast_Score=242, Evalue=7e-65,
Organism=Escherichia coli, GI1790299, Length=333, Percent_Identity=43.5435435435435, Blast_Score=241, Evalue=1e-64,
Organism=Escherichia coli, GI1789233, Length=321, Percent_Identity=42.3676012461059, Blast_Score=240, Evalue=2e-64,
Organism=Escherichia coli, GI87082117, Length=321, Percent_Identity=43.9252336448598, Blast_Score=239, Evalue=3e-64,
Organism=Escherichia coli, GI1789087, Length=308, Percent_Identity=42.5324675324675, Blast_Score=225, Evalue=6e-60,
Organism=Escherichia coli, GI87082152, Length=320, Percent_Identity=41.5625, Blast_Score=215, Evalue=8e-57,
Organism=Escherichia coli, GI1786524, Length=319, Percent_Identity=41.3793103448276, Blast_Score=213, Evalue=3e-56,
Organism=Escherichia coli, GI87081872, Length=323, Percent_Identity=36.5325077399381, Blast_Score=211, Evalue=1e-55,
Organism=Escherichia coli, GI1787583, Length=312, Percent_Identity=35.2564102564103, Blast_Score=175, Evalue=8e-45,
Organism=Escherichia coli, GI87081858, Length=300, Percent_Identity=32, Blast_Score=144, Evalue=2e-35,
Organism=Escherichia coli, GI1789828, Length=282, Percent_Identity=31.5602836879433, Blast_Score=119, Evalue=7e-28,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003593
- InterPro:   IPR011006
- InterPro:   IPR020441
- InterPro:   IPR009057
- InterPro:   IPR002197
- InterPro:   IPR002078
- InterPro:   IPR001789 [H]

Pfam domain/function: PF02954 HTH_8; PF00072 Response_reg; PF00158 Sigma54_activat [H]

EC number: NA

Molecular weight: Translated: 69620; Mature: 69489

Theoretical pI: Translated: 9.44; Mature: 9.44

Prosite motif: PS00676 SIGMA54_INTERACT_2 ; PS00688 SIGMA54_INTERACT_3 ; PS50045 SIGMA54_INTERACT_4

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.6 %Cys     (Translated Protein)
2.4 %Met     (Translated Protein)
4.0 %Cys+Met (Translated Protein)
1.6 %Cys     (Mature Protein)
2.2 %Met     (Mature Protein)
3.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSWEIAPTFARPAAAFARGARAFAIAASMSGRTSSTATPRGRLQRHDAADAPTFLAHRAT
CCCCCCCCHHHHHHHHHCCCHHEEEEECCCCCCCCCCCCCHHHHHCCCCCCHHHHHHHHH
RVARIAVPCPRPGRAHMRRSNTDTARRPPGIPAVLPHRSRQRRAPRDDARGARAPHTEHC
HHEEEECCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCC
TPRAALGAPAPDTFRTTFQLRCIVPDFRFPSDCAPPGHAGQRTASPADAAARQLVYLSRT
CCCHHCCCCCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHEEEEECC
PDTTLVEHLRARRWNVHVARSAHEAARRVKPDQPQAGIADLDGFAPRELPTLEAVLRQQQ
CCHHHHHHHHHHHCCEEEHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHC
VGWIALAGDARINDPDVRRLIRQYCFDYMPGLPPHETIDYLVGHAYGMVALCDLDLMAGA
CCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCHHHHCCC
TETGDEMVGACDAMQQLFRMIRKVAATDATVFISGESGTGKELTALAIHERSERRKAPFV
CCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCEEHHHHHHHHHHCCCCEE
AINCGAIPNHLLQSELFGYERGAFTGASQRKIGRVESADGGTLFLDEIGDMPLESQASML
EEECCCCHHHHHHHHHHCCCCCCCCCCCHHHCCCCCCCCCCEEEHHHHCCCCCHHHHHHH
RFLQEGKIERLGGHESIPVDVRIISATHVDLDAAMREGRFREDLYHRLCVLKLEEPPLRA
HHHHHCCHHHCCCCCCCCEEEEEEEEECCCHHHHHHCCCHHHHHHHHHEEEEECCCCCCC
RDKDIEILAHHILHRFRSDGARRIHGFTSCAIEAMYNYQWPGNVRELINRIRRAIVMSDS
CCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCC
RHLSAADLDLAPFAARQATTLAEARERAERRTIEASLLRHRNRLTEAAAELGVSRATLYR
CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHH
LMVSHGLRELSWGTHRTGASDPDDEAEPT
HHHHHCHHHHCCCCCCCCCCCCCCCCCCC
>Mature Secondary Structure 
SWEIAPTFARPAAAFARGARAFAIAASMSGRTSSTATPRGRLQRHDAADAPTFLAHRAT
CCCCCCCHHHHHHHHHCCCHHEEEEECCCCCCCCCCCCCHHHHHCCCCCCHHHHHHHHH
RVARIAVPCPRPGRAHMRRSNTDTARRPPGIPAVLPHRSRQRRAPRDDARGARAPHTEHC
HHEEEECCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCC
TPRAALGAPAPDTFRTTFQLRCIVPDFRFPSDCAPPGHAGQRTASPADAAARQLVYLSRT
CCCHHCCCCCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHEEEEECC
PDTTLVEHLRARRWNVHVARSAHEAARRVKPDQPQAGIADLDGFAPRELPTLEAVLRQQQ
CCHHHHHHHHHHHCCEEEHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHC
VGWIALAGDARINDPDVRRLIRQYCFDYMPGLPPHETIDYLVGHAYGMVALCDLDLMAGA
CCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCHHHHCCC
TETGDEMVGACDAMQQLFRMIRKVAATDATVFISGESGTGKELTALAIHERSERRKAPFV
CCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCEEHHHHHHHHHHCCCCEE
AINCGAIPNHLLQSELFGYERGAFTGASQRKIGRVESADGGTLFLDEIGDMPLESQASML
EEECCCCHHHHHHHHHHCCCCCCCCCCCHHHCCCCCCCCCCEEEHHHHCCCCCHHHHHHH
RFLQEGKIERLGGHESIPVDVRIISATHVDLDAAMREGRFREDLYHRLCVLKLEEPPLRA
HHHHHCCHHHCCCCCCCCEEEEEEEEECCCHHHHHHCCCHHHHHHHHHEEEEECCCCCCC
RDKDIEILAHHILHRFRSDGARRIHGFTSCAIEAMYNYQWPGNVRELINRIRRAIVMSDS
CCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCC
RHLSAADLDLAPFAARQATTLAEARERAERRTIEASLLRHRNRLTEAAAELGVSRATLYR
CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHH
LMVSHGLRELSWGTHRTGASDPDDEAEPT
HHHHHCHHHHCCCCCCCCCCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9097040; 9278503; 8346225 [H]