Definition Burkholderia mallei NCTC 10247 chromosome II, complete genome.
Accession NC_009079
Length 2,352,693

Click here to switch to the map view.

The map label for this gene is aer [H]

Identifier: 126445786

GI number: 126445786

Start: 738063

End: 739607

Strand: Direct

Name: aer [H]

Synonym: BMA10247_A0801

Alternate gene names: 126445786

Gene position: 738063-739607 (Clockwise)

Preceding gene: 126445781

Following gene: 126447107

Centisome position: 31.37

GC content: 70.42

Gene sequence:

>1545_bases
ATGCGTAACAACCAACCCGTCACCCAACACGAATTCGAGCTTCCCGACGACGCGACATTGATGTCGACGACCGATCCGCA
CGGCCGCATCACCTATGCGAACGCGACGTTCGTGCACGTCAGCGGCTTTTCGAGCGACGAGATCGTCGGCGCGCCGCACA
ACGTCGTGCGCCATCCCGACATGCCGCGCGATGCGTTCGCCGACATGTGGGCGACGTTAAAGCGCGGCGAGCCGTGGACC
GCGCTCGTCAAGAACCGCCGCAAGAACGGCGATCACTACTGGGTGCGCGCGAACGCGGTGCCGGTGATTCGCGGCGGGCA
GACGCAGGGCTACATGTCGGTGCGCACGAAGCCCGCGCGCGCCGAGACCGCCGCCGCCGACGCGCTCTATCGCGATTTTC
GCGAGGGCCGCGCGGGCAGCCGGCGCTTTCACAAGGGGCTGATCGTGCGCACCGGGCTGCTGCGTGCGTGCTCGCTGCTG
CAGACGATGTCGGTGCGCGCGCGCATCCATCTGCCGATCGTCGCGCTGACGCCGGCGATCGTCGGCGCTGCCTGGGCGGC
CGGCGTGGCGGGCGCGCCGCTCGCGCAGCTCGCGGGCGCGACGCTCGGCGGCGCGGCGCTCGCCGCGTGGTGGCTCGACG
CGCAGATCGCGCGCCCGCTGCGCACGTTGCGCCGGCAGGCGCTCGACGTCGCGACCGGGGCGAGCCGCCGGGGCGTCAAC
ATGAATCGCGTCGACGAAATCGGCATGTCGCTGCGCACGATCAATCAGCTCGGGCTGATGTTTCGCTGGCTGATCGACGA
CGTGAGCGAACAGGTCTTGACCGTGCAGCGCGCGGTCAACGAGATCGCGCAGGGCAATCACGATCTGAGCGCGCGCACCG
AGCAGGCGGCGACGAGCGTTCAGCAGACGGCCGCGTCGATGGCGCAGATGACGGCGACCGTGTCGAGCAACGCGCAGACC
GCGACGCAGGCGAACCGGCTGTCCGAATCGGCGAGCCATGCGGCGGAGCGCGGCGGCCAGGCAGTGCGCGAGGTGGTGAG
CACGATGGGCGAGATCACCGAGAGCTCGCGCCGGATCTCGGAGATCATCGGCGTGATCGACGGCATCGCGTTCCAGACCA
ACATCCTCGCGCTGAACGCGGCCGTCGAGGCCGCGCGCGCGGGCGAGCAGGGCCGCGGCTTCGCGGTGGTCGCGGGCGAG
GTGCGCGCGCTCGCGCAGCGCAGCGCGAACGCGGCGAAGGAGATCAAGGCGCTGATCGGTGCGAGCGTCGAGCGGGTCGA
ATCCGGCGCGCAGACGGTCGACTACGCGGGCAGGACGATGGGCGAGATCGTCTCGCAGGTGAAGCGCGTGTCCGATCTGA
TCGCCGAGATCAGCGCATCGACGAGCGAGCAGCGCGCGGGCGTCACGCAGGTCGACGACGCGGTCGTCCATCTCGACAGC
ATCACGCAGCAGAACGCCGCGCTCGTCGAGCAGAGCGCGGCGGCCTCGGAGAGCCTGCGGCAGCAGGCGACGCTGCTCGT
CGACGCGGTCGGCGTGTTTCGCTGA

Upstream 100 bases:

>100_bases
CTGCCGAAATAAAGGGTACAGGCCCATCGAAAGCCATTCGATTGATTCATTCGGCCGATGAAAGCAGCTAAAGGACGCAG
ATCCCATATTCGGAGCGCTC

Downstream 100 bases:

>100_bases
CCGCGGGAGGGCGGGGCGGCGATGCGCGGGCGCTGCCGCCGGCCGCATCGCGCCGCGCCGCGCTGCGCTGCGCCGGGGCG
CGCGCGATCAGCGCTTCACG

Product: methyl-accepting chemotaxis protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 514; Mature: 514

Protein sequence:

>514_residues
MRNNQPVTQHEFELPDDATLMSTTDPHGRITYANATFVHVSGFSSDEIVGAPHNVVRHPDMPRDAFADMWATLKRGEPWT
ALVKNRRKNGDHYWVRANAVPVIRGGQTQGYMSVRTKPARAETAAADALYRDFREGRAGSRRFHKGLIVRTGLLRACSLL
QTMSVRARIHLPIVALTPAIVGAAWAAGVAGAPLAQLAGATLGGAALAAWWLDAQIARPLRTLRRQALDVATGASRRGVN
MNRVDEIGMSLRTINQLGLMFRWLIDDVSEQVLTVQRAVNEIAQGNHDLSARTEQAATSVQQTAASMAQMTATVSSNAQT
ATQANRLSESASHAAERGGQAVREVVSTMGEITESSRRISEIIGVIDGIAFQTNILALNAAVEAARAGEQGRGFAVVAGE
VRALAQRSANAAKEIKALIGASVERVESGAQTVDYAGRTMGEIVSQVKRVSDLIAEISASTSEQRAGVTQVDDAVVHLDS
ITQQNAALVEQSAAASESLRQQATLLVDAVGVFR

Sequences:

>Translated_514_residues
MRNNQPVTQHEFELPDDATLMSTTDPHGRITYANATFVHVSGFSSDEIVGAPHNVVRHPDMPRDAFADMWATLKRGEPWT
ALVKNRRKNGDHYWVRANAVPVIRGGQTQGYMSVRTKPARAETAAADALYRDFREGRAGSRRFHKGLIVRTGLLRACSLL
QTMSVRARIHLPIVALTPAIVGAAWAAGVAGAPLAQLAGATLGGAALAAWWLDAQIARPLRTLRRQALDVATGASRRGVN
MNRVDEIGMSLRTINQLGLMFRWLIDDVSEQVLTVQRAVNEIAQGNHDLSARTEQAATSVQQTAASMAQMTATVSSNAQT
ATQANRLSESASHAAERGGQAVREVVSTMGEITESSRRISEIIGVIDGIAFQTNILALNAAVEAARAGEQGRGFAVVAGE
VRALAQRSANAAKEIKALIGASVERVESGAQTVDYAGRTMGEIVSQVKRVSDLIAEISASTSEQRAGVTQVDDAVVHLDS
ITQQNAALVEQSAAASESLRQQATLLVDAVGVFR
>Mature_514_residues
MRNNQPVTQHEFELPDDATLMSTTDPHGRITYANATFVHVSGFSSDEIVGAPHNVVRHPDMPRDAFADMWATLKRGEPWT
ALVKNRRKNGDHYWVRANAVPVIRGGQTQGYMSVRTKPARAETAAADALYRDFREGRAGSRRFHKGLIVRTGLLRACSLL
QTMSVRARIHLPIVALTPAIVGAAWAAGVAGAPLAQLAGATLGGAALAAWWLDAQIARPLRTLRRQALDVATGASRRGVN
MNRVDEIGMSLRTINQLGLMFRWLIDDVSEQVLTVQRAVNEIAQGNHDLSARTEQAATSVQQTAASMAQMTATVSSNAQT
ATQANRLSESASHAAERGGQAVREVVSTMGEITESSRRISEIIGVIDGIAFQTNILALNAAVEAARAGEQGRGFAVVAGE
VRALAQRSANAAKEIKALIGASVERVESGAQTVDYAGRTMGEIVSQVKRVSDLIAEISASTSEQRAGVTQVDDAVVHLDS
ITQQNAALVEQSAAASESLRQQATLLVDAVGVFR

Specific function: Signal transducer for aerotaxis. The aerotactic response is the accumulation of cells around air bubbles. The nature of the sensory stimulus detected by this protein is the proton motive force or cellular redox state. It uses a FAD prosthetic group as a r

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cell inner membrane; Multi-pass membrane protein [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 PAS (PER-ARNT-SIM) domain [H]

Homologues:

Organism=Escherichia coli, GI1789453, Length=514, Percent_Identity=51.1673151750973, Blast_Score=493, Evalue=1e-140,
Organism=Escherichia coli, GI1788194, Length=309, Percent_Identity=50.8090614886731, Blast_Score=261, Evalue=1e-70,
Organism=Escherichia coli, GI1787690, Length=303, Percent_Identity=49.5049504950495, Blast_Score=252, Evalue=3e-68,
Organism=Escherichia coli, GI2367378, Length=309, Percent_Identity=51.4563106796116, Blast_Score=250, Evalue=2e-67,
Organism=Escherichia coli, GI1788195, Length=234, Percent_Identity=61.5384615384615, Blast_Score=248, Evalue=9e-67,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR004090
- InterPro:   IPR004089
- InterPro:   IPR003660
- InterPro:   IPR001610
- InterPro:   IPR000014
- InterPro:   IPR013655 [H]

Pfam domain/function: PF00672 HAMP; PF00015 MCPsignal; PF08447 PAS_3 [H]

EC number: NA

Molecular weight: Translated: 54959; Mature: 54959

Theoretical pI: Translated: 9.71; Mature: 9.71

Prosite motif: PS50112 PAS ; PS50111 CHEMOTAXIS_TRANSDUC_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.2 %Cys     (Translated Protein)
2.5 %Met     (Translated Protein)
2.7 %Cys+Met (Translated Protein)
0.2 %Cys     (Mature Protein)
2.5 %Met     (Mature Protein)
2.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MRNNQPVTQHEFELPDDATLMSTTDPHGRITYANATFVHVSGFSSDEIVGAPHNVVRHPD
CCCCCCCCCCCCCCCCCCEEEECCCCCCEEEEECEEEEEEECCCCCCCCCCCHHHHCCCC
MPRDAFADMWATLKRGEPWTALVKNRRKNGDHYWVRANAVPVIRGGQTQGYMSVRTKPAR
CCHHHHHHHHHHHHCCCCHHHHHHHHCCCCCEEEEEECCEEEEECCCCCCEEEECCCCCH
AETAAADALYRDFREGRAGSRRFHKGLIVRTGLLRACSLLQTMSVRARIHLPIVALTPAI
HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECHHHHHHHHH
VGAAWAAGVAGAPLAQLAGATLGGAALAAWWLDAQIARPLRTLRRQALDVATGASRRGVN
HHHHHHHCCCCCCHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC
MNRVDEIGMSLRTINQLGLMFRWLIDDVSEQVLTVQRAVNEIAQGNHDLSARTEQAATSV
CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH
QQTAASMAQMTATVSSNAQTATQANRLSESASHAAERGGQAVREVVSTMGEITESSRRIS
HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHH
EIIGVIDGIAFQTNILALNAAVEAARAGEQGRGFAVVAGEVRALAQRSANAAKEIKALIG
HHHHHHHHHHHHHHHEEHHHHHHHHHCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHHHH
ASVERVESGAQTVDYAGRTMGEIVSQVKRVSDLIAEISASTSEQRAGVTQVDDAVVHLDS
HHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHCCCCHHHHHHHHHHH
ITQQNAALVEQSAAASESLRQQATLLVDAVGVFR
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC
>Mature Secondary Structure
MRNNQPVTQHEFELPDDATLMSTTDPHGRITYANATFVHVSGFSSDEIVGAPHNVVRHPD
CCCCCCCCCCCCCCCCCCEEEECCCCCCEEEEECEEEEEEECCCCCCCCCCCHHHHCCCC
MPRDAFADMWATLKRGEPWTALVKNRRKNGDHYWVRANAVPVIRGGQTQGYMSVRTKPAR
CCHHHHHHHHHHHHCCCCHHHHHHHHCCCCCEEEEEECCEEEEECCCCCCEEEECCCCCH
AETAAADALYRDFREGRAGSRRFHKGLIVRTGLLRACSLLQTMSVRARIHLPIVALTPAI
HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECHHHHHHHHH
VGAAWAAGVAGAPLAQLAGATLGGAALAAWWLDAQIARPLRTLRRQALDVATGASRRGVN
HHHHHHHCCCCCCHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC
MNRVDEIGMSLRTINQLGLMFRWLIDDVSEQVLTVQRAVNEIAQGNHDLSARTEQAATSV
CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH
QQTAASMAQMTATVSSNAQTATQANRLSESASHAAERGGQAVREVVSTMGEITESSRRIS
HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHH
EIIGVIDGIAFQTNILALNAAVEAARAGEQGRGFAVVAGEVRALAQRSANAAKEIKALIG
HHHHHHHHHHHHHHHEEHHHHHHHHHCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHHHH
ASVERVESGAQTVDYAGRTMGEIVSQVKRVSDLIAEISASTSEQRAGVTQVDDAVVHLDS
HHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHCCCCHHHHHHHHHHH
ITQQNAALVEQSAAASESLRQQATLLVDAVGVFR
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 9278503; 9190831; 9380671 [H]