Definition Escherichia coli 55989, complete genome.
Accession NC_011748
Length 5,154,862

Click here to switch to the map view.

The map label for this gene is aer

Identifier: 218696776

GI number: 218696776

Start: 3566278

End: 3567798

Strand: Reverse

Name: aer

Synonym: EC55989_3486

Alternate gene names: 218696776

Gene position: 3567798-3566278 (Counterclockwise)

Preceding gene: 218696778

Following gene: 218696774

Centisome position: 69.21

GC content: 55.1

Gene sequence:

>1521_bases
ATGTCTTCTCATCCGTATGTCACCCAGCAAAATACCCCGCTGGCGGACGATACCACTCTGATGTCCACTACCGATCTGCA
AAGCTATATCACTCATGCTAATGACACTTTTGTGCAGGTGAGCGGCTTTACCTTGCAAGAGTTACAAGGGCAGCCGCACA
ACATGGTGCGTCACCCGGATATGCCAAAAGCGGCGTTTGCGGATATGTGGTTCACCCTGAAAAAAGGGGAGCCCTGGAGC
GGCATCGTGAAAAATCGCCGCAAAAATGGTGACCATTATTGGGTGCGGGCCAATGCGGTACCGATGGTGCGCGAGGGAAA
AATCAGTGGCTATATGTCGATTCGTACCCGGGCGACGGATGAAGAGATTGCGGCGGTGGAGCCGCTGTACAAAGCGCTGA
ACGCCGGACGTACCAGTAAGCGTATTCATAAAGGCCTGGTGGTGCGTAAAGGCTGGCTGGGTAAACTGCCTTCATTACCG
CTTCGCTGGCGGGCGCGTGGAGTGATGACCCTGATGTTTATCTTGCTGGCGGCCATGCTTTGGTTTGTTGCTGCCCCGGT
GGTGACGTATTTCCTCTGTGTGTTAGTGGTATTGTTGGCAAGCGCTTGTTTTGAATGGCAGATTGTGCGCCCGATAGAAA
ATGTCGCCCGTCAGGCACTGAAGGTGGCGACCGGAGAGCGTAATAGTGTTGAGCACCTGAATCGCAGCGATGAGCTGGGG
CTGACATTACGCGCGGTAGGGCAGCTTGGCCTGATGTGCCGTTGGCTAATTAACGATGTCTCAAGCCAGGTGTCCAGTGT
CAGAAATGGCAGTGAGACGCTGGCGAAAGGCACCGATGAACTGAACGAACATACCCAGCAGACAGTTGATAACGTTCAGC
AAACGGTGGCGACCATGAACCAAATGGCGGCGTCGGTGAAACAGAACTCTGCCACGGCGTCGGCTGCCGATAAACTTTCT
ATCACCGCCAGTAATGCGGCAGTGCAGGGTGGGGAGGCGATGACCACGGTGATCAAGACAATGGACGATATCGCCGACAG
TACCCAGCGCATTGGCACCATTACTTCGCTGATTAACGATATTGCGTTTCAGACCAATATTCTGGCCCTGAATGCGGCGG
TGGAAGCGGCGCGTGCCGGCGAACAGGGCAAAGGTTTTGCAGTGGTGGCAGGGGAAGTGCGTCATTTAGCCAGCCGCAGC
GCTAATGCTGCCAACGATATTCGCAAGCTGATTGATGCCAGTGCTGATAAGGTGCAATCCGGTTCGCAGCAGGTACACGC
CGCCGGACGGACGATGGAAGATATTGTGGCACAGGTGAAAAACGTCACCCAGTTGATCGCCCAGATTAGCCATTCAACGC
TGGAACAGGCCGATGGGCTTTCCAGCCTGACCCGTGCAGTGGATGAGCTTAACCTGATCACCCAGAAAAATGCCGAGCTG
GTGGAAGAGAGTGCGCAGGTGTCGGCGATGGTGAAACACCGCGCCAGCCGACTGGAAGACGCGGTGACGGTGCTGCATTA
A

Upstream 100 bases:

>100_bases
GCGATCTAAATCAAATTAATCGGTTAAAGATAACCGCAGCGGGGCCGACATAAACTCTGACAAGAAGTTAACAACCATAT
AACCTGCACAGGACGCGAAC

Downstream 100 bases:

>100_bases
TCGGTATGCCGGATCCGGCGGTTGGAGCACAATGCCTGATGCGATGCTGGCGCATCTTATCAGGCCTGCGGGGTGTGTAG
CGGGTGTAGGCCGGATAAGG

Product: fused signal transducer for aerotaxis sensory component ; methyl accepting chemotaxis component

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 506; Mature: 505

Protein sequence:

>506_residues
MSSHPYVTQQNTPLADDTTLMSTTDLQSYITHANDTFVQVSGFTLQELQGQPHNMVRHPDMPKAAFADMWFTLKKGEPWS
GIVKNRRKNGDHYWVRANAVPMVREGKISGYMSIRTRATDEEIAAVEPLYKALNAGRTSKRIHKGLVVRKGWLGKLPSLP
LRWRARGVMTLMFILLAAMLWFVAAPVVTYFLCVLVVLLASACFEWQIVRPIENVARQALKVATGERNSVEHLNRSDELG
LTLRAVGQLGLMCRWLINDVSSQVSSVRNGSETLAKGTDELNEHTQQTVDNVQQTVATMNQMAASVKQNSATASAADKLS
ITASNAAVQGGEAMTTVIKTMDDIADSTQRIGTITSLINDIAFQTNILALNAAVEAARAGEQGKGFAVVAGEVRHLASRS
ANAANDIRKLIDASADKVQSGSQQVHAAGRTMEDIVAQVKNVTQLIAQISHSTLEQADGLSSLTRAVDELNLITQKNAEL
VEESAQVSAMVKHRASRLEDAVTVLH

Sequences:

>Translated_506_residues
MSSHPYVTQQNTPLADDTTLMSTTDLQSYITHANDTFVQVSGFTLQELQGQPHNMVRHPDMPKAAFADMWFTLKKGEPWS
GIVKNRRKNGDHYWVRANAVPMVREGKISGYMSIRTRATDEEIAAVEPLYKALNAGRTSKRIHKGLVVRKGWLGKLPSLP
LRWRARGVMTLMFILLAAMLWFVAAPVVTYFLCVLVVLLASACFEWQIVRPIENVARQALKVATGERNSVEHLNRSDELG
LTLRAVGQLGLMCRWLINDVSSQVSSVRNGSETLAKGTDELNEHTQQTVDNVQQTVATMNQMAASVKQNSATASAADKLS
ITASNAAVQGGEAMTTVIKTMDDIADSTQRIGTITSLINDIAFQTNILALNAAVEAARAGEQGKGFAVVAGEVRHLASRS
ANAANDIRKLIDASADKVQSGSQQVHAAGRTMEDIVAQVKNVTQLIAQISHSTLEQADGLSSLTRAVDELNLITQKNAEL
VEESAQVSAMVKHRASRLEDAVTVLH
>Mature_505_residues
SSHPYVTQQNTPLADDTTLMSTTDLQSYITHANDTFVQVSGFTLQELQGQPHNMVRHPDMPKAAFADMWFTLKKGEPWSG
IVKNRRKNGDHYWVRANAVPMVREGKISGYMSIRTRATDEEIAAVEPLYKALNAGRTSKRIHKGLVVRKGWLGKLPSLPL
RWRARGVMTLMFILLAAMLWFVAAPVVTYFLCVLVVLLASACFEWQIVRPIENVARQALKVATGERNSVEHLNRSDELGL
TLRAVGQLGLMCRWLINDVSSQVSSVRNGSETLAKGTDELNEHTQQTVDNVQQTVATMNQMAASVKQNSATASAADKLSI
TASNAAVQGGEAMTTVIKTMDDIADSTQRIGTITSLINDIAFQTNILALNAAVEAARAGEQGKGFAVVAGEVRHLASRSA
NAANDIRKLIDASADKVQSGSQQVHAAGRTMEDIVAQVKNVTQLIAQISHSTLEQADGLSSLTRAVDELNLITQKNAELV
EESAQVSAMVKHRASRLEDAVTVLH

Specific function: Signal transducer for aerotaxis. The aerotactic response is the accumulation of cells around air bubbles. The nature of the sensory stimulus detected by this protein is the proton motive force or cellular redox state. It uses a FAD prosthetic group as a r

COG id: COG0840

COG function: function code NT; Methyl-accepting chemotaxis protein

Gene ontology:

Cell location: Cell inner membrane; Multi-pass membrane protein [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 PAS (PER-ARNT-SIM) domain [H]

Homologues:

Organism=Escherichia coli, GI1789453, Length=506, Percent_Identity=99.2094861660079, Blast_Score=1033, Evalue=0.0,
Organism=Escherichia coli, GI1787690, Length=323, Percent_Identity=39.938080495356, Blast_Score=219, Evalue=3e-58,
Organism=Escherichia coli, GI2367378, Length=322, Percent_Identity=41.304347826087, Blast_Score=203, Evalue=3e-53,
Organism=Escherichia coli, GI1788195, Length=267, Percent_Identity=46.0674157303371, Blast_Score=202, Evalue=5e-53,
Organism=Escherichia coli, GI1788194, Length=328, Percent_Identity=38.109756097561, Blast_Score=200, Evalue=2e-52,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR004090
- InterPro:   IPR004089
- InterPro:   IPR003660
- InterPro:   IPR001610
- InterPro:   IPR000014
- InterPro:   IPR013655 [H]

Pfam domain/function: PF00672 HAMP; PF00015 MCPsignal; PF08447 PAS_3 [H]

EC number: NA

Molecular weight: Translated: 55132; Mature: 55000

Theoretical pI: Translated: 8.48; Mature: 8.48

Prosite motif: PS50111 CHEMOTAXIS_TRANSDUC_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
3.4 %Met     (Translated Protein)
4.0 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
3.2 %Met     (Mature Protein)
3.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSSHPYVTQQNTPLADDTTLMSTTDLQSYITHANDTFVQVSGFTLQELQGQPHNMVRHPD
CCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCCCHHHHCCCCCCCCCCCC
MPKAAFADMWFTLKKGEPWSGIVKNRRKNGDHYWVRANAVPMVREGKISGYMSIRTRATD
CCHHHHHHHHEEEECCCCCHHHHHHHCCCCCEEEEEECCCCCEECCCCCCEEEEEECCCC
EEIAAVEPLYKALNAGRTSKRIHKGLVVRKGWLGKLPSLPLRWRARGVMTLMFILLAAML
HHHHHHHHHHHHHHCCCHHHHHHHCHHEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHH
WFVAAPVVTYFLCVLVVLLASACFEWQIVRPIENVARQALKVATGERNSVEHLNRSDELG
HHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHCCCCCHHHHCCCCCCCC
LTLRAVGQLGLMCRWLINDVSSQVSSVRNGSETLAKGTDELNEHTQQTVDNVQQTVATMN
HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHCCHHHHHHHHHHHHHHHHHHHHHHH
QMAASVKQNSATASAADKLSITASNAAVQGGEAMTTVIKTMDDIADSTQRIGTITSLIND
HHHHHHHHCCCHHHHHHCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
IAFQTNILALNAAVEAARAGEQGKGFAVVAGEVRHLASRSANAANDIRKLIDASADKVQS
HHHHHHHHHHHHHHHHHHCCCCCCCEEEEHHHHHHHHHCCCCHHHHHHHHHHCCHHHHHC
GSQQVHAAGRTMEDIVAQVKNVTQLIAQISHSTLEQADGLSSLTRAVDELNLITQKNAEL
CHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCHHH
VEESAQVSAMVKHRASRLEDAVTVLH
HHHHHHHHHHHHHHHHHHHHHHHHCC
>Mature Secondary Structure 
SSHPYVTQQNTPLADDTTLMSTTDLQSYITHANDTFVQVSGFTLQELQGQPHNMVRHPD
CCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCCCHHHHCCCCCCCCCCCC
MPKAAFADMWFTLKKGEPWSGIVKNRRKNGDHYWVRANAVPMVREGKISGYMSIRTRATD
CCHHHHHHHHEEEECCCCCHHHHHHHCCCCCEEEEEECCCCCEECCCCCCEEEEEECCCC
EEIAAVEPLYKALNAGRTSKRIHKGLVVRKGWLGKLPSLPLRWRARGVMTLMFILLAAML
HHHHHHHHHHHHHHCCCHHHHHHHCHHEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHH
WFVAAPVVTYFLCVLVVLLASACFEWQIVRPIENVARQALKVATGERNSVEHLNRSDELG
HHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHCCCCCHHHHCCCCCCCC
LTLRAVGQLGLMCRWLINDVSSQVSSVRNGSETLAKGTDELNEHTQQTVDNVQQTVATMN
HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHCCHHHHHHHHHHHHHHHHHHHHHHH
QMAASVKQNSATASAADKLSITASNAAVQGGEAMTTVIKTMDDIADSTQRIGTITSLIND
HHHHHHHHCCCHHHHHHCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
IAFQTNILALNAAVEAARAGEQGKGFAVVAGEVRHLASRSANAANDIRKLIDASADKVQS
HHHHHHHHHHHHHHHHHHCCCCCCCEEEEHHHHHHHHHCCCCHHHHHHHHHHCCHHHHHC
GSQQVHAAGRTMEDIVAQVKNVTQLIAQISHSTLEQADGLSSLTRAVDELNLITQKNAEL
CHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCHHH
VEESAQVSAMVKHRASRLEDAVTVLH
HHHHHHHHHHHHHHHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 9278503; 9190831; 9380671 [H]