The gene/protein map for NC_011353 is currently unavailable.
Definition Escherichia coli O157:H7 str. EC4115, complete genome.
Accession NC_011353
Length 5,572,075

Click here to switch to the map view.

The map label for this gene is cpxA

Identifier: 209398760

GI number: 209398760

Start: 5006675

End: 5008048

Strand: Reverse

Name: cpxA

Synonym: ECH74115_5366

Alternate gene names: 209398760

Gene position: 5008048-5006675 (Counterclockwise)

Preceding gene: 209397017

Following gene: 209398469

Centisome position: 89.88

GC content: 54.59

Gene sequence:

>1374_bases
ATGATAGGCAGCTTAACCGCGCGCATCTTCGCCATCTTCTGGCTGACGCTGGCGCTGGTGTTGATGTTGGTTTTGATGTT
ACCCAAGCTCGATTCACGCCAGATGACCGAGCTTCTGGATAGCGAACAGCGTCAGGGTCTGATGATTGAGCAGCATGTTG
AAGCGGAGCTGGCGAACGATCCGCCCAACGATTTAATGTGGTGGCGGCGTCTGTTCCGGGCGATTGATAAGTGGGCACCG
CCAGGACAGCGTTTGTTATTGGTGACCACCGAAGGCCGCGTGATCGGCGCTGAACGCAGCGAAATGCAGATCATTCGTAA
CTTTATTGGTCAGGCCGATAACGCCGATCATCCGCAGAAGAAAAAGTATGGCCGCGTGGAACTGGTCGGTCCGTTCTCCG
TGCGTGATGGCGAAGATAATTACCAACTTTATCTGATTCGTCCGGCCAGCAGTTCTCAATCCGATTTCATTAACTTACTG
TTTGACCGCCCGCTATTACTGCTGATTGTCACCATGTTGGTCAGTACGCCGCTGCTGTTGTGGTTGGCCTGGAGTCTGGC
AAAACCGGCGCGTAAGCTGAAAAACGCTGCCGATGAAGTTGCCCAGGGAAACTTACGCCAGCACCCGGAACTGGAAGCGG
GGCCACAGGAATTCCTTGCCGCAGGTGCCAGTTTTAACCAGATGGTCACCGCGCTGGAGCGCATGATGACCTCTCAGCAG
CGTCTGCTTTCTGATATCTCTCACGAGCTGCGCACCCCGCTGACGCGTCTGCAACTGGGTACGGCGTTACTGCGCCGTCG
TAGTGGTGAAAGCAAGGAACTGGAGCGTATTGAAACCGAAGCGCAACGTCTGGACAGCATGATTAACGACCTGTTGGTGA
TGTCACGTAATCAGCAAAAAAACGCGCTGGTTAGCGAGACCATCAAAGCCAATCAGTTGTGGAGTGAAGTGCTGGATAAC
GCGGCGTTCGAAGCCGAGCAAATGGGCAAGTCGTTGACAGTTAACTTCCCGCCTGGGCCGTGGCCGCTGTACGGCAACCC
GAACGCCCTGGAGAGTGCGCTGGAAAACATTGTTCGTAATGCCCTGCGTTATTCCCATACGAAGATTGAAGTGGGCTTTG
CGGTAGATAAAGACGGTATCACCATTACGGTGGACGACGATGGTCCTGGCGTTAGCCCGGAAGATCGCGAACAGATTTTC
CGTCCGTTCTATCGGACCGATGAAGCGCGCGATCGTGAATCTGGCGGTACAGGTTTGGGACTGGCGATTGTTGAAACCGC
CATTCAGCAGCATCGTGGCTGGGTGAAAGCAGAAGACAGCCCGCTGGGCGGTTTACGGCTGGTGATTTGGTTGCCGCTGT
ATAAGCGGAGTTAA

Upstream 100 bases:

>100_bases
TGCACATTTCCAACCTGCGTCGTAAACTGCCGGATCGTAAAGATGGTCACCCGTGGTTTAAAACCTTGCGTGGTCGCGGC
TACCTGATGGTTTCTGCTTC

Downstream 100 bases:

>100_bases
ACTCCGCATTTGTAGGCAGGATAAGGCGTTTACGCCGCATCCGGCATTTGAGCAGGATGCCTGATGCGACGCTGATAGCG
TCTTATCAGGCCTACACTCC

Product: two-component sensor protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 457; Mature: 457

Protein sequence:

>457_residues
MIGSLTARIFAIFWLTLALVLMLVLMLPKLDSRQMTELLDSEQRQGLMIEQHVEAELANDPPNDLMWWRRLFRAIDKWAP
PGQRLLLVTTEGRVIGAERSEMQIIRNFIGQADNADHPQKKKYGRVELVGPFSVRDGEDNYQLYLIRPASSSQSDFINLL
FDRPLLLLIVTMLVSTPLLLWLAWSLAKPARKLKNAADEVAQGNLRQHPELEAGPQEFLAAGASFNQMVTALERMMTSQQ
RLLSDISHELRTPLTRLQLGTALLRRRSGESKELERIETEAQRLDSMINDLLVMSRNQQKNALVSETIKANQLWSEVLDN
AAFEAEQMGKSLTVNFPPGPWPLYGNPNALESALENIVRNALRYSHTKIEVGFAVDKDGITITVDDDGPGVSPEDREQIF
RPFYRTDEARDRESGGTGLGLAIVETAIQQHRGWVKAEDSPLGGLRLVIWLPLYKRS

Sequences:

>Translated_457_residues
MIGSLTARIFAIFWLTLALVLMLVLMLPKLDSRQMTELLDSEQRQGLMIEQHVEAELANDPPNDLMWWRRLFRAIDKWAP
PGQRLLLVTTEGRVIGAERSEMQIIRNFIGQADNADHPQKKKYGRVELVGPFSVRDGEDNYQLYLIRPASSSQSDFINLL
FDRPLLLLIVTMLVSTPLLLWLAWSLAKPARKLKNAADEVAQGNLRQHPELEAGPQEFLAAGASFNQMVTALERMMTSQQ
RLLSDISHELRTPLTRLQLGTALLRRRSGESKELERIETEAQRLDSMINDLLVMSRNQQKNALVSETIKANQLWSEVLDN
AAFEAEQMGKSLTVNFPPGPWPLYGNPNALESALENIVRNALRYSHTKIEVGFAVDKDGITITVDDDGPGVSPEDREQIF
RPFYRTDEARDRESGGTGLGLAIVETAIQQHRGWVKAEDSPLGGLRLVIWLPLYKRS
>Mature_457_residues
MIGSLTARIFAIFWLTLALVLMLVLMLPKLDSRQMTELLDSEQRQGLMIEQHVEAELANDPPNDLMWWRRLFRAIDKWAP
PGQRLLLVTTEGRVIGAERSEMQIIRNFIGQADNADHPQKKKYGRVELVGPFSVRDGEDNYQLYLIRPASSSQSDFINLL
FDRPLLLLIVTMLVSTPLLLWLAWSLAKPARKLKNAADEVAQGNLRQHPELEAGPQEFLAAGASFNQMVTALERMMTSQQ
RLLSDISHELRTPLTRLQLGTALLRRRSGESKELERIETEAQRLDSMINDLLVMSRNQQKNALVSETIKANQLWSEVLDN
AAFEAEQMGKSLTVNFPPGPWPLYGNPNALESALENIVRNALRYSHTKIEVGFAVDKDGITITVDDDGPGVSPEDREQIF
RPFYRTDEARDRESGGTGLGLAIVETAIQQHRGWVKAEDSPLGGLRLVIWLPLYKRS

Specific function: This protein is involved in several diverse cellular processes, such as the functioning of acetohydroxyacid synthetase I, in the biosynthesis of isoleucine and valine, the TraJ protein activation activity for tra gene expression in F plasmid, and the synt

COG id: COG0642

COG function: function code T; Signal transduction histidine kinase

Gene ontology:

Cell location: Cell inner membrane; Multi-pass membrane protein

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 histidine kinase domain

Homologues:

Organism=Escherichia coli, GI1790346, Length=457, Percent_Identity=100, Blast_Score=926, Evalue=0.0,
Organism=Escherichia coli, GI1787894, Length=300, Percent_Identity=32.6666666666667, Blast_Score=143, Evalue=2e-35,
Organism=Escherichia coli, GI1789808, Length=274, Percent_Identity=30.2919708029197, Blast_Score=130, Evalue=2e-31,
Organism=Escherichia coli, GI1786783, Length=332, Percent_Identity=29.8192771084337, Blast_Score=123, Evalue=3e-29,
Organism=Escherichia coli, GI1788393, Length=278, Percent_Identity=28.0575539568345, Blast_Score=113, Evalue=2e-26,
Organism=Escherichia coli, GI1786600, Length=231, Percent_Identity=30.3030303030303, Blast_Score=109, Evalue=4e-25,
Organism=Escherichia coli, GI145693157, Length=251, Percent_Identity=26.6932270916335, Blast_Score=86, Evalue=5e-18,
Organism=Escherichia coli, GI1788279, Length=276, Percent_Identity=23.9130434782609, Blast_Score=82, Evalue=6e-17,
Organism=Escherichia coli, GI87082128, Length=297, Percent_Identity=23.9057239057239, Blast_Score=82, Evalue=9e-17,
Organism=Escherichia coli, GI1786912, Length=274, Percent_Identity=30.6569343065693, Blast_Score=81, Evalue=1e-16,
Organism=Escherichia coli, GI1789149, Length=218, Percent_Identity=29.3577981651376, Blast_Score=80, Evalue=2e-16,
Organism=Escherichia coli, GI1790436, Length=219, Percent_Identity=25.5707762557078, Blast_Score=77, Evalue=2e-15,
Organism=Escherichia coli, GI1787374, Length=300, Percent_Identity=27.3333333333333, Blast_Score=76, Evalue=4e-15,
Organism=Escherichia coli, GI1789403, Length=237, Percent_Identity=29.1139240506329, Blast_Score=75, Evalue=6e-15,
Organism=Escherichia coli, GI1790551, Length=279, Percent_Identity=25.8064516129032, Blast_Score=74, Evalue=2e-14,
Organism=Escherichia coli, GI1790861, Length=287, Percent_Identity=24.390243902439, Blast_Score=72, Evalue=8e-14,
Organism=Escherichia coli, GI1788713, Length=238, Percent_Identity=26.0504201680672, Blast_Score=69, Evalue=5e-13,
Organism=Escherichia coli, GI48994928, Length=206, Percent_Identity=28.1553398058252, Blast_Score=69, Evalue=5e-13,
Organism=Escherichia coli, GI87081816, Length=230, Percent_Identity=28.695652173913, Blast_Score=69, Evalue=7e-13,
Organism=Escherichia coli, GI1790300, Length=250, Percent_Identity=26.8, Blast_Score=65, Evalue=8e-12,
Organism=Escherichia coli, GI1788549, Length=234, Percent_Identity=26.9230769230769, Blast_Score=64, Evalue=2e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): CPXA_ECO57 (P0AE84)

Other databases:

- EMBL:   AE005174
- EMBL:   BA000007
- PIR:   E91233
- RefSeq:   NP_290541.1
- RefSeq:   NP_312864.1
- ProteinModelPortal:   P0AE84
- SMR:   P0AE84
- EnsemblBacteria:   EBESCT00000028424
- EnsemblBacteria:   EBESCT00000058657
- GeneID:   914983
- GeneID:   960234
- GenomeReviews:   AE005174_GR
- GenomeReviews:   BA000007_GR
- KEGG:   ece:Z5456
- KEGG:   ecs:ECs4837
- GeneTree:   EBGT00050000008662
- HOGENOM:   HBG334875
- OMA:   LERMMTT
- ProtClustDB:   PRK09470
- BioCyc:   ECOL83334:ECS4837-MONOMER
- InterPro:   IPR003594
- InterPro:   IPR003660
- InterPro:   IPR004358
- InterPro:   IPR003661
- InterPro:   IPR005467
- InterPro:   IPR009082
- Gene3D:   G3DSA:3.30.565.10
- PRINTS:   PR00344
- SMART:   SM00304
- SMART:   SM00387
- SMART:   SM00388

Pfam domain/function: PF00672 HAMP; PF02518 HATPase_c; PF00512 HisKA; SSF55874 ATP_bd_ATPase; SSF47384 His_kin_homodim

EC number: =2.7.13.3

Molecular weight: Translated: 51625; Mature: 51625

Theoretical pI: Translated: 5.57; Mature: 5.57

Prosite motif: PS50885 HAMP; PS50109 HIS_KIN

Important sites: NA

Signals:

None

Transmembrane regions:

HASH(0x13032324)-; HASH(0x13219940)-;

Cys/Met content:

0.0 %Cys     (Translated Protein)
3.1 %Met     (Translated Protein)
3.1 %Cys+Met (Translated Protein)
0.0 %Cys     (Mature Protein)
3.1 %Met     (Mature Protein)
3.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MIGSLTARIFAIFWLTLALVLMLVLMLPKLDSRQMTELLDSEQRQGLMIEQHVEAELAND
CCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHCCC
PPNDLMWWRRLFRAIDKWAPPGQRLLLVTTEGRVIGAERSEMQIIRNFIGQADNADHPQK
CCHHHHHHHHHHHHHHHCCCCCCEEEEEEECCEEECCCHHHHHHHHHHHCCCCCCCCCHH
KKYGRVELVGPFSVRDGEDNYQLYLIRPASSSQSDFINLLFDRPLLLLIVTMLVSTPLLL
HHCCCEEEECCCCCCCCCCCEEEEEEECCCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHH
WLAWSLAKPARKLKNAADEVAQGNLRQHPELEAGPQEFLAAGASFNQMVTALERMMTSQQ
HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHCCCCHHHHHHHHHHHHHHHH
RLLSDISHELRTPLTRLQLGTALLRRRSGESKELERIETEAQRLDSMINDLLVMSRNQQK
HHHHHHHHHHHCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHH
NALVSETIKANQLWSEVLDNAAFEAEQMGKSLTVNFPPGPWPLYGNPNALESALENIVRN
HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHH
ALRYSHTKIEVGFAVDKDGITITVDDDGPGVSPEDREQIFRPFYRTDEARDRESGGTGLG
HHHHCCCEEEEEEEECCCCEEEEECCCCCCCCCHHHHHHHHHHHCCCCHHCCCCCCCCHH
LAIVETAIQQHRGWVKAEDSPLGGLRLVIWLPLYKRS
HHHHHHHHHHHCCCEECCCCCCCCEEEEEEECHHCCH
>Mature Secondary Structure
MIGSLTARIFAIFWLTLALVLMLVLMLPKLDSRQMTELLDSEQRQGLMIEQHVEAELAND
CCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHCCC
PPNDLMWWRRLFRAIDKWAPPGQRLLLVTTEGRVIGAERSEMQIIRNFIGQADNADHPQK
CCHHHHHHHHHHHHHHHCCCCCCEEEEEEECCEEECCCHHHHHHHHHHHCCCCCCCCCHH
KKYGRVELVGPFSVRDGEDNYQLYLIRPASSSQSDFINLLFDRPLLLLIVTMLVSTPLLL
HHCCCEEEECCCCCCCCCCCEEEEEEECCCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHH
WLAWSLAKPARKLKNAADEVAQGNLRQHPELEAGPQEFLAAGASFNQMVTALERMMTSQQ
HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHCCCCHHHHHHHHHHHHHHHH
RLLSDISHELRTPLTRLQLGTALLRRRSGESKELERIETEAQRLDSMINDLLVMSRNQQK
HHHHHHHHHHHCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHH
NALVSETIKANQLWSEVLDNAAFEAEQMGKSLTVNFPPGPWPLYGNPNALESALENIVRN
HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHH
ALRYSHTKIEVGFAVDKDGITITVDDDGPGVSPEDREQIFRPFYRTDEARDRESGGTGLG
HHHHCCCEEEEEEEECCCCEEEEECCCCCCCCCHHHHHHHHHHHCCCCHHCCCCCCCCHH
LAIVETAIQQHRGWVKAEDSPLGGLRLVIWLPLYKRS
HHHHHHHHHHHCCCEECCCCCCCCEEEEEEECHHCCH

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796