| Definition | Escherichia coli O157:H7 str. EC4115, complete genome. |
|---|---|
| Accession | NC_011353 |
| Length | 5,572,075 |
Click here to switch to the map view.
The map label for this gene is cpxA
Identifier: 209398760
GI number: 209398760
Start: 5006675
End: 5008048
Strand: Reverse
Name: cpxA
Synonym: ECH74115_5366
Alternate gene names: 209398760
Gene position: 5008048-5006675 (Counterclockwise)
Preceding gene: 209397017
Following gene: 209398469
Centisome position: 89.88
GC content: 54.59
Gene sequence:
>1374_bases ATGATAGGCAGCTTAACCGCGCGCATCTTCGCCATCTTCTGGCTGACGCTGGCGCTGGTGTTGATGTTGGTTTTGATGTT ACCCAAGCTCGATTCACGCCAGATGACCGAGCTTCTGGATAGCGAACAGCGTCAGGGTCTGATGATTGAGCAGCATGTTG AAGCGGAGCTGGCGAACGATCCGCCCAACGATTTAATGTGGTGGCGGCGTCTGTTCCGGGCGATTGATAAGTGGGCACCG CCAGGACAGCGTTTGTTATTGGTGACCACCGAAGGCCGCGTGATCGGCGCTGAACGCAGCGAAATGCAGATCATTCGTAA CTTTATTGGTCAGGCCGATAACGCCGATCATCCGCAGAAGAAAAAGTATGGCCGCGTGGAACTGGTCGGTCCGTTCTCCG TGCGTGATGGCGAAGATAATTACCAACTTTATCTGATTCGTCCGGCCAGCAGTTCTCAATCCGATTTCATTAACTTACTG TTTGACCGCCCGCTATTACTGCTGATTGTCACCATGTTGGTCAGTACGCCGCTGCTGTTGTGGTTGGCCTGGAGTCTGGC AAAACCGGCGCGTAAGCTGAAAAACGCTGCCGATGAAGTTGCCCAGGGAAACTTACGCCAGCACCCGGAACTGGAAGCGG GGCCACAGGAATTCCTTGCCGCAGGTGCCAGTTTTAACCAGATGGTCACCGCGCTGGAGCGCATGATGACCTCTCAGCAG CGTCTGCTTTCTGATATCTCTCACGAGCTGCGCACCCCGCTGACGCGTCTGCAACTGGGTACGGCGTTACTGCGCCGTCG TAGTGGTGAAAGCAAGGAACTGGAGCGTATTGAAACCGAAGCGCAACGTCTGGACAGCATGATTAACGACCTGTTGGTGA TGTCACGTAATCAGCAAAAAAACGCGCTGGTTAGCGAGACCATCAAAGCCAATCAGTTGTGGAGTGAAGTGCTGGATAAC GCGGCGTTCGAAGCCGAGCAAATGGGCAAGTCGTTGACAGTTAACTTCCCGCCTGGGCCGTGGCCGCTGTACGGCAACCC GAACGCCCTGGAGAGTGCGCTGGAAAACATTGTTCGTAATGCCCTGCGTTATTCCCATACGAAGATTGAAGTGGGCTTTG CGGTAGATAAAGACGGTATCACCATTACGGTGGACGACGATGGTCCTGGCGTTAGCCCGGAAGATCGCGAACAGATTTTC CGTCCGTTCTATCGGACCGATGAAGCGCGCGATCGTGAATCTGGCGGTACAGGTTTGGGACTGGCGATTGTTGAAACCGC CATTCAGCAGCATCGTGGCTGGGTGAAAGCAGAAGACAGCCCGCTGGGCGGTTTACGGCTGGTGATTTGGTTGCCGCTGT ATAAGCGGAGTTAA
Upstream 100 bases:
>100_bases TGCACATTTCCAACCTGCGTCGTAAACTGCCGGATCGTAAAGATGGTCACCCGTGGTTTAAAACCTTGCGTGGTCGCGGC TACCTGATGGTTTCTGCTTC
Downstream 100 bases:
>100_bases ACTCCGCATTTGTAGGCAGGATAAGGCGTTTACGCCGCATCCGGCATTTGAGCAGGATGCCTGATGCGACGCTGATAGCG TCTTATCAGGCCTACACTCC
Product: two-component sensor protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 457; Mature: 457
Protein sequence:
>457_residues MIGSLTARIFAIFWLTLALVLMLVLMLPKLDSRQMTELLDSEQRQGLMIEQHVEAELANDPPNDLMWWRRLFRAIDKWAP PGQRLLLVTTEGRVIGAERSEMQIIRNFIGQADNADHPQKKKYGRVELVGPFSVRDGEDNYQLYLIRPASSSQSDFINLL FDRPLLLLIVTMLVSTPLLLWLAWSLAKPARKLKNAADEVAQGNLRQHPELEAGPQEFLAAGASFNQMVTALERMMTSQQ RLLSDISHELRTPLTRLQLGTALLRRRSGESKELERIETEAQRLDSMINDLLVMSRNQQKNALVSETIKANQLWSEVLDN AAFEAEQMGKSLTVNFPPGPWPLYGNPNALESALENIVRNALRYSHTKIEVGFAVDKDGITITVDDDGPGVSPEDREQIF RPFYRTDEARDRESGGTGLGLAIVETAIQQHRGWVKAEDSPLGGLRLVIWLPLYKRS
Sequences:
>Translated_457_residues MIGSLTARIFAIFWLTLALVLMLVLMLPKLDSRQMTELLDSEQRQGLMIEQHVEAELANDPPNDLMWWRRLFRAIDKWAP PGQRLLLVTTEGRVIGAERSEMQIIRNFIGQADNADHPQKKKYGRVELVGPFSVRDGEDNYQLYLIRPASSSQSDFINLL FDRPLLLLIVTMLVSTPLLLWLAWSLAKPARKLKNAADEVAQGNLRQHPELEAGPQEFLAAGASFNQMVTALERMMTSQQ RLLSDISHELRTPLTRLQLGTALLRRRSGESKELERIETEAQRLDSMINDLLVMSRNQQKNALVSETIKANQLWSEVLDN AAFEAEQMGKSLTVNFPPGPWPLYGNPNALESALENIVRNALRYSHTKIEVGFAVDKDGITITVDDDGPGVSPEDREQIF RPFYRTDEARDRESGGTGLGLAIVETAIQQHRGWVKAEDSPLGGLRLVIWLPLYKRS >Mature_457_residues MIGSLTARIFAIFWLTLALVLMLVLMLPKLDSRQMTELLDSEQRQGLMIEQHVEAELANDPPNDLMWWRRLFRAIDKWAP PGQRLLLVTTEGRVIGAERSEMQIIRNFIGQADNADHPQKKKYGRVELVGPFSVRDGEDNYQLYLIRPASSSQSDFINLL FDRPLLLLIVTMLVSTPLLLWLAWSLAKPARKLKNAADEVAQGNLRQHPELEAGPQEFLAAGASFNQMVTALERMMTSQQ RLLSDISHELRTPLTRLQLGTALLRRRSGESKELERIETEAQRLDSMINDLLVMSRNQQKNALVSETIKANQLWSEVLDN AAFEAEQMGKSLTVNFPPGPWPLYGNPNALESALENIVRNALRYSHTKIEVGFAVDKDGITITVDDDGPGVSPEDREQIF RPFYRTDEARDRESGGTGLGLAIVETAIQQHRGWVKAEDSPLGGLRLVIWLPLYKRS
Specific function: This protein is involved in several diverse cellular processes, such as the functioning of acetohydroxyacid synthetase I, in the biosynthesis of isoleucine and valine, the TraJ protein activation activity for tra gene expression in F plasmid, and the synt
COG id: COG0642
COG function: function code T; Signal transduction histidine kinase
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 histidine kinase domain
Homologues:
Organism=Escherichia coli, GI1790346, Length=457, Percent_Identity=100, Blast_Score=926, Evalue=0.0, Organism=Escherichia coli, GI1787894, Length=300, Percent_Identity=32.6666666666667, Blast_Score=143, Evalue=2e-35, Organism=Escherichia coli, GI1789808, Length=274, Percent_Identity=30.2919708029197, Blast_Score=130, Evalue=2e-31, Organism=Escherichia coli, GI1786783, Length=332, Percent_Identity=29.8192771084337, Blast_Score=123, Evalue=3e-29, Organism=Escherichia coli, GI1788393, Length=278, Percent_Identity=28.0575539568345, Blast_Score=113, Evalue=2e-26, Organism=Escherichia coli, GI1786600, Length=231, Percent_Identity=30.3030303030303, Blast_Score=109, Evalue=4e-25, Organism=Escherichia coli, GI145693157, Length=251, Percent_Identity=26.6932270916335, Blast_Score=86, Evalue=5e-18, Organism=Escherichia coli, GI1788279, Length=276, Percent_Identity=23.9130434782609, Blast_Score=82, Evalue=6e-17, Organism=Escherichia coli, GI87082128, Length=297, Percent_Identity=23.9057239057239, Blast_Score=82, Evalue=9e-17, Organism=Escherichia coli, GI1786912, Length=274, Percent_Identity=30.6569343065693, Blast_Score=81, Evalue=1e-16, Organism=Escherichia coli, GI1789149, Length=218, Percent_Identity=29.3577981651376, Blast_Score=80, Evalue=2e-16, Organism=Escherichia coli, GI1790436, Length=219, Percent_Identity=25.5707762557078, Blast_Score=77, Evalue=2e-15, Organism=Escherichia coli, GI1787374, Length=300, Percent_Identity=27.3333333333333, Blast_Score=76, Evalue=4e-15, Organism=Escherichia coli, GI1789403, Length=237, Percent_Identity=29.1139240506329, Blast_Score=75, Evalue=6e-15, Organism=Escherichia coli, GI1790551, Length=279, Percent_Identity=25.8064516129032, Blast_Score=74, Evalue=2e-14, Organism=Escherichia coli, GI1790861, Length=287, Percent_Identity=24.390243902439, Blast_Score=72, Evalue=8e-14, Organism=Escherichia coli, GI1788713, Length=238, Percent_Identity=26.0504201680672, Blast_Score=69, Evalue=5e-13, Organism=Escherichia coli, GI48994928, Length=206, Percent_Identity=28.1553398058252, Blast_Score=69, Evalue=5e-13, Organism=Escherichia coli, GI87081816, Length=230, Percent_Identity=28.695652173913, Blast_Score=69, Evalue=7e-13, Organism=Escherichia coli, GI1790300, Length=250, Percent_Identity=26.8, Blast_Score=65, Evalue=8e-12, Organism=Escherichia coli, GI1788549, Length=234, Percent_Identity=26.9230769230769, Blast_Score=64, Evalue=2e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): CPXA_ECO57 (P0AE84)
Other databases:
- EMBL: AE005174 - EMBL: BA000007 - PIR: E91233 - RefSeq: NP_290541.1 - RefSeq: NP_312864.1 - ProteinModelPortal: P0AE84 - SMR: P0AE84 - EnsemblBacteria: EBESCT00000028424 - EnsemblBacteria: EBESCT00000058657 - GeneID: 914983 - GeneID: 960234 - GenomeReviews: AE005174_GR - GenomeReviews: BA000007_GR - KEGG: ece:Z5456 - KEGG: ecs:ECs4837 - GeneTree: EBGT00050000008662 - HOGENOM: HBG334875 - OMA: LERMMTT - ProtClustDB: PRK09470 - BioCyc: ECOL83334:ECS4837-MONOMER - InterPro: IPR003594 - InterPro: IPR003660 - InterPro: IPR004358 - InterPro: IPR003661 - InterPro: IPR005467 - InterPro: IPR009082 - Gene3D: G3DSA:3.30.565.10 - PRINTS: PR00344 - SMART: SM00304 - SMART: SM00387 - SMART: SM00388
Pfam domain/function: PF00672 HAMP; PF02518 HATPase_c; PF00512 HisKA; SSF55874 ATP_bd_ATPase; SSF47384 His_kin_homodim
EC number: =2.7.13.3
Molecular weight: Translated: 51625; Mature: 51625
Theoretical pI: Translated: 5.57; Mature: 5.57
Prosite motif: PS50885 HAMP; PS50109 HIS_KIN
Important sites: NA
Signals:
None
Transmembrane regions:
HASH(0x13032324)-; HASH(0x13219940)-;
Cys/Met content:
0.0 %Cys (Translated Protein) 3.1 %Met (Translated Protein) 3.1 %Cys+Met (Translated Protein) 0.0 %Cys (Mature Protein) 3.1 %Met (Mature Protein) 3.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MIGSLTARIFAIFWLTLALVLMLVLMLPKLDSRQMTELLDSEQRQGLMIEQHVEAELAND CCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHCCC PPNDLMWWRRLFRAIDKWAPPGQRLLLVTTEGRVIGAERSEMQIIRNFIGQADNADHPQK CCHHHHHHHHHHHHHHHCCCCCCEEEEEEECCEEECCCHHHHHHHHHHHCCCCCCCCCHH KKYGRVELVGPFSVRDGEDNYQLYLIRPASSSQSDFINLLFDRPLLLLIVTMLVSTPLLL HHCCCEEEECCCCCCCCCCCEEEEEEECCCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHH WLAWSLAKPARKLKNAADEVAQGNLRQHPELEAGPQEFLAAGASFNQMVTALERMMTSQQ HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHCCCCHHHHHHHHHHHHHHHH RLLSDISHELRTPLTRLQLGTALLRRRSGESKELERIETEAQRLDSMINDLLVMSRNQQK HHHHHHHHHHHCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHH NALVSETIKANQLWSEVLDNAAFEAEQMGKSLTVNFPPGPWPLYGNPNALESALENIVRN HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHH ALRYSHTKIEVGFAVDKDGITITVDDDGPGVSPEDREQIFRPFYRTDEARDRESGGTGLG HHHHCCCEEEEEEEECCCCEEEEECCCCCCCCCHHHHHHHHHHHCCCCHHCCCCCCCCHH LAIVETAIQQHRGWVKAEDSPLGGLRLVIWLPLYKRS HHHHHHHHHHHCCCEECCCCCCCCEEEEEEECHHCCH >Mature Secondary Structure MIGSLTARIFAIFWLTLALVLMLVLMLPKLDSRQMTELLDSEQRQGLMIEQHVEAELAND CCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHCCC PPNDLMWWRRLFRAIDKWAPPGQRLLLVTTEGRVIGAERSEMQIIRNFIGQADNADHPQK CCHHHHHHHHHHHHHHHCCCCCCEEEEEEECCEEECCCHHHHHHHHHHHCCCCCCCCCHH KKYGRVELVGPFSVRDGEDNYQLYLIRPASSSQSDFINLLFDRPLLLLIVTMLVSTPLLL HHCCCEEEECCCCCCCCCCCEEEEEEECCCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHH WLAWSLAKPARKLKNAADEVAQGNLRQHPELEAGPQEFLAAGASFNQMVTALERMMTSQQ HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHCCCCHHHHHHHHHHHHHHHH RLLSDISHELRTPLTRLQLGTALLRRRSGESKELERIETEAQRLDSMINDLLVMSRNQQK HHHHHHHHHHHCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHH NALVSETIKANQLWSEVLDNAAFEAEQMGKSLTVNFPPGPWPLYGNPNALESALENIVRN HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHH ALRYSHTKIEVGFAVDKDGITITVDDDGPGVSPEDREQIFRPFYRTDEARDRESGGTGLG HHHHCCCEEEEEEEECCCCEEEEECCCCCCCCCHHHHHHHHHHHCCCCHHCCCCCCCCHH LAIVETAIQQHRGWVKAEDSPLGGLRLVIWLPLYKRS HHHHHHHHHHHCCCEECCCCCCCCEEEEEEECHHCCH
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796