Definition Escherichia coli HS, complete genome.
Accession NC_009800
Length 4,643,538

Click here to switch to the map view.

The map label for this gene is yojN [H]

Identifier: 157161699

GI number: 157161699

Start: 2354470

End: 2357142

Strand: Direct

Name: yojN [H]

Synonym: EcHS_A2356

Alternate gene names: 157161699

Gene position: 2354470-2357142 (Clockwise)

Preceding gene: 157161690

Following gene: 157161700

Centisome position: 50.7

GC content: 49.57

Gene sequence:

>2673_bases
ATGCGTCAGAAAGAGACAACGGCCACGACCCGCTTTTCACTCCTACCGGGGAGCATTACCCGCTTCTTTTTACTGTTGAT
CATTGTGTTACTGGTGACGATGGGGGTAATGGTACAAAGCGCCGTTAACGCCTGGCTGAAAGATAAAAGTTACCAGATTG
TCGACATTACCCACGCAATCCAAAAGCGCGTCGATACCTGGCGTTACGTGACCTGGCAGATCTACGACAACATTGCCGCG
ACGACCTCCCCCTCCTCCGGCGAAGGTTTACAAGAGACGCGCCTGAAACAGGATGTCTACTATCTGGAGAAACCACGCCG
CAAAACGGAAGCGTTAATCTTTGGCTCTCACGACAACTCAACGCTTGAGATGACTCAGCGGATGTCCACTTATCTGGATA
CATTGTGGGGTGCAGAAAATGTACCGTGGTCGATGTATTACCTGAATGGTCAGGATAACAGTCTGGTACTGATCTCAACC
CTGCCCCTCAAAGATCTCACCTCCGGATTTAAAGAATCGACCGTCAGTGACATTGTTGATTCACGTCGTGCAGAAATGTT
GCAACAGGCCAACGCCCTCGATGAACGCGAAAGTTTTTCTAACATGCGCCGCCTGGCCTGGCAGAACGGTCATTACTTTA
CCTTGCGTACCACGTTCAACCAGCCGGGACATCTGGCAACGGTCGTGGCTTTTGATCTGCCGATTAATGATTTGATCCCA
CCGGGTATGCCGCTGGACAGTTTCCGCCTTGAGCCAGACGCGACGGCAACGGGAGACAATGATAATGAGAAAGAAGGGAC
GGATAGCGTCAGTATCCACTTTAACAGTACGAAGATTGAAATCTCCTCGGCACTCAACTCTACCGATATGCGCCTGGTCT
GGCAGGTTCCTTATGGCACCTTATTGCTGGATACGTTGCAAAACATTCTGCTGCCACTGCTGCTGAACATCGGTTTGCTG
GCGCTGGCGTTATTTGGCTATACCACATTCCGCCATTTCTCCAGTCGCAGTACAGAAAGTCTACCCAACACGGCGGTCAA
TAACGAATTGCGCATTTTACGGGCAATCAATGAAGAGATAGTCTCACTGCTGCCGCTCGGCCTGCTGGTTCACGATCAGG
AATCGAACCGCACTGTCATAAGTAACAAAATTGCCGATCATTTGCTGCCGCATTTGAATCTGCAAAACATCACCACCATG
GCGGAACAGCATCAGGGGATTATTCAGGCGACGATCAATAACGAGCTGTATGAGATCCGCATGTTCCGCAGCCAGGTCGC
GCCGCGCACACAAATTTTCATTATTCGCGATCAGGATCGCGAAGTGCTGGTAAACAAGAAACTCAAGCAGGCGCAGCGTC
TGTATGAGAAAAACCAGCAGGGGCGGATGACCTTTATGAAAAACATTGGCGATGCGCTGAAAGAACCCGCACAGTCCCTG
GCGGAGAGCGCGGCTAAACTCAATGCCCCGGAAAGCAAACAACTGGCGAATCAGGCAGATGTGCTGGTGCGGCTGGTCGA
TGAAATACAGTTAGCGAACATGCTTGCGGACGATAGCTGGAAAAGTGAGACGGTGCTGTTCTCCGTGCAGGATTTAATTG
ATGAAGTTGTGCCTTCAGTGTTGCCTGCCATCAAGCGTAAAGGTCTGCAACTGCTGATTAACAATCATCTGAAAGCACAC
GATATGCGCCGCGGCGATCGCGATGCCTTACGACGTATTTTGCTGCTACTGATGCAATATGCCGTGACCTCAACGCAATT
GGGAAAAATCACCCTTGAGGTTGATCAGGATGAGTCCTCCGAAGACCGCCTGACGTTCCGCATTCTGGACACCGGAGAAG
GCGTAAGCATTCATGAAATGGATAATTTGCACTTCCCGTTTATCAACCAGACCCAAAACGATCGCTATGGCAAGGCGGAC
CCGCTGGCATTCTGGCTGAGCGATCAACTGGCACGTAAACTGGGCGGTCATTTAAACATCAAAACGCGGGATGGGCTTGG
TACACGCTACTCTGTGCATATCAAAATGCTCGCAGCTGACCCGGAAGTTGAAGAGGAAGAAGAGCGTTTACTGGATGATG
TCTGCGTAATGGTGGATGTTACTTCGGCAGAAATTCGGAATATTGTCACTCGCCAGTTAGAAAATTGGGGTGCAACCTGT
ATCACACCCGATGAAAGATTAATTAGTCAAGATTATGATATCTTTTTAACGGATAATCCGTCTAATCTTACTGCCTCTGG
CTTGCTTTTAAGCGATGATGAGTCTGGCGTACGGGAAATTGGGCCTGGTCAATTGTGCGTCAACTTCAATATGAGCAACG
CTATGCAGGAAGCGGTCTTACAATTAATTGAAGTGCAACTGGCGCAGGAAGAGGTGACAGAATCGCCTCTGGGCGGAGAT
GAAAATGCGCAACTCCATGCCAGCGGCTATTATGCGCTCTTTGTAGACACAGTACCGGATGATGTTAAGAGGCTGTATAC
TGAAGCAGCAACCAGTGACTTTGCTGCGTTAGCCCAAACGGCTCATCGTCTTAAAGGCGTATTTGCCATGCTAAATCTGG
TACCCGGCAAGCAGTTATGTGAAACGCTGGAACATCTGATTCGTGAGAAGGATGTTCCAGGAATAGAAAAATACATCAGC
GACATTGACAGTTATGTCAAGAGCTTGCTGTAG

Upstream 100 bases:

>100_bases
AGAATAGAGAATCATCAATCAGGTAAGAGTCTGGAATTTCACACTGTACCCTTTATACTGCCCTATCACTTCGCGAAGTT
TTAACAGGTCATAAACACGA

Downstream 100 bases:

>100_bases
CAAGGTAGCCTATTACATGAACAATATGAACGTAATTATTGCCGATGACCATCCGATAGTCTTGTTCGGTATTCGCAAAT
CACTTGAGCAAATTGAGTGG

Product: phosphotransfer intermediate protein in two-component regulatory system with RcsBC

Products: NA

Alternate protein names: Phosphotransfer intermediate RcsD [H]

Number of amino acids: Translated: 890; Mature: 890

Protein sequence:

>890_residues
MRQKETTATTRFSLLPGSITRFFLLLIIVLLVTMGVMVQSAVNAWLKDKSYQIVDITHAIQKRVDTWRYVTWQIYDNIAA
TTSPSSGEGLQETRLKQDVYYLEKPRRKTEALIFGSHDNSTLEMTQRMSTYLDTLWGAENVPWSMYYLNGQDNSLVLIST
LPLKDLTSGFKESTVSDIVDSRRAEMLQQANALDERESFSNMRRLAWQNGHYFTLRTTFNQPGHLATVVAFDLPINDLIP
PGMPLDSFRLEPDATATGDNDNEKEGTDSVSIHFNSTKIEISSALNSTDMRLVWQVPYGTLLLDTLQNILLPLLLNIGLL
ALALFGYTTFRHFSSRSTESLPNTAVNNELRILRAINEEIVSLLPLGLLVHDQESNRTVISNKIADHLLPHLNLQNITTM
AEQHQGIIQATINNELYEIRMFRSQVAPRTQIFIIRDQDREVLVNKKLKQAQRLYEKNQQGRMTFMKNIGDALKEPAQSL
AESAAKLNAPESKQLANQADVLVRLVDEIQLANMLADDSWKSETVLFSVQDLIDEVVPSVLPAIKRKGLQLLINNHLKAH
DMRRGDRDALRRILLLLMQYAVTSTQLGKITLEVDQDESSEDRLTFRILDTGEGVSIHEMDNLHFPFINQTQNDRYGKAD
PLAFWLSDQLARKLGGHLNIKTRDGLGTRYSVHIKMLAADPEVEEEEERLLDDVCVMVDVTSAEIRNIVTRQLENWGATC
ITPDERLISQDYDIFLTDNPSNLTASGLLLSDDESGVREIGPGQLCVNFNMSNAMQEAVLQLIEVQLAQEEVTESPLGGD
ENAQLHASGYYALFVDTVPDDVKRLYTEAATSDFAALAQTAHRLKGVFAMLNLVPGKQLCETLEHLIREKDVPGIEKYIS
DIDSYVKSLL

Sequences:

>Translated_890_residues
MRQKETTATTRFSLLPGSITRFFLLLIIVLLVTMGVMVQSAVNAWLKDKSYQIVDITHAIQKRVDTWRYVTWQIYDNIAA
TTSPSSGEGLQETRLKQDVYYLEKPRRKTEALIFGSHDNSTLEMTQRMSTYLDTLWGAENVPWSMYYLNGQDNSLVLIST
LPLKDLTSGFKESTVSDIVDSRRAEMLQQANALDERESFSNMRRLAWQNGHYFTLRTTFNQPGHLATVVAFDLPINDLIP
PGMPLDSFRLEPDATATGDNDNEKEGTDSVSIHFNSTKIEISSALNSTDMRLVWQVPYGTLLLDTLQNILLPLLLNIGLL
ALALFGYTTFRHFSSRSTESLPNTAVNNELRILRAINEEIVSLLPLGLLVHDQESNRTVISNKIADHLLPHLNLQNITTM
AEQHQGIIQATINNELYEIRMFRSQVAPRTQIFIIRDQDREVLVNKKLKQAQRLYEKNQQGRMTFMKNIGDALKEPAQSL
AESAAKLNAPESKQLANQADVLVRLVDEIQLANMLADDSWKSETVLFSVQDLIDEVVPSVLPAIKRKGLQLLINNHLKAH
DMRRGDRDALRRILLLLMQYAVTSTQLGKITLEVDQDESSEDRLTFRILDTGEGVSIHEMDNLHFPFINQTQNDRYGKAD
PLAFWLSDQLARKLGGHLNIKTRDGLGTRYSVHIKMLAADPEVEEEEERLLDDVCVMVDVTSAEIRNIVTRQLENWGATC
ITPDERLISQDYDIFLTDNPSNLTASGLLLSDDESGVREIGPGQLCVNFNMSNAMQEAVLQLIEVQLAQEEVTESPLGGD
ENAQLHASGYYALFVDTVPDDVKRLYTEAATSDFAALAQTAHRLKGVFAMLNLVPGKQLCETLEHLIREKDVPGIEKYIS
DIDSYVKSLL
>Mature_890_residues
MRQKETTATTRFSLLPGSITRFFLLLIIVLLVTMGVMVQSAVNAWLKDKSYQIVDITHAIQKRVDTWRYVTWQIYDNIAA
TTSPSSGEGLQETRLKQDVYYLEKPRRKTEALIFGSHDNSTLEMTQRMSTYLDTLWGAENVPWSMYYLNGQDNSLVLIST
LPLKDLTSGFKESTVSDIVDSRRAEMLQQANALDERESFSNMRRLAWQNGHYFTLRTTFNQPGHLATVVAFDLPINDLIP
PGMPLDSFRLEPDATATGDNDNEKEGTDSVSIHFNSTKIEISSALNSTDMRLVWQVPYGTLLLDTLQNILLPLLLNIGLL
ALALFGYTTFRHFSSRSTESLPNTAVNNELRILRAINEEIVSLLPLGLLVHDQESNRTVISNKIADHLLPHLNLQNITTM
AEQHQGIIQATINNELYEIRMFRSQVAPRTQIFIIRDQDREVLVNKKLKQAQRLYEKNQQGRMTFMKNIGDALKEPAQSL
AESAAKLNAPESKQLANQADVLVRLVDEIQLANMLADDSWKSETVLFSVQDLIDEVVPSVLPAIKRKGLQLLINNHLKAH
DMRRGDRDALRRILLLLMQYAVTSTQLGKITLEVDQDESSEDRLTFRILDTGEGVSIHEMDNLHFPFINQTQNDRYGKAD
PLAFWLSDQLARKLGGHLNIKTRDGLGTRYSVHIKMLAADPEVEEEEERLLDDVCVMVDVTSAEIRNIVTRQLENWGATC
ITPDERLISQDYDIFLTDNPSNLTASGLLLSDDESGVREIGPGQLCVNFNMSNAMQEAVLQLIEVQLAQEEVTESPLGGD
ENAQLHASGYYALFVDTVPDDVKRLYTEAATSDFAALAQTAHRLKGVFAMLNLVPGKQLCETLEHLIREKDVPGIEKYIS
DIDSYVKSLL

Specific function: May serves as a phosphotransfer intermediate between RcsC and RcsB. It may acquire a phosphoryl group at His-842 from RcsC 'Asp-866' and transmit it to 'Asp-56' of RcsB [H]

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cell inner membrane; Multi-pass membrane protein (Probable) [H]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HPt domain [H]

Homologues:

Organism=Escherichia coli, GI1788545, Length=890, Percent_Identity=99.3258426966292, Blast_Score=1820, Evalue=0.0,
Organism=Escherichia coli, GI145693157, Length=469, Percent_Identity=22.1748400852878, Blast_Score=84, Evalue=3e-17,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003594
- InterPro:   IPR008207
- InterPro:   IPR005467 [H]

Pfam domain/function: PF02518 HATPase_c; PF01627 Hpt [H]

EC number: =2.7.13.3 [H]

Molecular weight: Translated: 100363; Mature: 100363

Theoretical pI: Translated: 4.81; Mature: 4.81

Prosite motif: PS50894 HPT ; PS50109 HIS_KIN

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.4 %Cys     (Translated Protein)
2.6 %Met     (Translated Protein)
3.0 %Cys+Met (Translated Protein)
0.4 %Cys     (Mature Protein)
2.6 %Met     (Mature Protein)
3.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MRQKETTATTRFSLLPGSITRFFLLLIIVLLVTMGVMVQSAVNAWLKDKSYQIVDITHAI
CCCCHHCHHHEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEHHHHH
QKRVDTWRYVTWQIYDNIAATTSPSSGEGLQETRLKQDVYYLEKPRRKTEALIFGSHDNS
HHHHHHHEEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCEEEEECCCCC
TLEMTQRMSTYLDTLWGAENVPWSMYYLNGQDNSLVLISTLPLKDLTSGFKESTVSDIVD
HHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCEEEEEECCHHHHHHHHHHHHHHHHHH
SRRAEMLQQANALDERESFSNMRRLAWQNGHYFTLRTTFNQPGHLATVVAFDLPINDLIP
HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCEEEEEEEECCHHHCCC
PGMPLDSFRLEPDATATGDNDNEKEGTDSVSIHFNSTKIEISSALNSTDMRLVWQVPYGT
CCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCEEEEHHHCCCCCCEEEEECCCHH
LLLDTLQNILLPLLLNIGLLALALFGYTTFRHFSSRSTESLPNTAVNNELRILRAINEEI
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHH
VSLLPLGLLVHDQESNRTVISNKIADHLLPHLNLQNITTMAEQHQGIIQATINNELYEIR
HHHHHHHHEEECCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHCCCEEEEECCCHHHHHH
MFRSQVAPRTQIFIIRDQDREVLVNKKLKQAQRLYEKNQQGRMTFMKNIGDALKEPAQSL
HHHHHCCCCEEEEEEECCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHH
AESAAKLNAPESKQLANQADVLVRLVDEIQLANMLADDSWKSETVLFSVQDLIDEVVPSV
HHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEHHHHHHHHHHHH
LPAIKRKGLQLLINNHLKAHDMRRGDRDALRRILLLLMQYAVTSTQLGKITLEVDQDESS
HHHHHHCCHHHHHHCCCHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCC
EDRLTFRILDTGEGVSIHEMDNLHFPFINQTQNDRYGKADPLAFWLSDQLARKLGGHLNI
CCEEEEEEEECCCCCEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEE
KTRDGLGTRYSVHIKMLAADPEVEEEEERLLDDVCVMVDVTSAEIRNIVTRQLENWGATC
EECCCCCCEEEEEEEEEECCCCCCHHHHHHHHHHHHEEECCHHHHHHHHHHHHHHCCCEE
ITPDERLISQDYDIFLTDNPSNLTASGLLLSDDESGVREIGPGQLCVNFNMSNAMQEAVL
ECCCHHHHCCCCEEEEECCCCCCEECCEEEECCCCCHHHCCCCCEEEEECHHHHHHHHHH
QLIEVQLAQEEVTESPLGGDENAQLHASGYYALFVDTVPDDVKRLYTEAATSDFAALAQT
HHHHHHHHHHHHHCCCCCCCCCCEEEECCEEEEEEECCCHHHHHHHHHHHHHHHHHHHHH
AHRLKGVFAMLNLVPGKQLCETLEHLIREKDVPGIEKYISDIDSYVKSLL
HHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHC
>Mature Secondary Structure
MRQKETTATTRFSLLPGSITRFFLLLIIVLLVTMGVMVQSAVNAWLKDKSYQIVDITHAI
CCCCHHCHHHEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEHHHHH
QKRVDTWRYVTWQIYDNIAATTSPSSGEGLQETRLKQDVYYLEKPRRKTEALIFGSHDNS
HHHHHHHEEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCEEEEECCCCC
TLEMTQRMSTYLDTLWGAENVPWSMYYLNGQDNSLVLISTLPLKDLTSGFKESTVSDIVD
HHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCEEEEEECCHHHHHHHHHHHHHHHHHH
SRRAEMLQQANALDERESFSNMRRLAWQNGHYFTLRTTFNQPGHLATVVAFDLPINDLIP
HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCEEEEEEEECCHHHCCC
PGMPLDSFRLEPDATATGDNDNEKEGTDSVSIHFNSTKIEISSALNSTDMRLVWQVPYGT
CCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCEEEEHHHCCCCCCEEEEECCCHH
LLLDTLQNILLPLLLNIGLLALALFGYTTFRHFSSRSTESLPNTAVNNELRILRAINEEI
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHH
VSLLPLGLLVHDQESNRTVISNKIADHLLPHLNLQNITTMAEQHQGIIQATINNELYEIR
HHHHHHHHEEECCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHCCCEEEEECCCHHHHHH
MFRSQVAPRTQIFIIRDQDREVLVNKKLKQAQRLYEKNQQGRMTFMKNIGDALKEPAQSL
HHHHHCCCCEEEEEEECCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHH
AESAAKLNAPESKQLANQADVLVRLVDEIQLANMLADDSWKSETVLFSVQDLIDEVVPSV
HHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEHHHHHHHHHHHH
LPAIKRKGLQLLINNHLKAHDMRRGDRDALRRILLLLMQYAVTSTQLGKITLEVDQDESS
HHHHHHCCHHHHHHCCCHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCC
EDRLTFRILDTGEGVSIHEMDNLHFPFINQTQNDRYGKADPLAFWLSDQLARKLGGHLNI
CCEEEEEEEECCCCCEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEE
KTRDGLGTRYSVHIKMLAADPEVEEEEERLLDDVCVMVDVTSAEIRNIVTRQLENWGATC
EECCCCCCEEEEEEEEEECCCCCCHHHHHHHHHHHHEEECCHHHHHHHHHHHHHHCCCEE
ITPDERLISQDYDIFLTDNPSNLTASGLLLSDDESGVREIGPGQLCVNFNMSNAMQEAVL
ECCCHHHHCCCCEEEEECCCCCCEECCEEEECCCCCHHHCCCCCEEEEECHHHHHHHHHH
QLIEVQLAQEEVTESPLGGDENAQLHASGYYALFVDTVPDDVKRLYTEAATSDFAALAQT
HHHHHHHHHHHHHCCCCCCCCCCEEEECCEEEEEEECCCHHHHHHHHHHHHHHHHHHHHH
AHRLKGVFAMLNLVPGKQLCETLEHLIREKDVPGIEKYISDIDSYVKSLL
HHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 9097040; 9278503; 2404948; 7984428; 11758943; 11309126 [H]