Definition Escherichia coli O157:H7 str. EC4115, complete genome.
Accession NC_011353
Length 5,572,075

Click here to switch to the map view.

The map label for this gene is yjhS [H]

Identifier: 209399945

GI number: 209399945

Start: 1183499

End: 1185349

Strand: Direct

Name: yjhS [H]

Synonym: ECH74115_1168

Alternate gene names: 209399945

Gene position: 1183499-1185349 (Clockwise)

Preceding gene: 209399020

Following gene: 209400325

Centisome position: 21.24

GC content: 55.48

Gene sequence:

>1851_bases
ATGGCATTTAAACACTATGATGTTGTCAGGGCGGCGTCGCCGTCAGACCTTGCGGAAAAGCTGACACACAAACTGAAAGA
GGGCTGGCAGCCATACGGCGGACCGGTTGCCATTACGCCGTACACACTGATGCAGGCGGTGGCTATTGAAGGAGATCCAC
AGGTCGGCCCTTCATCTAAGCCGGACTGGTTCTACGTGGTTGTGCTTGCCGGACAGTCCAACGGCATGGCCTACGGTGAA
GGGCTTCCGTTACCGGATTCTTACGATGCTCCGGATCCGCGCATTAAACAGCTGGCGCGCCGCAGCACGGTAACTCCGGG
TGGAGAGAGTTGTACGTATAACGACATCATTCCGGCCGACCACTGCCTGCATGATGTGCAGGATATGAGTACGCTGAATC
ATCCGAAGGCAGACCTGAGCAAAGGGCAGTACGGCTGTGTCGGCCAGGGCTTACATATTGCCAAAAAACTGCTTCCGTAT
ATCCCGAATAACGCGGGGATCCTGCTGGTACCATGCTGTCGTGGTGGTTCTGCATTCACCCAGGGCGCTGAGGGGACATT
CAGTGCGGACACGGGGGCCAGCCAGGATTCGGCACGCTGGGGTGTGGGTAAACCGTTATATCAGGACCTGATTGCGCGCA
CTAAAGCTGCATTACAGAAGAACCCGAAAAATGTGTTGCTGGCGGTGTGCTGGATGCAGGGAGAGTTTGACATGAGCGCC
GCCACCCACGCACAGCAACCTGCGCTGTTTACAGCCATGCTGACACAGTTTCGTGCTGACCTCTCCGTGTTTAACGCGCA
GTGCCATGGTGGCAGTGCTGCAGATGTGCCGTGGATTTGTGGTGATACGACGTATTACTGGAAAAATACATACGCTACCC
AGTACGACACCGTGTACGGCGGGTATAAAAACAGGGAGAGTGAGGGCGTTTATTTTGTGCCCTTCATGACAGACGGTAAC
GGCGTCAATACCGCCACTAACGCGCCGGCAGAAGATCCGGATATTCCGGCATCAGGATATTACGGTGCGGCATCGAGAAC
GAATGGAAACCAGGTATCATCAAACCGCCCGACACATTTCAGTTCATGGGCGCGCAGGAGCATTATTCCGGATCGTCTGG
CAACCGCTATTCTGAACGCAGCCGGGCGCACCTCCGCCTTCATCAGTGGTAAGGCACCGGAAATCAAACCCTCGCCCGGC
GGCAACACGCCATCGGGTCCGTCTGCAGATACGTCCGTTCGCACAATCTCCCTGCTGCCGGCAGCCGGAGAGGCTGCTGC
GCAGGGCTGGAGCATTAAGGATGGCGGAATTCAGTTGTCAGATGGTGTATTTAAGATCACCAGGCAGAGCAATAAAACCT
GGTCCCTGACGCATCCGGTGGATGACGCAATTACCCTGCTGACACAGGGCGGCAGACTGAACTGTAAGTTCCGCCTGTCA
GGCGCACTGACCAACAATCAGTTCGGGCTGGGGATTTATCTGTATACGGATGCTCCCGTTCCTGATGGTGTGGCGATGAC
GGGTACCGGTAATCCGTTCCTGATGTCGTACTTCACTCAGACCACTGACGGCAGAGTGAATCTGATGCATCACAGGAAAG
CCGGAAACACGAAGCTGGGGGAGTTCGGCGATTACGGTAACGACTGGCAGACGCTGGAGCTGGTGTTCACCGCCGGCAGT
GCCACGGTTACTCCGAAACTGAATGGAGTGGCTGGCCCGGCATTCCAGGTTATAAAAGACAGTCTGACACTGGGACTGAA
TGCGCTGACGCTGACGGATGTTACAAAAAATGCAGCGTATGGCGTTGAGATAGAAAGTCTGGTGCTGGAGATAAATGCAC
CGGCAGCATAA

Upstream 100 bases:

>100_bases
TATTGCGGGCTGTTGTCTCTCTTCTGCCATTGTCCTGTAACTTCCGGACTTCAGCCCGCTCCTCATTTTACTCACAATAT
TATCCCGGCCGGGAGGATTC

Downstream 100 bases:

>100_bases
TAAAAAAAGAGCCAGCGACTGACCTGAAAGAAGACGCTGGCTAAAAGGCCTTATATGTTTGTAGAGACTTATTTTTCACA
GACAGCAATGATGCCTGTCA

Product: hypothetical protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 616; Mature: 615

Protein sequence:

>616_residues
MAFKHYDVVRAASPSDLAEKLTHKLKEGWQPYGGPVAITPYTLMQAVAIEGDPQVGPSSKPDWFYVVVLAGQSNGMAYGE
GLPLPDSYDAPDPRIKQLARRSTVTPGGESCTYNDIIPADHCLHDVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPY
IPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDMSA
ATHAQQPALFTAMLTQFRADLSVFNAQCHGGSAADVPWICGDTTYYWKNTYATQYDTVYGGYKNRESEGVYFVPFMTDGN
GVNTATNAPAEDPDIPASGYYGAASRTNGNQVSSNRPTHFSSWARRSIIPDRLATAILNAAGRTSAFISGKAPEIKPSPG
GNTPSGPSADTSVRTISLLPAAGEAAAQGWSIKDGGIQLSDGVFKITRQSNKTWSLTHPVDDAITLLTQGGRLNCKFRLS
GALTNNQFGLGIYLYTDAPVPDGVAMTGTGNPFLMSYFTQTTDGRVNLMHHRKAGNTKLGEFGDYGNDWQTLELVFTAGS
ATVTPKLNGVAGPAFQVIKDSLTLGLNALTLTDVTKNAAYGVEIESLVLEINAPAA

Sequences:

>Translated_616_residues
MAFKHYDVVRAASPSDLAEKLTHKLKEGWQPYGGPVAITPYTLMQAVAIEGDPQVGPSSKPDWFYVVVLAGQSNGMAYGE
GLPLPDSYDAPDPRIKQLARRSTVTPGGESCTYNDIIPADHCLHDVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPY
IPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDMSA
ATHAQQPALFTAMLTQFRADLSVFNAQCHGGSAADVPWICGDTTYYWKNTYATQYDTVYGGYKNRESEGVYFVPFMTDGN
GVNTATNAPAEDPDIPASGYYGAASRTNGNQVSSNRPTHFSSWARRSIIPDRLATAILNAAGRTSAFISGKAPEIKPSPG
GNTPSGPSADTSVRTISLLPAAGEAAAQGWSIKDGGIQLSDGVFKITRQSNKTWSLTHPVDDAITLLTQGGRLNCKFRLS
GALTNNQFGLGIYLYTDAPVPDGVAMTGTGNPFLMSYFTQTTDGRVNLMHHRKAGNTKLGEFGDYGNDWQTLELVFTAGS
ATVTPKLNGVAGPAFQVIKDSLTLGLNALTLTDVTKNAAYGVEIESLVLEINAPAA
>Mature_615_residues
AFKHYDVVRAASPSDLAEKLTHKLKEGWQPYGGPVAITPYTLMQAVAIEGDPQVGPSSKPDWFYVVVLAGQSNGMAYGEG
LPLPDSYDAPDPRIKQLARRSTVTPGGESCTYNDIIPADHCLHDVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYI
PNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDMSAA
THAQQPALFTAMLTQFRADLSVFNAQCHGGSAADVPWICGDTTYYWKNTYATQYDTVYGGYKNRESEGVYFVPFMTDGNG
VNTATNAPAEDPDIPASGYYGAASRTNGNQVSSNRPTHFSSWARRSIIPDRLATAILNAAGRTSAFISGKAPEIKPSPGG
NTPSGPSADTSVRTISLLPAAGEAAAQGWSIKDGGIQLSDGVFKITRQSNKTWSLTHPVDDAITLLTQGGRLNCKFRLSG
ALTNNQFGLGIYLYTDAPVPDGVAMTGTGNPFLMSYFTQTTDGRVNLMHHRKAGNTKLGEFGDYGNDWQTLELVFTAGSA
TVTPKLNGVAGPAFQVIKDSLTLGLNALTLTDVTKNAAYGVEIESLVLEINAPAA

Specific function: Unknown

COG id: COG2801

COG function: function code L; Transposase and inactivated derivatives

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

Organism=Escherichia coli, GI1790763, Length=318, Percent_Identity=55.6603773584906, Blast_Score=375, Evalue=1e-105,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR005181
- InterPro:   IPR013830 [H]

Pfam domain/function: PF03629 DUF303 [H]

EC number: NA

Molecular weight: Translated: 65613; Mature: 65482

Theoretical pI: Translated: 6.64; Mature: 6.64

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.5 %Cys     (Translated Protein)
1.8 %Met     (Translated Protein)
3.2 %Cys+Met (Translated Protein)
1.5 %Cys     (Mature Protein)
1.6 %Met     (Mature Protein)
3.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MAFKHYDVVRAASPSDLAEKLTHKLKEGWQPYGGPVAITPYTLMQAVAIEGDPQVGPSSK
CCCCCCCCEECCCHHHHHHHHHHHHHHCCCCCCCCEEECHHHHHEEEEECCCCCCCCCCC
PDWFYVVVLAGQSNGMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGESCTYNDIIPAD
CCEEEEEEEEECCCCEECCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCHH
HCLHDVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFT
HHHHHHHHHHHCCCCCHHHCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEEEECCCCCHHC
QGAEGTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDMSA
CCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCCCHH
ATHAQQPALFTAMLTQFRADLSVFNAQCHGGSAADVPWICGDTTYYWKNTYATQYDTVYG
HHHCCCCHHHHHHHHHHHHHHHHEEEECCCCCCCCCCEEECCCEEEECCCCCEECCCEEC
GYKNRESEGVYFVPFMTDGNGVNTATNAPAEDPDIPASGYYGAASRTNGNQVSSNRPTHF
CCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHH
SSWARRSIIPDRLATAILNAAGRTSAFISGKAPEIKPSPGGNTPSGPSADTSVRTISLLP
HHHHHHCCCHHHHHHHHHHCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEC
AAGEAAAQGWSIKDGGIQLSDGVFKITRQSNKTWSLTHPVDDAITLLTQGGRLNCKFRLS
CCCHHHHCCCEECCCCEEECCCEEEEEECCCCEEEECCCCCHHHHHEECCCEEEEEEEEE
GALTNNQFGLGIYLYTDAPVPDGVAMTGTGNPFLMSYFTQTTDGRVNLMHHRKAGNTKLG
EEEECCCCEEEEEEEECCCCCCCEEEECCCCCEEEEEECCCCCCEEEEEEECCCCCCCCC
EFGDYGNDWQTLELVFTAGSATVTPKLNGVAGPAFQVIKDSLTLGLNALTLTDVTKNAAY
CCCCCCCCCEEEEEEEECCCEEECCCCCCCCCHHHHHHHHHHHHCCEEEEEEECCCCCCC
GVEIESLVLEINAPAA
CEEEEEEEEEECCCCC
>Mature Secondary Structure 
AFKHYDVVRAASPSDLAEKLTHKLKEGWQPYGGPVAITPYTLMQAVAIEGDPQVGPSSK
CCCCCCCEECCCHHHHHHHHHHHHHHCCCCCCCCEEECHHHHHEEEEECCCCCCCCCCC
PDWFYVVVLAGQSNGMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGESCTYNDIIPAD
CCEEEEEEEEECCCCEECCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCHH
HCLHDVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFT
HHHHHHHHHHHCCCCCHHHCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEEEECCCCCHHC
QGAEGTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDMSA
CCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCCCHH
ATHAQQPALFTAMLTQFRADLSVFNAQCHGGSAADVPWICGDTTYYWKNTYATQYDTVYG
HHHCCCCHHHHHHHHHHHHHHHHEEEECCCCCCCCCCEEECCCEEEECCCCCEECCCEEC
GYKNRESEGVYFVPFMTDGNGVNTATNAPAEDPDIPASGYYGAASRTNGNQVSSNRPTHF
CCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHH
SSWARRSIIPDRLATAILNAAGRTSAFISGKAPEIKPSPGGNTPSGPSADTSVRTISLLP
HHHHHHCCCHHHHHHHHHHCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEC
AAGEAAAQGWSIKDGGIQLSDGVFKITRQSNKTWSLTHPVDDAITLLTQGGRLNCKFRLS
CCCHHHHCCCEECCCCEEECCCEEEEEECCCCEEEECCCCCHHHHHEECCCEEEEEEEEE
GALTNNQFGLGIYLYTDAPVPDGVAMTGTGNPFLMSYFTQTTDGRVNLMHHRKAGNTKLG
EEEECCCCEEEEEEEECCCCCCCEEEECCCCCEEEEEECCCCCCEEEEEEECCCCCCCCC
EFGDYGNDWQTLELVFTAGSATVTPKLNGVAGPAFQVIKDSLTLGLNALTLTDVTKNAAY
CCCCCCCCCEEEEEEEECCCEEECCCCCCCCCHHHHHHHHHHHHCCEEEEEEECCCCCCC
GVEIESLVLEINAPAA
CEEEEEEEEEECCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 7610040; 9278503 [H]