Definition Escherichia coli O157:H7 str. EC4115, complete genome.
Accession NC_011353
Length 5,572,075

Click here to switch to the map view.

The map label for this gene is yjgL [H]

Identifier: 209398851

GI number: 209398851

Start: 5406158

End: 5407894

Strand: Direct

Name: yjgL [H]

Synonym: ECH74115_5774

Alternate gene names: 209398851

Gene position: 5406158-5407894 (Clockwise)

Preceding gene: 209399615

Following gene: 209398001

Centisome position: 97.02

GC content: 31.66

Gene sequence:

>1737_bases
ATGAGCAAAATATCAGATTTGAATTATTCTCAACACATTACATTAGCCGACAATTTTAAACAAAAAAGTGAAGTTTTAAA
TACCTGGCGTGTTGGAATGAATAATTTTGCCCGTAATGCCGGGGGGCAGGATAACACAAGAAATATCCTTAATCCTAAGA
CATTTTTGGAGTTTTTGGTAAAAATATTTACCCTGGGTTATGTGGATTTTAGCAAACGCTCCAACGAAGCGGGAAGAAAT
ATGATGGCTCATATTGAGTCCTCATCTTATATCAAAAATAATGATGGCAGTGAGATAATGAAGTTTGTTATGAATAATCC
TGAAGGGGAACGAGCGGATTCACCCAAGGTGATTATAGAAATTTCACTTTCCACTATTACTACTATGGGGACTCGTCAAG
GACATACAGCCATTATATTTCCACAACCTGATGGTTCGACTAACCGTTATGAAAGAAAGTCCTTTGAAAGAAAAGATGAG
AGTTCATTACACCTGATTACTAACAAGGTTCTGGCGTGTTACCAACGCGAAGCTAACAAGGAAATAGCTCGTCTATTAAA
TAATCATCAGAAGTTAAATAATCTACAGAAGTTAAATAATCTACAGAAGTTAAATAATATACAGAAGTTAAATAATATAC
AGGAGTTAAATAATTCGCAGGAGTTAAATAATTCGCAGGAGTTAAATAACTCGCAGGACTTAAAAAATTCGCAGGTGAGT
TGTAAAGGTTCAGTTGATTTTACGATTACGGATTTATTAGAAAAATCATTGAATAATGCATTATTAGCAATAAGGAACGA
ACATCTGCTATTAATGCCTCATGTATGTAGTGAATCGATTTCATACTTACTGGGCGAAAATGGTATACTTGAAGAAATAG
ATAAGCTCTACGAATTAAATGATCACGGAATTGATAATGACAAAGAAGGTAACAATGAAATTAATGACATCATGATTAAC
CTGTCTCATATTCTTATTGAATCCTTAGATGATGCAAAGGTTAATCTTACACCGGTCATCCATTCGATGTTGATGACTTT
TTTAGAATTGCCATATAATAATGATGTAAAAATACTGGAGTGGTGTTTTAATAAAAGCATGCAATATTTTGATGATTCTG
CAAAGATAGAGCATGCATGCTCCGTAATAAATCATATTAATTTTCGTCGCGATCAGTCTAAAGTAGCTGAGACATTATTT
TTCAATCTCGATAAAGAACCCTATAAAAATAGCCCTGAATTACAGGAGTTGATTTGGAAAAAGTTGGTTGTATATGTCAA
TGATTTTAACTTAAGCAATCGAGAAAAAACATATTTAATACAAAGAATATTTAATAATGTTGAGTCACTATTTAATAAAG
TACCTGTCAGTATTTTAGTTAATGATATTTTTATGAATGATTTTTTTATGAAAAACACTGAGATGATTAATTGGTACTTC
CCTCGGTTACTTAAGAGCTATGAGGATGAAAAGATTTATTTTGATAAGTTAGGGTATAATTTTAATAATAAAGAGTCTAA
TGAAGAGATTATGAAAAATCAACCAAAAGATGTTATTGAAGAAAAACTTAATAATGAATTAAAACTTAGGTTTAGAATGA
TGCAAACTATCTTGAAATCGGAGGTTAATGTATCGCCATTTATTGACCAACAGCGTTTAAATACACTAAATCCTCCGGAA
AATTTACGTATAGCAATAGAAAAATTTGGCTGGAAGAAAAAAACTATCACTGCATAA

Upstream 100 bases:

>100_bases
CCAGGTTTACGGCGAGTTTGTGAAAAGAGCGTTTTTTGATATTTTTTTGTGAGTAAAATTTGTAATGCTTAGACGTTCTT
ATTAACTCAAGGAGTTCGTC

Downstream 100 bases:

>100_bases
AATAATTTGACGCCGGGATGTTTCTGTATTTCCCGGCATCTTTATAGCGATATCAATTATTTACTGAGTGTCGCGACCAT
CACCGCTTTGATGGTGTGCA

Product: hypothetical protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 578; Mature: 577

Protein sequence:

>578_residues
MSKISDLNYSQHITLADNFKQKSEVLNTWRVGMNNFARNAGGQDNTRNILNPKTFLEFLVKIFTLGYVDFSKRSNEAGRN
MMAHIESSSYIKNNDGSEIMKFVMNNPEGERADSPKVIIEISLSTITTMGTRQGHTAIIFPQPDGSTNRYERKSFERKDE
SSLHLITNKVLACYQREANKEIARLLNNHQKLNNLQKLNNLQKLNNIQKLNNIQELNNSQELNNSQELNNSQDLKNSQVS
CKGSVDFTITDLLEKSLNNALLAIRNEHLLLMPHVCSESISYLLGENGILEEIDKLYELNDHGIDNDKEGNNEINDIMIN
LSHILIESLDDAKVNLTPVIHSMLMTFLELPYNNDVKILEWCFNKSMQYFDDSAKIEHACSVINHINFRRDQSKVAETLF
FNLDKEPYKNSPELQELIWKKLVVYVNDFNLSNREKTYLIQRIFNNVESLFNKVPVSILVNDIFMNDFFMKNTEMINWYF
PRLLKSYEDEKIYFDKLGYNFNNKESNEEIMKNQPKDVIEEKLNNELKLRFRMMQTILKSEVNVSPFIDQQRLNTLNPPE
NLRIAIEKFGWKKKTITA

Sequences:

>Translated_578_residues
MSKISDLNYSQHITLADNFKQKSEVLNTWRVGMNNFARNAGGQDNTRNILNPKTFLEFLVKIFTLGYVDFSKRSNEAGRN
MMAHIESSSYIKNNDGSEIMKFVMNNPEGERADSPKVIIEISLSTITTMGTRQGHTAIIFPQPDGSTNRYERKSFERKDE
SSLHLITNKVLACYQREANKEIARLLNNHQKLNNLQKLNNLQKLNNIQKLNNIQELNNSQELNNSQELNNSQDLKNSQVS
CKGSVDFTITDLLEKSLNNALLAIRNEHLLLMPHVCSESISYLLGENGILEEIDKLYELNDHGIDNDKEGNNEINDIMIN
LSHILIESLDDAKVNLTPVIHSMLMTFLELPYNNDVKILEWCFNKSMQYFDDSAKIEHACSVINHINFRRDQSKVAETLF
FNLDKEPYKNSPELQELIWKKLVVYVNDFNLSNREKTYLIQRIFNNVESLFNKVPVSILVNDIFMNDFFMKNTEMINWYF
PRLLKSYEDEKIYFDKLGYNFNNKESNEEIMKNQPKDVIEEKLNNELKLRFRMMQTILKSEVNVSPFIDQQRLNTLNPPE
NLRIAIEKFGWKKKTITA
>Mature_577_residues
SKISDLNYSQHITLADNFKQKSEVLNTWRVGMNNFARNAGGQDNTRNILNPKTFLEFLVKIFTLGYVDFSKRSNEAGRNM
MAHIESSSYIKNNDGSEIMKFVMNNPEGERADSPKVIIEISLSTITTMGTRQGHTAIIFPQPDGSTNRYERKSFERKDES
SLHLITNKVLACYQREANKEIARLLNNHQKLNNLQKLNNLQKLNNIQKLNNIQELNNSQELNNSQELNNSQDLKNSQVSC
KGSVDFTITDLLEKSLNNALLAIRNEHLLLMPHVCSESISYLLGENGILEEIDKLYELNDHGIDNDKEGNNEINDIMINL
SHILIESLDDAKVNLTPVIHSMLMTFLELPYNNDVKILEWCFNKSMQYFDDSAKIEHACSVINHINFRRDQSKVAETLFF
NLDKEPYKNSPELQELIWKKLVVYVNDFNLSNREKTYLIQRIFNNVESLFNKVPVSILVNDIFMNDFFMKNTEMINWYFP
RLLKSYEDEKIYFDKLGYNFNNKESNEEIMKNQPKDVIEEKLNNELKLRFRMMQTILKSEVNVSPFIDQQRLNTLNPPEN
LRIAIEKFGWKKKTITA

Specific function: Unknown

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

Organism=Escherichia coli, GI87082399, Length=604, Percent_Identity=79.635761589404, Blast_Score=946, Evalue=0.0,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 67288; Mature: 67157

Theoretical pI: Translated: 6.26; Mature: 6.26

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.9 %Cys     (Translated Protein)
3.1 %Met     (Translated Protein)
4.0 %Cys+Met (Translated Protein)
0.9 %Cys     (Mature Protein)
2.9 %Met     (Mature Protein)
3.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSKISDLNYSQHITLADNFKQKSEVLNTWRVGMNNFARNAGGQDNTRNILNPKTFLEFLV
CCCCCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHH
KIFTLGYVDFSKRSNEAGRNMMAHIESSSYIKNNDGSEIMKFVMNNPEGERADSPKVIIE
HHHHHCCHHHHHCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCEEEEE
ISLSTITTMGTRQGHTAIIFPQPDGSTNRYERKSFERKDESSLHLITNKVLACYQREANK
EEHHHHHHCCCCCCCEEEEEECCCCCCCHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHH
EIARLLNNHQKLNNLQKLNNLQKLNNIQKLNNIQELNNSQELNNSQELNNSQDLKNSQVS
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHCCCHHHCCCCHHCCCCCCE
CKGSVDFTITDLLEKSLNNALLAIRNEHLLLMPHVCSESISYLLGENGILEEIDKLYELN
ECCCCCEEHHHHHHHHHCCCEEEEECCCEEEECHHHHHHHHHHHCCCCHHHHHHHHHHCC
DHGIDNDKEGNNEINDIMINLSHILIESLDDAKVNLTPVIHSMLMTFLELPYNNDVKILE
CCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCEEHHHHHHHHHHHHHHCCCCCCHHHHH
WCFNKSMQYFDDSAKIEHACSVINHINFRRDQSKVAETLFFNLDKEPYKNSPELQELIWK
HHHHCCHHHHCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCCCCHHHHHHHH
KLVVYVNDFNLSNREKTYLIQRIFNNVESLFNKVPVSILVNDIFMNDFFMKNTEMINWYF
HHHHHEECCCCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCHHHHHHH
PRLLKSYEDEKIYFDKLGYNFNNKESNEEIMKNQPKDVIEEKLNNELKLRFRMMQTILKS
HHHHHCCCCCEEEEHHCCCCCCCCCCCHHHHHCCCHHHHHHHHCCHHHHHHHHHHHHHHH
EVNVSPFIDQQRLNTLNPPENLRIAIEKFGWKKKTITA
CCCCCCCCCHHHHCCCCCCHHHEEEHHHCCCCCCCCCC
>Mature Secondary Structure 
SKISDLNYSQHITLADNFKQKSEVLNTWRVGMNNFARNAGGQDNTRNILNPKTFLEFLV
CCCCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHH
KIFTLGYVDFSKRSNEAGRNMMAHIESSSYIKNNDGSEIMKFVMNNPEGERADSPKVIIE
HHHHHCCHHHHHCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCEEEEE
ISLSTITTMGTRQGHTAIIFPQPDGSTNRYERKSFERKDESSLHLITNKVLACYQREANK
EEHHHHHHCCCCCCCEEEEEECCCCCCCHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHH
EIARLLNNHQKLNNLQKLNNLQKLNNIQKLNNIQELNNSQELNNSQELNNSQDLKNSQVS
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHCCCHHHCCCCHHCCCCCCE
CKGSVDFTITDLLEKSLNNALLAIRNEHLLLMPHVCSESISYLLGENGILEEIDKLYELN
ECCCCCEEHHHHHHHHHCCCEEEEECCCEEEECHHHHHHHHHHHCCCCHHHHHHHHHHCC
DHGIDNDKEGNNEINDIMINLSHILIESLDDAKVNLTPVIHSMLMTFLELPYNNDVKILE
CCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCEEHHHHHHHHHHHHHHCCCCCCHHHHH
WCFNKSMQYFDDSAKIEHACSVINHINFRRDQSKVAETLFFNLDKEPYKNSPELQELIWK
HHHHCCHHHHCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCCCCHHHHHHHH
KLVVYVNDFNLSNREKTYLIQRIFNNVESLFNKVPVSILVNDIFMNDFFMKNTEMINWYF
HHHHHEECCCCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCHHHHHHH
PRLLKSYEDEKIYFDKLGYNFNNKESNEEIMKNQPKDVIEEKLNNELKLRFRMMQTILKS
HHHHHCCCCCEEEEHHCCCCCCCCCCCHHHHHCCCHHHHHHHHCCHHHHHHHHHHHHHHH
EVNVSPFIDQQRLNTLNPPENLRIAIEKFGWKKKTITA
CCCCCCCCCHHHHCCCCCCHHHEEEHHHCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 7610040; 9278503 [H]