Definition | Escherichia coli HS, complete genome. |
---|---|
Accession | NC_009800 |
Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is mpl [H]
Identifier: 157163707
GI number: 157163707
Start: 4490980
End: 4492353
Strand: Direct
Name: mpl [H]
Synonym: EcHS_A4486
Alternate gene names: 157163707
Gene position: 4490980-4492353 (Clockwise)
Preceding gene: 157163705
Following gene: 157163709
Centisome position: 96.71
GC content: 55.46
Gene sequence:
>1374_bases ATGCGCATTCATATTTTAGGAATTTGTGGCACGTTTATGGGCGGTCTGGCGATGCTGGCGCGCCAGTTAGGCCATGAAGT AACGGGTTCGGACGCCAATGTGTATCCGCCGATGAGCACCTTACTTGAGAAGCAAGGCATTGAGCTGATTCAGGGTTACG ATGCCAGCCAGCTCGAGCCGCAGCCGGATCTGGTGATTATTGGCAACGCCATGACCCGTGGAAATCCGTGTGTGGAAGCG GTACTGGAAAAAAACATCCCTTATATGTCAGGTCCACAGTGGCTGCACGATTTTGTGCTGCGTGACCGCTGGGTGCTGGC CGTTGCCGGTACACATGGCAAAACCACCACCGCGGGAATGGCGACCTGGATTCTGGAACAGTGCGGTTACAAACCGGGAT TTGTGATCGGCGGTGTGCCGGGGAACTTTGAGGTTTCGGCGCGTCTGGGCGAAAGCGACTTCTTTGTTATCGAAGCGGAT GAGTATGACTGCGCCTTCTTCGACAAACGCTCTAAATTTGTCCATTACTGCCCGCGTACGCTGATCCTCAACAACCTTGA GTTCGATCACGCCGATATCTTTGACGACCTGAAAGCGATCCAGAAACAGTTCCACCATCTGGTGCGTATCGTTCCGGGGC AGGGCCGTATTATCTGGCCGGAAAATGACATCAACCTGAAACAGACCATGGCGATGGGCTGCTGGAGCGAGCAGGAGCTG GTGGGTGAGCAGGGGCACTGGCAGGCGAAAAAGCTGACCACCGATGCTTCCGAATGGGAAGTCTTGCTGGATGGCGAAAA AGTGGGCGAAGTGAAATGGTCGCTGGTAGGCGAACATAATATGCACAATGGCCTGATGGCGATTGCAGCGGCTCGCCATG TTGGTGTAGCGCCGGCAGATGCCGCTAACGCGCTGGGTTCGTTTATTAATGCTCGTCGCCGTCTGGAGTTGCGTGGTGAA GCGAATGGCGTCACGGTATATGACGATTTTGCCCATCACCCGACGGCGATTCTGGCAACGCTGGCGGCGCTGCGTGGCAA AGTTGGTGGTACGGCGCGCATTATTGCTGTGCTGGAACCGCGCTCGAATACCATGAAAATGGGGATCTGCAAAGACGATC TGGCACCTTCATTAGGTCGTGCCGATGAAGTCTTCCTGCTGCAACCGGCGCATATTCCGTGGCAGGTGGCAGAAGTGGCA GAAGCCTGCGTTCAGCCTGCACACTGGAGTGGCGATGTGGATACGCTGGCAGATATGGTGGTGAAAACCGCTCAGCCTGG CGACCATATTCTGGTGATGAGCAACGGCGGTTTTGGTGGGATCCATCAGAAACTGCTGGATGGTCTGGCGAAGAAGGCGG AAGCTGCGCAGTAA
Upstream 100 bases:
>100_bases GTGAATCGCGCCAGCAAATTACGGATTATCCTGAAATGCGTTTCTCACTTGCCCGACATATGCGTAAAATGAGCGGCAGA TTAAAAAAGGATAGTGACGT
Downstream 100 bases:
>100_bases TTCGGCCTCAGCCTGAGATAGCATTGCCGGATAAGGCGTTTACGCCGCATCCGGCATTTGAGCATAGTGCCTGATGCGAC GCTTGATGCGTCTTATCCGG
Product: UDP-N-acetylmuramate:L-alanyl-gamma-D-glutamyl- meso-diaminopimelat e ligase
Products: NA
Alternate protein names: Murein peptide ligase [H]
Number of amino acids: Translated: 457; Mature: 457
Protein sequence:
>457_residues MRIHILGICGTFMGGLAMLARQLGHEVTGSDANVYPPMSTLLEKQGIELIQGYDASQLEPQPDLVIIGNAMTRGNPCVEA VLEKNIPYMSGPQWLHDFVLRDRWVLAVAGTHGKTTTAGMATWILEQCGYKPGFVIGGVPGNFEVSARLGESDFFVIEAD EYDCAFFDKRSKFVHYCPRTLILNNLEFDHADIFDDLKAIQKQFHHLVRIVPGQGRIIWPENDINLKQTMAMGCWSEQEL VGEQGHWQAKKLTTDASEWEVLLDGEKVGEVKWSLVGEHNMHNGLMAIAAARHVGVAPADAANALGSFINARRRLELRGE ANGVTVYDDFAHHPTAILATLAALRGKVGGTARIIAVLEPRSNTMKMGICKDDLAPSLGRADEVFLLQPAHIPWQVAEVA EACVQPAHWSGDVDTLADMVVKTAQPGDHILVMSNGGFGGIHQKLLDGLAKKAEAAQ
Sequences:
>Translated_457_residues MRIHILGICGTFMGGLAMLARQLGHEVTGSDANVYPPMSTLLEKQGIELIQGYDASQLEPQPDLVIIGNAMTRGNPCVEA VLEKNIPYMSGPQWLHDFVLRDRWVLAVAGTHGKTTTAGMATWILEQCGYKPGFVIGGVPGNFEVSARLGESDFFVIEAD EYDCAFFDKRSKFVHYCPRTLILNNLEFDHADIFDDLKAIQKQFHHLVRIVPGQGRIIWPENDINLKQTMAMGCWSEQEL VGEQGHWQAKKLTTDASEWEVLLDGEKVGEVKWSLVGEHNMHNGLMAIAAARHVGVAPADAANALGSFINARRRLELRGE ANGVTVYDDFAHHPTAILATLAALRGKVGGTARIIAVLEPRSNTMKMGICKDDLAPSLGRADEVFLLQPAHIPWQVAEVA EACVQPAHWSGDVDTLADMVVKTAQPGDHILVMSNGGFGGIHQKLLDGLAKKAEAAQ >Mature_457_residues MRIHILGICGTFMGGLAMLARQLGHEVTGSDANVYPPMSTLLEKQGIELIQGYDASQLEPQPDLVIIGNAMTRGNPCVEA VLEKNIPYMSGPQWLHDFVLRDRWVLAVAGTHGKTTTAGMATWILEQCGYKPGFVIGGVPGNFEVSARLGESDFFVIEAD EYDCAFFDKRSKFVHYCPRTLILNNLEFDHADIFDDLKAIQKQFHHLVRIVPGQGRIIWPENDINLKQTMAMGCWSEQEL VGEQGHWQAKKLTTDASEWEVLLDGEKVGEVKWSLVGEHNMHNGLMAIAAARHVGVAPADAANALGSFINARRRLELRGE ANGVTVYDDFAHHPTAILATLAALRGKVGGTARIIAVLEPRSNTMKMGICKDDLAPSLGRADEVFLLQPAHIPWQVAEVA EACVQPAHWSGDVDTLADMVVKTAQPGDHILVMSNGGFGGIHQKLLDGLAKKAEAAQ
Specific function: Reutilizes the intact tripeptide L-alanyl-gamma-D- glutamyl-meso-diaminopimelate by linking it to UDP-N-acetylmuramic acid [H]
COG id: COG0773
COG function: function code M; UDP-N-acetylmuramate-alanine ligase
Gene ontology:
Cell location: Secreted [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the MurCDEF family [H]
Homologues:
Organism=Escherichia coli, GI1790680, Length=457, Percent_Identity=99.781181619256, Blast_Score=942, Evalue=0.0, Organism=Escherichia coli, GI1786279, Length=407, Percent_Identity=28.5012285012285, Blast_Score=113, Evalue=3e-26,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR004101 - InterPro: IPR013221 - InterPro: IPR000713 - InterPro: IPR005757 - InterPro: IPR016040 [H]
Pfam domain/function: PF01225 Mur_ligase; PF02875 Mur_ligase_C; PF08245 Mur_ligase_M [H]
EC number: 6.3.2.- [C]
Molecular weight: Translated: 49894; Mature: 49894
Theoretical pI: Translated: 5.78; Mature: 5.78
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.8 %Cys (Translated Protein) 3.3 %Met (Translated Protein) 5.0 %Cys+Met (Translated Protein) 1.8 %Cys (Mature Protein) 3.3 %Met (Mature Protein) 5.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MRIHILGICGTFMGGLAMLARQLGHEVTGSDANVYPPMSTLLEKQGIELIQGYDASQLEP CEEEEEEHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHCCCHHHCCCCCCCCCC QPDLVIIGNAMTRGNPCVEAVLEKNIPYMSGPQWLHDFVLRDRWVLAVAGTHGKTTTAGM CCCEEEEECCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCHHHH ATWILEQCGYKPGFVIGGVPGNFEVSARLGESDFFVIEADEYDCAFFDKRSKFVHYCPRT HHHHHHHCCCCCCEEECCCCCCEEEEEEECCCEEEEEECCCCCEEEHHCCCHHHHHCCHH LILNNLEFDHADIFDDLKAIQKQFHHLVRIVPGQGRIIWPENDINLKQTMAMGCWSEQEL HEECCCCCCHHHHHHHHHHHHHHHHHHHEEECCCCEEECCCCCCCHHHHHHHCCCCHHHH VGEQGHWQAKKLTTDASEWEVLLDGEKVGEVKWSLVGEHNMHNGLMAIAAARHVGVAPAD CCCCCCCCHHHCCCCCCCCEEEECCCCCCCEEEEEECCCCCCCHHHHHHHHHHCCCCCCH AANALGSFINARRRLELRGEANGVTVYDDFAHHPTAILATLAALRGKVGGTARIIAVLEP HHHHHHHHHHHHHEEEECCCCCCEEEECCCCCCHHHHHHHHHHHHCCCCCCEEEEEEECC RSNTMKMGICKDDLAPSLGRADEVFLLQPAHIPWQVAEVAEACVQPAHWSGDVDTLADMV CCCCEEEECCHHHCCCCCCCCCCEEEEECCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHH VKTAQPGDHILVMSNGGFGGIHQKLLDGLAKKAEAAQ HHCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHCCC >Mature Secondary Structure MRIHILGICGTFMGGLAMLARQLGHEVTGSDANVYPPMSTLLEKQGIELIQGYDASQLEP CEEEEEEHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHCCCHHHCCCCCCCCCC QPDLVIIGNAMTRGNPCVEAVLEKNIPYMSGPQWLHDFVLRDRWVLAVAGTHGKTTTAGM CCCEEEEECCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCHHHH ATWILEQCGYKPGFVIGGVPGNFEVSARLGESDFFVIEADEYDCAFFDKRSKFVHYCPRT HHHHHHHCCCCCCEEECCCCCCEEEEEEECCCEEEEEECCCCCEEEHHCCCHHHHHCCHH LILNNLEFDHADIFDDLKAIQKQFHHLVRIVPGQGRIIWPENDINLKQTMAMGCWSEQEL HEECCCCCCHHHHHHHHHHHHHHHHHHHEEECCCCEEECCCCCCCHHHHHHHCCCCHHHH VGEQGHWQAKKLTTDASEWEVLLDGEKVGEVKWSLVGEHNMHNGLMAIAAARHVGVAPAD CCCCCCCCHHHCCCCCCCCEEEECCCCCCCEEEEEECCCCCCCHHHHHHHHHHCCCCCCH AANALGSFINARRRLELRGEANGVTVYDDFAHHPTAILATLAALRGKVGGTARIIAVLEP HHHHHHHHHHHHHEEEECCCCCCEEEECCCCCCHHHHHHHHHHHHCCCCCCEEEEEEECC RSNTMKMGICKDDLAPSLGRADEVFLLQPAHIPWQVAEVAEACVQPAHWSGDVDTLADMV CCCCEEEECCHHHCCCCCCCCCCEEEEECCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHH VKTAQPGDHILVMSNGGFGGIHQKLLDGLAKKAEAAQ HHCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 7610040; 9278503; 2843822; 7984428; 8808921 [H]