Definition | Shigella flexneri 2a str. 2457T, complete genome. |
---|---|
Accession | NC_004741 |
Length | 4,599,354 |
Click here to switch to the map view.
The map label for this gene is nlpD
Identifier: 30064099
GI number: 30064099
Start: 2834455
End: 2835594
Strand: Reverse
Name: nlpD
Synonym: S2958
Alternate gene names: 30064099
Gene position: 2835594-2834455 (Counterclockwise)
Preceding gene: 30064100
Following gene: 30064097
Centisome position: 61.65
GC content: 52.37
Gene sequence:
>1140_bases ATGAGCGCGGGAAGCCCAAAATTCACCGTTCGCCGCATTGCGGCTTTGTCACTGGTTTCGCTATGGCTGGCAGGCTGTTC TGACACTTCAAATCCACCGGCACCGGTCAGCTCCGTTAATGGCAATGCGCCTGCAAATACTAATTCTGGTATGTTGATTA CGCCGCCGCCGAAAATGGGGACGACGTCTACAGCGCAGCAACCGCAAATTCAGCCGGTACAGCAGCCACAAATTCAGGCT ACTCAACAACCGCAAATCCAGCCAGTGCAGCCAGTAGCTCAGCAGCCGGTACAGATGGAAAACGGACGCATCGTCTATAA CCGTCAGTATGGGAACATTCCGAAAGGCAGTTATAGCGGCAGTACCTATACCGTGAAAAAAGGCGACACACTTTTCTATA TCGCCTGGATTACTGGCAACGATTTCCGTGACCTTGCTCAGCGCAACAATATTCAGGCACCATACGCGCTGAACGTTGGT CAGACCTTGCAGGTGGGTAATGCTTCCGGTACGCCAATCACTGGCGGAAATGCCATTACCCAGGCCGACGCAGCAGAGCA AGGAGTTGTGATCAAGCCTGCACAAAATTCCACCGTTGCTGTTGCGTCGCAACCGACAATTACGTATTCTGAGTCTTCGG GTGAACAGAGTGCTAACAAAATGTTGCCGAACAACAAGCCAACTGCGACCACGGTCACAGCGCCTGTAACGGTACCAACA GCAAGCACAACCGAGCCGACTGTCAGCAGTACATCAACCAGTACGCCTATCTCCACCTGGCGCTGGCCGACTGAGGGCAA AGTGATCGAAACCTTTGGCGCTTCTGAGGGGGGCAACAAGGGGATTGATATCGCAGGCAGCAAAGGACAGGCAATTATCG CGACCGCAGATGGCCGCGTTGTTTATGCTGGTAACGCGCTGCGCGGCTACGGTAATCTGATTATCATCAAACATAATGAT GATTACCTGAGTGCCTACGCCCATAACGACACAATGCTGGTCCGGGAACAACAAGAAGTGAAGGCGGGGCAAAAAATAGC AACCATGGGTAGCACCGGAACCAGTTCAACACGCTTGCATTTTGAAATTCGTTACAAGGGGAAATCCGTAAACCCGCTGC GTTATTTGCCGCAGCGATAA
Upstream 100 bases:
>100_bases CGTAGGTCAGCGTATCGTGAACATCTTTTCCAGTGTTCAGTAGGGTGCCTTGCACGGTAATTATGTCACTGGTTATTAAC CAATTTTTCCTGGGGGATAA
Downstream 100 bases:
>100_bases ATCGGCGGAACCAGGCTTTTGCTTGAATGTTCCGTCAAGGGATCACGGGTAGGAGCCACCTTATGAGTCAGAATACGCTG AAAGTTCATGATTTAAATGA
Product: lipoprotein NlpD
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 379; Mature: 378
Protein sequence:
>379_residues MSAGSPKFTVRRIAALSLVSLWLAGCSDTSNPPAPVSSVNGNAPANTNSGMLITPPPKMGTTSTAQQPQIQPVQQPQIQA TQQPQIQPVQPVAQQPVQMENGRIVYNRQYGNIPKGSYSGSTYTVKKGDTLFYIAWITGNDFRDLAQRNNIQAPYALNVG QTLQVGNASGTPITGGNAITQADAAEQGVVIKPAQNSTVAVASQPTITYSESSGEQSANKMLPNNKPTATTVTAPVTVPT ASTTEPTVSSTSTSTPISTWRWPTEGKVIETFGASEGGNKGIDIAGSKGQAIIATADGRVVYAGNALRGYGNLIIIKHND DYLSAYAHNDTMLVREQQEVKAGQKIATMGSTGTSSTRLHFEIRYKGKSVNPLRYLPQR
Sequences:
>Translated_379_residues MSAGSPKFTVRRIAALSLVSLWLAGCSDTSNPPAPVSSVNGNAPANTNSGMLITPPPKMGTTSTAQQPQIQPVQQPQIQA TQQPQIQPVQPVAQQPVQMENGRIVYNRQYGNIPKGSYSGSTYTVKKGDTLFYIAWITGNDFRDLAQRNNIQAPYALNVG QTLQVGNASGTPITGGNAITQADAAEQGVVIKPAQNSTVAVASQPTITYSESSGEQSANKMLPNNKPTATTVTAPVTVPT ASTTEPTVSSTSTSTPISTWRWPTEGKVIETFGASEGGNKGIDIAGSKGQAIIATADGRVVYAGNALRGYGNLIIIKHND DYLSAYAHNDTMLVREQQEVKAGQKIATMGSTGTSSTRLHFEIRYKGKSVNPLRYLPQR >Mature_378_residues SAGSPKFTVRRIAALSLVSLWLAGCSDTSNPPAPVSSVNGNAPANTNSGMLITPPPKMGTTSTAQQPQIQPVQQPQIQAT QQPQIQPVQPVAQQPVQMENGRIVYNRQYGNIPKGSYSGSTYTVKKGDTLFYIAWITGNDFRDLAQRNNIQAPYALNVGQ TLQVGNASGTPITGGNAITQADAAEQGVVIKPAQNSTVAVASQPTITYSESSGEQSANKMLPNNKPTATTVTAPVTVPTA STTEPTVSSTSTSTPISTWRWPTEGKVIETFGASEGGNKGIDIAGSKGQAIIATADGRVVYAGNALRGYGNLIIIKHNDD YLSAYAHNDTMLVREQQEVKAGQKIATMGSTGTSSTRLHFEIRYKGKSVNPLRYLPQR
Specific function: May be involved in stationary-phase survival
COG id: COG0739
COG function: function code M; Membrane proteins related to metalloendopeptidases
Gene ontology:
Cell location: Cell inner membrane; Lipid-anchor (Potential)
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 LysM repeat
Homologues:
Organism=Escherichia coli, GI1789099, Length=379, Percent_Identity=100, Blast_Score=766, Evalue=0.0, Organism=Escherichia coli, GI87082174, Length=264, Percent_Identity=43.5606060606061, Blast_Score=207, Evalue=7e-55, Organism=Escherichia coli, GI87082297, Length=138, Percent_Identity=35.5072463768116, Blast_Score=81, Evalue=9e-17,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NLPD_ECOLI (P0ADA3)
Other databases:
- EMBL: L07869 - EMBL: U29579 - EMBL: U00096 - EMBL: AP009048 - EMBL: D17549 - PIR: B55522 - RefSeq: AP_003309.1 - RefSeq: NP_417222.1 - ProteinModelPortal: P0ADA3 - DIP: DIP-48067N - STRING: P0ADA3 - SWISS-2DPAGE: P0ADA3 - PRIDE: P0ADA3 - EnsemblBacteria: EBESCT00000001497 - EnsemblBacteria: EBESCT00000001498 - EnsemblBacteria: EBESCT00000015121 - GeneID: 947011 - GenomeReviews: AP009048_GR - GenomeReviews: U00096_GR - KEGG: ecj:JW2712 - KEGG: eco:b2742 - EchoBASE: EB2034 - EcoGene: EG12111 - eggNOG: COG0739 - GeneTree: EBGT00050000009952 - HOGENOM: HBG754735 - OMA: HEARTEY - ProtClustDB: PRK10871 - BioCyc: EcoCyc:EG12111-MONOMER - Genevestigator: P0ADA3 - GO: GO:0001896 - GO: GO:0006508 - InterPro: IPR011055 - InterPro: IPR016047 - InterPro: IPR002886 - InterPro: IPR018392 - InterPro: IPR002482 - PANTHER: PTHR21666:SF7 - SMART: SM00257
Pfam domain/function: PF01476 LysM; PF01551 Peptidase_M23; SSF51261 Dup_hybrid_motif
EC number: NA
Molecular weight: Translated: 40149; Mature: 40018
Theoretical pI: Translated: 9.92; Mature: 9.92
Prosite motif: PS51257 PROKAR_LIPOPROTEIN; PS00013 PROKAR_LIPOPROTEIN
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.3 %Cys (Translated Protein) 1.8 %Met (Translated Protein) 2.1 %Cys+Met (Translated Protein) 0.3 %Cys (Mature Protein) 1.6 %Met (Mature Protein) 1.9 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSAGSPKFTVRRIAALSLVSLWLAGCSDTSNPPAPVSSVNGNAPANTNSGMLITPPPKMG CCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCC TTSTAQQPQIQPVQQPQIQATQQPQIQPVQPVAQQPVQMENGRIVYNRQYGNIPKGSYSG CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCEEECCCEEEEEECCCCCCCCCCCC STYTVKKGDTLFYIAWITGNDFRDLAQRNNIQAPYALNVGQTLQVGNASGTPITGGNAIT CEEEEECCCEEEEEEEEECCCHHHHHHHCCCCCCEEECCCCEEEECCCCCCEECCCCCEE QADAAEQGVVIKPAQNSTVAVASQPTITYSESSGEQSANKMLPNNKPTATTVTAPVTVPT ECCCCCCCEEEECCCCCEEEEECCCEEEECCCCCCHHHHHCCCCCCCCEEEEEEEEEECC ASTTEPTVSSTSTSTPISTWRWPTEGKVIETFGASEGGNKGIDIAGSKGQAIIATADGRV CCCCCCCCCCCCCCCCCCEEECCCCCCEEEEECCCCCCCCCEEEECCCCCEEEEECCCEE VYAGNALRGYGNLIIIKHNDDYLSAYAHNDTMLVREQQEVKAGQKIATMGSTGTSSTRLH EEECCCCCCCCCEEEEEECCCEEEEEECCCEEEEECHHHHHCCCEEEEECCCCCCCEEEE FEIRYKGKSVNPLRYLPQR EEEEECCCCCCHHHCCCCC >Mature Secondary Structure SAGSPKFTVRRIAALSLVSLWLAGCSDTSNPPAPVSSVNGNAPANTNSGMLITPPPKMG CCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCC TTSTAQQPQIQPVQQPQIQATQQPQIQPVQPVAQQPVQMENGRIVYNRQYGNIPKGSYSG CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCEEECCCEEEEEECCCCCCCCCCCC STYTVKKGDTLFYIAWITGNDFRDLAQRNNIQAPYALNVGQTLQVGNASGTPITGGNAIT CEEEEECCCEEEEEEEEECCCHHHHHHHCCCCCCEEECCCCEEEECCCCCCEECCCCCEE QADAAEQGVVIKPAQNSTVAVASQPTITYSESSGEQSANKMLPNNKPTATTVTAPVTVPT ECCCCCCCEEEECCCCCEEEEECCCEEEECCCCCCHHHHHCCCCCCCCEEEEEEEEEECC ASTTEPTVSSTSTSTPISTWRWPTEGKVIETFGASEGGNKGIDIAGSKGQAIIATADGRV CCCCCCCCCCCCCCCCCCEEECCCCCCEEEEECCCCCCCCCEEEECCCCCEEEEECCCEE VYAGNALRGYGNLIIIKHNDDYLSAYAHNDTMLVREQQEVKAGQKIATMGSTGTSSTRLH EEECCCCCCCCCEEEEEECCCEEEEEECCCEEEEECHHHHHCCCEEEEECCCCCCCEEEE FEIRYKGKSVNPLRYLPQR EEEEECCCCCCHHHCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 8132457; 9278503; 8208244