Definition | Rhizobium leguminosarum bv. trifolii WSM2304 chromosome, complete genome. |
---|---|
Accession | NC_011369 |
Length | 4,537,948 |
Click here to switch to the map view.
The map label for this gene is pepP [C]
Identifier: 209550340
GI number: 209550340
Start: 2804421
End: 2805572
Strand: Direct
Name: pepP [C]
Synonym: Rleg2_2761
Alternate gene names: 209550340
Gene position: 2804421-2805572 (Clockwise)
Preceding gene: 209550339
Following gene: 209550341
Centisome position: 61.8
GC content: 64.06
Gene sequence:
>1152_bases ATGGCACTGCACTTCGAAAAGGCCGAATTCGCAAGCCGGCTTGCGCGCCTCACCGAGAAGATGAAGGAGGAAAAGCTCGA CGCCCTGCTGCTCTTCGCCCAGGAGAGCATGTACTGGCTGACCGGCTACGACACGTTCGGCTACTGCTTCTTCCAGACGC TGGTCGTCAAGAGCGACGGCACCATGGCGTTGCTGACCCGCTCGGCCGATCTTCGCCAGGCTAGGCACACCTCGATCCTC GAGGACATCCATATCTGGGTCGACCGGGTCAATGCCGATCCGACGCTCGACCTGAAGAACCTGCTGGTCGAGATGGATCT GCTCGGCGCCCGCATCGGCGTCGAATATGACACGCACGGCATGACCGGCCGTATCTCGCGCCTGCTCGACGCGCAGTTGA CCACCTTCGGCCAGATCACCGACGCTTCCTACCTCGTCAGTCGCCTGCGCCTTGTCAAAAGCCCGACCGAGGTTGCCTAT GTCGAGCGCGCCGCCGTTCTCGCCGACGATGCGCTCGACGCCGCGATCCGGCTGACGAAGCCCGGCGCCGACGAGGCCGA TATCCTCGCCGCCATGCAGGGCGCGATCTTTTCCGGCGGCGGCGACTATCCCGCCAACGAGTTCATCATCGGCTCCGGCG CCGATGCCCTGCTTTGCCGTTACAAGGCCGGCCGCCGCAAGCTCGACGCCAGCGACCAGCTGACGCTCGAATGGGCCGGC GCTTACGCGCATTACCACGCCGCCATGATGCGCACGATCGTCATCGGCGAGCCGATGCAACGCCACCGCGAACTCTATAA TGCCTGCCGCGAAACCATCGAGGCGATCGAGACGGTGCTGAAGCCGGGCAACAGCTTCGGCGACGTCTTCGACATGCATG CCAGGATCATGGACGAACGCGGCCTTGCCCGCCACCGGCTGAACGCCTGCGGTTATTCGCTCGGCGCCCGCTTCTCGCCC TCCTGGATGGAGCATCAGATGTTCCATGTCGGCAATCCGCAGCCGATCGAGCCGAACATGTCGCTCTTCGTGCACATGAT CATCGCCGATTCCGACACGGGCACGGCGATGACGCTCGGCCAGACCTATCTGACGACGGCGGATGCGCCGCGCGCGCTCT CCCGCCATCCGCTCGATTTCATCGGTCTTTGA
Upstream 100 bases:
>100_bases ACAGCGAGCGCTTCTTTTCCTACCGCAGGACGACACATCGCAAGGAGCCGGACTACGGCCGGCAGATTTCCGCGATATCA ATCAGGGAGACGTGAGCAGA
Downstream 100 bases:
>100_bases CCGATGAGAATCCGGAATTGACAGACATCAGCCGCCGCCCGCATAAATTCGGGCGGGGCTGATGCGAATGTGGGAAGGAC GGATCAAGGGATGACGCGAG
Product: peptidase M24
Products: N-terminal amino acid including proline [C]
Alternate protein names: NA
Number of amino acids: Translated: 383; Mature: 382
Protein sequence:
>383_residues MALHFEKAEFASRLARLTEKMKEEKLDALLLFAQESMYWLTGYDTFGYCFFQTLVVKSDGTMALLTRSADLRQARHTSIL EDIHIWVDRVNADPTLDLKNLLVEMDLLGARIGVEYDTHGMTGRISRLLDAQLTTFGQITDASYLVSRLRLVKSPTEVAY VERAAVLADDALDAAIRLTKPGADEADILAAMQGAIFSGGGDYPANEFIIGSGADALLCRYKAGRRKLDASDQLTLEWAG AYAHYHAAMMRTIVIGEPMQRHRELYNACRETIEAIETVLKPGNSFGDVFDMHARIMDERGLARHRLNACGYSLGARFSP SWMEHQMFHVGNPQPIEPNMSLFVHMIIADSDTGTAMTLGQTYLTTADAPRALSRHPLDFIGL
Sequences:
>Translated_383_residues MALHFEKAEFASRLARLTEKMKEEKLDALLLFAQESMYWLTGYDTFGYCFFQTLVVKSDGTMALLTRSADLRQARHTSIL EDIHIWVDRVNADPTLDLKNLLVEMDLLGARIGVEYDTHGMTGRISRLLDAQLTTFGQITDASYLVSRLRLVKSPTEVAY VERAAVLADDALDAAIRLTKPGADEADILAAMQGAIFSGGGDYPANEFIIGSGADALLCRYKAGRRKLDASDQLTLEWAG AYAHYHAAMMRTIVIGEPMQRHRELYNACRETIEAIETVLKPGNSFGDVFDMHARIMDERGLARHRLNACGYSLGARFSP SWMEHQMFHVGNPQPIEPNMSLFVHMIIADSDTGTAMTLGQTYLTTADAPRALSRHPLDFIGL >Mature_382_residues ALHFEKAEFASRLARLTEKMKEEKLDALLLFAQESMYWLTGYDTFGYCFFQTLVVKSDGTMALLTRSADLRQARHTSILE DIHIWVDRVNADPTLDLKNLLVEMDLLGARIGVEYDTHGMTGRISRLLDAQLTTFGQITDASYLVSRLRLVKSPTEVAYV ERAAVLADDALDAAIRLTKPGADEADILAAMQGAIFSGGGDYPANEFIIGSGADALLCRYKAGRRKLDASDQLTLEWAGA YAHYHAAMMRTIVIGEPMQRHRELYNACRETIEAIETVLKPGNSFGDVFDMHARIMDERGLARHRLNACGYSLGARFSPS WMEHQMFHVGNPQPIEPNMSLFVHMIIADSDTGTAMTLGQTYLTTADAPRALSRHPLDFIGL
Specific function: Unknown
COG id: COG0006
COG function: function code E; Xaa-Pro aminopeptidase
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the peptidase M24 family [H]
Homologues:
Organism=Escherichia coli, GI1789275, Length=167, Percent_Identity=27.5449101796407, Blast_Score=66, Evalue=4e-12,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000587 - InterPro: IPR000994 [H]
Pfam domain/function: PF01321 Creatinase_N; PF00557 Peptidase_M24 [H]
EC number: 3.4.11.9 [C]
Molecular weight: Translated: 42629; Mature: 42498
Theoretical pI: Translated: 5.73; Mature: 5.73
Prosite motif: PS00141 ASP_PROTEASE
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.0 %Cys (Translated Protein) 4.4 %Met (Translated Protein) 5.5 %Cys+Met (Translated Protein) 1.0 %Cys (Mature Protein) 4.2 %Met (Mature Protein) 5.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MALHFEKAEFASRLARLTEKMKEEKLDALLLFAQESMYWLTGYDTFGYCFFQTLVVKSDG CCCCCHHHHHHHHHHHHHHHHHHHHHHHHEEEECCCEEEEECCCHHHHHHHHHHHEECCC TMALLTRSADLRQARHTSILEDIHIWVDRVNADPTLDLKNLLVEMDLLGARIGVEYDTHG CEEEEECCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCEECCCC MTGRISRLLDAQLTTFGQITDASYLVSRLRLVKSPTEVAYVERAAVLADDALDAAIRLTK CHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHEEEECC PGADEADILAAMQGAIFSGGGDYPANEFIIGSGADALLCRYKAGRRKLDASDQLTLEWAG CCCCHHHHHHHHHCCEECCCCCCCCCCEEEECCCHHHHHHHHHCCCCCCCCCCEEEEEHH AYAHYHAAMMRTIVIGEPMQRHRELYNACRETIEAIETVLKPGNSFGDVFDMHARIMDER HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHC GLARHRLNACGYSLGARFSPSWMEHQMFHVGNPQPIEPNMSLFVHMIIADSDTGTAMTLG CHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCEEEEEEEEEECCCCCEEEEC QTYLTTADAPRALSRHPLDFIGL CCEEECCCCCHHHHCCCCHHCCC >Mature Secondary Structure ALHFEKAEFASRLARLTEKMKEEKLDALLLFAQESMYWLTGYDTFGYCFFQTLVVKSDG CCCCHHHHHHHHHHHHHHHHHHHHHHHHEEEECCCEEEEECCCHHHHHHHHHHHEECCC TMALLTRSADLRQARHTSILEDIHIWVDRVNADPTLDLKNLLVEMDLLGARIGVEYDTHG CEEEEECCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCEECCCC MTGRISRLLDAQLTTFGQITDASYLVSRLRLVKSPTEVAYVERAAVLADDALDAAIRLTK CHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHEEEECC PGADEADILAAMQGAIFSGGGDYPANEFIIGSGADALLCRYKAGRRKLDASDQLTLEWAG CCCCHHHHHHHHHCCEECCCCCCCCCCEEEECCCHHHHHHHHHCCCCCCCCCCEEEEEHH AYAHYHAAMMRTIVIGEPMQRHRELYNACRETIEAIETVLKPGNSFGDVFDMHARIMDER HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHC GLARHRLNACGYSLGARFSPSWMEHQMFHVGNPQPIEPNMSLFVHMIIADSDTGTAMTLG CHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCEEEEEEEEEECCCCCEEEEC QTYLTTADAPRALSRHPLDFIGL CCEEECCCCCHHHHCCCCHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: Co2+ [C]
Kcat value (1/min): 8100 [C]
Specific activity: 3.45
Km value (mM): 3 {(4-nitro)Phe-Pro-HN-CH2-CH2-NH-o-aminobenzoyl}} 0.22 {(4-nitro)Phe-Pro-Pro-HN-CH2-CH2-NH-o-aminobenzoyl}} [C]
Substrates: N-terminal amino acid including proline [C]
Specific reaction: Release of any N-terminal amino acid including proline that mislinked with proline even from a dipeptide or tripeptide [C]
General reaction: carboxylic acid amide hydrolysis [C]
Inhibitor: thiazolidide acid; -2-hydroxy -3-amino acidpyrrolidide acid; Pro-methyl ester acid; Pro-Phe-methyl ester acid; 2-hydroxy -3-aminoacyl-Pro-OH dipeptides [C]
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9163424 [H]