Definition | Agrobacterium tumefaciens str. C58 chromosome circular, complete sequence. |
---|---|
Accession | NC_003062 |
Length | 2,841,580 |
Click here to switch to the map view.
The map label for this gene is pepP [C]
Identifier: 15889492
GI number: 15889492
Start: 2184125
End: 2185276
Strand: Direct
Name: pepP [C]
Synonym: Atu2216
Alternate gene names: 15889492
Gene position: 2184125-2185276 (Clockwise)
Preceding gene: 15889491
Following gene: 159185102
Centisome position: 76.86
GC content: 61.2
Gene sequence:
>1152_bases ATGGCCCTGCATTTCGAGCTTTCGGAATTCGATGCCCGCCGCGAGCGTCTGCTCACGAAAATGGCCGAGGAGAAGCTGGA CGCCCTGCTGCTTTTTGCGCAGGAGAGCATGTATTGGCTGACCGGCTACGACACCTTCGGTTACTGCTTCTTCCAGACGC TTGTGGTCAAGTCGGATGGCTCCATGACGCTGCTGACCCGCTCGGCCGATCTGCGGCAGGCCCGCAATACCTCCGTCATC GACAATATCCTCATCTGGGTCGACCGGCCCAATGCGGACCCCACACTCGATCTGAAAAACCTGCTGAGCGATCTCGATCT GCTCGGCGCCAAGATCGGGGTGGAATACGACACCCACGGCATGACCGGCCGTGTCGCCCGCCTGCTGGACAACCAGCTTG CCAGTTTCTGCCAGATGAGCGACGCCTCTTATCTCGTCAGCACCCTCAGGCTCGTCAAAAGCCAGGCCGAGATCGCTTAT GCCCGCAAGGCCGGGCAACTGGCCGATGAGGCTCTGGATGCGGCATTGCCGCTCATCAAACCGGGCGCGGACGAAGCTGC GATCCTTGCCGCGATGCAAGGGGCAGTTCTGGCCGGCGGCGGTGATTATCCCGCCAATGAGTTCATCGTCGGTTCCGGCG CGGATGCGCTGCTCTGCCGTTACAAGGCCGGACGCCGCAGGCTCGACACCAAGGATCAGCTGACGCTGGAGTGGGCCGGT GTCAGCGCCCATTACCATGCAGCCATGATGCGCACGGTGGTGATCGGCGAACAGGATTTCCGCCAGAAGGAGCTTTACAG CGCCTGCCTGCAAAATATCACCGCCATCGAGGAAGTGCTGCGGCCGGGCAAGACTTTCGGCGATGTCTTCGATGTGCATG CCCGCGTGATGGACGAACGCGGATTGACCCGCCACCGCCTGAACTCCTGCGGTTATTCGCTGGGCGCCCGTTTCTCGCCT TCCTGGATGGAGCACCAGATGTTCCATGCCGGCAATCCGCAGGAAATCACCGCCGACATGACGCTCTTCGTGCATATGAT CGTGATGGATTCCGATTCCGGCACGGCCATGACGCTCGGCCAGACCTATCTGACAACGGAGGGCGCCCCGGAAGCGCTCT CGCGCCATAACCTCGATCTTCTCACAGCTTAA
Upstream 100 bases:
>100_bases CCCGACGAACATCGTTTCTATTCCTACCGCCGCACCACCCACCGCGCCGAACCCGATTACGGCCGGCAGATTTCCGCAAT CGCAATTCTGGAGGACTGAC
Downstream 100 bases:
>100_bases ATCGCCACCGGGTTTGACAGAGCGGGAAAACCGCCCTTACATTGCGCCACCTTGGGGCAAGAAGGAGCGGAACATGAGAG GGAAAATCGCATGAGCCGGA
Product: Xaa-Pro dipeptidase
Products: N-terminal amino acid including proline [C]
Alternate protein names: NA
Number of amino acids: Translated: 383; Mature: 382
Protein sequence:
>383_residues MALHFELSEFDARRERLLTKMAEEKLDALLLFAQESMYWLTGYDTFGYCFFQTLVVKSDGSMTLLTRSADLRQARNTSVI DNILIWVDRPNADPTLDLKNLLSDLDLLGAKIGVEYDTHGMTGRVARLLDNQLASFCQMSDASYLVSTLRLVKSQAEIAY ARKAGQLADEALDAALPLIKPGADEAAILAAMQGAVLAGGGDYPANEFIVGSGADALLCRYKAGRRRLDTKDQLTLEWAG VSAHYHAAMMRTVVIGEQDFRQKELYSACLQNITAIEEVLRPGKTFGDVFDVHARVMDERGLTRHRLNSCGYSLGARFSP SWMEHQMFHAGNPQEITADMTLFVHMIVMDSDSGTAMTLGQTYLTTEGAPEALSRHNLDLLTA
Sequences:
>Translated_383_residues MALHFELSEFDARRERLLTKMAEEKLDALLLFAQESMYWLTGYDTFGYCFFQTLVVKSDGSMTLLTRSADLRQARNTSVI DNILIWVDRPNADPTLDLKNLLSDLDLLGAKIGVEYDTHGMTGRVARLLDNQLASFCQMSDASYLVSTLRLVKSQAEIAY ARKAGQLADEALDAALPLIKPGADEAAILAAMQGAVLAGGGDYPANEFIVGSGADALLCRYKAGRRRLDTKDQLTLEWAG VSAHYHAAMMRTVVIGEQDFRQKELYSACLQNITAIEEVLRPGKTFGDVFDVHARVMDERGLTRHRLNSCGYSLGARFSP SWMEHQMFHAGNPQEITADMTLFVHMIVMDSDSGTAMTLGQTYLTTEGAPEALSRHNLDLLTA >Mature_382_residues ALHFELSEFDARRERLLTKMAEEKLDALLLFAQESMYWLTGYDTFGYCFFQTLVVKSDGSMTLLTRSADLRQARNTSVID NILIWVDRPNADPTLDLKNLLSDLDLLGAKIGVEYDTHGMTGRVARLLDNQLASFCQMSDASYLVSTLRLVKSQAEIAYA RKAGQLADEALDAALPLIKPGADEAAILAAMQGAVLAGGGDYPANEFIVGSGADALLCRYKAGRRRLDTKDQLTLEWAGV SAHYHAAMMRTVVIGEQDFRQKELYSACLQNITAIEEVLRPGKTFGDVFDVHARVMDERGLTRHRLNSCGYSLGARFSPS WMEHQMFHAGNPQEITADMTLFVHMIVMDSDSGTAMTLGQTYLTTEGAPEALSRHNLDLLTA
Specific function: Unknown
COG id: COG0006
COG function: function code E; Xaa-Pro aminopeptidase
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the peptidase M24 family [H]
Homologues:
None
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000587 - InterPro: IPR000994 [H]
Pfam domain/function: PF01321 Creatinase_N; PF00557 Peptidase_M24 [H]
EC number: 3.4.11.9 [C]
Molecular weight: Translated: 42349; Mature: 42218
Theoretical pI: Translated: 5.02; Mature: 5.02
Prosite motif: PS00141 ASP_PROTEASE
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.3 %Cys (Translated Protein) 4.2 %Met (Translated Protein) 5.5 %Cys+Met (Translated Protein) 1.3 %Cys (Mature Protein) 3.9 %Met (Mature Protein) 5.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MALHFELSEFDARRERLLTKMAEEKLDALLLFAQESMYWLTGYDTFGYCFFQTLVVKSDG CEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEEECCCHHHHHHHHHHHEECCC SMTLLTRSADLRQARNTSVIDNILIWVDRPNADPTLDLKNLLSDLDLLGAKIGVEYDTHG CEEEEECCHHHHHHCCCCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCEECCCC MTGRVARLLDNQLASFCQMSDASYLVSTLRLVKSQAEIAYARKAGQLADEALDAALPLIK CHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC PGADEAAILAAMQGAVLAGGGDYPANEFIVGSGADALLCRYKAGRRRLDTKDQLTLEWAG CCCCHHHHHHHHCCCEEECCCCCCCCCEEEECCCHHHHHEEHHCCCCCCCCCCEEEEEEC VSAHYHAAMMRTVVIGEQDFRQKELYSACLQNITAIEEVLRPGKTFGDVFDVHARVMDER CCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHC GLTRHRLNSCGYSLGARFSPSWMEHQMFHAGNPQEITADMTLFVHMIVMDSDSGTAMTLG CCHHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCCHHHHHHEEEEEEEEEECCCCCEEEEC QTYLTTEGAPEALSRHNLDLLTA CEEEECCCCHHHHHHCCCEEECC >Mature Secondary Structure ALHFELSEFDARRERLLTKMAEEKLDALLLFAQESMYWLTGYDTFGYCFFQTLVVKSDG EEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEEECCCHHHHHHHHHHHEECCC SMTLLTRSADLRQARNTSVIDNILIWVDRPNADPTLDLKNLLSDLDLLGAKIGVEYDTHG CEEEEECCHHHHHHCCCCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCEECCCC MTGRVARLLDNQLASFCQMSDASYLVSTLRLVKSQAEIAYARKAGQLADEALDAALPLIK CHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC PGADEAAILAAMQGAVLAGGGDYPANEFIVGSGADALLCRYKAGRRRLDTKDQLTLEWAG CCCCHHHHHHHHCCCEEECCCCCCCCCEEEECCCHHHHHEEHHCCCCCCCCCCEEEEEEC VSAHYHAAMMRTVVIGEQDFRQKELYSACLQNITAIEEVLRPGKTFGDVFDVHARVMDER CCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHC GLTRHRLNSCGYSLGARFSPSWMEHQMFHAGNPQEITADMTLFVHMIVMDSDSGTAMTLG CCHHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCCHHHHHHEEEEEEEEEECCCCCEEEEC QTYLTTEGAPEALSRHNLDLLTA CEEEECCCCHHHHHHCCCEEECC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: Co2+ [C]
Kcat value (1/min): 8100 [C]
Specific activity: 3.45
Km value (mM): 0.22 {(4-nitro)Phe-Pro-Pro-HN-CH2-CH2-NH-o-aminobenzoyl}} 3 {(4-nitro)Phe-Pro-HN-CH2-CH2-NH-o-aminobenzoyl}} [C]
Substrates: N-terminal amino acid including proline [C]
Specific reaction: Release of any N-terminal amino acid including proline that mislinked with proline even from a dipeptide or tripeptide [C]
General reaction: carboxylic acid amide hydrolysis [C]
Inhibitor: thiazolidide acid; -2-hydroxy -3-amino acidpyrrolidide acid; Pro-methyl ester acid; Pro-Phe-methyl ester acid; 2-hydroxy -3-aminoacyl-Pro-OH dipeptides [C]
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9163424 [H]