Definition Agrobacterium tumefaciens str. C58 chromosome circular, complete sequence.
Accession NC_003062
Length 2,841,580

Click here to switch to the map view.

The map label for this gene is pepQ [H]

Identifier: 15889355

GI number: 15889355

Start: 2030680

End: 2032521

Strand: Reverse

Name: pepQ [H]

Synonym: Atu2070

Alternate gene names: 15889355

Gene position: 2032521-2030680 (Counterclockwise)

Preceding gene: 159185037

Following gene: 159185036

Centisome position: 71.53

GC content: 63.3

Gene sequence:

>1842_bases
ATGTTCCAGACCTTCGACAACAAATCCGCACCGCAATTCGGCAAGGCCCGTGTCGAGGCGCTGCGCGCCGGTTTCGATGC
ACTCGGTATCGACGGATTTCTCGTGCCGCGCGCCGATGAATATCAGGGAGAATATGTGCCTGAGTGCGCCGAGCGCCTGT
CATGGCTGACCGGCTTTACTGGTTCGGCCGGCATTGCTCTGGTGACGCGCGCGCAAGCGGTGGTGTTTGTCGATGGCCGA
TATACGACGCAGCTCAAGTCTCAGGTCGATCAGTCGGTCTTTACCGGCGGTGATCTGGTCGGCGCGCCGCCTTCCGTCTG
GCTGTCCGAACATGCGGCACAGGGTTTCCGGCTCGGCATCGATCCATGGCTGCATACCGGCGCCGAGTTGAAGCGGCTGG
AAAAGGCGCTGGCCGGCAAGGGCGGCTCGGTCGTGCTTCTGGAAAAGAACCCGCTCGATGCTCTCTGGCAGGACCGTCCC
GCCGAGCCGCTGGAACCCGTTGTCATCCAGCCGGAAGCCTTTACCGGGATACTGGCGAAGGAAAAGATCGCCTCGCTCGC
GGAAACCGTCTCGGCCAAGGGTGCGGATGCCCTGCTGGTCACCGACCCGTCATCCATCGCCTGGATTTTCAATATTCGCG
GCAATGACGTGCCGCACACGCCGCATCCGCTCGCCCGCGGTATCATTTATGCCGACGGCAAGGCGGATATTTTTCTCGAC
AAGCGCAAGACCGGCATCGAGGCCGAGGCCTATCTCGCGCAACTGGCGACGCAGCTGCCGCCATCGAAGATTGCCGACCG
TCTGCACGCCATTGCCAGCGCCAAAGGCCGGGTGATGGTCGATGCCGATCTGACGCCTGTCGCGCTGACCGGCGCGATCA
CTGCGGCGGGTGGCTCTCTGATTGAAGAGGCCGATCCGGTCCGCCTGCCGCGGGCGCGCAAGAACAAGGCGGAGCTTGCC
GGCTCGGCTGCCGCGCATGTGCAGGATGGTGCGGCGATGGTTGAATATCTCTGCTGGCTGGATCGCCAGCAGCCGGGCAG
CGTCACCGAGATTGCCGCAGTAAAGGCGCTGGAGGCGGCACGCGCCAAGGTGGGGCAGGCGATGCAGAACCCGCTGAAGG
ATGTGTCGTTCGACACGATTTCCGGCGCTGGCGACCATGCCGCCATCATCCATTACCGCGTCACGACGGACACGGACCGC
ATACTTGCCGATGGCGAGATGTTTCTCGTTGATTCCGGTGCGCAATATGTCAACGGCACCACTGATATCACCCGCACCGT
TGCCATCGGCACGGTGCCGGAAGAGCAGAGGCGTTTCTTCACGCTGGTGCTGAAGGGCGTGATTGCCATTAGTGCGGCAC
GGTTTCCGAAGGGAACGCGTGGCTGCGACCTCGATCCGTTGGCGCGCATCGCGCTCTGGAAGGCGGGGGCAGATTATGCC
CATGGCACCGGCCACGGCGTCGGCTCCTATCTCTCCGTGCACGAGGGGCCGCAACGTATTGCAAGGCTTTCGACGCAGGA
GCTTCTCCCCGGCATGATCCTGTCGAACGAGCCGGGTTATTATCGTCCCGGCGCTTTCGGCATCCGGATCGAGAACCTGA
TCTATGTGCGCGAGGCGGAGGAGGTGGCCGGCGGTGACCAGCCGATGTTCTCCTTCGAGACGCTGACCTGGTGCCCGATC
GACCGCCGGCTGGTCGTCGTGTCCCTGCTGACTGACGAGGAACTCGACTGGCTGAACGCCTACCACGCCGACGTTCTGGA
GAAGCTCTCTCCGCTCATCACCGATGAAGAGGTGAAGGCGTGGCTCGTTGCCGCGACGAAGCCGTTGGAGCGGGCCGCCT
AA

Upstream 100 bases:

>100_bases
GGCTTGAAACCGGATTGATCGGCATGGCATGGTCCTATCCGGTCGCTGATCCTATCGCGCGGTTGGCGGCTTTTCTTATG
ACAGTTTCTGAAAGACACAC

Downstream 100 bases:

>100_bases
AACAGCGAGACGTGCCTGATGGCTATGACGATGGCCCAGCCGATGGCGAGCATCGTCAGGCCGGGAAAGCGCAGGCAGAC
CAGAAGGGCTGCCACCATGC

Product: aminopeptidase P

Products: NA

Alternate protein names: X-Pro dipeptidase; Imidodipeptidase; Proline dipeptidase; Prolidase [H]

Number of amino acids: Translated: 613; Mature: 613

Protein sequence:

>613_residues
MFQTFDNKSAPQFGKARVEALRAGFDALGIDGFLVPRADEYQGEYVPECAERLSWLTGFTGSAGIALVTRAQAVVFVDGR
YTTQLKSQVDQSVFTGGDLVGAPPSVWLSEHAAQGFRLGIDPWLHTGAELKRLEKALAGKGGSVVLLEKNPLDALWQDRP
AEPLEPVVIQPEAFTGILAKEKIASLAETVSAKGADALLVTDPSSIAWIFNIRGNDVPHTPHPLARGIIYADGKADIFLD
KRKTGIEAEAYLAQLATQLPPSKIADRLHAIASAKGRVMVDADLTPVALTGAITAAGGSLIEEADPVRLPRARKNKAELA
GSAAAHVQDGAAMVEYLCWLDRQQPGSVTEIAAVKALEAARAKVGQAMQNPLKDVSFDTISGAGDHAAIIHYRVTTDTDR
ILADGEMFLVDSGAQYVNGTTDITRTVAIGTVPEEQRRFFTLVLKGVIAISAARFPKGTRGCDLDPLARIALWKAGADYA
HGTGHGVGSYLSVHEGPQRIARLSTQELLPGMILSNEPGYYRPGAFGIRIENLIYVREAEEVAGGDQPMFSFETLTWCPI
DRRLVVVSLLTDEELDWLNAYHADVLEKLSPLITDEEVKAWLVAATKPLERAA

Sequences:

>Translated_613_residues
MFQTFDNKSAPQFGKARVEALRAGFDALGIDGFLVPRADEYQGEYVPECAERLSWLTGFTGSAGIALVTRAQAVVFVDGR
YTTQLKSQVDQSVFTGGDLVGAPPSVWLSEHAAQGFRLGIDPWLHTGAELKRLEKALAGKGGSVVLLEKNPLDALWQDRP
AEPLEPVVIQPEAFTGILAKEKIASLAETVSAKGADALLVTDPSSIAWIFNIRGNDVPHTPHPLARGIIYADGKADIFLD
KRKTGIEAEAYLAQLATQLPPSKIADRLHAIASAKGRVMVDADLTPVALTGAITAAGGSLIEEADPVRLPRARKNKAELA
GSAAAHVQDGAAMVEYLCWLDRQQPGSVTEIAAVKALEAARAKVGQAMQNPLKDVSFDTISGAGDHAAIIHYRVTTDTDR
ILADGEMFLVDSGAQYVNGTTDITRTVAIGTVPEEQRRFFTLVLKGVIAISAARFPKGTRGCDLDPLARIALWKAGADYA
HGTGHGVGSYLSVHEGPQRIARLSTQELLPGMILSNEPGYYRPGAFGIRIENLIYVREAEEVAGGDQPMFSFETLTWCPI
DRRLVVVSLLTDEELDWLNAYHADVLEKLSPLITDEEVKAWLVAATKPLERAA
>Mature_613_residues
MFQTFDNKSAPQFGKARVEALRAGFDALGIDGFLVPRADEYQGEYVPECAERLSWLTGFTGSAGIALVTRAQAVVFVDGR
YTTQLKSQVDQSVFTGGDLVGAPPSVWLSEHAAQGFRLGIDPWLHTGAELKRLEKALAGKGGSVVLLEKNPLDALWQDRP
AEPLEPVVIQPEAFTGILAKEKIASLAETVSAKGADALLVTDPSSIAWIFNIRGNDVPHTPHPLARGIIYADGKADIFLD
KRKTGIEAEAYLAQLATQLPPSKIADRLHAIASAKGRVMVDADLTPVALTGAITAAGGSLIEEADPVRLPRARKNKAELA
GSAAAHVQDGAAMVEYLCWLDRQQPGSVTEIAAVKALEAARAKVGQAMQNPLKDVSFDTISGAGDHAAIIHYRVTTDTDR
ILADGEMFLVDSGAQYVNGTTDITRTVAIGTVPEEQRRFFTLVLKGVIAISAARFPKGTRGCDLDPLARIALWKAGADYA
HGTGHGVGSYLSVHEGPQRIARLSTQELLPGMILSNEPGYYRPGAFGIRIENLIYVREAEEVAGGDQPMFSFETLTWCPI
DRRLVVVSLLTDEELDWLNAYHADVLEKLSPLITDEEVKAWLVAATKPLERAA

Specific function: Splits dipeptides with a prolyl in the C-terminal position and a nonpolar amino acid at the N-terminal position [H]

COG id: COG0006

COG function: function code E; Xaa-Pro aminopeptidase

Gene ontology:

Cell location: Cytoplasm [H]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the peptidase M24B family. Archaeal-type prolidase subfamily [H]

Homologues:

Organism=Homo sapiens, GI264681563, Length=609, Percent_Identity=37.9310344827586, Blast_Score=372, Evalue=1e-103,
Organism=Homo sapiens, GI93141226, Length=605, Percent_Identity=34.3801652892562, Blast_Score=337, Evalue=1e-92,
Organism=Homo sapiens, GI264681565, Length=609, Percent_Identity=36.1247947454844, Blast_Score=337, Evalue=2e-92,
Organism=Escherichia coli, GI1788728, Length=175, Percent_Identity=37.7142857142857, Blast_Score=91, Evalue=3e-19,
Organism=Escherichia coli, GI1789275, Length=240, Percent_Identity=26.6666666666667, Blast_Score=66, Evalue=5e-12,
Organism=Caenorhabditis elegans, GI17509539, Length=603, Percent_Identity=36.9817578772803, Blast_Score=341, Evalue=8e-94,
Organism=Caenorhabditis elegans, GI25149105, Length=629, Percent_Identity=29.2527821939587, Blast_Score=259, Evalue=4e-69,
Organism=Saccharomyces cerevisiae, GI6322999, Length=658, Percent_Identity=29.9392097264438, Blast_Score=276, Evalue=7e-75,
Organism=Drosophila melanogaster, GI17137632, Length=611, Percent_Identity=34.0425531914894, Blast_Score=327, Evalue=1e-89,
Organism=Drosophila melanogaster, GI161078230, Length=584, Percent_Identity=29.1095890410959, Blast_Score=242, Evalue=5e-64,
Organism=Drosophila melanogaster, GI21357287, Length=584, Percent_Identity=29.1095890410959, Blast_Score=242, Evalue=5e-64,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000587
- InterPro:   IPR001714
- InterPro:   IPR000994
- InterPro:   IPR001131 [H]

Pfam domain/function: PF01321 Creatinase_N; PF00557 Peptidase_M24 [H]

EC number: =3.4.13.9 [H]

Molecular weight: Translated: 65970; Mature: 65970

Theoretical pI: Translated: 5.13; Mature: 5.13

Prosite motif: PS00141 ASP_PROTEASE

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.7 %Cys     (Translated Protein)
1.1 %Met     (Translated Protein)
1.8 %Cys+Met (Translated Protein)
0.7 %Cys     (Mature Protein)
1.1 %Met     (Mature Protein)
1.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MFQTFDNKSAPQFGKARVEALRAGFDALGIDGFLVPRADEYQGEYVPECAERLSWLTGFT
CCCCCCCCCCCCHHHHHHHHHHCCCHHCCCCEEECCCCCCCCCCCHHHHHHHHHHHHCCC
GSAGIALVTRAQAVVFVDGRYTTQLKSQVDQSVFTGGDLVGAPPSVWLSEHAAQGFRLGI
CCCCEEEEEECEEEEEECCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCEEECC
DPWLHTGAELKRLEKALAGKGGSVVLLEKNPLDALWQDRPAEPLEPVVIQPEAFTGILAK
CHHHHCCHHHHHHHHHHCCCCCCEEEEECCCCHHHHCCCCCCCCCCEEECCCHHHHHHHH
EKIASLAETVSAKGADALLVTDPSSIAWIFNIRGNDVPHTPHPLARGIIYADGKADIFLD
HHHHHHHHHHHCCCCCEEEEECCCCEEEEEEECCCCCCCCCCHHHCCEEEECCCCEEEEE
KRKTGIEAEAYLAQLATQLPPSKIADRLHAIASAKGRVMVDADLTPVALTGAITAAGGSL
CCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCEEEECCCCCEEEECCHHHCCCCH
IEEADPVRLPRARKNKAELAGSAAAHVQDGAAMVEYLCWLDRQQPGSVTEIAAVKALEAA
HHCCCCCCCCCCCCCHHHHCCCHHHHHHCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHH
RAKVGQAMQNPLKDVSFDTISGAGDHAAIIHYRVTTDTDRILADGEMFLVDSGAQYVNGT
HHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEEEEECCCCCEEECCCEEEEECCCHHCCCC
TDITRTVAIGTVPEEQRRFFTLVLKGVIAISAARFPKGTRGCDLDPLARIALWKAGADYA
CCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCC
HGTGHGVGSYLSVHEGPQRIARLSTQELLPGMILSNEPGYYRPGAFGIRIENLIYVREAE
CCCCCCCCCHHHHCCCHHHHHHHHHHHHCCCEEECCCCCCCCCCEEEEEEEEEEEEEEHH
EVAGGDQPMFSFETLTWCPIDRRLVVVSLLTDEELDWLNAYHADVLEKLSPLITDEEVKA
HHCCCCCCCEEECEEEECCCCCCEEEEEEECCCCHHHHHHHHHHHHHHHHCCCCCHHHHE
WLVAATKPLERAA
EEEECCCCHHHCC
>Mature Secondary Structure
MFQTFDNKSAPQFGKARVEALRAGFDALGIDGFLVPRADEYQGEYVPECAERLSWLTGFT
CCCCCCCCCCCCHHHHHHHHHHCCCHHCCCCEEECCCCCCCCCCCHHHHHHHHHHHHCCC
GSAGIALVTRAQAVVFVDGRYTTQLKSQVDQSVFTGGDLVGAPPSVWLSEHAAQGFRLGI
CCCCEEEEEECEEEEEECCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCEEECC
DPWLHTGAELKRLEKALAGKGGSVVLLEKNPLDALWQDRPAEPLEPVVIQPEAFTGILAK
CHHHHCCHHHHHHHHHHCCCCCCEEEEECCCCHHHHCCCCCCCCCCEEECCCHHHHHHHH
EKIASLAETVSAKGADALLVTDPSSIAWIFNIRGNDVPHTPHPLARGIIYADGKADIFLD
HHHHHHHHHHHCCCCCEEEEECCCCEEEEEEECCCCCCCCCCHHHCCEEEECCCCEEEEE
KRKTGIEAEAYLAQLATQLPPSKIADRLHAIASAKGRVMVDADLTPVALTGAITAAGGSL
CCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCEEEECCCCCEEEECCHHHCCCCH
IEEADPVRLPRARKNKAELAGSAAAHVQDGAAMVEYLCWLDRQQPGSVTEIAAVKALEAA
HHCCCCCCCCCCCCCHHHHCCCHHHHHHCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHH
RAKVGQAMQNPLKDVSFDTISGAGDHAAIIHYRVTTDTDRILADGEMFLVDSGAQYVNGT
HHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEEEEECCCCCEEECCCEEEEECCCHHCCCC
TDITRTVAIGTVPEEQRRFFTLVLKGVIAISAARFPKGTRGCDLDPLARIALWKAGADYA
CCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCC
HGTGHGVGSYLSVHEGPQRIARLSTQELLPGMILSNEPGYYRPGAFGIRIENLIYVREAE
CCCCCCCCCHHHHCCCHHHHHHHHHHHHCCCEEECCCCCCCCCCEEEEEEEEEEEEEEHH
EVAGGDQPMFSFETLTWCPIDRRLVVVSLLTDEELDWLNAYHADVLEKLSPLITDEEVKA
HHCCCCCCCEEECEEEECCCCCCEEEEEEECCCCHHHHHHHHHHHHHHHHCCCCCHHHHE
WLVAATKPLERAA
EEEECCCCHHHCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9733678; 11223522; 11210522 [H]