Definition Xanthomonas campestris pv. campestris str. 8004 chromosome, complete genome.
Accession NC_007086
Length 5,148,708

Click here to switch to the map view.

The map label for this gene is ptpA [H]

Identifier: 77761099

GI number: 77761099

Start: 726893

End: 729205

Strand: Reverse

Name: ptpA [H]

Synonym: XC_0611

Alternate gene names: 77761099

Gene position: 729205-726893 (Counterclockwise)

Preceding gene: 66766951

Following gene: 66766947

Centisome position: 14.16

GC content: 66.45

Gene sequence:

>2313_bases
TTGCCGATGCGCCACCTGCTCACCGCCCTGGCCGTTGCCCTGCTGCCCGCACTGGCCAGCGCGCAGGCGCCCAGCGTGAC
TGCCGCCGACTACGCGCGCGCCGAACGCCTGGTCAGCTATCTGGCGCAGCCGCTGGTGGACCACGCCGCAACGCAGGTGA
CCTGGCTGGACCCCACGCATGTGGTGTATGTGGACCATGACGCCAGGGGCGACCGCCTGCTGCAGCTGGACACCGCCACC
GGCAAGACCGCGCCACTGTTCAAGCCTGCCCGGCTGGCCACCGCGCTCAACAGCTTGCTCAAGACCGGCGACAAACCGCT
CAAGGCCGCCACGCTGGCGCCCAAGGTCCGCCTCACCGCCGATGGCCGCTACCGTTTCGAGATCCGCGACACCGACGTGA
TCTGCGATGTGCGCGCCGCCTGCAGCAAGGTCTACGCCAGCGAAAAGGCCGAACCGGGCGTGCTTTCGCCGGACAAGACC
CGCGAGGCCTTCATCCGCAACTGGAACCTGTGGGTGCGTGAGCTGGCCACCGGCAAGGAAACCCAGCTGACCCGCGATGG
CGTGGAAAACTTCGGCTACGCCACCGACAACGCCGGCTGGCAGCACAGCGACAACGCCATCGTTGCGTGGTCGCCGGACT
CGCGCAAGATCGCCACCTTCCAGCAGGACCAGCGCAAGACCGGCGACATGTATCTGGTCAGCACCAAGCTCGGGCATCCG
GAACTGCAGTCGTGGAAATACCCGCTGGCCGGCGACAAGGACGTGACCATGATCGAGCGCGTCATCATCGACGTGCCCAC
CGCCAGCGTGGTGCGCCTGAAGCTGCCGCCGGACCAGCACCGCTCCACCCTGTGCGATGACGTGAGCTGCTCGCCGGGCC
TGTGGGACGACGTGAAGTGGGCGCCGGACAGCAAGACCCTGGCGTTCGCCTCCACCTCGCGCTTCCACAAGCAGGTGTGG
CTGCGCATTGCCGACGCCAGCACCGGCGCGGTGCGTACCGCCTTCGACGAAACCGCCAAGACCTATTACGAGAGCGGCCA
GGTCGCCGCGAACTGGGCCTATCTGCCCGGCAGCAACGAAGCGGTGTGGTTCTCCGAACGCAGCGACTGGGGCCAGCTGT
ATCTGTACGACCTGCAGACCGGCAAGCCCAAACGCGCGATCACCAGCGGCGAAGGCAATGTCACCGAGCTGCTGCGGGTG
GACCCGGTGAGCCGCACCGCGTGGTTTGTCGGCGTGGGCAAGGTGCCGGGCGTGGACCCGTACTACCAGCAACTGTGGAA
GGTGAGCCTGGACGGCGGCGCACCGGTGCTGCTGACGCCGGAAGCGGCCGATCACAGCATTGCACTCTCACCCGATGGCG
CGCGCTTCGTCGATACCTATTCCACCACGCTCACCCCACCGGTCAGCGTGCTGCGCGCTGCGGCCGATGGGCGCACGCTC
AGCACCGTCGCCACCGCCGACATCACCCGCCTGAAGGCCGCCGGCTGGGTCCCGCCGGAACCGATCACGGTGAAGGCGCG
CGACGGCAAGACCACCTTGTACGGCTTGCTGTTCAAGCCCACCCACTTCGATCCGGCACGCAAATATCCGGTGATCGATT
ACATCTACCCCGGCCCGCAGACCGGCTCGGTGCGCGGGCGCGGCTTCTACGCCGGCCATGGCGACAATCGATCGCTCGCG
GAACTGGGCTTCATCGTCATCGCCATCGACGGCATGGGCACGCCCTGGCGCTCCAAGACCTTCCACGACACCTGGTACGC
CAACATGGGCGACAACACCCTGCCCGACCAGGTGGCTGCGGTGAAGGAACTCGGCCAGCGCTACCCGTGGTTCGACACCA
CCCGCGTCGGCATCTGGGGCCACTCCGGTGGCGGCAATGCGTCCACCGGCGCGATGCTGCGCTACCCCGAACTGTTCAAG
GTGGCCTGGTCGGAAAGCGGCAACCACGACAACCGCGGCTACGAAGACGACTGGGCCGAGAAGTACCACGGCGAGCACAT
CGTCAACAAAGACGGCACCTCCAATTACGACGACCAAGCCAACGCGACGCATGCAAGCAAGCTGCAAGGCCGGCTGATGC
TGGTGCACGGCACGCTCGACGACAACGTGCCGCCATATCTCACCCTGTTGGTGGCCGACGCGCTGATCAAGGTCAACAAG
AACTTCGACATGCTGATGCTGCCCAATGCCAAGCATGGCTACGGCGACCTGACCCCGTATGTCACCCGCCGCCGTTGGGA
CTACTTCGTGCAGTATCTGCTCGGTGCTACGCCCCCGGCGCAGTACCAGATGCAGCCGATGCCGAAGCACTGA

Upstream 100 bases:

>100_bases
TATGGCAGGGTCACAGGCCCCGCCCGCGCACGTAGCATTGCCGGGTTGCGCCACTCTGCTCGGCGCGCGTCCTTTTGATG
AGGTCCACATGCCCACACCG

Downstream 100 bases:

>100_bases
GCACGCCAGTTGCAGCTGGTGTCATCGCGCAACGCAGATGGTTCATGCGTTGCGCAATGGCGCGGCGGCGCGGTACTGCA
CTCACGTCCGCGCCGCGCGC

Product: dipeptidyl peptidase IV

Products: NA

Alternate protein names: PTP; Prolyl tripeptidyl peptidase A [H]

Number of amino acids: Translated: 770; Mature: 769

Protein sequence:

>770_residues
MPMRHLLTALAVALLPALASAQAPSVTAADYARAERLVSYLAQPLVDHAATQVTWLDPTHVVYVDHDARGDRLLQLDTAT
GKTAPLFKPARLATALNSLLKTGDKPLKAATLAPKVRLTADGRYRFEIRDTDVICDVRAACSKVYASEKAEPGVLSPDKT
REAFIRNWNLWVRELATGKETQLTRDGVENFGYATDNAGWQHSDNAIVAWSPDSRKIATFQQDQRKTGDMYLVSTKLGHP
ELQSWKYPLAGDKDVTMIERVIIDVPTASVVRLKLPPDQHRSTLCDDVSCSPGLWDDVKWAPDSKTLAFASTSRFHKQVW
LRIADASTGAVRTAFDETAKTYYESGQVAANWAYLPGSNEAVWFSERSDWGQLYLYDLQTGKPKRAITSGEGNVTELLRV
DPVSRTAWFVGVGKVPGVDPYYQQLWKVSLDGGAPVLLTPEAADHSIALSPDGARFVDTYSTTLTPPVSVLRAAADGRTL
STVATADITRLKAAGWVPPEPITVKARDGKTTLYGLLFKPTHFDPARKYPVIDYIYPGPQTGSVRGRGFYAGHGDNRSLA
ELGFIVIAIDGMGTPWRSKTFHDTWYANMGDNTLPDQVAAVKELGQRYPWFDTTRVGIWGHSGGGNASTGAMLRYPELFK
VAWSESGNHDNRGYEDDWAEKYHGEHIVNKDGTSNYDDQANATHASKLQGRLMLVHGTLDDNVPPYLTLLVADALIKVNK
NFDMLMLPNAKHGYGDLTPYVTRRRWDYFVQYLLGATPPAQYQMQPMPKH

Sequences:

>Translated_770_residues
MPMRHLLTALAVALLPALASAQAPSVTAADYARAERLVSYLAQPLVDHAATQVTWLDPTHVVYVDHDARGDRLLQLDTAT
GKTAPLFKPARLATALNSLLKTGDKPLKAATLAPKVRLTADGRYRFEIRDTDVICDVRAACSKVYASEKAEPGVLSPDKT
REAFIRNWNLWVRELATGKETQLTRDGVENFGYATDNAGWQHSDNAIVAWSPDSRKIATFQQDQRKTGDMYLVSTKLGHP
ELQSWKYPLAGDKDVTMIERVIIDVPTASVVRLKLPPDQHRSTLCDDVSCSPGLWDDVKWAPDSKTLAFASTSRFHKQVW
LRIADASTGAVRTAFDETAKTYYESGQVAANWAYLPGSNEAVWFSERSDWGQLYLYDLQTGKPKRAITSGEGNVTELLRV
DPVSRTAWFVGVGKVPGVDPYYQQLWKVSLDGGAPVLLTPEAADHSIALSPDGARFVDTYSTTLTPPVSVLRAAADGRTL
STVATADITRLKAAGWVPPEPITVKARDGKTTLYGLLFKPTHFDPARKYPVIDYIYPGPQTGSVRGRGFYAGHGDNRSLA
ELGFIVIAIDGMGTPWRSKTFHDTWYANMGDNTLPDQVAAVKELGQRYPWFDTTRVGIWGHSGGGNASTGAMLRYPELFK
VAWSESGNHDNRGYEDDWAEKYHGEHIVNKDGTSNYDDQANATHASKLQGRLMLVHGTLDDNVPPYLTLLVADALIKVNK
NFDMLMLPNAKHGYGDLTPYVTRRRWDYFVQYLLGATPPAQYQMQPMPKH
>Mature_769_residues
PMRHLLTALAVALLPALASAQAPSVTAADYARAERLVSYLAQPLVDHAATQVTWLDPTHVVYVDHDARGDRLLQLDTATG
KTAPLFKPARLATALNSLLKTGDKPLKAATLAPKVRLTADGRYRFEIRDTDVICDVRAACSKVYASEKAEPGVLSPDKTR
EAFIRNWNLWVRELATGKETQLTRDGVENFGYATDNAGWQHSDNAIVAWSPDSRKIATFQQDQRKTGDMYLVSTKLGHPE
LQSWKYPLAGDKDVTMIERVIIDVPTASVVRLKLPPDQHRSTLCDDVSCSPGLWDDVKWAPDSKTLAFASTSRFHKQVWL
RIADASTGAVRTAFDETAKTYYESGQVAANWAYLPGSNEAVWFSERSDWGQLYLYDLQTGKPKRAITSGEGNVTELLRVD
PVSRTAWFVGVGKVPGVDPYYQQLWKVSLDGGAPVLLTPEAADHSIALSPDGARFVDTYSTTLTPPVSVLRAAADGRTLS
TVATADITRLKAAGWVPPEPITVKARDGKTTLYGLLFKPTHFDPARKYPVIDYIYPGPQTGSVRGRGFYAGHGDNRSLAE
LGFIVIAIDGMGTPWRSKTFHDTWYANMGDNTLPDQVAAVKELGQRYPWFDTTRVGIWGHSGGGNASTGAMLRYPELFKV
AWSESGNHDNRGYEDDWAEKYHGEHIVNKDGTSNYDDQANATHASKLQGRLMLVHGTLDDNVPPYLTLLVADALIKVNKN
FDMLMLPNAKHGYGDLTPYVTRRRWDYFVQYLLGATPPAQYQMQPMPKH

Specific function: Serine proteinase. Releases tripeptides from the free amino terminus of proteins. Has a requirement for Pro in the P1 position, but is inactivated by Pro in the P1' position [H]

COG id: COG1506

COG function: function code E; Dipeptidyl aminopeptidases/acylaminoacyl-peptidases

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the peptidase S9B family [H]

Homologues:

Organism=Homo sapiens, GI37577089, Length=365, Percent_Identity=31.2328767123288, Blast_Score=150, Evalue=5e-36,
Organism=Homo sapiens, GI18450280, Length=365, Percent_Identity=31.2328767123288, Blast_Score=149, Evalue=7e-36,
Organism=Homo sapiens, GI194394146, Length=365, Percent_Identity=29.3150684931507, Blast_Score=139, Evalue=7e-33,
Organism=Homo sapiens, GI37577091, Length=365, Percent_Identity=26.8493150684932, Blast_Score=95, Evalue=2e-19,
Organism=Homo sapiens, GI16933540, Length=387, Percent_Identity=24.031007751938, Blast_Score=88, Evalue=3e-17,
Organism=Homo sapiens, GI18765694, Length=319, Percent_Identity=21.0031347962382, Blast_Score=70, Evalue=1e-11,
Organism=Caenorhabditis elegans, GI17508017, Length=349, Percent_Identity=28.080229226361, Blast_Score=114, Evalue=2e-25,
Organism=Caenorhabditis elegans, GI17508019, Length=349, Percent_Identity=28.080229226361, Blast_Score=113, Evalue=3e-25,
Organism=Caenorhabditis elegans, GI17564632, Length=489, Percent_Identity=22.6993865030675, Blast_Score=72, Evalue=9e-13,
Organism=Caenorhabditis elegans, GI17564634, Length=489, Percent_Identity=22.6993865030675, Blast_Score=72, Evalue=1e-12,
Organism=Saccharomyces cerevisiae, GI6324793, Length=218, Percent_Identity=27.5229357798165, Blast_Score=67, Evalue=2e-11,
Organism=Drosophila melanogaster, GI45550825, Length=377, Percent_Identity=27.5862068965517, Blast_Score=114, Evalue=3e-25,
Organism=Drosophila melanogaster, GI45553511, Length=377, Percent_Identity=27.5862068965517, Blast_Score=114, Evalue=3e-25,
Organism=Drosophila melanogaster, GI45551969, Length=379, Percent_Identity=27.9683377308707, Blast_Score=113, Evalue=5e-25,
Organism=Drosophila melanogaster, GI221331178, Length=254, Percent_Identity=29.9212598425197, Blast_Score=87, Evalue=4e-17,
Organism=Drosophila melanogaster, GI17933704, Length=261, Percent_Identity=29.8850574712644, Blast_Score=87, Evalue=4e-17,
Organism=Drosophila melanogaster, GI161083744, Length=261, Percent_Identity=29.8850574712644, Blast_Score=86, Evalue=8e-17,
Organism=Drosophila melanogaster, GI24582032, Length=784, Percent_Identity=21.3010204081633, Blast_Score=78, Evalue=2e-14,
Organism=Drosophila melanogaster, GI24582257, Length=243, Percent_Identity=24.2798353909465, Blast_Score=75, Evalue=3e-13,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001375
- InterPro:   IPR002469 [H]

Pfam domain/function: PF00930 DPPIV_N; PF00326 Peptidase_S9 [H]

EC number: =3.4.14.12 [H]

Molecular weight: Translated: 85157; Mature: 85026

Theoretical pI: Translated: 7.37; Mature: 7.37

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.5 %Cys     (Translated Protein)
1.6 %Met     (Translated Protein)
2.1 %Cys+Met (Translated Protein)
0.5 %Cys     (Mature Protein)
1.4 %Met     (Mature Protein)
2.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MPMRHLLTALAVALLPALASAQAPSVTAADYARAERLVSYLAQPLVDHAATQVTWLDPTH
CCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCEEEEECCEE
VVYVDHDARGDRLLQLDTATGKTAPLFKPARLATALNSLLKTGDKPLKAATLAPKVRLTA
EEEEECCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHCCCCCCEEEECCCEEEEEE
DGRYRFEIRDTDVICDVRAACSKVYASEKAEPGVLSPDKTREAFIRNWNLWVRELATGKE
CCEEEEEEECCCEEEEHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCH
TQLTRDGVENFGYATDNAGWQHSDNAIVAWSPDSRKIATFQQDQRKTGDMYLVSTKLGHP
HHHHHHHHHHCCCCCCCCCCEECCCEEEEECCCCCEEEHHHHHHCCCCCEEEEEECCCCC
ELQSWKYPLAGDKDVTMIERVIIDVPTASVVRLKLPPDQHRSTLCDDVSCSPGLWDDVKW
HHHHCCCCCCCCCCHHHHHHHHEECCCCEEEEEECCCCHHHHHHCCCCCCCCCCCCCCCC
APDSKTLAFASTSRFHKQVWLRIADASTGAVRTAFDETAKTYYESGQVAANWAYLPGSNE
CCCCCEEEEEHHHHHHEEEEEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEEECCCCCC
AVWFSERSDWGQLYLYDLQTGKPKRAITSGEGNVTELLRVDPVSRTAWFVGVGKVPGVDP
EEEEECCCCCCEEEEEEECCCCCCCEEECCCCCCEEEEEECCCCCEEEEEECCCCCCCCH
YYQQLWKVSLDGGAPVLLTPEAADHSIALSPDGARFVDTYSTTLTPPVSVLRAAADGRTL
HHHHHEEEEECCCCCEEECCCCCCCEEEECCCCCEEEEECCCCCCCHHHHHHHHCCCCEE
STVATADITRLKAAGWVPPEPITVKARDGKTTLYGLLFKPTHFDPARKYPVIDYIYPGPQ
EEEHHHHHHHHHHCCCCCCCCEEEEECCCCEEEEEEEECCCCCCCCCCCCEEEEECCCCC
TGSVRGRGFYAGHGDNRSLAELGFIVIAIDGMGTPWRSKTFHDTWYANMGDNTLPDQVAA
CCCCCCCEEEECCCCCCCHHHCCEEEEEEECCCCCCCCCCCCCEEEECCCCCCCHHHHHH
VKELGQRYPWFDTTRVGIWGHSGGGNASTGAMLRYPELFKVAWSESGNHDNRGYEDDWAE
HHHHHHCCCCCCCEEEEEEECCCCCCCCCCCEEECCHHHEEEECCCCCCCCCCCCCHHHH
KYHGEHIVNKDGTSNYDDQANATHASKLQGRLMLVHGTLDDNVPPYLTLLVADALIKVNK
HHCCCEEECCCCCCCCCCCCCCCHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHEECC
NFDMLMLPNAKHGYGDLTPYVTRRRWDYFVQYLLGATPPAQYQMQPMPKH
CCCEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC
>Mature Secondary Structure 
PMRHLLTALAVALLPALASAQAPSVTAADYARAERLVSYLAQPLVDHAATQVTWLDPTH
CHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCEEEEECCEE
VVYVDHDARGDRLLQLDTATGKTAPLFKPARLATALNSLLKTGDKPLKAATLAPKVRLTA
EEEEECCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHCCCCCCEEEECCCEEEEEE
DGRYRFEIRDTDVICDVRAACSKVYASEKAEPGVLSPDKTREAFIRNWNLWVRELATGKE
CCEEEEEEECCCEEEEHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCH
TQLTRDGVENFGYATDNAGWQHSDNAIVAWSPDSRKIATFQQDQRKTGDMYLVSTKLGHP
HHHHHHHHHHCCCCCCCCCCEECCCEEEEECCCCCEEEHHHHHHCCCCCEEEEEECCCCC
ELQSWKYPLAGDKDVTMIERVIIDVPTASVVRLKLPPDQHRSTLCDDVSCSPGLWDDVKW
HHHHCCCCCCCCCCHHHHHHHHEECCCCEEEEEECCCCHHHHHHCCCCCCCCCCCCCCCC
APDSKTLAFASTSRFHKQVWLRIADASTGAVRTAFDETAKTYYESGQVAANWAYLPGSNE
CCCCCEEEEEHHHHHHEEEEEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEEECCCCCC
AVWFSERSDWGQLYLYDLQTGKPKRAITSGEGNVTELLRVDPVSRTAWFVGVGKVPGVDP
EEEEECCCCCCEEEEEEECCCCCCCEEECCCCCCEEEEEECCCCCEEEEEECCCCCCCCH
YYQQLWKVSLDGGAPVLLTPEAADHSIALSPDGARFVDTYSTTLTPPVSVLRAAADGRTL
HHHHHEEEEECCCCCEEECCCCCCCEEEECCCCCEEEEECCCCCCCHHHHHHHHCCCCEE
STVATADITRLKAAGWVPPEPITVKARDGKTTLYGLLFKPTHFDPARKYPVIDYIYPGPQ
EEEHHHHHHHHHHCCCCCCCCEEEEECCCCEEEEEEEECCCCCCCCCCCCEEEEECCCCC
TGSVRGRGFYAGHGDNRSLAELGFIVIAIDGMGTPWRSKTFHDTWYANMGDNTLPDQVAA
CCCCCCCEEEECCCCCCCHHHCCEEEEEEECCCCCCCCCCCCCEEEECCCCCCCHHHHHH
VKELGQRYPWFDTTRVGIWGHSGGGNASTGAMLRYPELFKVAWSESGNHDNRGYEDDWAE
HHHHHHCCCCCCCEEEEEEECCCCCCCCCCCEEECCHHHEEEECCCCCCCCCCCCCHHHH
KYHGEHIVNKDGTSNYDDQANATHASKLQGRLMLVHGTLDDNVPPYLTLLVADALIKVNK
HHCCCEEECCCCCCCCCCCCCCCHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHEECC
NFDMLMLPNAKHGYGDLTPYVTRRRWDYFVQYLLGATPPAQYQMQPMPKH
CCCEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA