Definition | Bacteroides thetaiotaomicron VPI-5482 chromosome, complete genome. |
---|---|
Accession | NC_004663 |
Length | 6,260,361 |
Click here to switch to the map view.
The map label for this gene is ptpA [H]
Identifier: 29349601
GI number: 29349601
Start: 5527974
End: 5530184
Strand: Reverse
Name: ptpA [H]
Synonym: BT_4193
Alternate gene names: 29349601
Gene position: 5530184-5527974 (Counterclockwise)
Preceding gene: 29349605
Following gene: 29349600
Centisome position: 88.34
GC content: 46.49
Gene sequence:
>2211_bases ATGAGAAAAGTAAGTTTAGCCCTGCTCCTCTGCCTTCTCTGCCTCGCCGGAATGGCACAAGGACAAAAGGCACTTGATTT AAAAGATATTACCTCCGGACGCTTCCGCCCGGAAAATATTCAGGGAGTCATCCCCACACCAGATGGAGAGCACTACACAC AAATGAACGCCGATGGTACGCAAATCATCAAATACTCATTCCGCACAGGAGAAAAGGTAGAGGTGATCTTTGACGTGAAC CAGGCACGCGAATGTGACTTCAAGAATTTCGACAGCTATCAGTTCTCACCCGATGGTGACAAATTACTGATCGCTACCAA AACGACCCCGATATACCGGCATTCATACACAGCTGTGCATTATATATACCCTTTGAAACGAAATGATAAGGGAGTTACGA CAAACAACATTATTGAACGGCTGTCCGACGGCGGACCTCAACAGGTCCCCGTCTTTTCTCCAGACGGAACGATGATCGCT TTCGTGCGTGACAATAATATATTTCTCGTAAAGCTGCTCTACGGTAACAGCGAAAGCCAGGTGACCGAAGATGGCAAGCA AAATATGGTTCTCAACGGTATCCCCGACTGGGTATATGAGGAAGAGTTCGGTTTCAACCGCGCCCTGGAGTTCAGTGCCG ATAATACCATGATTGCTTTCATCCGCTTCGACGAGTCGGAAGTTCCTTCCTATTCTTTTCCGATGTTCGCGGGAGAAGCA CCGCAGATTACTCCTTTGAAAGATTATCCGGGAGAATACACCTACAAGTATCCGAAAGCCGGATACCCGAACTCGAAAGT AGAAGTACGTACGTACGACATCAAATCGCACGTGACCCGCACCATGAAGCTTCCGATAGATGCAGACGGATATATCCCCC GCATCCGCTTCACGAAAGATGCCAGCAAACTTGCCGTTATGACATTGAACCGCCATCAGGATCGTTTCGACCTTTATTTT GCCGATCCGCGCTCAACGCTCTGCAAGTTAGTGTTGCGTGATGAGTCACCTTATTATATTAAGGAGAACGTGTTCGATAA TATCAAGTTCTATCCGGAAACTTTCAGTTTGCTTAGCGAACGTGACGGTTTCAGCCATCTTTACTGGTACAGCATGGGGG GAAACCTTATCAAGAAAGTAACCAATGGCAAATATGAAGTGAAGGATTTTCTGGGTTATGACGCAACAGACGGCTCTTTC TATTATACCAGCAATGAGGAAAGCCCCTTGCGTAAAGCAGTCTACAAGATTGATAAGAAAGGCAAAAAGACAAAGTTGTC TCAACGTGAAGGAACCAATACGCCGTTATTCAGCAAATCGATGAAGTATTATATGAACAAATTCTCGAATCTGGATACTC CTATGCTAGTTACGCTGAATGACAACACAGGCAAAACCTTAAAAACGCTTATCACCAACGACCAGTTGAAACAAACGTTA GCCGGCTATGCTATCCCGCAAAAAGAGTTCTTTACCTTCCAGACTACGGACGGGGTAACTTTGAATGGATGGATGATGAA GCCTGTCAACTTCTCCGCATCCAAGAAATATCCGGTGCTGATGTACCAGTATAGCGGTCCGGGGTCACAACAAGTTCTGG ATACTTGGGGAATCAGCTGGGAGACTTACATGGCCAGCCTCGGCTATATCGTTGTATGTGTAGACGGTCGTGGTACAGGA GGCCGCGGAGAAGCGTTCGAGAAGTGCACTTATCTCAAAATCGGCGTGAAAGAAGCCAAAGACCAGGTCGAAACAGCTCT CTATTTAGGCAAACAGCCGTATGTGGATAAAGACCGTATCGGTATCTGGGGATGGAGCTATGGCGGATACATGACATTGA TGAGTATGAGCGAAGGAACTCCGGTATTCAAAGCCGGTGTAGCTGTTGCCGCACCTACGGACTGGCGTTTCTACGACACT ATATATACAGAACGTTTCATGCGCACCCCGAAAGAAAACGCAGAAGGATACAAAGAATCATCCGCTTTCACACGGGCTGA CAAACTGCACGGCAATCTGTTGCTGGTACATGGTATGGCAGACGATAATGTTCACTTCCAGAACTGTGCAGAATATGCCG AGCATCTGGTGCAGTTGGGTAAACAATTTGATATGCAAGTATATACCAATCGTAATCATGGCATATACGGCGGCAACACC CGTCAGCATCTATATACACGATTGACCAACTTCTTCCTGAACAATTTGTAA
Upstream 100 bases:
>100_bases CAAAAACAGGATGTACTTTGAGGTATTTAAGTTTATTGTTATTTTTGCAGGGTATGAACTTAACCCCCTTAAACAAGTAA AATATAAAACGATTTATAAA
Downstream 100 bases:
>100_bases TGGCTGACAGAGTGCGCAAGCCTGAATGGCTGAAAATAAATATCGGTGCCAACGACCGTTACACCGAAACTAAAAGAATT GTCGACTCCCACTGCCTGCA
Product: dipeptidyl peptidase IV
Products: NA
Alternate protein names: PTP; Prolyl tripeptidyl peptidase A [H]
Number of amino acids: Translated: 736; Mature: 736
Protein sequence:
>736_residues MRKVSLALLLCLLCLAGMAQGQKALDLKDITSGRFRPENIQGVIPTPDGEHYTQMNADGTQIIKYSFRTGEKVEVIFDVN QARECDFKNFDSYQFSPDGDKLLIATKTTPIYRHSYTAVHYIYPLKRNDKGVTTNNIIERLSDGGPQQVPVFSPDGTMIA FVRDNNIFLVKLLYGNSESQVTEDGKQNMVLNGIPDWVYEEEFGFNRALEFSADNTMIAFIRFDESEVPSYSFPMFAGEA PQITPLKDYPGEYTYKYPKAGYPNSKVEVRTYDIKSHVTRTMKLPIDADGYIPRIRFTKDASKLAVMTLNRHQDRFDLYF ADPRSTLCKLVLRDESPYYIKENVFDNIKFYPETFSLLSERDGFSHLYWYSMGGNLIKKVTNGKYEVKDFLGYDATDGSF YYTSNEESPLRKAVYKIDKKGKKTKLSQREGTNTPLFSKSMKYYMNKFSNLDTPMLVTLNDNTGKTLKTLITNDQLKQTL AGYAIPQKEFFTFQTTDGVTLNGWMMKPVNFSASKKYPVLMYQYSGPGSQQVLDTWGISWETYMASLGYIVVCVDGRGTG GRGEAFEKCTYLKIGVKEAKDQVETALYLGKQPYVDKDRIGIWGWSYGGYMTLMSMSEGTPVFKAGVAVAAPTDWRFYDT IYTERFMRTPKENAEGYKESSAFTRADKLHGNLLLVHGMADDNVHFQNCAEYAEHLVQLGKQFDMQVYTNRNHGIYGGNT RQHLYTRLTNFFLNNL
Sequences:
>Translated_736_residues MRKVSLALLLCLLCLAGMAQGQKALDLKDITSGRFRPENIQGVIPTPDGEHYTQMNADGTQIIKYSFRTGEKVEVIFDVN QARECDFKNFDSYQFSPDGDKLLIATKTTPIYRHSYTAVHYIYPLKRNDKGVTTNNIIERLSDGGPQQVPVFSPDGTMIA FVRDNNIFLVKLLYGNSESQVTEDGKQNMVLNGIPDWVYEEEFGFNRALEFSADNTMIAFIRFDESEVPSYSFPMFAGEA PQITPLKDYPGEYTYKYPKAGYPNSKVEVRTYDIKSHVTRTMKLPIDADGYIPRIRFTKDASKLAVMTLNRHQDRFDLYF ADPRSTLCKLVLRDESPYYIKENVFDNIKFYPETFSLLSERDGFSHLYWYSMGGNLIKKVTNGKYEVKDFLGYDATDGSF YYTSNEESPLRKAVYKIDKKGKKTKLSQREGTNTPLFSKSMKYYMNKFSNLDTPMLVTLNDNTGKTLKTLITNDQLKQTL AGYAIPQKEFFTFQTTDGVTLNGWMMKPVNFSASKKYPVLMYQYSGPGSQQVLDTWGISWETYMASLGYIVVCVDGRGTG GRGEAFEKCTYLKIGVKEAKDQVETALYLGKQPYVDKDRIGIWGWSYGGYMTLMSMSEGTPVFKAGVAVAAPTDWRFYDT IYTERFMRTPKENAEGYKESSAFTRADKLHGNLLLVHGMADDNVHFQNCAEYAEHLVQLGKQFDMQVYTNRNHGIYGGNT RQHLYTRLTNFFLNNL >Mature_736_residues MRKVSLALLLCLLCLAGMAQGQKALDLKDITSGRFRPENIQGVIPTPDGEHYTQMNADGTQIIKYSFRTGEKVEVIFDVN QARECDFKNFDSYQFSPDGDKLLIATKTTPIYRHSYTAVHYIYPLKRNDKGVTTNNIIERLSDGGPQQVPVFSPDGTMIA FVRDNNIFLVKLLYGNSESQVTEDGKQNMVLNGIPDWVYEEEFGFNRALEFSADNTMIAFIRFDESEVPSYSFPMFAGEA PQITPLKDYPGEYTYKYPKAGYPNSKVEVRTYDIKSHVTRTMKLPIDADGYIPRIRFTKDASKLAVMTLNRHQDRFDLYF ADPRSTLCKLVLRDESPYYIKENVFDNIKFYPETFSLLSERDGFSHLYWYSMGGNLIKKVTNGKYEVKDFLGYDATDGSF YYTSNEESPLRKAVYKIDKKGKKTKLSQREGTNTPLFSKSMKYYMNKFSNLDTPMLVTLNDNTGKTLKTLITNDQLKQTL AGYAIPQKEFFTFQTTDGVTLNGWMMKPVNFSASKKYPVLMYQYSGPGSQQVLDTWGISWETYMASLGYIVVCVDGRGTG GRGEAFEKCTYLKIGVKEAKDQVETALYLGKQPYVDKDRIGIWGWSYGGYMTLMSMSEGTPVFKAGVAVAAPTDWRFYDT IYTERFMRTPKENAEGYKESSAFTRADKLHGNLLLVHGMADDNVHFQNCAEYAEHLVQLGKQFDMQVYTNRNHGIYGGNT RQHLYTRLTNFFLNNL
Specific function: Serine proteinase. Releases tripeptides from the free amino terminus of proteins. Has a requirement for Pro in the P1 position, but is inactivated by Pro in the P1' position [H]
COG id: COG1506
COG function: function code E; Dipeptidyl aminopeptidases/acylaminoacyl-peptidases
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: Belongs to the peptidase S9B family [H]
Homologues:
Organism=Homo sapiens, GI16933540, Length=772, Percent_Identity=31.6062176165803, Blast_Score=335, Evalue=8e-92, Organism=Homo sapiens, GI18765694, Length=691, Percent_Identity=32.7062228654124, Blast_Score=309, Evalue=5e-84, Organism=Homo sapiens, GI86792778, Length=668, Percent_Identity=27.6946107784431, Blast_Score=263, Evalue=6e-70, Organism=Homo sapiens, GI86792863, Length=668, Percent_Identity=27.6946107784431, Blast_Score=263, Evalue=6e-70, Organism=Homo sapiens, GI86792774, Length=668, Percent_Identity=27.6946107784431, Blast_Score=263, Evalue=6e-70, Organism=Homo sapiens, GI295842359, Length=677, Percent_Identity=26.8833087149188, Blast_Score=242, Evalue=8e-64, Organism=Homo sapiens, GI52426756, Length=677, Percent_Identity=26.8833087149188, Blast_Score=242, Evalue=8e-64, Organism=Homo sapiens, GI295842403, Length=677, Percent_Identity=26.8833087149188, Blast_Score=242, Evalue=8e-64, Organism=Homo sapiens, GI85787627, Length=677, Percent_Identity=27.178729689808, Blast_Score=242, Evalue=1e-63, Organism=Homo sapiens, GI295849272, Length=677, Percent_Identity=26.8833087149188, Blast_Score=242, Evalue=1e-63, Organism=Homo sapiens, GI18450280, Length=712, Percent_Identity=29.0730337078652, Blast_Score=230, Evalue=3e-60, Organism=Homo sapiens, GI37577089, Length=712, Percent_Identity=29.0730337078652, Blast_Score=230, Evalue=5e-60, Organism=Homo sapiens, GI194394146, Length=559, Percent_Identity=27.906976744186, Blast_Score=191, Evalue=3e-48, Organism=Homo sapiens, GI37577091, Length=711, Percent_Identity=26.1603375527426, Blast_Score=162, Evalue=1e-39, Organism=Homo sapiens, GI18450278, Length=496, Percent_Identity=24.7983870967742, Blast_Score=85, Evalue=2e-16, Organism=Caenorhabditis elegans, GI17508017, Length=251, Percent_Identity=36.6533864541833, Blast_Score=164, Evalue=1e-40, Organism=Caenorhabditis elegans, GI17508019, Length=251, Percent_Identity=36.6533864541833, Blast_Score=164, Evalue=1e-40, Organism=Caenorhabditis elegans, GI17550672, Length=684, Percent_Identity=24.1228070175439, Blast_Score=163, Evalue=3e-40, Organism=Caenorhabditis elegans, GI17564634, Length=612, Percent_Identity=23.6928104575163, Blast_Score=149, Evalue=7e-36, Organism=Caenorhabditis elegans, GI17564632, Length=612, Percent_Identity=23.6928104575163, Blast_Score=148, Evalue=8e-36, Organism=Caenorhabditis elegans, GI17552908, Length=258, Percent_Identity=27.1317829457364, Blast_Score=72, Evalue=1e-12, Organism=Saccharomyces cerevisiae, GI6321817, Length=692, Percent_Identity=31.6473988439306, Blast_Score=293, Evalue=5e-80, Organism=Saccharomyces cerevisiae, GI6324793, Length=678, Percent_Identity=27.2861356932153, Blast_Score=228, Evalue=2e-60, Organism=Drosophila melanogaster, GI17933704, Length=768, Percent_Identity=29.1666666666667, Blast_Score=275, Evalue=8e-74, Organism=Drosophila melanogaster, GI221331178, Length=768, Percent_Identity=29.1666666666667, Blast_Score=274, Evalue=2e-73, Organism=Drosophila melanogaster, GI161083744, Length=768, Percent_Identity=29.1666666666667, Blast_Score=273, Evalue=3e-73, Organism=Drosophila melanogaster, GI24582032, Length=672, Percent_Identity=30.5059523809524, Blast_Score=259, Evalue=5e-69, Organism=Drosophila melanogaster, GI45550825, Length=705, Percent_Identity=26.0992907801418, Blast_Score=198, Evalue=1e-50, Organism=Drosophila melanogaster, GI45553511, Length=705, Percent_Identity=26.0992907801418, Blast_Score=198, Evalue=1e-50, Organism=Drosophila melanogaster, GI45551969, Length=705, Percent_Identity=26.0992907801418, Blast_Score=197, Evalue=1e-50, Organism=Drosophila melanogaster, GI24582257, Length=273, Percent_Identity=36.2637362637363, Blast_Score=185, Evalue=1e-46, Organism=Drosophila melanogaster, GI45551475, Length=694, Percent_Identity=23.6311239193084, Blast_Score=164, Evalue=2e-40, Organism=Drosophila melanogaster, GI221372263, Length=694, Percent_Identity=23.6311239193084, Blast_Score=164, Evalue=2e-40, Organism=Drosophila melanogaster, GI221372266, Length=694, Percent_Identity=23.6311239193084, Blast_Score=164, Evalue=3e-40,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001375 - InterPro: IPR002469 [H]
Pfam domain/function: PF00930 DPPIV_N; PF00326 Peptidase_S9 [H]
EC number: =3.4.14.12 [H]
Molecular weight: Translated: 84034; Mature: 84034
Theoretical pI: Translated: 8.01; Mature: 8.01
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.0 %Cys (Translated Protein) 3.1 %Met (Translated Protein) 4.1 %Cys+Met (Translated Protein) 1.0 %Cys (Mature Protein) 3.1 %Met (Mature Protein) 4.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MRKVSLALLLCLLCLAGMAQGQKALDLKDITSGRFRPENIQGVIPTPDGEHYTQMNADGT CCCHHHHHHHHHHHHHHHCCCCCCCCHHHCCCCCCCCCCCCEEECCCCCCCEEEECCCCC QIIKYSFRTGEKVEVIFDVNQARECDFKNFDSYQFSPDGDKLLIATKTTPIYRHSYTAVH EEEEEEECCCCEEEEEEECCCCCCCCCCCCCCEEECCCCCEEEEEECCCCEEECCCEEEE YIYPLKRNDKGVTTNNIIERLSDGGPQQVPVFSPDGTMIAFVRDNNIFLVKLLYGNSESQ EEEEEECCCCCCCHHHHHHHHHCCCCCCCEEECCCCCEEEEEECCCEEEEEEEECCCCCC VTEDGKQNMVLNGIPDWVYEEEFGFNRALEFSADNTMIAFIRFDESEVPSYSFPMFAGEA CCCCCCCCEEEECCCCHHCCCCCCCCEEEEECCCCEEEEEEEECCCCCCCCCCCEECCCC PQITPLKDYPGEYTYKYPKAGYPNSKVEVRTYDIKSHVTRTMKLPIDADGYIPRIRFTKD CCCCCCCCCCCCEEEECCCCCCCCCEEEEEEEEHHHHCEEEEECCCCCCCCCCEEEEECC ASKLAVMTLNRHQDRFDLYFADPRSTLCKLVLRDESPYYIKENVFDNIKFYPETFSLLSE CCEEEEEEECCCCCEEEEEEECCHHHHEEEEEECCCCEEEECCCCCCCEECHHHHHHHHC RDGFSHLYWYSMGGNLIKKVTNGKYEVKDFLGYDATDGSFYYTSNEESPLRKAVYKIDKK CCCCCEEEEEECCCHHHHHHCCCCEEEHHHCCCCCCCCEEEEECCCCCHHHHHHHHHHCC GKKTKLSQREGTNTPLFSKSMKYYMNKFSNLDTPMLVTLNDNTGKTLKTLITNDQLKQTL CCCCCCHHCCCCCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCHHHHHCCHHHHHHH AGYAIPQKEFFTFQTTDGVTLNGWMMKPVNFSASKKYPVLMYQYSGPGSQQVLDTWGISW HHCCCCCHHEEEEECCCCEEECCEEECCCCCCCCCCCCEEEEEECCCCCCHHHHHCCCCH ETYMASLGYIVVCVDGRGTGGRGEAFEKCTYLKIGVKEAKDQVETALYLGKQPYVDKDRI HHHHHCCCEEEEEEECCCCCCCCCHHHCEEEEEEEHHHHHHHHHHHHHCCCCCCCCCCCE GIWGWSYGGYMTLMSMSEGTPVFKAGVAVAAPTDWRFYDTIYTERFMRTPKENAEGYKES EEEEECCCCEEEEEECCCCCCEEECCEEEECCCCCCHHHHHHHHHHHCCCCHHCCCCHHH SAFTRADKLHGNLLLVHGMADDNVHFQNCAEYAEHLVQLGKQFDMQVYTNRNHGIYGGNT HHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCEEEEECCCCCCCCCCH RQHLYTRLTNFFLNNL HHHHHHHHHHHHHHCC >Mature Secondary Structure MRKVSLALLLCLLCLAGMAQGQKALDLKDITSGRFRPENIQGVIPTPDGEHYTQMNADGT CCCHHHHHHHHHHHHHHHCCCCCCCCHHHCCCCCCCCCCCCEEECCCCCCCEEEECCCCC QIIKYSFRTGEKVEVIFDVNQARECDFKNFDSYQFSPDGDKLLIATKTTPIYRHSYTAVH EEEEEEECCCCEEEEEEECCCCCCCCCCCCCCEEECCCCCEEEEEECCCCEEECCCEEEE YIYPLKRNDKGVTTNNIIERLSDGGPQQVPVFSPDGTMIAFVRDNNIFLVKLLYGNSESQ EEEEEECCCCCCCHHHHHHHHHCCCCCCCEEECCCCCEEEEEECCCEEEEEEEECCCCCC VTEDGKQNMVLNGIPDWVYEEEFGFNRALEFSADNTMIAFIRFDESEVPSYSFPMFAGEA CCCCCCCCEEEECCCCHHCCCCCCCCEEEEECCCCEEEEEEEECCCCCCCCCCCEECCCC PQITPLKDYPGEYTYKYPKAGYPNSKVEVRTYDIKSHVTRTMKLPIDADGYIPRIRFTKD CCCCCCCCCCCCEEEECCCCCCCCCEEEEEEEEHHHHCEEEEECCCCCCCCCCEEEEECC ASKLAVMTLNRHQDRFDLYFADPRSTLCKLVLRDESPYYIKENVFDNIKFYPETFSLLSE CCEEEEEEECCCCCEEEEEEECCHHHHEEEEEECCCCEEEECCCCCCCEECHHHHHHHHC RDGFSHLYWYSMGGNLIKKVTNGKYEVKDFLGYDATDGSFYYTSNEESPLRKAVYKIDKK CCCCCEEEEEECCCHHHHHHCCCCEEEHHHCCCCCCCCEEEEECCCCCHHHHHHHHHHCC GKKTKLSQREGTNTPLFSKSMKYYMNKFSNLDTPMLVTLNDNTGKTLKTLITNDQLKQTL CCCCCCHHCCCCCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCHHHHHCCHHHHHHH AGYAIPQKEFFTFQTTDGVTLNGWMMKPVNFSASKKYPVLMYQYSGPGSQQVLDTWGISW HHCCCCCHHEEEEECCCCEEECCEEECCCCCCCCCCCCEEEEEECCCCCCHHHHHCCCCH ETYMASLGYIVVCVDGRGTGGRGEAFEKCTYLKIGVKEAKDQVETALYLGKQPYVDKDRI HHHHHCCCEEEEEEECCCCCCCCCHHHCEEEEEEEHHHHHHHHHHHHHCCCCCCCCCCCE GIWGWSYGGYMTLMSMSEGTPVFKAGVAVAAPTDWRFYDTIYTERFMRTPKENAEGYKES EEEEECCCCEEEEEECCCCCCEEECCEEEECCCCCCHHHHHHHHHHHCCCCHHCCCCHHH SAFTRADKLHGNLLLVHGMADDNVHFQNCAEYAEHLVQLGKQFDMQVYTNRNHGIYGGNT HHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCEEEEECCCCCCCCCCH RQHLYTRLTNFFLNNL HHHHHHHHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA