Definition Acidovorax citrulli AAC00-1 chromosome, complete genome.
Accession NC_008752
Length 5,352,772

Click here to switch to the map view.

The map label for this gene is degP [H]

Identifier: 120612604

GI number: 120612604

Start: 4394845

End: 4396389

Strand: Reverse

Name: degP [H]

Synonym: Aave_3965

Alternate gene names: 120612604

Gene position: 4396389-4394845 (Counterclockwise)

Preceding gene: 120612605

Following gene: 120612603

Centisome position: 82.13

GC content: 70.81

Gene sequence:

>1545_bases
ATGAACACGATCCTCTCCACCCCCCGCCGCCTGACCCTCGCCCTGGTGGTCGCCGGGGCCATGGGGGCGACGGGCGCGGG
GCTCGTCACCAACCTGCATTCCAGCCATGCCGCGGTGCCTCCCGCGGCCACGGGTCCGGTGGCTGCGGCGCCGGCCGCGT
ATTCCGGCGTGTCCGCCCCGGATTTCGCGCAGATCACGGCGCTCAACGGCCCGGCGGTGGTGAACATCAGCGTGACGGGC
ACGACCAAGACCTCATACAACGGCGGGGATGACGACGACGACGATGAGACGCCCTCTGCCCAGCACCAGCAGCCCCGCGG
CGGGCAGGGCATGGATCCGGACGATCCGTTCTTCGAATTCTTCCGCCGCTTCGGCATGCCCGGCATGGGCGGCGGCCGCA
TGCCGCAGCGCGAGGTGCCGATGCGCGGCGAGGGCTCGGGTTTCATCGTGAGCCCGGACGGCGTCATACTGACCAATGCC
CATGTGGTCAAGGGCGCCAGCACGGTCACGGTGAAGCTCACCGACCGGCGCGAATTCCGCGCCAAGGTGCTGGGCTCCGA
TCCCAAGACCGACATCGCCGTCCTGAAAATCGACGCGAAAGACCTGCCCGTGGTGCACCTGGGCGATACGAAAAAGCTCG
CGGTGGGCGAATGGGTGCTGGCCATCGGCTCGCCCTTCGGTTTCGAGAACAGCGTGACGGCGGGCGTGGTGAGCGCCAAG
GGCCGCGCGCTGCCGGACGACAGCTTCGTGCCCTTCATCCAGACGGACGTGGCGGTGAACCCCGGCAACTCCGGCGGCCC
GCTGTTCAACTCGCGCGGCGAGGTGGTGGGCATCAACTCGCAGATCTACAGCCGCTCGGGCGGCTACCAGGGCGTGTCAT
TCGCGATCCCGATCGAAGTGGCGGAGCGGGTGAAGGAGCAGATCCTGGCCACGGGCAAGGCCAGCCATGCGCGCCTGGGC
GTGTCGGTGCAGGAGGTGAACCAGGCGTTCGCCGATTCCTTCCAGCTCGACAAGCCCGAGGGCGCGCTGGTGGCGGGCGT
GGAGCCCGGCGGCCCGGCGGACAAGGCCGGCCTGAAGTCGGGCGACGTGATCCTGCGCATCAACGGCCAGCCCATCGTGG
CGTCCGGCGACCTGCCGGCCTTCGTGGGCCAGTCGGCGCCCGGCTCCACCGCGCGCATGCAGGTGTGGCGCCACGGCAAG
CAGGAAGAGATCACCGCCACCCTGGGCGATGCGAGCGACAAGCCGGCCAAGCTGGCCAGCGCCGACAAGGCGGTGGGCAA
GGGCCAGCTGGGCCTGTCGCTGCGGCCGCTGCAGCCGCAGGAGCGCCGCGAGGCCGGCGTGAGCGGCGGGCTGGTGGTGG
AAGAAGCGCGCGGCCCGGCCGCCCTGGCGGGCGTGGCCCCCGGCGATGTGCTGCTGTCCATCAACGGCACGCCGGCGCAG
AGCATCGAGCAGGTGCGCGCCGCGATGGCCAAGGCGGGCAAGTCGGTGGCGCTGCTGATCCAGCGCGACGGCGACAAGAT
CTTCGTGCCCGTGCCCCTGGGCTGA

Upstream 100 bases:

>100_bases
CCGACACCGCCCTAAGCCTCCCCTAACCTTCCTCTAAGCCCGCTCTAAGCCTCGCTGCGCACACTCCCGTTCATGCCACA
ACCCACCTGAAAGGTTTGCC

Downstream 100 bases:

>100_bases
CCCCTCCCGGGCGGAGGCCGGTTGCCGCCGGCGGAGCCCAGGGGCTCGGCCCGCATTGGGCGGGCCCATGTCCCTGCGGC
CCGGCCGGTTCCTACGCGGC

Product: protease Do

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 514; Mature: 514

Protein sequence:

>514_residues
MNTILSTPRRLTLALVVAGAMGATGAGLVTNLHSSHAAVPPAATGPVAAAPAAYSGVSAPDFAQITALNGPAVVNISVTG
TTKTSYNGGDDDDDDETPSAQHQQPRGGQGMDPDDPFFEFFRRFGMPGMGGGRMPQREVPMRGEGSGFIVSPDGVILTNA
HVVKGASTVTVKLTDRREFRAKVLGSDPKTDIAVLKIDAKDLPVVHLGDTKKLAVGEWVLAIGSPFGFENSVTAGVVSAK
GRALPDDSFVPFIQTDVAVNPGNSGGPLFNSRGEVVGINSQIYSRSGGYQGVSFAIPIEVAERVKEQILATGKASHARLG
VSVQEVNQAFADSFQLDKPEGALVAGVEPGGPADKAGLKSGDVILRINGQPIVASGDLPAFVGQSAPGSTARMQVWRHGK
QEEITATLGDASDKPAKLASADKAVGKGQLGLSLRPLQPQERREAGVSGGLVVEEARGPAALAGVAPGDVLLSINGTPAQ
SIEQVRAAMAKAGKSVALLIQRDGDKIFVPVPLG

Sequences:

>Translated_514_residues
MNTILSTPRRLTLALVVAGAMGATGAGLVTNLHSSHAAVPPAATGPVAAAPAAYSGVSAPDFAQITALNGPAVVNISVTG
TTKTSYNGGDDDDDDETPSAQHQQPRGGQGMDPDDPFFEFFRRFGMPGMGGGRMPQREVPMRGEGSGFIVSPDGVILTNA
HVVKGASTVTVKLTDRREFRAKVLGSDPKTDIAVLKIDAKDLPVVHLGDTKKLAVGEWVLAIGSPFGFENSVTAGVVSAK
GRALPDDSFVPFIQTDVAVNPGNSGGPLFNSRGEVVGINSQIYSRSGGYQGVSFAIPIEVAERVKEQILATGKASHARLG
VSVQEVNQAFADSFQLDKPEGALVAGVEPGGPADKAGLKSGDVILRINGQPIVASGDLPAFVGQSAPGSTARMQVWRHGK
QEEITATLGDASDKPAKLASADKAVGKGQLGLSLRPLQPQERREAGVSGGLVVEEARGPAALAGVAPGDVLLSINGTPAQ
SIEQVRAAMAKAGKSVALLIQRDGDKIFVPVPLG
>Mature_514_residues
MNTILSTPRRLTLALVVAGAMGATGAGLVTNLHSSHAAVPPAATGPVAAAPAAYSGVSAPDFAQITALNGPAVVNISVTG
TTKTSYNGGDDDDDDETPSAQHQQPRGGQGMDPDDPFFEFFRRFGMPGMGGGRMPQREVPMRGEGSGFIVSPDGVILTNA
HVVKGASTVTVKLTDRREFRAKVLGSDPKTDIAVLKIDAKDLPVVHLGDTKKLAVGEWVLAIGSPFGFENSVTAGVVSAK
GRALPDDSFVPFIQTDVAVNPGNSGGPLFNSRGEVVGINSQIYSRSGGYQGVSFAIPIEVAERVKEQILATGKASHARLG
VSVQEVNQAFADSFQLDKPEGALVAGVEPGGPADKAGLKSGDVILRINGQPIVASGDLPAFVGQSAPGSTARMQVWRHGK
QEEITATLGDASDKPAKLASADKAVGKGQLGLSLRPLQPQERREAGVSGGLVVEEARGPAALAGVAPGDVLLSINGTPAQ
SIEQVRAAMAKAGKSVALLIQRDGDKIFVPVPLG

Specific function: Serine Protease That Is Required At High Temperature. Involved In The Degradation Of Damaged Proteins. It Can Degrade Icia, Ada, Casein And Globin. Shared Specificity With Degq. [C]

COG id: COG0265

COG function: function code O; Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain

Gene ontology:

Cell location: Periplasmic Protein [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 2 PDZ (DHR) domains [H]

Homologues:

Organism=Homo sapiens, GI4506141, Length=308, Percent_Identity=38.6363636363636, Blast_Score=169, Evalue=9e-42,
Organism=Homo sapiens, GI24308541, Length=258, Percent_Identity=37.984496124031, Blast_Score=159, Evalue=7e-39,
Organism=Homo sapiens, GI22129776, Length=364, Percent_Identity=32.6923076923077, Blast_Score=155, Evalue=7e-38,
Organism=Homo sapiens, GI7019477, Length=259, Percent_Identity=38.996138996139, Blast_Score=144, Evalue=2e-34,
Organism=Escherichia coli, GI1786356, Length=370, Percent_Identity=36.2162162162162, Blast_Score=223, Evalue=2e-59,
Organism=Escherichia coli, GI1789629, Length=406, Percent_Identity=34.7290640394089, Blast_Score=217, Evalue=1e-57,
Organism=Escherichia coli, GI1789630, Length=278, Percent_Identity=34.5323741007194, Blast_Score=166, Evalue=5e-42,
Organism=Drosophila melanogaster, GI24646839, Length=286, Percent_Identity=34.965034965035, Blast_Score=150, Evalue=3e-36,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001478
- InterPro:   IPR009003
- InterPro:   IPR011782
- InterPro:   IPR001254
- InterPro:   IPR001940 [H]

Pfam domain/function: PF00595 PDZ; PF00089 Trypsin [H]

EC number: 3.4.21.-

Molecular weight: Translated: 52783; Mature: 52783

Theoretical pI: Translated: 6.42; Mature: 6.42

Prosite motif: PS50106 PDZ

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.0 %Cys     (Translated Protein)
1.8 %Met     (Translated Protein)
1.8 %Cys+Met (Translated Protein)
0.0 %Cys     (Mature Protein)
1.8 %Met     (Mature Protein)
1.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNTILSTPRRLTLALVVAGAMGATGAGLVTNLHSSHAAVPPAATGPVAAAPAAYSGVSAP
CCCCCCCCHHEEEEEEEHHCCCCCCCHHEEECCCCCCCCCCCCCCCCCCCCCHHCCCCCC
DFAQITALNGPAVVNISVTGTTKTSYNGGDDDDDDETPSAQHQQPRGGQGMDPDDPFFEF
CCEEEEEECCCEEEEEEEEECEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHH
FRRFGMPGMGGGRMPQREVPMRGEGSGFIVSPDGVILTNAHVVKGASTVTVKLTDRREFR
HHHHCCCCCCCCCCCCCCCCCCCCCCEEEECCCCEEEECCEEECCCCEEEEEECCCHHHH
AKVLGSDPKTDIAVLKIDAKDLPVVHLGDTKKLAVGEWVLAIGSPFGFENSVTAGVVSAK
HHHCCCCCCCCEEEEEECCCCCCEEEECCCCEEECCEEEEEECCCCCCCCCCEEEEEECC
GRALPDDSFVPFIQTDVAVNPGNSGGPLFNSRGEVVGINSQIYSRSGGYQGVSFAIPIEV
CCCCCCCCCCCEEEECEEECCCCCCCCCCCCCCCEEEECHHHCCCCCCCCCEEEEECHHH
AERVKEQILATGKASHARLGVSVQEVNQAFADSFQLDKPEGALVAGVEPGGPADKAGLKS
HHHHHHHHHHCCCCCCCEECCCHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCCCCCCC
GDVILRINGQPIVASGDLPAFVGQSAPGSTARMQVWRHGKQEEITATLGDASDKPAKLAS
CCEEEEECCCEEEECCCCCHHCCCCCCCCHHHHHHHHCCCCCEEEEEECCCCCCCCHHHH
ADKAVGKGQLGLSLRPLQPQERREAGVSGGLVVEEARGPAALAGVAPGDVLLSINGTPAQ
HHHHCCCCCCCEEECCCCCHHHHHCCCCCCEEEECCCCCEEEECCCCCCEEEEECCCCHH
SIEQVRAAMAKAGKSVALLIQRDGDKIFVPVPLG
HHHHHHHHHHHCCCEEEEEEEECCCEEEEEECCC
>Mature Secondary Structure
MNTILSTPRRLTLALVVAGAMGATGAGLVTNLHSSHAAVPPAATGPVAAAPAAYSGVSAP
CCCCCCCCHHEEEEEEEHHCCCCCCCHHEEECCCCCCCCCCCCCCCCCCCCCHHCCCCCC
DFAQITALNGPAVVNISVTGTTKTSYNGGDDDDDDETPSAQHQQPRGGQGMDPDDPFFEF
CCEEEEEECCCEEEEEEEEECEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHH
FRRFGMPGMGGGRMPQREVPMRGEGSGFIVSPDGVILTNAHVVKGASTVTVKLTDRREFR
HHHHCCCCCCCCCCCCCCCCCCCCCCEEEECCCCEEEECCEEECCCCEEEEEECCCHHHH
AKVLGSDPKTDIAVLKIDAKDLPVVHLGDTKKLAVGEWVLAIGSPFGFENSVTAGVVSAK
HHHCCCCCCCCEEEEEECCCCCCEEEECCCCEEECCEEEEEECCCCCCCCCCEEEEEECC
GRALPDDSFVPFIQTDVAVNPGNSGGPLFNSRGEVVGINSQIYSRSGGYQGVSFAIPIEV
CCCCCCCCCCCEEEECEEECCCCCCCCCCCCCCCEEEECHHHCCCCCCCCCEEEEECHHH
AERVKEQILATGKASHARLGVSVQEVNQAFADSFQLDKPEGALVAGVEPGGPADKAGLKS
HHHHHHHHHHCCCCCCCEECCCHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCCCCCCC
GDVILRINGQPIVASGDLPAFVGQSAPGSTARMQVWRHGKQEEITATLGDASDKPAKLAS
CCEEEEECCCEEEECCCCCHHCCCCCCCCHHHHHHHHCCCCCEEEEEECCCCCCCCHHHH
ADKAVGKGQLGLSLRPLQPQERREAGVSGGLVVEEARGPAALAGVAPGDVLLSINGTPAQ
HHHHCCCCCCCEEECCCCCHHHHHCCCCCCEEEECCCCCEEEECCCCCCEEEEECCCCHH
SIEQVRAAMAKAGKSVALLIQRDGDKIFVPVPLG
HHHHHHHHHHHCCCEEEEEEEECCCEEEEEECCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: Hydrolase; Acting on peptide bonds (Peptidases) [C]

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 10684935 [H]