| Definition | Shewanella baltica OS195 chromosome, complete genome. |
|---|---|
| Accession | NC_009997 |
| Length | 5,347,283 |
Click here to switch to the map view.
The map label for this gene is dcp [H]
Identifier: 160875854
GI number: 160875854
Start: 3253058
End: 3255202
Strand: Reverse
Name: dcp [H]
Synonym: Sbal195_2743
Alternate gene names: 160875854
Gene position: 3255202-3253058 (Counterclockwise)
Preceding gene: 160875857
Following gene: 160875849
Centisome position: 60.88
GC content: 46.71
Gene sequence:
>2145_bases ATGCGTAAGTCCGTTATTACCACAGCCGTGGCCGCGGCTTTGTTACTTGGCGCTTGTTCAGACAAGCCAGCCGCCAGTGC CAATGAAGCCAAAAATGCCGTAGCGACTCAAACTGCTGCTAAAGCAGAAGCAACCAATGTGTTGCTGAGCAAGAGTCCAC TGCAGGATCAAGCGCCACAATTTAACCTTATCCATACTGCAGATTATGCTCCTGCCTTCGAGCAAGGCATTAAAGCCCAT GATGCTGAAATCGCTGCCATTGTGAATAACAAGCAAGCAGCCAGTTTTGACAACACCATTCTGGCGATGGAAACCTCGGG TGAATTATTAACGCGGGTGTCGCGGACCTTCTTTAATCTCGCCGGTCTGATTTCTGATGATGATTTCCAGAAAACCGAAG CAGATCTTGCACCTAAGCTATCTGCCCATCGCGATAATATTTATCTCGATCCAACGTTATTTGCCCGTGTTGAAGCGGTA TATCAACAAAAAAGCAGCTTAAATGCGGAAGATCAGCGTTTAGTTGGATACTACTATGAGCAGTTTGTTCGCGCTGGCGC GAAATTGTCAGCTGAAGATAAAGCTAAGATGCGCGACTTTAACGCCGAGCTTGCAACCTTAGCGACGGACTTTTCACAAA ACAGCTTAAAATCATTTAAAGATGATGTGATTGTTGTGACGGATAAAGCCCAGTTAGCTGGTTTATCTGACAGCGAAATC GCCACCTTAGCCGCTGCCGCTAAAGCTGCTGGAAAAGAGGGCTATTTGATTACCTTAGTGAACACTACTCGCCAGCCTGT GCTTTCTAGCCTGAGCAACCGTGAACTACGCCAAAAAGTGTGGGAAACTTCGGCGCACAGAGCCGTCGCAACTAATGGCC CATTGATTGTTAAAATGGCGCAAATTCGGGCTAAAAAAGCTAACTTATTGGGATTCGATACTTGGGCATCCTATGCGGTT GCCGATCAAATGGCGAAAACCCCAGCCGCTGTGTATGAAATCCTCGATGATTTAGCGCCAAAAGCCTTAGCACGTGCCAA AGTGGAAGCGGCTGATATTCAAGCTGAAATCAAAAAGGCGGGCGGTGATTTCGAACTGCAACCTTGGGATTGGGCTTACT ACGCCGACAAAGTGCGTAAAGAGAAATACGATCTCGACGAAAGCAGCATCAAACCTTATTTCGAGTTTAATACTGTGCTG CAAGACGGGCTGTTTTTCGCGATGCATAAATTGTATGGCATCAGCTTAAAAGCGCGTACCGATTTACCCGTGTGGAATCC CGATGTCTTAGCCTTTGAAGTCTTCGATAAAGACAACAGCTCTATTGGCTTATTCTACCTTGACCCTTATGCCCGTGAAG GCAAAGGCGGCGGCGCGTGGATGGATGAGTTCGTCACCCAAAGCGGTCTGCTTGGCACTAAGCCCGTTGTTTATAACGCG CTGAATATTCCTAAGCCAGCCGATGGCCCAACGTTAATGACCTTCGATGAAGTGACCACTATGTTCCATGAAATGGGTCA TGCGGTTCACGGTTTATTCTCGCAGGTGAAATATCCAAGCGTGGCTGGTACATCAACTGCACGTGACTTTGTTGAGTTCC CATCCCAGGTTAACGAAGATTGGAATATCGATCCTGCTGTGATTGCTAACTATGCCAAACACTACAAAACGGGTGAGCCA ATTCCTAAGGCACTGCTCGATAAAATCTTAGCCTCAAATAAGTTTGGCCAAGGTTTTGATACCGTTGAGTATCTGTCAGC TGCCTTGTTAGATATGGAATGGCACTCTATCAGCGCCGATACTCAAATTAGCGATGTAGAGAAGTTTGAGCACCAAGTGT TAGTTAAACATGGCCTCGATTTTGCGCCAATACCCCCGCGTTATAAATCGAGCTACTTTAGCCATGCCTTCTCCGGTGGT TATTCAGCTGGATATTATGCGTATTTGTGGACCGAAGTGTTTGCCGCCGATGCCTTTGCCTATATGGGCGAGCATGGTGG ATTGAAAGCCGATAATGGTGACAAATTCCGTAAAGAAGTCTTATCTAAAGGTAACAGTGAAGATCTCATGCAAGATTACA TTCACTTTACCGGTAAGAAACCGACAACAGATGCACTGTTAAAGCGCCGCGGTTTAGTCGACTAA
Upstream 100 bases:
>100_bases TTGACTCAGACTGTTTTAAACTGAAAGCCAAAATCCGTTAGCGCGCACCTTTCACCTTAAAAATACAAACGAAAGGGCAG CAACATCCAAGGGAGTTAAT
Downstream 100 bases:
>100_bases TCGATTCGTTGTAAAAAGGCCCAAAGGCTTGTTAGGGAAACCTAGCAAGCCTTTTTTATTGATGTCGGGGCGGATTTATT CATCATTTTTGTTTCAAGAA
Product: peptidyl-dipeptidase Dcp
Products: NA
Alternate protein names: Dipeptidyl carboxypeptidase [H]
Number of amino acids: Translated: 714; Mature: 714
Protein sequence:
>714_residues MRKSVITTAVAAALLLGACSDKPAASANEAKNAVATQTAAKAEATNVLLSKSPLQDQAPQFNLIHTADYAPAFEQGIKAH DAEIAAIVNNKQAASFDNTILAMETSGELLTRVSRTFFNLAGLISDDDFQKTEADLAPKLSAHRDNIYLDPTLFARVEAV YQQKSSLNAEDQRLVGYYYEQFVRAGAKLSAEDKAKMRDFNAELATLATDFSQNSLKSFKDDVIVVTDKAQLAGLSDSEI ATLAAAAKAAGKEGYLITLVNTTRQPVLSSLSNRELRQKVWETSAHRAVATNGPLIVKMAQIRAKKANLLGFDTWASYAV ADQMAKTPAAVYEILDDLAPKALARAKVEAADIQAEIKKAGGDFELQPWDWAYYADKVRKEKYDLDESSIKPYFEFNTVL QDGLFFAMHKLYGISLKARTDLPVWNPDVLAFEVFDKDNSSIGLFYLDPYAREGKGGGAWMDEFVTQSGLLGTKPVVYNA LNIPKPADGPTLMTFDEVTTMFHEMGHAVHGLFSQVKYPSVAGTSTARDFVEFPSQVNEDWNIDPAVIANYAKHYKTGEP IPKALLDKILASNKFGQGFDTVEYLSAALLDMEWHSISADTQISDVEKFEHQVLVKHGLDFAPIPPRYKSSYFSHAFSGG YSAGYYAYLWTEVFAADAFAYMGEHGGLKADNGDKFRKEVLSKGNSEDLMQDYIHFTGKKPTTDALLKRRGLVD
Sequences:
>Translated_714_residues MRKSVITTAVAAALLLGACSDKPAASANEAKNAVATQTAAKAEATNVLLSKSPLQDQAPQFNLIHTADYAPAFEQGIKAH DAEIAAIVNNKQAASFDNTILAMETSGELLTRVSRTFFNLAGLISDDDFQKTEADLAPKLSAHRDNIYLDPTLFARVEAV YQQKSSLNAEDQRLVGYYYEQFVRAGAKLSAEDKAKMRDFNAELATLATDFSQNSLKSFKDDVIVVTDKAQLAGLSDSEI ATLAAAAKAAGKEGYLITLVNTTRQPVLSSLSNRELRQKVWETSAHRAVATNGPLIVKMAQIRAKKANLLGFDTWASYAV ADQMAKTPAAVYEILDDLAPKALARAKVEAADIQAEIKKAGGDFELQPWDWAYYADKVRKEKYDLDESSIKPYFEFNTVL QDGLFFAMHKLYGISLKARTDLPVWNPDVLAFEVFDKDNSSIGLFYLDPYAREGKGGGAWMDEFVTQSGLLGTKPVVYNA LNIPKPADGPTLMTFDEVTTMFHEMGHAVHGLFSQVKYPSVAGTSTARDFVEFPSQVNEDWNIDPAVIANYAKHYKTGEP IPKALLDKILASNKFGQGFDTVEYLSAALLDMEWHSISADTQISDVEKFEHQVLVKHGLDFAPIPPRYKSSYFSHAFSGG YSAGYYAYLWTEVFAADAFAYMGEHGGLKADNGDKFRKEVLSKGNSEDLMQDYIHFTGKKPTTDALLKRRGLVD >Mature_714_residues MRKSVITTAVAAALLLGACSDKPAASANEAKNAVATQTAAKAEATNVLLSKSPLQDQAPQFNLIHTADYAPAFEQGIKAH DAEIAAIVNNKQAASFDNTILAMETSGELLTRVSRTFFNLAGLISDDDFQKTEADLAPKLSAHRDNIYLDPTLFARVEAV YQQKSSLNAEDQRLVGYYYEQFVRAGAKLSAEDKAKMRDFNAELATLATDFSQNSLKSFKDDVIVVTDKAQLAGLSDSEI ATLAAAAKAAGKEGYLITLVNTTRQPVLSSLSNRELRQKVWETSAHRAVATNGPLIVKMAQIRAKKANLLGFDTWASYAV ADQMAKTPAAVYEILDDLAPKALARAKVEAADIQAEIKKAGGDFELQPWDWAYYADKVRKEKYDLDESSIKPYFEFNTVL QDGLFFAMHKLYGISLKARTDLPVWNPDVLAFEVFDKDNSSIGLFYLDPYAREGKGGGAWMDEFVTQSGLLGTKPVVYNA LNIPKPADGPTLMTFDEVTTMFHEMGHAVHGLFSQVKYPSVAGTSTARDFVEFPSQVNEDWNIDPAVIANYAKHYKTGEP IPKALLDKILASNKFGQGFDTVEYLSAALLDMEWHSISADTQISDVEKFEHQVLVKHGLDFAPIPPRYKSSYFSHAFSGG YSAGYYAYLWTEVFAADAFAYMGEHGGLKADNGDKFRKEVLSKGNSEDLMQDYIHFTGKKPTTDALLKRRGLVD
Specific function: Removes dipeptides from the C-termini of N-blocked tripeptides, tetrapeptides and larger peptides [H]
COG id: COG0339
COG function: function code E; Zn-dependent oligopeptidases
Gene ontology:
Cell location: Cytoplasm [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the peptidase M3 family [H]
Homologues:
Organism=Homo sapiens, GI4507491, Length=638, Percent_Identity=30.564263322884, Blast_Score=252, Evalue=1e-66, Organism=Homo sapiens, GI14149738, Length=642, Percent_Identity=29.7507788161994, Blast_Score=226, Evalue=5e-59, Organism=Homo sapiens, GI156105687, Length=422, Percent_Identity=29.3838862559242, Blast_Score=172, Evalue=1e-42, Organism=Escherichia coli, GI1787819, Length=675, Percent_Identity=46.962962962963, Blast_Score=615, Evalue=1e-177, Organism=Escherichia coli, GI1789913, Length=693, Percent_Identity=32.9004329004329, Blast_Score=354, Evalue=1e-98, Organism=Caenorhabditis elegans, GI32565901, Length=598, Percent_Identity=24.247491638796, Blast_Score=106, Evalue=5e-23, Organism=Caenorhabditis elegans, GI71999758, Length=576, Percent_Identity=20.1388888888889, Blast_Score=72, Evalue=1e-12, Organism=Saccharomyces cerevisiae, GI6319793, Length=595, Percent_Identity=29.7478991596639, Blast_Score=218, Evalue=3e-57, Organism=Saccharomyces cerevisiae, GI6322715, Length=644, Percent_Identity=23.2919254658385, Blast_Score=145, Evalue=3e-35, Organism=Drosophila melanogaster, GI20129717, Length=395, Percent_Identity=30.126582278481, Blast_Score=170, Evalue=4e-42, Organism=Drosophila melanogaster, GI21356111, Length=473, Percent_Identity=26.6384778012685, Blast_Score=156, Evalue=6e-38,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001567 [H]
Pfam domain/function: PF01432 Peptidase_M3 [H]
EC number: =3.4.15.5 [H]
Molecular weight: Translated: 78511; Mature: 78511
Theoretical pI: Translated: 5.35; Mature: 5.35
Prosite motif: PS00013 PROKAR_LIPOPROTEIN ; PS00142 ZINC_PROTEASE
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.1 %Cys (Translated Protein) 1.8 %Met (Translated Protein) 2.0 %Cys+Met (Translated Protein) 0.1 %Cys (Mature Protein) 1.8 %Met (Mature Protein) 2.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MRKSVITTAVAAALLLGACSDKPAASANEAKNAVATQTAAKAEATNVLLSKSPLQDQAPQ CCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHEECCCCCCCCCCC FNLIHTADYAPAFEQGIKAHDAEIAAIVNNKQAASFDNTILAMETSGELLTRVSRTFFNL EEEEECCCCCHHHHHCCHHCCCEEEEEECCCCCCCCCCEEEEEECCHHHHHHHHHHHHHH AGLISDDDFQKTEADLAPKLSAHRDNIYLDPTLFARVEAVYQQKSSLNAEDQRLVGYYYE HHCCCCCCHHHHHHHHCCCCCCCCCCEEECHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH QFVRAGAKLSAEDKAKMRDFNAELATLATDFSQNSLKSFKDDVIVVTDKAQLAGLSDSEI HHHHCCCCCCCHHHHHHHHHCCHHHHHHHHCCHHHHHHHCCCEEEEECCHHHCCCCCHHH ATLAAAAKAAGKEGYLITLVNTTRQPVLSSLSNRELRQKVWETSAHRAVATNGPLIVKMA HHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHHHHHHCCCEEECCCCEEEEHH QIRAKKANLLGFDTWASYAVADQMAKTPAAVYEILDDLAPKALARAKVEAADIQAEIKKA HHHHHHHCEEEEHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC GGDFELQPWDWAYYADKVRKEKYDLDESSIKPYFEFNTVLQDGLFFAMHKLYGISLKART CCCCCCCCCCHHHHHHHHHHHHCCCCHHHCCCHHHHHHHHHHHHHHHHHHHHCCEEEECC DLPVWNPDVLAFEVFDKDNSSIGLFYLDPYAREGKGGGAWMDEFVTQSGLLGTKPVVYNA CCCCCCCCEEEEEEECCCCCEEEEEEECCCCCCCCCCCHHHHHHHHHCCCCCCCCHHEEC LNIPKPADGPTLMTFDEVTTMFHEMGHAVHGLFSQVKYPSVAGTSTARDFVEFPSQVNED CCCCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCC WNIDPAVIANYAKHYKTGEPIPKALLDKILASNKFGQGFDTVEYLSAALLDMEWHSISAD CCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCHHCCCCCC TQISDVEKFEHQVLVKHGLDFAPIPPRYKSSYFSHAFSGGYSAGYYAYLWTEVFAADAFA CCHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHH YMGEHGGLKADNGDKFRKEVLSKGNSEDLMQDYIHFTGKKPTTDALLKRRGLVD HHHCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHCCCCCCHHHHHHHCCCCC >Mature Secondary Structure MRKSVITTAVAAALLLGACSDKPAASANEAKNAVATQTAAKAEATNVLLSKSPLQDQAPQ CCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHEECCCCCCCCCCC FNLIHTADYAPAFEQGIKAHDAEIAAIVNNKQAASFDNTILAMETSGELLTRVSRTFFNL EEEEECCCCCHHHHHCCHHCCCEEEEEECCCCCCCCCCEEEEEECCHHHHHHHHHHHHHH AGLISDDDFQKTEADLAPKLSAHRDNIYLDPTLFARVEAVYQQKSSLNAEDQRLVGYYYE HHCCCCCCHHHHHHHHCCCCCCCCCCEEECHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH QFVRAGAKLSAEDKAKMRDFNAELATLATDFSQNSLKSFKDDVIVVTDKAQLAGLSDSEI HHHHCCCCCCCHHHHHHHHHCCHHHHHHHHCCHHHHHHHCCCEEEEECCHHHCCCCCHHH ATLAAAAKAAGKEGYLITLVNTTRQPVLSSLSNRELRQKVWETSAHRAVATNGPLIVKMA HHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHHHHHHCCCEEECCCCEEEEHH QIRAKKANLLGFDTWASYAVADQMAKTPAAVYEILDDLAPKALARAKVEAADIQAEIKKA HHHHHHHCEEEEHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC GGDFELQPWDWAYYADKVRKEKYDLDESSIKPYFEFNTVLQDGLFFAMHKLYGISLKART CCCCCCCCCCHHHHHHHHHHHHCCCCHHHCCCHHHHHHHHHHHHHHHHHHHHCCEEEECC DLPVWNPDVLAFEVFDKDNSSIGLFYLDPYAREGKGGGAWMDEFVTQSGLLGTKPVVYNA CCCCCCCCEEEEEEECCCCCEEEEEEECCCCCCCCCCCHHHHHHHHHCCCCCCCCHHEEC LNIPKPADGPTLMTFDEVTTMFHEMGHAVHGLFSQVKYPSVAGTSTARDFVEFPSQVNED CCCCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCC WNIDPAVIANYAKHYKTGEPIPKALLDKILASNKFGQGFDTVEYLSAALLDMEWHSISAD CCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCHHCCCCCC TQISDVEKFEHQVLVKHGLDFAPIPPRYKSSYFSHAFSGGYSAGYYAYLWTEVFAADAFA CCHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHH YMGEHGGLKADNGDKFRKEVLSKGNSEDLMQDYIHFTGKKPTTDALLKRRGLVD HHHCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHCCCCCCHHHHHHHCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 8226676; 9097039; 9278503 [H]