The gene/protein map for NC_008700 is currently unavailable.
Definition Shewanella amazonensis SB2B chromosome, complete genome.
Accession NC_008700
Length 4,306,142

Click here to switch to the map view.

The map label for this gene is vpr [H]

Identifier: 119773882

GI number: 119773882

Start: 910321

End: 915498

Strand: Reverse

Name: vpr [H]

Synonym: Sama_0744

Alternate gene names: 119773882

Gene position: 915498-910321 (Counterclockwise)

Preceding gene: 119773883

Following gene: 119773879

Centisome position: 21.26

GC content: 55.18

Gene sequence:

>5178_bases
ATGACAGTAAAACATCCAATCAAAGCCAGCGCAGCTGCTGTGTTCGGTGTCCTGTACCTGGGGATGTCGGGCTACGCCGC
TGCAGAGATTGGCATGGCGAAAGCCGACAAGGGCGGCTTCTATGTGCCCACCTTTACCAGCGACGATATCAAGGCATTCA
ATGCGTCGCGTAAAGAAGATCAAACCGGCGACCTCTTCCTCGTCCCCGGCAAAGTGAATCACGTACTGAATCGCCGTCAG
CACCAAGTGTTTGAGTTCGACGACTCCATCAAGGGCGAACATACCTTTATCGTGCAGTTCGATGACAAGCCCGTCGCTAC
CTACGATGGTGGTGTGACAGGCTACGCGGCCACCAAGCCTTTGATGATGCAAAAGTCCGGTGCACTGAATCCAGGCCAGG
CTCAGGCAGCTGAAGTAGTGCACTACCAGTCCATGCTAAGGAGCAAACAACAGAGTGTCCTGAATCAGGCCAGTGCCCAC
GGCGCCCGCTTCGAACTGAAGAACCAGTTTACCCTGGCCAACAACGCCGCCACGGTTCGCATGACCCAGGAAGACGCCGC
GCGTATGGCGCAGGTTCCCGGCGTGAAAAAGATTACCCCTACCCGGGTGTTCAAGTTGCGTACCGACCGTGGTCCGGAGT
TTATTCATGCCGATTCAGCCTGGAACGGCAATACCAGCTCTGGTCTGAAGGCCCAGGGTGAAGGCATGGTCGTGGGGATT
ATCGACACAGGTGTGAATACCGACCATCCAGCGTTTGCCTCAGATGCCGACTTTACTGCCAGCCATGAAAAATTGGGCGG
TCAGTATCTGGGAGACTGTCAAACCGATGCCAGCCTCTGTAACGATAAGCTGATTGGCGTTTACTCCTACGAGGTGATTA
CTGAAGTCTATAACGCGCCCGAGTTTCAGGATTACTCCTGGCAGAGCAAGCTTATCCGTCCCCGCAACGGTGAAGATTAC
AACGGCCACGGCTCCCACACCGCCAGCACCGCCGCAGGGAACCGCATTGAAAATACACCGCTGCAAGCGGCCAACGGCGA
CAAGGTCAGCGACGGCGTAAACTTGCCATTCAACTTCGATCACACCTCCGGTGTTGCGCCCCGCGCCCACATTATTTCTT
ACCAGGTGTGCTGGCCAGGGAGCGGTGGTGATCCATACGCAGGTTGTCCCGAAGAAGCCATTCTCGCCGCCTTTGAAGAT
GCCATCCGCGACGGCGTGGACGTGATTAACTTCTCCATTGGTGGCGGCGAAAACTTCCCCTGGGAAGACCCAATGGAACT
GGCATTCCTGTCTGCCCGCGAAGCCGGTATCTCAGTGGCCGCCGCTGCCGGTAACTCTGGCCCGTACTTCTACAGTGCTG
ACCATACCTCTCCCTGGGTAACCACTGTGGGGGCATCCACCCACGACAGAACCCTGGATGCGGGTAAAACCAGTATCACG
GCCTTTGAATCCACCGGGCCTGCCTACACCATTCCGAAAAATGACATCGTGGGCAAAGGCTTTACCGAAGAAATTTCCGG
TCAGTTTGTGCTGGCTGAAAACTATCCCGACCCCAACCCCAATGATGGCTATGCGGCCAAGACCTGTAATGCACCTTTCC
CGGCCGGCACCTTCACCGCTGACCAAATTGTCGTCTGTGAGCGTGGCGATATTCCCCGCGTAGATAAGGCAATCAACGTA
CAGGCAGGCGGCGCCGGTGGTTTGGTGTTGCAAAACGTCAGCTATAACGACCCTCTGGTGGCCGACCGTTTTGTAATCCC
CGGGATCAACGTGTCTTCCAGCGTGGGTTACAGCCTGAAAAACTGGATAAACCGCAGTAACGGCACCGCCCGCGGCACCA
TCACAGCCCATGTGAACGACTATCTGCTGGACGAAGAAAAAGGCAATCTGTTGGCCTACTTCAGCTCCATGGGCCCAAGC
CGCTATATCGACAACCTGGTACCGGATGTCACTGCGCCCGGCGTAAATATCTATGCCGCCAACGCCGATGACCAACCCTT
TACCAATTATCCATCGGCCAGCGACTGGACCATGATGAGCGGAACTTCCATGGCCTCGCCCCATGTGGCCGGCGCCATGA
CGCTGCTGACCCAGTTGCACCCTGACTGGACACCGGCGGAGATCCAATCGGCATTGATGCTGACCGCCAACGAAGTCAAA
TACCAACCTTATGCCGGCGCAACACCCGCTGAACTGCCATACCACTTCATGGCAGGTGCCGGTGCCATTGATGTGGCAAA
GGCCGACGCCACTGGGCTTATCATGGATGAAACCATTGATGGCTATATGGCTGCCAACCCCAATAACGGTGGCATAGTTA
ACTGGCTGAACCTGCCATCCATGGTTGACATGAACTGTGAGAAAGAATGCACCTGGATGCGCACAGTCAAGGCCACCAAA
GACGGCAGCTGGTCAGTCGGCACTGAAGTACGGGAGGACGGTGCTACCCTGGTTGCTACGCCGAACCAATTCAGCCTGAA
GGCCGGGGAAACCCAAACCATCATGGTCAAAATGACGGTACCGAGCATCAATCGCTACGCGGTCGATCCGGATGACGGTG
ATTCACCATGGGAAAGCAATACCAACTACGCCCTCTTTAATGGCAAGCTGATGCTGACCGAGAGCACCGGCAACTCACCT
GAGTTGCATATGCCTGTGGTGGCTCTGTCCAACTACGACCAACTGCCTTTTGCCAAGCAAATCGAGTTCAACCGTGAACA
GGGCTCAGAAACTTTTATCGTAAATACCGACAACTACAGCCAGTTTACCCCAAGATACTATGGGCTGGTTAAACCCGAAG
TGGAACAGCATGAGTTGGGTCTGGTGAGTCCTATCATCAATATGGCCAACGTGGAAAAGTGGGGCCTCAGCAAGGTGGTG
GTGCCAGAAGGCACAAAGCGCCTGATGGTGGAGGTACAGTCGGCCGAGGTCATTGGCTATGACAACAATCAGAACCCGCG
CTATATCAAACAAGCTCCCGTGCTGACCGTGGGTCTGGATGCCAATGGCAACGATGGTTTTACCCCCTCCCAGGAGGAGA
TCGATGCCGACTACTACGCCCTCAGGAACGAGTTTTTTTCCGAGATGAAGTGTCAGTCATCATCCTCTGCGGTGCAGAAC
TACTGTGACATAGTGGACCCGACTCCCGGCACCTATTGGATTGCTCTGATCAACGTTGGCAGTGGCGAGCAGAAGTATAA
GGTGAATACCGCCGTTGCCGTGATAGGTAACGACAGCGCAGCGGGCAACTTCCACCTTGAAGGTCCGGCATCCCACGATG
GCAATGGCAACTACCAGCTGACCCTGAATTGGGATCTGCCGGAAGCGGCGGAAGGCGACGTGTTCTACGGTGGTTTCGAC
ATGGGTAACATGCCTGGCGAAGAAGGCACCCTGGGCTTTACCTCCCTCACCCTGAGACGCGGTAAGGACAATGTGTCCTT
TAACCTCAGCCAGGACAAGGCCCGCAATATGGATGTGATCGAAATCGATCTGTCTATGCTGCCCAACCTGGAAACTCAGG
ACAGAGATTTCAGCTTCAAGCTGACACTGCCCGATGGTATGCGTTTGGCGCCTGAAACCCTCAAGACGGTTAATGACAAA
GCCCTGACCAATCTCGAAATGGATGAAAAGGGCTTCAGCCTCAGTGGTAACCAGCCAAGCACCCGCAATATCCAGCGGGA
GTATGTGGTGACCAACAGCCTGACGTCTGCCCAGTGCCGTACTCCCATCATTGATGAGTACTCGGACGGTGGCTATATCG
ACTTGCATGAGTTCGGGATGCAGCCCGACCAAGTCTGGCACGTGGGAGACCATCGTGCCTATAACGATGTCCCTATGAAC
TGGCTGTTCTGGGGCATGGATCAGGAGCAATTCAAACTCTATAACCAGGACAACGGTGGCTTTATCCGTATGCACGCCGT
GGGCGCCATGCAGTTCAACAGTGCCTATTGGATGATGAACTACGTGCGTGGTCCAGGCTTCCTGTTCGAGTCAATCAACC
CCTTCTGGCGCGGCAGCTTCGAAGCCAAGAACCGTCGTCACTGGGAAGATCCCTGGGGTTTGACCATTGCGGCTCAGTAC
GATGCAGACCGTCCTGATCTCGGCGATCTGCTGTTTATGGAGTTTGATAACGTCACAGATAAGCAAACCGGCGATGAATA
TGACTATGAAGTGATATTGCGCCCCAATCTGGACTTCCGTGATAATCGCTTCGAGATGATCTTTGCCTACGACAACCTGG
GTGCAAACCTGGCGAAGGGTACTATTTTTGTCGAAGGCTTTGACAGCCCTTACTCTACTAACGTGGGTCCGAAAGATGGC
TATCTCTACACCATGGTTGGCTTTGACAATCTGGATGAGGTGCTCGAAGACAATATGGTGATGTGTTTCGACTACCAGGG
TCCCGAGCAGAGCGCCATCGACATGAAGGTGAAGGCGGTTGTCCAGCCTGAAGCCGTGGGTAAGACGCTGGAAATCCTGC
TGACCCATAGCGTAGAAGGTCAGGCCGAAAAAACCACCAGCCGCACTATTGTGGTCAACAGCGATCTGAAAGTCGCTGCC
ATGCCCGATATGCAAGTGGCCGAAGACGGTGAATTGAGCGGCATCGAAGTCTTCTATCTTGACGCCAACAAGGTCGGCAA
CCACCTGCTGGTGAGTGGCGACCATGTGACGGCCACTGTCGATGGCAGCAGCTTCAGCCTCAAGCCCGATGCCGACTTCT
TCGGTGAAACCCTGGTGACTGTGACGGTTCAGGACAACGAGCATGCAAGCGATCAGGCCAGCACCAGCTTTATGCTGACA
GTCACCCCGGAGCAGGATGCACCGGTTGCCAAAACCGCCGAGGCTGAGATTGCCATCACAGAGGGTCAAACCATTACCCT
GGATGCCAGCAGCTCAGTAGATATGGACGGTGACTCTCTGACATTCAGCTGGGATGGCCCGGGCACCTTCAGTGACGACA
GCGCTGCTGTCACTAAGGTAACCGGCCTGTCAGTGGGTGAACACAGTTTCACCGTGACAGTGTCTGACGGTATGGATGAA
GCCGAGGCAGAGGTCATAGTGAAGGTGGCCGCAGCTCCCGTCACTGAAACCACTCCAGCCAACAACAGCTCAGGCGGCAG
CCTGGGTTGGATGGCCTTGCTGCTGATGGCAGCCGGAGCACTGCGCCGTCGCCATTAA

Upstream 100 bases:

>100_bases
ATTATTGAAAAGTGAATATGTGCTTTTCGATAGCCGGCAGGGAATGGCGGCACCGACAATTGTGGATGGAAACATAATAA
CGACAAGAACGGAGTCATTA

Downstream 100 bases:

>100_bases
AACCCCTTGGTGGCCAGGATCTTTTTCTGGTCACACTTCCCCCCAAACGAAGTGAAAACCAAAGACCGATGGCAACTCCA
TCGGTCTTTTTTTCAGCGCC

Product: serine protease

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 1725; Mature: 1724

Protein sequence:

>1725_residues
MTVKHPIKASAAAVFGVLYLGMSGYAAAEIGMAKADKGGFYVPTFTSDDIKAFNASRKEDQTGDLFLVPGKVNHVLNRRQ
HQVFEFDDSIKGEHTFIVQFDDKPVATYDGGVTGYAATKPLMMQKSGALNPGQAQAAEVVHYQSMLRSKQQSVLNQASAH
GARFELKNQFTLANNAATVRMTQEDAARMAQVPGVKKITPTRVFKLRTDRGPEFIHADSAWNGNTSSGLKAQGEGMVVGI
IDTGVNTDHPAFASDADFTASHEKLGGQYLGDCQTDASLCNDKLIGVYSYEVITEVYNAPEFQDYSWQSKLIRPRNGEDY
NGHGSHTASTAAGNRIENTPLQAANGDKVSDGVNLPFNFDHTSGVAPRAHIISYQVCWPGSGGDPYAGCPEEAILAAFED
AIRDGVDVINFSIGGGENFPWEDPMELAFLSAREAGISVAAAAGNSGPYFYSADHTSPWVTTVGASTHDRTLDAGKTSIT
AFESTGPAYTIPKNDIVGKGFTEEISGQFVLAENYPDPNPNDGYAAKTCNAPFPAGTFTADQIVVCERGDIPRVDKAINV
QAGGAGGLVLQNVSYNDPLVADRFVIPGINVSSSVGYSLKNWINRSNGTARGTITAHVNDYLLDEEKGNLLAYFSSMGPS
RYIDNLVPDVTAPGVNIYAANADDQPFTNYPSASDWTMMSGTSMASPHVAGAMTLLTQLHPDWTPAEIQSALMLTANEVK
YQPYAGATPAELPYHFMAGAGAIDVAKADATGLIMDETIDGYMAANPNNGGIVNWLNLPSMVDMNCEKECTWMRTVKATK
DGSWSVGTEVREDGATLVATPNQFSLKAGETQTIMVKMTVPSINRYAVDPDDGDSPWESNTNYALFNGKLMLTESTGNSP
ELHMPVVALSNYDQLPFAKQIEFNREQGSETFIVNTDNYSQFTPRYYGLVKPEVEQHELGLVSPIINMANVEKWGLSKVV
VPEGTKRLMVEVQSAEVIGYDNNQNPRYIKQAPVLTVGLDANGNDGFTPSQEEIDADYYALRNEFFSEMKCQSSSSAVQN
YCDIVDPTPGTYWIALINVGSGEQKYKVNTAVAVIGNDSAAGNFHLEGPASHDGNGNYQLTLNWDLPEAAEGDVFYGGFD
MGNMPGEEGTLGFTSLTLRRGKDNVSFNLSQDKARNMDVIEIDLSMLPNLETQDRDFSFKLTLPDGMRLAPETLKTVNDK
ALTNLEMDEKGFSLSGNQPSTRNIQREYVVTNSLTSAQCRTPIIDEYSDGGYIDLHEFGMQPDQVWHVGDHRAYNDVPMN
WLFWGMDQEQFKLYNQDNGGFIRMHAVGAMQFNSAYWMMNYVRGPGFLFESINPFWRGSFEAKNRRHWEDPWGLTIAAQY
DADRPDLGDLLFMEFDNVTDKQTGDEYDYEVILRPNLDFRDNRFEMIFAYDNLGANLAKGTIFVEGFDSPYSTNVGPKDG
YLYTMVGFDNLDEVLEDNMVMCFDYQGPEQSAIDMKVKAVVQPEAVGKTLEILLTHSVEGQAEKTTSRTIVVNSDLKVAA
MPDMQVAEDGELSGIEVFYLDANKVGNHLLVSGDHVTATVDGSSFSLKPDADFFGETLVTVTVQDNEHASDQASTSFMLT
VTPEQDAPVAKTAEAEIAITEGQTITLDASSSVDMDGDSLTFSWDGPGTFSDDSAAVTKVTGLSVGEHSFTVTVSDGMDE
AEAEVIVKVAAAPVTETTPANNSSGGSLGWMALLLMAAGALRRRH

Sequences:

>Translated_1725_residues
MTVKHPIKASAAAVFGVLYLGMSGYAAAEIGMAKADKGGFYVPTFTSDDIKAFNASRKEDQTGDLFLVPGKVNHVLNRRQ
HQVFEFDDSIKGEHTFIVQFDDKPVATYDGGVTGYAATKPLMMQKSGALNPGQAQAAEVVHYQSMLRSKQQSVLNQASAH
GARFELKNQFTLANNAATVRMTQEDAARMAQVPGVKKITPTRVFKLRTDRGPEFIHADSAWNGNTSSGLKAQGEGMVVGI
IDTGVNTDHPAFASDADFTASHEKLGGQYLGDCQTDASLCNDKLIGVYSYEVITEVYNAPEFQDYSWQSKLIRPRNGEDY
NGHGSHTASTAAGNRIENTPLQAANGDKVSDGVNLPFNFDHTSGVAPRAHIISYQVCWPGSGGDPYAGCPEEAILAAFED
AIRDGVDVINFSIGGGENFPWEDPMELAFLSAREAGISVAAAAGNSGPYFYSADHTSPWVTTVGASTHDRTLDAGKTSIT
AFESTGPAYTIPKNDIVGKGFTEEISGQFVLAENYPDPNPNDGYAAKTCNAPFPAGTFTADQIVVCERGDIPRVDKAINV
QAGGAGGLVLQNVSYNDPLVADRFVIPGINVSSSVGYSLKNWINRSNGTARGTITAHVNDYLLDEEKGNLLAYFSSMGPS
RYIDNLVPDVTAPGVNIYAANADDQPFTNYPSASDWTMMSGTSMASPHVAGAMTLLTQLHPDWTPAEIQSALMLTANEVK
YQPYAGATPAELPYHFMAGAGAIDVAKADATGLIMDETIDGYMAANPNNGGIVNWLNLPSMVDMNCEKECTWMRTVKATK
DGSWSVGTEVREDGATLVATPNQFSLKAGETQTIMVKMTVPSINRYAVDPDDGDSPWESNTNYALFNGKLMLTESTGNSP
ELHMPVVALSNYDQLPFAKQIEFNREQGSETFIVNTDNYSQFTPRYYGLVKPEVEQHELGLVSPIINMANVEKWGLSKVV
VPEGTKRLMVEVQSAEVIGYDNNQNPRYIKQAPVLTVGLDANGNDGFTPSQEEIDADYYALRNEFFSEMKCQSSSSAVQN
YCDIVDPTPGTYWIALINVGSGEQKYKVNTAVAVIGNDSAAGNFHLEGPASHDGNGNYQLTLNWDLPEAAEGDVFYGGFD
MGNMPGEEGTLGFTSLTLRRGKDNVSFNLSQDKARNMDVIEIDLSMLPNLETQDRDFSFKLTLPDGMRLAPETLKTVNDK
ALTNLEMDEKGFSLSGNQPSTRNIQREYVVTNSLTSAQCRTPIIDEYSDGGYIDLHEFGMQPDQVWHVGDHRAYNDVPMN
WLFWGMDQEQFKLYNQDNGGFIRMHAVGAMQFNSAYWMMNYVRGPGFLFESINPFWRGSFEAKNRRHWEDPWGLTIAAQY
DADRPDLGDLLFMEFDNVTDKQTGDEYDYEVILRPNLDFRDNRFEMIFAYDNLGANLAKGTIFVEGFDSPYSTNVGPKDG
YLYTMVGFDNLDEVLEDNMVMCFDYQGPEQSAIDMKVKAVVQPEAVGKTLEILLTHSVEGQAEKTTSRTIVVNSDLKVAA
MPDMQVAEDGELSGIEVFYLDANKVGNHLLVSGDHVTATVDGSSFSLKPDADFFGETLVTVTVQDNEHASDQASTSFMLT
VTPEQDAPVAKTAEAEIAITEGQTITLDASSSVDMDGDSLTFSWDGPGTFSDDSAAVTKVTGLSVGEHSFTVTVSDGMDE
AEAEVIVKVAAAPVTETTPANNSSGGSLGWMALLLMAAGALRRRH
>Mature_1724_residues
TVKHPIKASAAAVFGVLYLGMSGYAAAEIGMAKADKGGFYVPTFTSDDIKAFNASRKEDQTGDLFLVPGKVNHVLNRRQH
QVFEFDDSIKGEHTFIVQFDDKPVATYDGGVTGYAATKPLMMQKSGALNPGQAQAAEVVHYQSMLRSKQQSVLNQASAHG
ARFELKNQFTLANNAATVRMTQEDAARMAQVPGVKKITPTRVFKLRTDRGPEFIHADSAWNGNTSSGLKAQGEGMVVGII
DTGVNTDHPAFASDADFTASHEKLGGQYLGDCQTDASLCNDKLIGVYSYEVITEVYNAPEFQDYSWQSKLIRPRNGEDYN
GHGSHTASTAAGNRIENTPLQAANGDKVSDGVNLPFNFDHTSGVAPRAHIISYQVCWPGSGGDPYAGCPEEAILAAFEDA
IRDGVDVINFSIGGGENFPWEDPMELAFLSAREAGISVAAAAGNSGPYFYSADHTSPWVTTVGASTHDRTLDAGKTSITA
FESTGPAYTIPKNDIVGKGFTEEISGQFVLAENYPDPNPNDGYAAKTCNAPFPAGTFTADQIVVCERGDIPRVDKAINVQ
AGGAGGLVLQNVSYNDPLVADRFVIPGINVSSSVGYSLKNWINRSNGTARGTITAHVNDYLLDEEKGNLLAYFSSMGPSR
YIDNLVPDVTAPGVNIYAANADDQPFTNYPSASDWTMMSGTSMASPHVAGAMTLLTQLHPDWTPAEIQSALMLTANEVKY
QPYAGATPAELPYHFMAGAGAIDVAKADATGLIMDETIDGYMAANPNNGGIVNWLNLPSMVDMNCEKECTWMRTVKATKD
GSWSVGTEVREDGATLVATPNQFSLKAGETQTIMVKMTVPSINRYAVDPDDGDSPWESNTNYALFNGKLMLTESTGNSPE
LHMPVVALSNYDQLPFAKQIEFNREQGSETFIVNTDNYSQFTPRYYGLVKPEVEQHELGLVSPIINMANVEKWGLSKVVV
PEGTKRLMVEVQSAEVIGYDNNQNPRYIKQAPVLTVGLDANGNDGFTPSQEEIDADYYALRNEFFSEMKCQSSSSAVQNY
CDIVDPTPGTYWIALINVGSGEQKYKVNTAVAVIGNDSAAGNFHLEGPASHDGNGNYQLTLNWDLPEAAEGDVFYGGFDM
GNMPGEEGTLGFTSLTLRRGKDNVSFNLSQDKARNMDVIEIDLSMLPNLETQDRDFSFKLTLPDGMRLAPETLKTVNDKA
LTNLEMDEKGFSLSGNQPSTRNIQREYVVTNSLTSAQCRTPIIDEYSDGGYIDLHEFGMQPDQVWHVGDHRAYNDVPMNW
LFWGMDQEQFKLYNQDNGGFIRMHAVGAMQFNSAYWMMNYVRGPGFLFESINPFWRGSFEAKNRRHWEDPWGLTIAAQYD
ADRPDLGDLLFMEFDNVTDKQTGDEYDYEVILRPNLDFRDNRFEMIFAYDNLGANLAKGTIFVEGFDSPYSTNVGPKDGY
LYTMVGFDNLDEVLEDNMVMCFDYQGPEQSAIDMKVKAVVQPEAVGKTLEILLTHSVEGQAEKTTSRTIVVNSDLKVAAM
PDMQVAEDGELSGIEVFYLDANKVGNHLLVSGDHVTATVDGSSFSLKPDADFFGETLVTVTVQDNEHASDQASTSFMLTV
TPEQDAPVAKTAEAEIAITEGQTITLDASSSVDMDGDSLTFSWDGPGTFSDDSAAVTKVTGLSVGEHSFTVTVSDGMDEA
EAEVIVKVAAAPVTETTPANNSSGGSLGWMALLLMAAGALRRRH

Specific function: Not required for growth or sporulation [H]

COG id: COG1404

COG function: function code O; Subtilisin-like serine proteases

Gene ontology:

Cell location: Secreted [H]

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the peptidase S8 family [H]

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000209
- InterPro:   IPR022398
- InterPro:   IPR015500
- InterPro:   IPR010259
- InterPro:   IPR003137 [H]

Pfam domain/function: PF05922 Inhibitor_I9; PF02225 PA; PF00082 Peptidase_S8 [H]

EC number: NA

Molecular weight: Translated: 187618; Mature: 187487

Theoretical pI: Translated: 4.29; Mature: 4.29

Prosite motif: PS00136 SUBTILASE_ASP ; PS00138 SUBTILASE_SER

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.7 %Cys     (Translated Protein)
3.2 %Met     (Translated Protein)
3.9 %Cys+Met (Translated Protein)
0.7 %Cys     (Mature Protein)
3.1 %Met     (Mature Protein)
3.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTVKHPIKASAAAVFGVLYLGMSGYAAAEIGMAKADKGGFYVPTFTSDDIKAFNASRKED
CCCCCCCCHHHHHHHHHHHHCCCCCCHHHCCCEECCCCCEEEEECCCCCHHHHCCCCCCC
QTGDLFLVPGKVNHVLNRRQHQVFEFDDSIKGEHTFIVQFDDKPVATYDGGVTGYAATKP
CCCCEEEECCHHHHHHHHHHHEEEECCCCCCCCEEEEEEECCCCCEEECCCEECCCCCCC
LMMQKSGALNPGQAQAAEVVHYQSMLRSKQQSVLNQASAHGARFELKNQFTLANNAATVR
EEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCEEEEECCCEEEE
MTQEDAARMAQVPGVKKITPTRVFKLRTDRGPEFIHADSAWNGNTSSGLKAQGEGMVVGI
EEHHHHHHHHHCCCCCCCCCCEEEEEECCCCCCEEEECCCCCCCCCCCCEECCCEEEEEE
IDTGVNTDHPAFASDADFTASHEKLGGQYLGDCQTDASLCNDKLIGVYSYEVITEVYNAP
EECCCCCCCCCCCCCCCCCCCHHHCCCCCCCCCCCCHHHCCCCEEEEEHHHHHHHHHCCC
EFQDYSWQSKLIRPRNGEDYNGHGSHTASTAAGNRIENTPLQAANGDKVSDGVNLPFNFD
CCCCCCCHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECC
HTSGVAPRAHIISYQVCWPGSGGDPYAGCPEEAILAAFEDAIRDGVDVINFSIGGGENFP
CCCCCCCCEEEEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCEEEEEEECCCCCCC
WEDPMELAFLSAREAGISVAAAAGNSGPYFYSADHTSPWVTTVGASTHDRTLDAGKTSIT
CCCCHHHEEEEHHCCCEEEEEECCCCCCEEEECCCCCCEEEEECCCCCCCCCCCCCCEEE
AFESTGPAYTIPKNDIVGKGFTEEISGQFVLAENYPDPNPNDGYAAKTCNAPFPAGTFTA
EEECCCCEEECCCCCCCCCCCHHHHCCCEEEECCCCCCCCCCCCEEEECCCCCCCCCCCC
DQIVVCERGDIPRVDKAINVQAGGAGGLVLQNVSYNDPLVADRFVIPGINVSSSVGYSLK
CEEEEECCCCCCCCCCEEEEECCCCCCEEEEECCCCCCCEECEEEECCCCCCCCCCHHHH
NWINRSNGTARGTITAHVNDYLLDEEKGNLLAYFSSMGPSRYIDNLVPDVTAPGVNIYAA
HHHCCCCCCEEEEEEEECCCEEEECCCCCEEEEEECCCCHHHHHHHCCCCCCCCCEEEEE
NADDQPFTNYPSASDWTMMSGTSMASPHVAGAMTLLTQLHPDWTPAEIQSALMLTANEVK
CCCCCCCCCCCCCCCEEEECCCCCCCCHHHHHHHHHHHHCCCCCHHHHHHEEEEEECCEE
YQPYAGATPAELPYHFMAGAGAIDVAKADATGLIMDETIDGYMAANPNNGGIVNWLNLPS
ECCCCCCCCCCCCCHHHCCCCEEEEECCCCCEEEEECCCCCEEEECCCCCCEEEEECCCC
MVDMNCEKECTWMRTVKATKDGSWSVGTEVREDGATLVATPNQFSLKAGETQTIMVKMTV
EECCCCCCCCEEEEEEEECCCCCCCCCCHHHCCCCEEEECCCCEEEECCCCEEEEEEEEC
PSINRYAVDPDDGDSPWESNTNYALFNGKLMLTESTGNSPELHMPVVALSNYDQLPFAKQ
CCCCEEEECCCCCCCCCCCCCCEEEEECEEEEEECCCCCCCEEEEEEEECCCCCCCCHHH
IEFNREQGSETFIVNTDNYSQFTPRYYGLVKPEVEQHELGLVSPIINMANVEKWGLSKVV
EECCCCCCCEEEEEECCCCCCCCCCEEECCCCCCCHHHCCHHHHHHHHHCCCCCCCEEEE
VPEGTKRLMVEVQSAEVIGYDNNQNPRYIKQAPVLTVGLDANGNDGFTPSQEEIDADYYA
CCCCCCEEEEEEECCEEEEECCCCCCCEEECCCEEEEEECCCCCCCCCCCHHHHCHHHHH
LRNEFFSEMKCQSSSSAVQNYCDIVDPTPGTYWIALINVGSGEQKYKVNTAVAVIGNDSA
HHHHHHHHHHCCCCHHHHHHHHHCCCCCCCEEEEEEEECCCCCCEEEEEEEEEEEECCCC
AGNFHLEGPASHDGNGNYQLTLNWDLPEAAEGDVFYGGFDMGNMPGEEGTLGFTSLTLRR
CCEEEECCCCCCCCCCCEEEEEEECCCCCCCCCEEECCCCCCCCCCCCCCEEEEEEEEEC
GKDNVSFNLSQDKARNMDVIEIDLSMLPNLETQDRDFSFKLTLPDGMRLAPETLKTVNDK
CCCCEEEEECCHHHCCCEEEEEEHHHCCCCCCCCCCEEEEEECCCCCEECHHHHHHCCHH
ALTNLEMDEKGFSLSGNQPSTRNIQREYVVTNSLTSAQCRTPIIDEYSDGGYIDLHEFGM
HHCCCEECCCCCEECCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCCCEEEHHHHCC
QPDQVWHVGDHRAYNDVPMNWLFWGMDQEQFKLYNQDNGGFIRMHAVGAMQFNSAYWMMN
CCCCEEECCCCCCCCCCCCCEEEECCCHHHEEEEECCCCCEEEEEEEEEEEECCEEEEEE
YVRGPGFLFESINPFWRGSFEAKNRRHWEDPWGLTIAAQYDADRPDLGDLLFMEFDNVTD
ECCCCCCHHHCCCHHHCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCEEEEEECCCCC
KQTGDEYDYEVILRPNLDFRDNRFEMIFAYDNLGANLAKGTIFVEGFDSPYSTNVGPKDG
CCCCCCCCEEEEEECCCCCCCCCEEEEEEECCCCCCEECCEEEEEECCCCCCCCCCCCCC
YLYTMVGFDNLDEVLEDNMVMCFDYQGPEQSAIDMKVKAVVQPEAVGKTLEILLTHSVEG
EEEEEECCCCHHHHHHCCEEEEEECCCCCCCCEEEEEEEEECHHHHCCEEEEEEEECCCC
QAEKTTSRTIVVNSDLKVAAMPDMQVAEDGELSGIEVFYLDANKVGNHLLVSGDHVTATV
CCCCCCCEEEEEECCEEEEECCCCEECCCCCCCCEEEEEEEHHHCCCEEEEECCEEEEEE
DGSSFSLKPDADFFGETLVTVTVQDNEHASDQASTSFMLTVTPEQDAPVAKTAEAEIAIT
CCCEEECCCCCCCCCCEEEEEEEECCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEE
EGQTITLDASSSVDMDGDSLTFSWDGPGTFSDDSAAVTKVTGLSVGEHSFTVTVSDGMDE
CCCEEEEECCCCCCCCCCEEEEEECCCCCCCCCCCEEEEEECCEECCCEEEEEECCCCCC
AEAEVIVKVAAAPVTETTPANNSSGGSLGWMALLLMAAGALRRRH
CCCEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCC
>Mature Secondary Structure 
TVKHPIKASAAAVFGVLYLGMSGYAAAEIGMAKADKGGFYVPTFTSDDIKAFNASRKED
CCCCCCCHHHHHHHHHHHHCCCCCCHHHCCCEECCCCCEEEEECCCCCHHHHCCCCCCC
QTGDLFLVPGKVNHVLNRRQHQVFEFDDSIKGEHTFIVQFDDKPVATYDGGVTGYAATKP
CCCCEEEECCHHHHHHHHHHHEEEECCCCCCCCEEEEEEECCCCCEEECCCEECCCCCCC
LMMQKSGALNPGQAQAAEVVHYQSMLRSKQQSVLNQASAHGARFELKNQFTLANNAATVR
EEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCEEEEECCCEEEE
MTQEDAARMAQVPGVKKITPTRVFKLRTDRGPEFIHADSAWNGNTSSGLKAQGEGMVVGI
EEHHHHHHHHHCCCCCCCCCCEEEEEECCCCCCEEEECCCCCCCCCCCCEECCCEEEEEE
IDTGVNTDHPAFASDADFTASHEKLGGQYLGDCQTDASLCNDKLIGVYSYEVITEVYNAP
EECCCCCCCCCCCCCCCCCCCHHHCCCCCCCCCCCCHHHCCCCEEEEEHHHHHHHHHCCC
EFQDYSWQSKLIRPRNGEDYNGHGSHTASTAAGNRIENTPLQAANGDKVSDGVNLPFNFD
CCCCCCCHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECC
HTSGVAPRAHIISYQVCWPGSGGDPYAGCPEEAILAAFEDAIRDGVDVINFSIGGGENFP
CCCCCCCCEEEEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCEEEEEEECCCCCCC
WEDPMELAFLSAREAGISVAAAAGNSGPYFYSADHTSPWVTTVGASTHDRTLDAGKTSIT
CCCCHHHEEEEHHCCCEEEEEECCCCCCEEEECCCCCCEEEEECCCCCCCCCCCCCCEEE
AFESTGPAYTIPKNDIVGKGFTEEISGQFVLAENYPDPNPNDGYAAKTCNAPFPAGTFTA
EEECCCCEEECCCCCCCCCCCHHHHCCCEEEECCCCCCCCCCCCEEEECCCCCCCCCCCC
DQIVVCERGDIPRVDKAINVQAGGAGGLVLQNVSYNDPLVADRFVIPGINVSSSVGYSLK
CEEEEECCCCCCCCCCEEEEECCCCCCEEEEECCCCCCCEECEEEECCCCCCCCCCHHHH
NWINRSNGTARGTITAHVNDYLLDEEKGNLLAYFSSMGPSRYIDNLVPDVTAPGVNIYAA
HHHCCCCCCEEEEEEEECCCEEEECCCCCEEEEEECCCCHHHHHHHCCCCCCCCCEEEEE
NADDQPFTNYPSASDWTMMSGTSMASPHVAGAMTLLTQLHPDWTPAEIQSALMLTANEVK
CCCCCCCCCCCCCCCEEEECCCCCCCCHHHHHHHHHHHHCCCCCHHHHHHEEEEEECCEE
YQPYAGATPAELPYHFMAGAGAIDVAKADATGLIMDETIDGYMAANPNNGGIVNWLNLPS
ECCCCCCCCCCCCCHHHCCCCEEEEECCCCCEEEEECCCCCEEEECCCCCCEEEEECCCC
MVDMNCEKECTWMRTVKATKDGSWSVGTEVREDGATLVATPNQFSLKAGETQTIMVKMTV
EECCCCCCCCEEEEEEEECCCCCCCCCCHHHCCCCEEEECCCCEEEECCCCEEEEEEEEC
PSINRYAVDPDDGDSPWESNTNYALFNGKLMLTESTGNSPELHMPVVALSNYDQLPFAKQ
CCCCEEEECCCCCCCCCCCCCCEEEEECEEEEEECCCCCCCEEEEEEEECCCCCCCCHHH
IEFNREQGSETFIVNTDNYSQFTPRYYGLVKPEVEQHELGLVSPIINMANVEKWGLSKVV
EECCCCCCCEEEEEECCCCCCCCCCEEECCCCCCCHHHCCHHHHHHHHHCCCCCCCEEEE
VPEGTKRLMVEVQSAEVIGYDNNQNPRYIKQAPVLTVGLDANGNDGFTPSQEEIDADYYA
CCCCCCEEEEEEECCEEEEECCCCCCCEEECCCEEEEEECCCCCCCCCCCHHHHCHHHHH
LRNEFFSEMKCQSSSSAVQNYCDIVDPTPGTYWIALINVGSGEQKYKVNTAVAVIGNDSA
HHHHHHHHHHCCCCHHHHHHHHHCCCCCCCEEEEEEEECCCCCCEEEEEEEEEEEECCCC
AGNFHLEGPASHDGNGNYQLTLNWDLPEAAEGDVFYGGFDMGNMPGEEGTLGFTSLTLRR
CCEEEECCCCCCCCCCCEEEEEEECCCCCCCCCEEECCCCCCCCCCCCCCEEEEEEEEEC
GKDNVSFNLSQDKARNMDVIEIDLSMLPNLETQDRDFSFKLTLPDGMRLAPETLKTVNDK
CCCCEEEEECCHHHCCCEEEEEEHHHCCCCCCCCCCEEEEEECCCCCEECHHHHHHCCHH
ALTNLEMDEKGFSLSGNQPSTRNIQREYVVTNSLTSAQCRTPIIDEYSDGGYIDLHEFGM
HHCCCEECCCCCEECCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCCCEEEHHHHCC
QPDQVWHVGDHRAYNDVPMNWLFWGMDQEQFKLYNQDNGGFIRMHAVGAMQFNSAYWMMN
CCCCEEECCCCCCCCCCCCCEEEECCCHHHEEEEECCCCCEEEEEEEEEEEECCEEEEEE
YVRGPGFLFESINPFWRGSFEAKNRRHWEDPWGLTIAAQYDADRPDLGDLLFMEFDNVTD
ECCCCCCHHHCCCHHHCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCEEEEEECCCCC
KQTGDEYDYEVILRPNLDFRDNRFEMIFAYDNLGANLAKGTIFVEGFDSPYSTNVGPKDG
CCCCCCCCEEEEEECCCCCCCCCEEEEEEECCCCCCEECCEEEEEECCCCCCCCCCCCCC
YLYTMVGFDNLDEVLEDNMVMCFDYQGPEQSAIDMKVKAVVQPEAVGKTLEILLTHSVEG
EEEEEECCCCHHHHHHCCEEEEEECCCCCCCCEEEEEEEEECHHHHCCEEEEEEEECCCC
QAEKTTSRTIVVNSDLKVAAMPDMQVAEDGELSGIEVFYLDANKVGNHLLVSGDHVTATV
CCCCCCCEEEEEECCEEEEECCCCEECCCCCCCCEEEEEEEHHHCCCEEEEECCEEEEEE
DGSSFSLKPDADFFGETLVTVTVQDNEHASDQASTSFMLTVTPEQDAPVAKTAEAEIAIT
CCCEEECCCCCCCCCCEEEEEEEECCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEE
EGQTITLDASSSVDMDGDSLTFSWDGPGTFSDDSAAVTKVTGLSVGEHSFTVTVSDGMDE
CCCEEEEECCCCCCCCCCEEEEEECCCCCCCCCCCEEEEEECCEECCCEEEEEECCCCCC
AEAEVIVKVAAAPVTETTPANNSSGGSLGWMALLLMAAGALRRRH
CCCEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 1938892; 7934828; 9384377; 10658653 [H]