Definition Yersinia pseudotuberculosis YPIII chromosome, complete genome.
Accession NC_010465
Length 4,689,441

Click here to switch to the map view.

The map label for this gene is clpV1 [H]

Identifier: 170022648

GI number: 170022648

Start: 429095

End: 431698

Strand: Direct

Name: clpV1 [H]

Synonym: YPK_0395

Alternate gene names: 170022648

Gene position: 429095-431698 (Clockwise)

Preceding gene: 170022647

Following gene: 170022649

Centisome position: 9.15

GC content: 52.96

Gene sequence:

>2604_bases
ATGATTCAAATTGACTTGCCCACACTCGTAAACCGCCTTAATCCCATTGCCCGCCATTCTCTGGAAGCGGCGGCGGCGCA
TTGCGTCAGTCAGCAGGAAGCTGAAATTACGGTCTCCCAAGTGCTACTGCAAATGATCTCCACACCATTATGTGATGTCA
GGTTAATCCTGAGCCATGCGGGTGTTGAGGAGGATGAACTACGGGAGTCGCTCGATCAACGTGTTTCAGGCTATCAAGCT
ATTACCCAGGCCTACCCGAGTTTTTCCCCATTACTGGTGGAATGGTTACAAGACAGTTGGTTGCTGGCATCGACAGAGAT
GGAACACAGCCAACTCCGTAGCGGCGTGATGTTATTGACGTTATTGCTCAGCCCGTCGCGCTATCTGGTCCCCACGGCCA
ACCGGCTACTGTCCCCGATTAACCGTGAGTTACTGCGTCAGAATTTTGCTAACTGGACGGCAGACTCAGCAGAAACACCG
CGAGCAGAAAAAGGGGCTGAAGCTGGCAACGGCGCAGAAATCAATGGTGACAGCTTGCTGGCCCGCTATGCCAGCAATAT
GACCGAACAGGCCCGTAACGGCGAATTAGACCCTGTGCTGTGTCGGGATACCGAAATTGATCTGATGATCGATATTCTGT
GCCGCCGCCGCAAAAATAACCCGATAGTGGTGGGTGAAGCGGGGGTAGGGAAAAGTGCGCTGATCGAAGGTTTAGCCCTG
CGTATCGTGGATAATCAGGTGCCAGAAAAACTGCGTAACAGCGAGTTGATGACGCTGGATTTAGGCGCGCTTCAGGCTGG
TGCGGCAGTGAAAGGCGAATTTGAGAAGCGTTTCAAGGGCATCATGGCGGAAATTGCGCAGTCCACGACGCCAATTATCC
TGTTTATTGATGAAGCACACACGCTGATCGGGGCCGGTAACCAACAGGGCGGATTAGACATTTCCAATCTGCTCAAACCC
GCTTTAGCGCGTGGTGAACTGAAAACGATTGCGGCAACCACCTGGAGCGAATACAAAAAATACTTTGAAAAAGATGCTGC
GCTGTCGCGCCGTTTCCAGTTGGTCAAAGTTTCTGAACCCAGCGCGCAAGAGGCCACGATTATCATGCGTGGCCTACGTA
CAGTTTATGAACAGGCGCATGGCGTCCTGATCGATGATGAAGCATTGCAAGCTGCGGCGGTGCTCAGTGACCGCTATATT
TCAGGGCGGCAGTTGCCAGATAAAGCCATCGACGTGTTAGATACCGCAGCCGCACGTGTTGCCATCAATCTCACCTCTGC
ACCGCGTCAGGTTTCGGCACTGAAAAACGAACTCTACCATCAGGGGATGGAAATCGAGATGCTGGAGCGGGAGCAACGCC
TCAGCTTAAGCCGCCCGGATGAGCGGTTATCCGTTTTACAACAGCAGCGAATTGAGATCGAACAGCAACTCATTGCCCTG
AATACCGGTTGGGAAAAGCAACAACATTTGGTCCAGCAGATTATCGCACTGAGAGCGGTCCTGTTGGCACAAGAAGAGAG
CGCCACAGATGAGCAAGTGGTCAATCTGACAGCGCTGAGCGATGAGTTGGAACGGTTGCAACAGCACCAGACGCTGGTCT
CGCCCCATGTGGATAAGAGCCAGATTGCGGCGGTGATCGCGGAATGGACCGGTGTACCGCTAAACCGCCTTTCTCAGAGT
GAATTGGCAGTGGTCACTGAATTGCCCTCTTATCTGGGGCAGCAGATCAAAGGACAAGAGACGGCTATCCATTGCTTGCA
CCAACACTTGCTGACCGCAAGGGCGGATTTACGTCGCCCAGGTCGCCCAATGGGGGCTTTCCTGTTGGTTGGGCCTAGCG
GCGTGGGTAAAACGGAAACCGTGCTACAGATTGCCGACCTGCTCTATGGTGGCCGCCAATATCTCACCACCATCAATATG
TCCGAATTCCAAGAGAAGCATACCGTCTCGCGCCTGATTGGTTCACCTCCGGGCTACGTCGGGTATGGTGAAGGCGGCGT
ACTGACCGAAGCTATTCGTCAAAAACCTTACTCGGTAGTGCTACTGGATGAAGTGGAAAAAGCCCACCCCGATGTGCTGA
ATCTGTTTTATCAGGCCTTCGATAAGGGCGAGTTGGCGGACGGTGAAGGCCGCATCATCGATTGCAAAAATATCGTATTT
TTCCTGACCTCCAATCTGGGATACCAGACCATTGTTGATCATGCAGACGAGCCTGCATTGCTCAATGAACGGCTCTACCC
CGAGCTATCGGCATTCTTTAAACCTGCGCTACTGGCCCGCATGGAAGTGGTTCCTTATCTGTCGCTGGGCATGGAAACGC
TGCAAATTATCATTCACGGCAAACTGAACCGTCTGGATACGCTGCTGCGCCAACGCTTTAGTGCTGATGTGGTCATTGAA
CCTGAAGTGATCGACGAAATCCTGCTACGCGCGACCCGTGCCGAAAATGGTGCGCGTATGCTCGAGTCGATTATCGACGG
TGCGCTATTACCACCGGTTTCTCTGTTGTTATTACAGAAAGTGGCCGCTGGCACCGCTATCAGCCACATTCGGATCGCGG
TAGAGGGGAATGTGTTTACTGCACAGGTTGAGGGAGCGATATGA

Upstream 100 bases:

>100_bases
CTGCTGGTGGTGATTTATCTGTTTTATGCCATCCACCTGCACACCCAAAGTCAGGATATTTTGCAACAGCTAAACAATTT
ATTGAGCTAGGAAAGCACAA

Downstream 100 bases:

>100_bases
GACAACGGTTAAAGAGTGTCCTCGCACTGCTGGACAATGACTCAACCGAACAACTGATCCATCGCTTTCTGACGATTAAC
CACCACCGCCAACGATTTAG

Product: type VI secretion ATPase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 867; Mature: 867

Protein sequence:

>867_residues
MIQIDLPTLVNRLNPIARHSLEAAAAHCVSQQEAEITVSQVLLQMISTPLCDVRLILSHAGVEEDELRESLDQRVSGYQA
ITQAYPSFSPLLVEWLQDSWLLASTEMEHSQLRSGVMLLTLLLSPSRYLVPTANRLLSPINRELLRQNFANWTADSAETP
RAEKGAEAGNGAEINGDSLLARYASNMTEQARNGELDPVLCRDTEIDLMIDILCRRRKNNPIVVGEAGVGKSALIEGLAL
RIVDNQVPEKLRNSELMTLDLGALQAGAAVKGEFEKRFKGIMAEIAQSTTPIILFIDEAHTLIGAGNQQGGLDISNLLKP
ALARGELKTIAATTWSEYKKYFEKDAALSRRFQLVKVSEPSAQEATIIMRGLRTVYEQAHGVLIDDEALQAAAVLSDRYI
SGRQLPDKAIDVLDTAAARVAINLTSAPRQVSALKNELYHQGMEIEMLEREQRLSLSRPDERLSVLQQQRIEIEQQLIAL
NTGWEKQQHLVQQIIALRAVLLAQEESATDEQVVNLTALSDELERLQQHQTLVSPHVDKSQIAAVIAEWTGVPLNRLSQS
ELAVVTELPSYLGQQIKGQETAIHCLHQHLLTARADLRRPGRPMGAFLLVGPSGVGKTETVLQIADLLYGGRQYLTTINM
SEFQEKHTVSRLIGSPPGYVGYGEGGVLTEAIRQKPYSVVLLDEVEKAHPDVLNLFYQAFDKGELADGEGRIIDCKNIVF
FLTSNLGYQTIVDHADEPALLNERLYPELSAFFKPALLARMEVVPYLSLGMETLQIIIHGKLNRLDTLLRQRFSADVVIE
PEVIDEILLRATRAENGARMLESIIDGALLPPVSLLLLQKVAAGTAISHIRIAVEGNVFTAQVEGAI

Sequences:

>Translated_867_residues
MIQIDLPTLVNRLNPIARHSLEAAAAHCVSQQEAEITVSQVLLQMISTPLCDVRLILSHAGVEEDELRESLDQRVSGYQA
ITQAYPSFSPLLVEWLQDSWLLASTEMEHSQLRSGVMLLTLLLSPSRYLVPTANRLLSPINRELLRQNFANWTADSAETP
RAEKGAEAGNGAEINGDSLLARYASNMTEQARNGELDPVLCRDTEIDLMIDILCRRRKNNPIVVGEAGVGKSALIEGLAL
RIVDNQVPEKLRNSELMTLDLGALQAGAAVKGEFEKRFKGIMAEIAQSTTPIILFIDEAHTLIGAGNQQGGLDISNLLKP
ALARGELKTIAATTWSEYKKYFEKDAALSRRFQLVKVSEPSAQEATIIMRGLRTVYEQAHGVLIDDEALQAAAVLSDRYI
SGRQLPDKAIDVLDTAAARVAINLTSAPRQVSALKNELYHQGMEIEMLEREQRLSLSRPDERLSVLQQQRIEIEQQLIAL
NTGWEKQQHLVQQIIALRAVLLAQEESATDEQVVNLTALSDELERLQQHQTLVSPHVDKSQIAAVIAEWTGVPLNRLSQS
ELAVVTELPSYLGQQIKGQETAIHCLHQHLLTARADLRRPGRPMGAFLLVGPSGVGKTETVLQIADLLYGGRQYLTTINM
SEFQEKHTVSRLIGSPPGYVGYGEGGVLTEAIRQKPYSVVLLDEVEKAHPDVLNLFYQAFDKGELADGEGRIIDCKNIVF
FLTSNLGYQTIVDHADEPALLNERLYPELSAFFKPALLARMEVVPYLSLGMETLQIIIHGKLNRLDTLLRQRFSADVVIE
PEVIDEILLRATRAENGARMLESIIDGALLPPVSLLLLQKVAAGTAISHIRIAVEGNVFTAQVEGAI
>Mature_867_residues
MIQIDLPTLVNRLNPIARHSLEAAAAHCVSQQEAEITVSQVLLQMISTPLCDVRLILSHAGVEEDELRESLDQRVSGYQA
ITQAYPSFSPLLVEWLQDSWLLASTEMEHSQLRSGVMLLTLLLSPSRYLVPTANRLLSPINRELLRQNFANWTADSAETP
RAEKGAEAGNGAEINGDSLLARYASNMTEQARNGELDPVLCRDTEIDLMIDILCRRRKNNPIVVGEAGVGKSALIEGLAL
RIVDNQVPEKLRNSELMTLDLGALQAGAAVKGEFEKRFKGIMAEIAQSTTPIILFIDEAHTLIGAGNQQGGLDISNLLKP
ALARGELKTIAATTWSEYKKYFEKDAALSRRFQLVKVSEPSAQEATIIMRGLRTVYEQAHGVLIDDEALQAAAVLSDRYI
SGRQLPDKAIDVLDTAAARVAINLTSAPRQVSALKNELYHQGMEIEMLEREQRLSLSRPDERLSVLQQQRIEIEQQLIAL
NTGWEKQQHLVQQIIALRAVLLAQEESATDEQVVNLTALSDELERLQQHQTLVSPHVDKSQIAAVIAEWTGVPLNRLSQS
ELAVVTELPSYLGQQIKGQETAIHCLHQHLLTARADLRRPGRPMGAFLLVGPSGVGKTETVLQIADLLYGGRQYLTTINM
SEFQEKHTVSRLIGSPPGYVGYGEGGVLTEAIRQKPYSVVLLDEVEKAHPDVLNLFYQAFDKGELADGEGRIIDCKNIVF
FLTSNLGYQTIVDHADEPALLNERLYPELSAFFKPALLARMEVVPYLSLGMETLQIIIHGKLNRLDTLLRQRFSADVVIE
PEVIDEILLRATRAENGARMLESIIDGALLPPVSLLLLQKVAAGTAISHIRIAVEGNVFTAQVEGAI

Specific function: Required for secretion of hcp1 probably by providing the energy source for its translocation [H]

COG id: COG0542

COG function: function code O; ATPases with chaperone activity, ATP-binding subunit

Gene ontology:

Cell location: Cytoplasm [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the clpA/clpB family [H]

Homologues:

Organism=Homo sapiens, GI13540606, Length=320, Percent_Identity=33.125, Blast_Score=156, Evalue=9e-38,
Organism=Escherichia coli, GI1788943, Length=662, Percent_Identity=42.1450151057402, Blast_Score=503, Evalue=1e-143,
Organism=Escherichia coli, GI1787109, Length=274, Percent_Identity=44.5255474452555, Blast_Score=248, Evalue=2e-66,
Organism=Saccharomyces cerevisiae, GI6323002, Length=698, Percent_Identity=36.676217765043, Blast_Score=433, Evalue=1e-122,
Organism=Saccharomyces cerevisiae, GI6320464, Length=686, Percent_Identity=37.0262390670554, Blast_Score=423, Evalue=1e-119,

Paralogues:

None

Copy number: 560 Molecules/Cell In: Glucose minimal media [C]

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003593
- InterPro:   IPR013093
- InterPro:   IPR003959
- InterPro:   IPR017729
- InterPro:   IPR018368
- InterPro:   IPR001270
- InterPro:   IPR019489
- InterPro:   IPR023150 [H]

Pfam domain/function: PF00004 AAA; PF07724 AAA_2; PF10431 ClpB_D2-small [H]

EC number: NA

Molecular weight: Translated: 95649; Mature: 95649

Theoretical pI: Translated: 4.94; Mature: 4.94

Prosite motif: PS00870 CLPAB_1 ; PS00871 CLPAB_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.7 %Cys     (Translated Protein)
1.8 %Met     (Translated Protein)
2.5 %Cys+Met (Translated Protein)
0.7 %Cys     (Mature Protein)
1.8 %Met     (Mature Protein)
2.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MIQIDLPTLVNRLNPIARHSLEAAAAHCVSQQEAEITVSQVLLQMISTPLCDVRLILSHA
CEEECHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHC
GVEEDELRESLDQRVSGYQAITQAYPSFSPLLVEWLQDSWLLASTEMEHSQLRSGVMLLT
CCCHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCEEEEECHHHHHHHHHHHHHHH
LLLSPSRYLVPTANRLLSPINRELLRQNFANWTADSAETPRAEKGAEAGNGAEINGDSLL
HHHCCCCEECCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHH
ARYASNMTEQARNGELDPVLCRDTEIDLMIDILCRRRKNNPIVVGEAGVGKSALIEGLAL
HHHHHHHHHHHHCCCCCCEEECCCCHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHH
RIVDNQVPEKLRNSELMTLDLGALQAGAAVKGEFEKRFKGIMAEIAQSTTPIILFIDEAH
HHHHHHHHHHHCCCCEEEEECCHHHCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCC
TLIGAGNQQGGLDISNLLKPALARGELKTIAATTWSEYKKYFEKDAALSRRFQLVKVSEP
CEEECCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCEEEEEECCC
SAQEATIIMRGLRTVYEQAHGVLIDDEALQAAAVLSDRYISGRQLPDKAIDVLDTAAARV
CCHHHHHHHHHHHHHHHHHCCEEEEHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHE
AINLTSAPRQVSALKNELYHQGMEIEMLEREQRLSLSRPDERLSVLQQQRIEIEQQLIAL
EEEECCCCHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHH
NTGWEKQQHLVQQIIALRAVLLAQEESATDEQVVNLTALSDELERLQQHQTLVSPHVDKS
CCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHH
QIAAVIAEWTGVPLNRLSQSELAVVTELPSYLGQQIKGQETAIHCLHQHLLTARADLRRP
HHHHHHHHHCCCCHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCC
GRPMGAFLLVGPSGVGKTETVLQIADLLYGGRQYLTTINMSEFQEKHTVSRLIGSPPGYV
CCCCCEEEEECCCCCCCHHHHHHHHHHHHCCHHHEEECCHHHHHHHHHHHHHHCCCCCCC
GYGEGGVLTEAIRQKPYSVVLLDEVEKAHPDVLNLFYQAFDKGELADGEGRIIDCKNIVF
CCCCCCHHHHHHHCCCCEEEEEHHHHHHCHHHHHHHHHHCCCCCCCCCCCCEEECCCEEE
FLTSNLGYQTIVDHADEPALLNERLYPELSAFFKPALLARMEVVPYLSLGMETLQIIIHG
EEECCCCHHHHHCCCCCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC
KLNRLDTLLRQRFSADVVIEPEVIDEILLRATRAENGARMLESIIDGALLPPVSLLLLQK
CHHHHHHHHHHHCCCCEEECHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCCHHHHHHHHH
VAAGTAISHIRIAVEGNVFTAQVEGAI
HHHCCCEEEEEEEEECCEEEEEECCCC
>Mature Secondary Structure
MIQIDLPTLVNRLNPIARHSLEAAAAHCVSQQEAEITVSQVLLQMISTPLCDVRLILSHA
CEEECHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHC
GVEEDELRESLDQRVSGYQAITQAYPSFSPLLVEWLQDSWLLASTEMEHSQLRSGVMLLT
CCCHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCEEEEECHHHHHHHHHHHHHHH
LLLSPSRYLVPTANRLLSPINRELLRQNFANWTADSAETPRAEKGAEAGNGAEINGDSLL
HHHCCCCEECCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHH
ARYASNMTEQARNGELDPVLCRDTEIDLMIDILCRRRKNNPIVVGEAGVGKSALIEGLAL
HHHHHHHHHHHHCCCCCCEEECCCCHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHH
RIVDNQVPEKLRNSELMTLDLGALQAGAAVKGEFEKRFKGIMAEIAQSTTPIILFIDEAH
HHHHHHHHHHHCCCCEEEEECCHHHCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCC
TLIGAGNQQGGLDISNLLKPALARGELKTIAATTWSEYKKYFEKDAALSRRFQLVKVSEP
CEEECCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCEEEEEECCC
SAQEATIIMRGLRTVYEQAHGVLIDDEALQAAAVLSDRYISGRQLPDKAIDVLDTAAARV
CCHHHHHHHHHHHHHHHHHCCEEEEHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHE
AINLTSAPRQVSALKNELYHQGMEIEMLEREQRLSLSRPDERLSVLQQQRIEIEQQLIAL
EEEECCCCHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHH
NTGWEKQQHLVQQIIALRAVLLAQEESATDEQVVNLTALSDELERLQQHQTLVSPHVDKS
CCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHH
QIAAVIAEWTGVPLNRLSQSELAVVTELPSYLGQQIKGQETAIHCLHQHLLTARADLRRP
HHHHHHHHHCCCCHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCC
GRPMGAFLLVGPSGVGKTETVLQIADLLYGGRQYLTTINMSEFQEKHTVSRLIGSPPGYV
CCCCCEEEEECCCCCCCHHHHHHHHHHHHCCHHHEEECCHHHHHHHHHHHHHHCCCCCCC
GYGEGGVLTEAIRQKPYSVVLLDEVEKAHPDVLNLFYQAFDKGELADGEGRIIDCKNIVF
CCCCCCHHHHHHHCCCCEEEEEHHHHHHCHHHHHHHHHHCCCCCCCCCCCCEEECCCEEE
FLTSNLGYQTIVDHADEPALLNERLYPELSAFFKPALLARMEVVPYLSLGMETLQIIIHG
EEECCCCHHHHHCCCCCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC
KLNRLDTLLRQRFSADVVIEPEVIDEILLRATRAENGARMLESIIDGALLPPVSLLLLQK
CHHHHHHHHHHHCCCCEEECHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCCHHHHHHHHH
VAAGTAISHIRIAVEGNVFTAQVEGAI
HHHCCCEEEEEEEEECCEEEEEECCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: Hydrolase; Acting on peptide bonds (Peptidases); Serine endopeptidases [C]

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 10984043 [H]