Definition Helicobacter pylori Shi470, complete genome.
Accession NC_010698
Length 1,608,548

Click here to switch to the map view.

The map label for this gene is vspIM [H]

Identifier: 188527264

GI number: 188527264

Start: 452573

End: 455023

Strand: Direct

Name: vspIM [H]

Synonym: HPSH_02355

Alternate gene names: 188527264

Gene position: 452573-455023 (Clockwise)

Preceding gene: 188527263

Following gene: 188527265

Centisome position: 28.14

GC content: 32.84

Gene sequence:

>2451_bases
ATGCTTTCAAACGCTCTTTCTGTTGCAGAAATCGCTCGCCTAGTCAATGTTTCTCATAGCAGCGTGCATAACTGGATCAA
AACCAATCTTTTAGAGAAAATAGAGATTGATTCAAAAATTTATGTGAAAACAAGCTCTTTTTTAGATTTTTGCCGCAACC
ATTTAGGGAAAAACAAGCTTAACAAATACGCTAACAAATCCTTAAAAGGCGCGCATAACCATCAAGAATTGATTTTAAAA
TACCTAAAAATATTAGAAAATAGTTCTGATTTAGAAAAGTTGGGTTCTTATTATGAAGAAGAGCTTTCTAACACTACCAG
AAATTTAGAAGGCATTTACTACACTCCTAATAAAATAGTAGAACAACTTTTCACTCTCCCTAAAGATTTTGATGCTTCTC
AAGCGATTTTTTGCGATCCGGCTGTGGGGAGCGGGAATTTCATCATGCATGCTTTAAAGCTGGGGTTTAAGGTTGAAAAT
ATTTATGGCTATGATACGGACGCTTTTGCTATCGCTTTGACTAAAAAGCGTATTAAAGAGCGTTATCATTTGGATTGCCC
TAATATTGTGCAAAAAGATTTTTTAAATTTAAAACACACCCCACAATTTGATTGCATTTTCACTAACCCGCCATGGGGTA
AGAAATACAATCAAAACCAAAAAGAAAATTTCAAACAGTGCTTTAACCTCTCTCAAAGCCTAGATAGCGCGTCGCTCTTT
TTTATAGCGAGTTTGAATTGCTTAAAAGAAAACGCTCATTTGGGGCTATTATTACCCGAAAGTTGTTTGAATATTGATGC
GTTTAGTAAAATGCGAGAAATGGCTCTGAAGTTTCAAATTAGAAGCCTTATTGATTTCAACAAACCCTTTAAAACCCTAA
TGACTAAGGCTGTTGGTTTGGCGCTTAAAAAAGCCCCTAACAAGAATCAAAAAATCTCATGCTTTTATCAAAGTAGTGAG
TTCAAACGCTCGCCCTCTTCTTTTTTAAACAACCCTAAAAAGATTTTTAATATCCATTGCTCTAGCAAAGAAAATAAAAT
TTTAGACCACCTTTTTTCCCTTCCTCATATAACTTTAAAAAATAACGCTCATTTTGCTTTAGGGATTGTTACAGGCAACA
ATAAAGAAAAATTACACTCCAAACAAGAAAAAAATACCATTCCTATTTTTAGGGGTTCAGATATTTTAAAAGACAGATTA
AAAGCCCCTAGCCAATTCATTAACGCTGATCTAAAAGACTGCCAGCAAGTCGCTCCCTTAAGCCTTTATCAGTCTAGAGA
AAAAATCGTGTATAAATTCATTTCTTCAAAACTTGTCTTTTTTTATGATAATAAGCAACGCCTTTTTTTAAATAGCGCGA
ACATGTTTGTTTTAAAAGAAAATTTTCCTATCAACGCTAATGCGCTAAAAGAATTATTAAACAGCGATTTAATGCAATTT
ATTTTTGAATCGCTTTTTAAAACGCATAAAATTTTAAGAAAAGATTTGGAATGCTTGCCCCTATTTGTGCAATTTATCAA
CAATAGTTTTGATGAAAAATTTTATTTGAAAAATTTAGGGATAGAAAAAAAGACCCTAAACATTTTACAATCAGGAAAAA
CCATGCACATCGCTTGTCTTTTGGCTTTAGGGGATAACCTCATCACGCTTAGCCTTTTAAAAGAAATCGCTTCCAAACAG
CAACAACCCCTTAAAATCCTAGGCACTCATTTGACTTTAAAAATCGCCAGGCTTTTAGAATGCGAAAAACATTTTGAAAT
CATCCCTCTTTTTGAAAATGTCCCTGCTTTTTATGACCTTAAAAAACAAGGAGTTTTTTGGGCGATGAAGGATTTTTTAT
TGTTGTTAAAAGCGATTAAAAAGCATCAAATCAAACATTTGATTTTAGAAAAACAGGATTTTAGAAGCGCTCTTTTAACC
ACATTCATTCCCATAACCGCTCCTAATAAAGACATTAAAAATGTTTATCAAAACCGCCAGGAGTTGTTTTCTCAAATTTA
TGGGCATGTTTTTAATCATTCTCCATATCTTATGAATTTAAAAAACCCCAAAAAGATTTTAATCAACCCTTTCACAAGAT
CAATAGAGCGAAGTATCCCTTTAGAGCATTTGCAAATCGTTTTAAAACTCTTAAAACCCTTTTGTGTTACGCTTTTAGAT
TTTGAAGAACGATACGCTTTTTTAAAAGATAGAGTTACTCATTATCGCGTTAAAACCAGTTTAGAAGAAGTTAAAAACCT
GATTTTAGAAAGCGATTTGTATATAGGGGGGGATTCGTTTTTGATCCATTTGGCTTACTATTTAAAGAAAAATTATTTTA
TCTTTTTTTATAGGGATAATGACGATTTCATGCCGCCTAATGGTAAGAATGAAAATTTTCTAAAAGCCCACAAAAGCCAT
TTTATAGAACAGGATTTAGCCAAAAAATTCCGCCATTTGGGGCTATTATAA

Upstream 100 bases:

>100_bases
CCTTTTATGAAACGCATGGCAAAGGGTTAAACACTTCCCTCTTTTTCAAACGCCTTGTGGTGTTTAATGTGAGTTATGTT
TATAGTTTTTAGGGGGTAAA

Downstream 100 bases:

>100_bases
TATTGTGTTATACTTCTAAATTCAATTTTGCTTGTTAGGACACTTATGAAAAATATTAGAAATATCGCTGTAATCGCGCA
TGTTGATCATGGGAAAACCA

Product: type II adenine specific methyltransferase

Products: NA

Alternate protein names: M.VspI; Adenine-specific methyltransferase VspI [H]

Number of amino acids: Translated: 816; Mature: 816

Protein sequence:

>816_residues
MLSNALSVAEIARLVNVSHSSVHNWIKTNLLEKIEIDSKIYVKTSSFLDFCRNHLGKNKLNKYANKSLKGAHNHQELILK
YLKILENSSDLEKLGSYYEEELSNTTRNLEGIYYTPNKIVEQLFTLPKDFDASQAIFCDPAVGSGNFIMHALKLGFKVEN
IYGYDTDAFAIALTKKRIKERYHLDCPNIVQKDFLNLKHTPQFDCIFTNPPWGKKYNQNQKENFKQCFNLSQSLDSASLF
FIASLNCLKENAHLGLLLPESCLNIDAFSKMREMALKFQIRSLIDFNKPFKTLMTKAVGLALKKAPNKNQKISCFYQSSE
FKRSPSSFLNNPKKIFNIHCSSKENKILDHLFSLPHITLKNNAHFALGIVTGNNKEKLHSKQEKNTIPIFRGSDILKDRL
KAPSQFINADLKDCQQVAPLSLYQSREKIVYKFISSKLVFFYDNKQRLFLNSANMFVLKENFPINANALKELLNSDLMQF
IFESLFKTHKILRKDLECLPLFVQFINNSFDEKFYLKNLGIEKKTLNILQSGKTMHIACLLALGDNLITLSLLKEIASKQ
QQPLKILGTHLTLKIARLLECEKHFEIIPLFENVPAFYDLKKQGVFWAMKDFLLLLKAIKKHQIKHLILEKQDFRSALLT
TFIPITAPNKDIKNVYQNRQELFSQIYGHVFNHSPYLMNLKNPKKILINPFTRSIERSIPLEHLQIVLKLLKPFCVTLLD
FEERYAFLKDRVTHYRVKTSLEEVKNLILESDLYIGGDSFLIHLAYYLKKNYFIFFYRDNDDFMPPNGKNENFLKAHKSH
FIEQDLAKKFRHLGLL

Sequences:

>Translated_816_residues
MLSNALSVAEIARLVNVSHSSVHNWIKTNLLEKIEIDSKIYVKTSSFLDFCRNHLGKNKLNKYANKSLKGAHNHQELILK
YLKILENSSDLEKLGSYYEEELSNTTRNLEGIYYTPNKIVEQLFTLPKDFDASQAIFCDPAVGSGNFIMHALKLGFKVEN
IYGYDTDAFAIALTKKRIKERYHLDCPNIVQKDFLNLKHTPQFDCIFTNPPWGKKYNQNQKENFKQCFNLSQSLDSASLF
FIASLNCLKENAHLGLLLPESCLNIDAFSKMREMALKFQIRSLIDFNKPFKTLMTKAVGLALKKAPNKNQKISCFYQSSE
FKRSPSSFLNNPKKIFNIHCSSKENKILDHLFSLPHITLKNNAHFALGIVTGNNKEKLHSKQEKNTIPIFRGSDILKDRL
KAPSQFINADLKDCQQVAPLSLYQSREKIVYKFISSKLVFFYDNKQRLFLNSANMFVLKENFPINANALKELLNSDLMQF
IFESLFKTHKILRKDLECLPLFVQFINNSFDEKFYLKNLGIEKKTLNILQSGKTMHIACLLALGDNLITLSLLKEIASKQ
QQPLKILGTHLTLKIARLLECEKHFEIIPLFENVPAFYDLKKQGVFWAMKDFLLLLKAIKKHQIKHLILEKQDFRSALLT
TFIPITAPNKDIKNVYQNRQELFSQIYGHVFNHSPYLMNLKNPKKILINPFTRSIERSIPLEHLQIVLKLLKPFCVTLLD
FEERYAFLKDRVTHYRVKTSLEEVKNLILESDLYIGGDSFLIHLAYYLKKNYFIFFYRDNDDFMPPNGKNENFLKAHKSH
FIEQDLAKKFRHLGLL
>Mature_816_residues
MLSNALSVAEIARLVNVSHSSVHNWIKTNLLEKIEIDSKIYVKTSSFLDFCRNHLGKNKLNKYANKSLKGAHNHQELILK
YLKILENSSDLEKLGSYYEEELSNTTRNLEGIYYTPNKIVEQLFTLPKDFDASQAIFCDPAVGSGNFIMHALKLGFKVEN
IYGYDTDAFAIALTKKRIKERYHLDCPNIVQKDFLNLKHTPQFDCIFTNPPWGKKYNQNQKENFKQCFNLSQSLDSASLF
FIASLNCLKENAHLGLLLPESCLNIDAFSKMREMALKFQIRSLIDFNKPFKTLMTKAVGLALKKAPNKNQKISCFYQSSE
FKRSPSSFLNNPKKIFNIHCSSKENKILDHLFSLPHITLKNNAHFALGIVTGNNKEKLHSKQEKNTIPIFRGSDILKDRL
KAPSQFINADLKDCQQVAPLSLYQSREKIVYKFISSKLVFFYDNKQRLFLNSANMFVLKENFPINANALKELLNSDLMQF
IFESLFKTHKILRKDLECLPLFVQFINNSFDEKFYLKNLGIEKKTLNILQSGKTMHIACLLALGDNLITLSLLKEIASKQ
QQPLKILGTHLTLKIARLLECEKHFEIIPLFENVPAFYDLKKQGVFWAMKDFLLLLKAIKKHQIKHLILEKQDFRSALLT
TFIPITAPNKDIKNVYQNRQELFSQIYGHVFNHSPYLMNLKNPKKILINPFTRSIERSIPLEHLQIVLKLLKPFCVTLLD
FEERYAFLKDRVTHYRVKTSLEEVKNLILESDLYIGGDSFLIHLAYYLKKNYFIFFYRDNDDFMPPNGKNENFLKAHKSH
FIEQDLAKKFRHLGLL

Specific function: This methylase recognizes the double-stranded sequence ATTAAT, causes specific methylation on A-5 on both strands, and protects the DNA from cleavage by the VspI endonuclease [H]

COG id: COG0827

COG function: function code L; Adenine-specific DNA methylase

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the N(4)/N(6)-methyltransferase family [H]

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003356
- InterPro:   IPR002052
- InterPro:   IPR002296 [H]

Pfam domain/function: PF02384 N6_Mtase [H]

EC number: =2.1.1.72 [H]

Molecular weight: Translated: 94738; Mature: 94738

Theoretical pI: Translated: 9.89; Mature: 9.89

Prosite motif: PS00092 N6_MTASE

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.7 %Cys     (Translated Protein)
1.3 %Met     (Translated Protein)
3.1 %Cys+Met (Translated Protein)
1.7 %Cys     (Mature Protein)
1.3 %Met     (Mature Protein)
3.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MLSNALSVAEIARLVNVSHSSVHNWIKTNLLEKIEIDSKIYVKTSSFLDFCRNHLGKNKL
CCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEEEHHHHHHHHHHHCHHHH
NKYANKSLKGAHNHQELILKYLKILENSSDLEKLGSYYEEELSNTTRNLEGIYYTPNKIV
HHHHHHCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCEEEECHHHHH
EQLFTLPKDFDASQAIFCDPAVGSGNFIMHALKLGFKVENIYGYDTDAFAIALTKKRIKE
HHHHCCCCCCCCCCEEEECCCCCCCHHHHHHHHHCCEEEEEECCCCCHHHHHHHHHHHHH
RYHLDCPNIVQKDFLNLKHTPQFDCIFTNPPWGKKYNQNQKENFKQCFNLSQSLDSASLF
HHCCCCCHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHH
FIASLNCLKENAHLGLLLPESCLNIDAFSKMREMALKFQIRSLIDFNKPFKTLMTKAVGL
HHHHHHHHHCCCCEEEECCHHHCCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH
ALKKAPNKNQKISCFYQSSEFKRSPSSFLNNPKKIFNIHCSSKENKILDHLFSLPHITLK
HHHCCCCCCCEEEEEEECCHHCCCCHHHHCCCCEEEEEECCCCCHHHHHHHHCCCCEEEE
NNAHFALGIVTGNNKEKLHSKQEKNTIPIFRGSDILKDRLKAPSQFINADLKDCQQVAPL
CCCEEEEEEEECCCHHHHHHHHHCCCCCEECCCHHHHHHHCCCHHHHCCCHHHHHHHCCH
SLYQSREKIVYKFISSKLVFFYDNKQRLFLNSANMFVLKENFPINANALKELLNSDLMQF
HHHHHHHHHHHHHHHCCEEEEEECCCEEEEECCCEEEEECCCCCCHHHHHHHHHHHHHHH
IFESLFKTHKILRKDLECLPLFVQFINNSFDEKFYLKNLGIEKKTLNILQSGKTMHIACL
HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHCCCCHHHHHHHHCCCEEEEEEH
LALGDNLITLSLLKEIASKQQQPLKILGTHLTLKIARLLECEKHFEIIPLFENVPAFYDL
HHHCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCEEEEHHHCCCHHHHH
KKQGVFWAMKDFLLLLKAIKKHQIKHLILEKQDFRSALLTTFIPITAPNKDIKNVYQNRQ
HHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHH
ELFSQIYGHVFNHSPYLMNLKNPKKILINPFTRSIERSIPLEHLQIVLKLLKPFCVTLLD
HHHHHHHHHHHCCCCEEEECCCCCEEEECHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHH
FEERYAFLKDRVTHYRVKTSLEEVKNLILESDLYIGGDSFLIHLAYYLKKNYFIFFYRDN
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCCHHHHHHHHHHHCCEEEEEEECC
DDFMPPNGKNENFLKAHKSHFIEQDLAKKFRHLGLL
CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCC
>Mature Secondary Structure
MLSNALSVAEIARLVNVSHSSVHNWIKTNLLEKIEIDSKIYVKTSSFLDFCRNHLGKNKL
CCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEEEHHHHHHHHHHHCHHHH
NKYANKSLKGAHNHQELILKYLKILENSSDLEKLGSYYEEELSNTTRNLEGIYYTPNKIV
HHHHHHCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCEEEECHHHHH
EQLFTLPKDFDASQAIFCDPAVGSGNFIMHALKLGFKVENIYGYDTDAFAIALTKKRIKE
HHHHCCCCCCCCCCEEEECCCCCCCHHHHHHHHHCCEEEEEECCCCCHHHHHHHHHHHHH
RYHLDCPNIVQKDFLNLKHTPQFDCIFTNPPWGKKYNQNQKENFKQCFNLSQSLDSASLF
HHCCCCCHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHH
FIASLNCLKENAHLGLLLPESCLNIDAFSKMREMALKFQIRSLIDFNKPFKTLMTKAVGL
HHHHHHHHHCCCCEEEECCHHHCCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH
ALKKAPNKNQKISCFYQSSEFKRSPSSFLNNPKKIFNIHCSSKENKILDHLFSLPHITLK
HHHCCCCCCCEEEEEEECCHHCCCCHHHHCCCCEEEEEECCCCCHHHHHHHHCCCCEEEE
NNAHFALGIVTGNNKEKLHSKQEKNTIPIFRGSDILKDRLKAPSQFINADLKDCQQVAPL
CCCEEEEEEEECCCHHHHHHHHHCCCCCEECCCHHHHHHHCCCHHHHCCCHHHHHHHCCH
SLYQSREKIVYKFISSKLVFFYDNKQRLFLNSANMFVLKENFPINANALKELLNSDLMQF
HHHHHHHHHHHHHHHCCEEEEEECCCEEEEECCCEEEEECCCCCCHHHHHHHHHHHHHHH
IFESLFKTHKILRKDLECLPLFVQFINNSFDEKFYLKNLGIEKKTLNILQSGKTMHIACL
HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHCCCCHHHHHHHHCCCEEEEEEH
LALGDNLITLSLLKEIASKQQQPLKILGTHLTLKIARLLECEKHFEIIPLFENVPAFYDL
HHHCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCEEEEHHHCCCHHHHH
KKQGVFWAMKDFLLLLKAIKKHQIKHLILEKQDFRSALLTTFIPITAPNKDIKNVYQNRQ
HHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHH
ELFSQIYGHVFNHSPYLMNLKNPKKILINPFTRSIERSIPLEHLQIVLKLLKPFCVTLLD
HHHHHHHHHHHCCCCEEEECCCCCEEEECHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHH
FEERYAFLKDRVTHYRVKTSLEEVKNLILESDLYIGGDSFLIHLAYYLKKNYFIFFYRDN
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCCHHHHHHHHHHHCCEEEEEEECC
DDFMPPNGKNENFLKAHKSHFIEQDLAKKFRHLGLL
CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8493116; 7607528 [H]