Definition Vibrio splendidus LGP32 chromosome 1, complete genome.
Accession NC_011753
Length 3,299,303

Click here to switch to the map view.

The map label for this gene is hsdS [H]

Identifier: 218709370

GI number: 218709370

Start: 1490971

End: 1492539

Strand: Direct

Name: hsdS [H]

Synonym: VS_1379

Alternate gene names: 218709370

Gene position: 1490971-1492539 (Clockwise)

Preceding gene: 218709369

Following gene: 218709371

Centisome position: 45.19

GC content: 43.28

Gene sequence:

>1569_bases
ATGAGTGAGTTGCCGAAAGGTTGGATTGCTTGTACACCTTCAGATCTAGCGAATGACCCGAAAAACGAAATTGTTGATGG
CCCGTTTGGCTCAAATCTAAAAGCTTCTGAATATACTGATGAAGGAACACCAATAGTTCGAATTCAAAATGTAAAGCGCA
TGGCTTTTTTAAACAAGAACATAAAATATGTTACGGATGAAAAAGCAGAGTTCTTGAAGAGGCATAGCTTTAAGTCAGGT
GATTTGCTATTAACGAAACTTGGTGAGCCTCTAGGGCTGACGTGTATTGCTCCTGAGTACCTAAATGAAGGCATAATTGT
TGCTGATATTGTTAGGTTACGACCGAACCCAGAAGTTAACCGCAAGTGTTTGGCTTATCTTCTTAACTCTGAAGGCGTAA
TCAAACAGATAAATGCACACACTAAAGGCTCTACACGAGCTCGAATCAATTTATCTGTTGTTCGAAACTTAAATATAAAT
TTACCACCACTAGCCGAACAAAAACGCATCGTCGAAAAAATCGATGAGGTATTGGCACAGGTCGACACCATCAAAGCCCG
CCTAGATGGTATCCCAGATTTGCTAAAACGCTTCCGCCAATCGGTTCTCACTTCTGCAGTGTCGGGTAAGTTGACGGAAG
AGTGGCGTGAAGAGCAAGATGCTTATCCAACACTTAATGAGCTTAAAGCGACAATCGAACAAGAACGCTTTGAAATATGG
TGCTCTGCTGAGCTAAATAAAAAAATTTCTAAGGGGAAACCTCCTGCTAACGATAAATGGAAGGAAAAGTACCAACCGGG
AAACCCTAAGCATAATGATTCAAATAAACGTACAGCTGTTGAAGAAATTAAAGCACCGTGGTTGCTAACATCTTTAGACG
CAGTGTCTATTCTGACGACTGGTAAAACGCCGTCTACGGCAAAAGATGAATATTGGAATGGAGATACAATGTTTGTATCT
CCAGCCCAAATTCATCCTGAGGGTTATCTTCATAACCCCTCAAGGTATGTATCGAAGGCAGGGTGCCAGATTGTTCCCTT
GATCTCAAAAGGTTCGACGCTAATTGTTTGTATTGGAACCGTAGGGAAGGTCGGATTGTTAACTGAAGATGTGGTTATCA
ATCAACAAATTAATGCAATAACCCCTCTGCCTAGCGTCACTCATAAGTATATGTACTACTGGTGTAAAACGTTGTATCCA
TGGATTATTGATACAGCCCGAGCCACAGTTAACGCAGCGATACTTAACAAGAGTACGATGTCTACGGCACCTTTTGCATT
ACCGCCGCTTGAAGAACAAAAAGAAATCGTTCGCCTTGTCGACCAATACTTCGCGTTCGCTGACACCATTGAAGCGCAAG
TGAAAAAAGCGCAGGCGAGAGTAGATAACCTAACCCAAAGCATCTTGGCAAAAGCCTTCCGAGGTGAATTGGTGGCGCAA
GATCCAAACGATGAACCCGCCGACAAACTGTTAGCGCGCATTGCCGAGGCTCGCAAAGAGGCCGAAGCCCTAGCGAAGGC
GGCCAAAAAAGCTGAAGCGGCAAAGAAAAGAGCCGCAAAAAGTGCATAA

Upstream 100 bases:

>100_bases
ACCAACTGATGCAAGCACTCGGTGCGAATGACGAAGCAGAAGGGCAAAAGCAGTTACTTGAGGAAGCCTTTGGCTTGGCA
GAAAAGCCGGAGGCTGAGTA

Downstream 100 bases:

>100_bases
GTTCTAAGCTTTACTTTGTTGTATTAATAAAGGGGCGAGAGCCCCTTTTTAGTGTCTGTCATGTGGGATTTTTAAATGTT
TAAAGCGATTATTGAGTTAG

Product: type I restriction enzyme EcoKI S subunit

Products: NA

Alternate protein names: S.EcoBI; Type I restriction enzyme EcoBI specificity protein; S protein [H]

Number of amino acids: Translated: 522; Mature: 521

Protein sequence:

>522_residues
MSELPKGWIACTPSDLANDPKNEIVDGPFGSNLKASEYTDEGTPIVRIQNVKRMAFLNKNIKYVTDEKAEFLKRHSFKSG
DLLLTKLGEPLGLTCIAPEYLNEGIIVADIVRLRPNPEVNRKCLAYLLNSEGVIKQINAHTKGSTRARINLSVVRNLNIN
LPPLAEQKRIVEKIDEVLAQVDTIKARLDGIPDLLKRFRQSVLTSAVSGKLTEEWREEQDAYPTLNELKATIEQERFEIW
CSAELNKKISKGKPPANDKWKEKYQPGNPKHNDSNKRTAVEEIKAPWLLTSLDAVSILTTGKTPSTAKDEYWNGDTMFVS
PAQIHPEGYLHNPSRYVSKAGCQIVPLISKGSTLIVCIGTVGKVGLLTEDVVINQQINAITPLPSVTHKYMYYWCKTLYP
WIIDTARATVNAAILNKSTMSTAPFALPPLEEQKEIVRLVDQYFAFADTIEAQVKKAQARVDNLTQSILAKAFRGELVAQ
DPNDEPADKLLARIAEARKEAEALAKAAKKAEAAKKRAAKSA

Sequences:

>Translated_522_residues
MSELPKGWIACTPSDLANDPKNEIVDGPFGSNLKASEYTDEGTPIVRIQNVKRMAFLNKNIKYVTDEKAEFLKRHSFKSG
DLLLTKLGEPLGLTCIAPEYLNEGIIVADIVRLRPNPEVNRKCLAYLLNSEGVIKQINAHTKGSTRARINLSVVRNLNIN
LPPLAEQKRIVEKIDEVLAQVDTIKARLDGIPDLLKRFRQSVLTSAVSGKLTEEWREEQDAYPTLNELKATIEQERFEIW
CSAELNKKISKGKPPANDKWKEKYQPGNPKHNDSNKRTAVEEIKAPWLLTSLDAVSILTTGKTPSTAKDEYWNGDTMFVS
PAQIHPEGYLHNPSRYVSKAGCQIVPLISKGSTLIVCIGTVGKVGLLTEDVVINQQINAITPLPSVTHKYMYYWCKTLYP
WIIDTARATVNAAILNKSTMSTAPFALPPLEEQKEIVRLVDQYFAFADTIEAQVKKAQARVDNLTQSILAKAFRGELVAQ
DPNDEPADKLLARIAEARKEAEALAKAAKKAEAAKKRAAKSA
>Mature_521_residues
SELPKGWIACTPSDLANDPKNEIVDGPFGSNLKASEYTDEGTPIVRIQNVKRMAFLNKNIKYVTDEKAEFLKRHSFKSGD
LLLTKLGEPLGLTCIAPEYLNEGIIVADIVRLRPNPEVNRKCLAYLLNSEGVIKQINAHTKGSTRARINLSVVRNLNINL
PPLAEQKRIVEKIDEVLAQVDTIKARLDGIPDLLKRFRQSVLTSAVSGKLTEEWREEQDAYPTLNELKATIEQERFEIWC
SAELNKKISKGKPPANDKWKEKYQPGNPKHNDSNKRTAVEEIKAPWLLTSLDAVSILTTGKTPSTAKDEYWNGDTMFVSP
AQIHPEGYLHNPSRYVSKAGCQIVPLISKGSTLIVCIGTVGKVGLLTEDVVINQQINAITPLPSVTHKYMYYWCKTLYPW
IIDTARATVNAAILNKSTMSTAPFALPPLEEQKEIVRLVDQYFAFADTIEAQVKKAQARVDNLTQSILAKAFRGELVAQD
PNDEPADKLLARIAEARKEAEALAKAAKKAEAAKKRAAKSA

Specific function: The M and S subunits together form a methyltransferase (MTase) that methylates two adenine residues in complementary strands of a bipartite DNA recognition sequence. In the presence of the R subunit the complex can also act as an endonuclease, binding to

COG id: COG0732

COG function: function code V; Restriction endonuclease S subunits

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the type-I restriction system S methylase family [H]

Homologues:

Organism=Escherichia coli, GI1790807, Length=116, Percent_Identity=42.2413793103448, Blast_Score=91, Evalue=2e-19,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000055 [H]

Pfam domain/function: PF01420 Methylase_S [H]

EC number: NA

Molecular weight: Translated: 58021; Mature: 57890

Theoretical pI: Translated: 9.15; Mature: 9.15

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.3 %Cys     (Translated Protein)
1.0 %Met     (Translated Protein)
2.3 %Cys+Met (Translated Protein)
1.3 %Cys     (Mature Protein)
0.8 %Met     (Mature Protein)
2.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSELPKGWIACTPSDLANDPKNEIVDGPFGSNLKASEYTDEGTPIVRIQNVKRMAFLNKN
CCCCCCCCEEECCHHHCCCCHHHCCCCCCCCCCCCCCCCCCCCCEEEECCHHHHHHHCCC
IKYVTDEKAEFLKRHSFKSGDLLLTKLGEPLGLTCIAPEYLNEGIIVADIVRLRPNPEVN
CCEECCHHHHHHHHHCCCCCCEEHECCCCCCCEEEECHHHHCCCEEEEHHHHCCCCCCCC
RKCLAYLLNSEGVIKQINAHTKGSTRARINLSVVRNLNINLPPLAEQKRIVEKIDEVLAQ
HHHHHHHHCCCCHHHHHCCCCCCCCEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHH
VDTIKARLDGIPDLLKRFRQSVLTSAVSGKLTEEWREEQDAYPTLNELKATIEQERFEIW
HHHHHHHHCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCHHHHHHHHHHHHHEEE
CSAELNKKISKGKPPANDKWKEKYQPGNPKHNDSNKRTAVEEIKAPWLLTSLDAVSILTT
ECHHHHHHHHCCCCCCCCCHHHHCCCCCCCCCCCCHHHHHHHHCCCHHEEHHHHEEEEEC
GKTPSTAKDEYWNGDTMFVSPAQIHPEGYLHNPSRYVSKAGCQIVPLISKGSTLIVCIGT
CCCCCCCCCCCCCCCEEEEECCEECCCCCCCCHHHHHHHCCCEEEEEECCCCEEEEEECC
VGKVGLLTEDVVINQQINAITPLPSVTHKYMYYWCKTLYPWIIDTARATVNAAILNKSTM
CCCCCCEEHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC
STAPFALPPLEEQKEIVRLVDQYFAFADTIEAQVKKAQARVDNLTQSILAKAFRGELVAQ
CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEC
DPNDEPADKLLARIAEARKEAEALAKAAKKAEAAKKRAAKSA
CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC
>Mature Secondary Structure 
SELPKGWIACTPSDLANDPKNEIVDGPFGSNLKASEYTDEGTPIVRIQNVKRMAFLNKN
CCCCCCCEEECCHHHCCCCHHHCCCCCCCCCCCCCCCCCCCCCEEEECCHHHHHHHCCC
IKYVTDEKAEFLKRHSFKSGDLLLTKLGEPLGLTCIAPEYLNEGIIVADIVRLRPNPEVN
CCEECCHHHHHHHHHCCCCCCEEHECCCCCCCEEEECHHHHCCCEEEEHHHHCCCCCCCC
RKCLAYLLNSEGVIKQINAHTKGSTRARINLSVVRNLNINLPPLAEQKRIVEKIDEVLAQ
HHHHHHHHCCCCHHHHHCCCCCCCCEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHH
VDTIKARLDGIPDLLKRFRQSVLTSAVSGKLTEEWREEQDAYPTLNELKATIEQERFEIW
HHHHHHHHCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCHHHHHHHHHHHHHEEE
CSAELNKKISKGKPPANDKWKEKYQPGNPKHNDSNKRTAVEEIKAPWLLTSLDAVSILTT
ECHHHHHHHHCCCCCCCCCHHHHCCCCCCCCCCCCHHHHHHHHCCCHHEEHHHHEEEEEC
GKTPSTAKDEYWNGDTMFVSPAQIHPEGYLHNPSRYVSKAGCQIVPLISKGSTLIVCIGT
CCCCCCCCCCCCCCCEEEEECCEECCCCCCCCHHHHHHHCCCEEEEEECCCCEEEEEECC
VGKVGLLTEDVVINQQINAITPLPSVTHKYMYYWCKTLYPWIIDTARATVNAAILNKSTM
CCCCCCEEHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC
STAPFALPPLEEQKEIVRLVDQYFAFADTIEAQVKKAQARVDNLTQSILAKAFRGELVAQ
CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEC
DPNDEPADKLLARIAEARKEAEALAKAAKKAEAAKKRAAKSA
CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: Hydrolase; Acting on ester bonds [C]

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 6304321 [H]