Definition Haemophilus influenzae Rd KW20 chromosome, complete genome.
Accession NC_000907
Length 1,830,138

Click here to switch to the map view.

The map label for this gene is hsdS

Identifier: 16273200

GI number: 16273200

Start: 1366454

End: 1367833

Strand: Reverse

Name: hsdS

Synonym: HI1286

Alternate gene names: NA

Gene position: 1367833-1366454 (Counterclockwise)

Preceding gene: 30995437

Following gene: 16273199

Centisome position: 74.74

GC content: 35.22

Gene sequence:

>1380_bases
ATGAGTGATTGGAAAGAGTATTCTCTGGGTGATATATCAAGAAATATATCAAGAAGGTTTGACTTTAATGCATACCCAAA
TGTTGTATTTATTAACACTGGGGATGTATTAAATAATAAATTTCTACATTGCGAGATATCAAACGTCAAAGATTTACCAG
GGCAAGCTAAGAAAGCCATAAAAAAAGGCGATATTCTATATAGTGAAATAAGACCAGGTAATGGACGATATTTATTTGTA
GATAATGATTTAGACAATTATGTTGTTTCCACAAAATTTATGGTAATTGAGCCTAATGCTAATATTGTATTACCAGAATT
CCTATTTTTACTACTAATTAGCAATGAAACAACCGAATATTTTAAAATGATAGCTGAATCAAGATCTGGAACATTTCCAC
AGATCACATTTGATTCGGTTTCAAGTTTATCTTTAAATATACCAGATAAAGAAACACAACAAAAAATCCTTGATATTATT
ACCCCATTAGATGACAAAATAGAACTCAACACCCAAATCAACCAAACCTTGGAACAAATCGCCCAAGCGCTGTTTAAAAG
CTGGTTTGTCGATTTCGATCCCGTGCGTGCCAAAGCCCAAGCCCTTTCGGACGGCATGAGCCTTGAACAAGCGGAACTTG
CCGCCATGCAGGCAATCAGCGGAAAAACACCCGAAGAACTGACCGCACTTTCACAAACACAGCCTGACCGCTACGCCGAA
TTAGCCGAAACCGCCAAAGCGTTTCCGTGTGAGATGGTGGAGGTTGATGGGGTGGAGGTTGATGGGGTTGAGGTGCCGAG
GGGGTGGGAAATGAAAGCCTTATCAGATTTAGGTCAAATTATCTGCGGGAAAACACCATCAAAATCCAACAAAGAATTTT
ATGGTGATGATGTGCCATTTATTAAGATTCCAGATATGCACAATCAAGTATTTATTACTCAAACAACAGATAACTTGAGT
GTAGTAGGTGCAAATTACCAATCTAAAAAATATATTCCAGCAAAAAGCATTTGTGTAAGTTGTATTGCTACTGTCGGATT
GGTTTCAATGACATCTAAGCCATCTCATACAAACCAACAAATTAATTCAATTATTCCAGATGATGAACAATCCTGTGAGT
TTTTATATTTATCGTTAAAACAGCCATCAATGACAAAATATCTAAAAGATTTGGCAAGTGGTGGCACTGCAACTTTAAAT
TTAAACACAAGCACATTCTCTAAAATAGAGATAATTACACCATCAAAAGAAATTATCTATATTTTTCAAAAAAAAGTTGT
TTCTATTTTTGAAAAGACCTTATCAAATTCTATTGAAAATAAGAGACTAACTGAAATAAGAGATTTATTGCTGCCTAGAT
TGTTGAATGGGGAAATTTAA

Upstream 100 bases:

>100_bases
GCAGAAAAAATGCAAAATCTGACCGCTCTTTTGAAGGAGCAATTTGCAAAAAGTGCGGAATTGGAAGCTGAGATTAAGAA
GAATTTAGGGGGATTGGGTT

Downstream 100 bases:

>100_bases
TTACTCACGAGATTGATTTCACTTCTGCCAATAAACGAAGAAATTAATCTATTAGCTCCCCAAAGACAGCAAATACACAA
AAGGAACACCTATGCTCAAC

Product: type I restriction/modification specificity protein

Products: NA

Alternate protein names: S.HindVIIP; Type I restriction enzyme HindVIIP specificity protein; S protein

Number of amino acids: Translated: 459; Mature: 458

Protein sequence:

>459_residues
MSDWKEYSLGDISRNISRRFDFNAYPNVVFINTGDVLNNKFLHCEISNVKDLPGQAKKAIKKGDILYSEIRPGNGRYLFV
DNDLDNYVVSTKFMVIEPNANIVLPEFLFLLLISNETTEYFKMIAESRSGTFPQITFDSVSSLSLNIPDKETQQKILDII
TPLDDKIELNTQINQTLEQIAQALFKSWFVDFDPVRAKAQALSDGMSLEQAELAAMQAISGKTPEELTALSQTQPDRYAE
LAETAKAFPCEMVEVDGVEVDGVEVPRGWEMKALSDLGQIICGKTPSKSNKEFYGDDVPFIKIPDMHNQVFITQTTDNLS
VVGANYQSKKYIPAKSICVSCIATVGLVSMTSKPSHTNQQINSIIPDDEQSCEFLYLSLKQPSMTKYLKDLASGGTATLN
LNTSTFSKIEIITPSKEIIYIFQKKVVSIFEKTLSNSIENKRLTEIRDLLLPRLLNGEI

Sequences:

>Translated_459_residues
MSDWKEYSLGDISRNISRRFDFNAYPNVVFINTGDVLNNKFLHCEISNVKDLPGQAKKAIKKGDILYSEIRPGNGRYLFV
DNDLDNYVVSTKFMVIEPNANIVLPEFLFLLLISNETTEYFKMIAESRSGTFPQITFDSVSSLSLNIPDKETQQKILDII
TPLDDKIELNTQINQTLEQIAQALFKSWFVDFDPVRAKAQALSDGMSLEQAELAAMQAISGKTPEELTALSQTQPDRYAE
LAETAKAFPCEMVEVDGVEVDGVEVPRGWEMKALSDLGQIICGKTPSKSNKEFYGDDVPFIKIPDMHNQVFITQTTDNLS
VVGANYQSKKYIPAKSICVSCIATVGLVSMTSKPSHTNQQINSIIPDDEQSCEFLYLSLKQPSMTKYLKDLASGGTATLN
LNTSTFSKIEIITPSKEIIYIFQKKVVSIFEKTLSNSIENKRLTEIRDLLLPRLLNGEI
>Mature_458_residues
SDWKEYSLGDISRNISRRFDFNAYPNVVFINTGDVLNNKFLHCEISNVKDLPGQAKKAIKKGDILYSEIRPGNGRYLFVD
NDLDNYVVSTKFMVIEPNANIVLPEFLFLLLISNETTEYFKMIAESRSGTFPQITFDSVSSLSLNIPDKETQQKILDIIT
PLDDKIELNTQINQTLEQIAQALFKSWFVDFDPVRAKAQALSDGMSLEQAELAAMQAISGKTPEELTALSQTQPDRYAEL
AETAKAFPCEMVEVDGVEVDGVEVPRGWEMKALSDLGQIICGKTPSKSNKEFYGDDVPFIKIPDMHNQVFITQTTDNLSV
VGANYQSKKYIPAKSICVSCIATVGLVSMTSKPSHTNQQINSIIPDDEQSCEFLYLSLKQPSMTKYLKDLASGGTATLNL
NTSTFSKIEIITPSKEIIYIFQKKVVSIFEKTLSNSIENKRLTEIRDLLLPRLLNGEI

Specific function: The M and S subunits together form a methyltransferase (MTase) that methylates two adenine residues in complementary strands of a bipartite DNA recognition sequence. Subunit S dictates DNA sequences specificity

COG id: COG0732

COG function: function code V; Restriction endonuclease S subunits

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the type-I restriction system S methylase family

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): T1SH_HAEIN (P44152)

Other databases:

- EMBL:   L42023
- PIR:   H64024
- RefSeq:   NP_439438.1
- REBASE:   5608
- GeneID:   950231
- GenomeReviews:   L42023_GR
- KEGG:   hin:HI1286
- NMPDR:   fig|71421.1.peg.1226
- TIGR:   HI_1286
- HOGENOM:   HBG679476
- OMA:   EPNANIV
- ProtClustDB:   CLSK2393461
- BioCyc:   HINF71421:HI_1286-MONOMER
- InterPro:   IPR000055

Pfam domain/function: PF01420 Methylase_S

EC number: NA

Molecular weight: Translated: 51402; Mature: 51271

Theoretical pI: Translated: 4.66; Mature: 4.66

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.3 %Cys     (Translated Protein)
2.2 %Met     (Translated Protein)
3.5 %Cys+Met (Translated Protein)
1.3 %Cys     (Mature Protein)
2.0 %Met     (Mature Protein)
3.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSDWKEYSLGDISRNISRRFDFNAYPNVVFINTGDVLNNKFLHCEISNVKDLPGQAKKAI
CCCCCCCCHHHHHHHHHHCCCCCCCCCEEEEECCCHHCCCEEEEECCCCCCCCCHHHHHH
KKGDILYSEIRPGNGRYLFVDNDLDNYVVSTKFMVIEPNANIVLPEFLFLLLISNETTEY
HHCCEEEEECCCCCCEEEEEECCCCCEEEEEEEEEECCCCCEEHHHHHHHHHHCCCHHHH
FKMIAESRSGTFPQITFDSVSSLSLNIPDKETQQKILDIITPLDDKIELNTQINQTLEQI
HHHHHHCCCCCCCEEECCCCCEEEEECCCHHHHHHHHHHHCCCCCCEEECHHHHHHHHHH
AQALFKSWFVDFDPVRAKAQALSDGMSLEQAELAAMQAISGKTPEELTALSQTQPDRYAE
HHHHHHHHCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHCCCCHHHHH
LAETAKAFPCEMVEVDGVEVDGVEVPRGWEMKALSDLGQIICGKTPSKSNKEFYGDDVPF
HHHHHHCCCCEEEEECCEEECCEECCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCE
IKIPDMHNQVFITQTTDNLSVVGANYQSKKYIPAKSICVSCIATVGLVSMTSKPSHTNQQ
EECCCCCCEEEEEEECCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHH
INSIIPDDEQSCEFLYLSLKQPSMTKYLKDLASGGTATLNLNTSTFSKIEIITPSKEIIY
HHHCCCCCCCCCEEEEEEECCCHHHHHHHHHHCCCEEEEEECCCCCCEEEEECCCHHEEH
IFQKKVVSIFEKTLSNSIENKRLTEIRDLLLPRLLNGEI
HHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCC
>Mature Secondary Structure 
SDWKEYSLGDISRNISRRFDFNAYPNVVFINTGDVLNNKFLHCEISNVKDLPGQAKKAI
CCCCCCCHHHHHHHHHHCCCCCCCCCEEEEECCCHHCCCEEEEECCCCCCCCCHHHHHH
KKGDILYSEIRPGNGRYLFVDNDLDNYVVSTKFMVIEPNANIVLPEFLFLLLISNETTEY
HHCCEEEEECCCCCCEEEEEECCCCCEEEEEEEEEECCCCCEEHHHHHHHHHHCCCHHHH
FKMIAESRSGTFPQITFDSVSSLSLNIPDKETQQKILDIITPLDDKIELNTQINQTLEQI
HHHHHHCCCCCCCEEECCCCCEEEEECCCHHHHHHHHHHHCCCCCCEEECHHHHHHHHHH
AQALFKSWFVDFDPVRAKAQALSDGMSLEQAELAAMQAISGKTPEELTALSQTQPDRYAE
HHHHHHHHCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHCCCCHHHHH
LAETAKAFPCEMVEVDGVEVDGVEVPRGWEMKALSDLGQIICGKTPSKSNKEFYGDDVPF
HHHHHHCCCCEEEEECCEEECCEECCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCE
IKIPDMHNQVFITQTTDNLSVVGANYQSKKYIPAKSICVSCIATVGLVSMTSKPSHTNQQ
EECCCCCCEEEEEEECCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHH
INSIIPDDEQSCEFLYLSLKQPSMTKYLKDLASGGTATLNLNTSTFSKIEIITPSKEIIY
HHHCCCCCCCCCEEEEEEECCCHHHHHHHHHHCCCEEEEEECCCCCCEEEEECCCHHEEH
IFQKKVVSIFEKTLSNSIENKRLTEIRDLLLPRLLNGEI
HHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 7542800