Definition Streptococcus pneumoniae D39, complete genome.
Accession NC_008533
Length 2,046,115

Click here to switch to the map view.

The map label for this gene is epsH [H]

Identifier: 116517146

GI number: 116517146

Start: 79054

End: 79593

Strand: Direct

Name: epsH [H]

Synonym: SPD_0078

Alternate gene names: 116517146

Gene position: 79054-79593 (Clockwise)

Preceding gene: 116515848

Following gene: 116516097

Centisome position: 3.86

GC content: 44.63

Gene sequence:

>540_bases
ATGAAGTTATTGTCTATCGCAATTTCTAGCTATAATGCAGCAGCCTATCTTCATTACTGTGTGGAGTCGCTAGTGATTGG
TGGTGAGCAAGTTGGGATTTTGATTATCAATGACGGGTCTCAGGATCAGACTCAGGAAATCGCTGAGTGTTTAGCTAGCA
AGTATCCTAATATCGTTAGAGCCATCTATCAGGAAAATAAATGCCATGGCGGTGCGGTCAATCGTGGCTTGGCAGAGGCT
TCTGGGCGCTATTTTAAAGTAGTTGACAGTGATGACTGGGTGGATCCTCGTGCCTACTTGAAAATTCTTGAAACCTTGCA
GGAACTTGAGAGCAAAGGTCAAGAGGTGGATGTCTTTGTGACCAATTTTGTCTATGAAAAGGAAGGGCAGTCTCGTAAGA
AGAGTATGAGTTACGATTCAGTCTTGCCTGTTCGGCAGATTTTTGGCTGGGACCAGGTCGGAAATTTCTCCAAAGGCCAG
TATACCATGATGCACTCGCTGATTTATCGGACAGATTTGTTGCGTGCTAGCCAGTTCTAA

Upstream 100 bases:

>100_bases
GACGAAGTCAGTAACATCTATACGGCAAGGCGACGTTGACGCGGTTTGAAGAGATTTTCGAAGAGTATAAGAAAAAATCA
GTCCCCTAAAGGAGTAGATT

Downstream 100 bases:

>100_bases
CTGCCTGAACATACTTTTTATGTCGATAATCTCTTTGTCTTTACGCCCCTTCAGCAGGTCAAGACCATGTACTATCTGCC
TGTCGATTTCTATCGTTATT

Product: glycosyl transferase, group 2 family protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 179; Mature: 179

Protein sequence:

>179_residues
MKLLSIAISSYNAAAYLHYCVESLVIGGEQVGILIINDGSQDQTQEIAECLASKYPNIVRAIYQENKCHGGAVNRGLAEA
SGRYFKVVDSDDWVDPRAYLKILETLQELESKGQEVDVFVTNFVYEKEGQSRKKSMSYDSVLPVRQIFGWDQVGNFSKGQ
YTMMHSLIYRTDLLRASQF

Sequences:

>Translated_179_residues
MKLLSIAISSYNAAAYLHYCVESLVIGGEQVGILIINDGSQDQTQEIAECLASKYPNIVRAIYQENKCHGGAVNRGLAEA
SGRYFKVVDSDDWVDPRAYLKILETLQELESKGQEVDVFVTNFVYEKEGQSRKKSMSYDSVLPVRQIFGWDQVGNFSKGQ
YTMMHSLIYRTDLLRASQF
>Mature_179_residues
MKLLSIAISSYNAAAYLHYCVESLVIGGEQVGILIINDGSQDQTQEIAECLASKYPNIVRAIYQENKCHGGAVNRGLAEA
SGRYFKVVDSDDWVDPRAYLKILETLQELESKGQEVDVFVTNFVYEKEGQSRKKSMSYDSVLPVRQIFGWDQVGNFSKGQ
YTMMHSLIYRTDLLRASQF

Specific function: May be involved in the production of the exopolysaccharide (EPS) component of the extracellular matrix during biofilm formation. EPS is responsible for the adhesion of chains of cells into bundles. Required for biofilm maintenance [H]

COG id: COG0463

COG function: function code M; Glycosyltransferases involved in cell wall biogenesis

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the glycosyltransferase 2 family [H]

Homologues:

Organism=Escherichia coli, GI1790044, Length=176, Percent_Identity=32.9545454545455, Blast_Score=73, Evalue=1e-14,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001173 [H]

Pfam domain/function: PF00535 Glycos_transf_2 [H]

EC number: 2.-.-.- [C]

Molecular weight: Translated: 20209; Mature: 20209

Theoretical pI: Translated: 5.28; Mature: 5.28

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.7 %Cys     (Translated Protein)
2.2 %Met     (Translated Protein)
3.9 %Cys+Met (Translated Protein)
1.7 %Cys     (Mature Protein)
2.2 %Met     (Mature Protein)
3.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKLLSIAISSYNAAAYLHYCVESLVIGGEQVGILIINDGSQDQTQEIAECLASKYPNIVR
CCHHEEEHHCCCHHHHHHHHHHHHHCCCCEEEEEEEECCCCHHHHHHHHHHHHHCHHHHH
AIYQENKCHGGAVNRGLAEASGRYFKVVDSDDWVDPRAYLKILETLQELESKGQEVDVFV
HHHHHCCCCCCHHHCCHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEE
TNFVYEKEGQSRKKSMSYDSVLPVRQIFGWDQVGNFSKGQYTMMHSLIYRTDLLRASQF
EHEEECCCCCHHHHHCCCHHHHHHHHHHCCHHCCCCCCCHHHHHHHHHHHHHHHHHCCC
>Mature Secondary Structure
MKLLSIAISSYNAAAYLHYCVESLVIGGEQVGILIINDGSQDQTQEIAECLASKYPNIVR
CCHHEEEHHCCCHHHHHHHHHHHHHCCCCEEEEEEEECCCCHHHHHHHHHHHHHCHHHHH
AIYQENKCHGGAVNRGLAEASGRYFKVVDSDDWVDPRAYLKILETLQELESKGQEVDVFV
HHHHHCCCCCCHHHCCHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEE
TNFVYEKEGQSRKKSMSYDSVLPVRQIFGWDQVGNFSKGQYTMMHSLIYRTDLLRASQF
EHEEECCCCCHHHHHCCCHHHHHHHHHHCCHHCCCCCCCHHHHHHHHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 8969506; 9384377 [H]