| Definition | Streptococcus pneumoniae D39, complete genome. |
|---|---|
| Accession | NC_008533 |
| Length | 2,046,115 |
Click here to switch to the map view.
The map label for this gene is epsH [H]
Identifier: 116517146
GI number: 116517146
Start: 79054
End: 79593
Strand: Direct
Name: epsH [H]
Synonym: SPD_0078
Alternate gene names: 116517146
Gene position: 79054-79593 (Clockwise)
Preceding gene: 116515848
Following gene: 116516097
Centisome position: 3.86
GC content: 44.63
Gene sequence:
>540_bases ATGAAGTTATTGTCTATCGCAATTTCTAGCTATAATGCAGCAGCCTATCTTCATTACTGTGTGGAGTCGCTAGTGATTGG TGGTGAGCAAGTTGGGATTTTGATTATCAATGACGGGTCTCAGGATCAGACTCAGGAAATCGCTGAGTGTTTAGCTAGCA AGTATCCTAATATCGTTAGAGCCATCTATCAGGAAAATAAATGCCATGGCGGTGCGGTCAATCGTGGCTTGGCAGAGGCT TCTGGGCGCTATTTTAAAGTAGTTGACAGTGATGACTGGGTGGATCCTCGTGCCTACTTGAAAATTCTTGAAACCTTGCA GGAACTTGAGAGCAAAGGTCAAGAGGTGGATGTCTTTGTGACCAATTTTGTCTATGAAAAGGAAGGGCAGTCTCGTAAGA AGAGTATGAGTTACGATTCAGTCTTGCCTGTTCGGCAGATTTTTGGCTGGGACCAGGTCGGAAATTTCTCCAAAGGCCAG TATACCATGATGCACTCGCTGATTTATCGGACAGATTTGTTGCGTGCTAGCCAGTTCTAA
Upstream 100 bases:
>100_bases GACGAAGTCAGTAACATCTATACGGCAAGGCGACGTTGACGCGGTTTGAAGAGATTTTCGAAGAGTATAAGAAAAAATCA GTCCCCTAAAGGAGTAGATT
Downstream 100 bases:
>100_bases CTGCCTGAACATACTTTTTATGTCGATAATCTCTTTGTCTTTACGCCCCTTCAGCAGGTCAAGACCATGTACTATCTGCC TGTCGATTTCTATCGTTATT
Product: glycosyl transferase, group 2 family protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 179; Mature: 179
Protein sequence:
>179_residues MKLLSIAISSYNAAAYLHYCVESLVIGGEQVGILIINDGSQDQTQEIAECLASKYPNIVRAIYQENKCHGGAVNRGLAEA SGRYFKVVDSDDWVDPRAYLKILETLQELESKGQEVDVFVTNFVYEKEGQSRKKSMSYDSVLPVRQIFGWDQVGNFSKGQ YTMMHSLIYRTDLLRASQF
Sequences:
>Translated_179_residues MKLLSIAISSYNAAAYLHYCVESLVIGGEQVGILIINDGSQDQTQEIAECLASKYPNIVRAIYQENKCHGGAVNRGLAEA SGRYFKVVDSDDWVDPRAYLKILETLQELESKGQEVDVFVTNFVYEKEGQSRKKSMSYDSVLPVRQIFGWDQVGNFSKGQ YTMMHSLIYRTDLLRASQF >Mature_179_residues MKLLSIAISSYNAAAYLHYCVESLVIGGEQVGILIINDGSQDQTQEIAECLASKYPNIVRAIYQENKCHGGAVNRGLAEA SGRYFKVVDSDDWVDPRAYLKILETLQELESKGQEVDVFVTNFVYEKEGQSRKKSMSYDSVLPVRQIFGWDQVGNFSKGQ YTMMHSLIYRTDLLRASQF
Specific function: May be involved in the production of the exopolysaccharide (EPS) component of the extracellular matrix during biofilm formation. EPS is responsible for the adhesion of chains of cells into bundles. Required for biofilm maintenance [H]
COG id: COG0463
COG function: function code M; Glycosyltransferases involved in cell wall biogenesis
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the glycosyltransferase 2 family [H]
Homologues:
Organism=Escherichia coli, GI1790044, Length=176, Percent_Identity=32.9545454545455, Blast_Score=73, Evalue=1e-14,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001173 [H]
Pfam domain/function: PF00535 Glycos_transf_2 [H]
EC number: 2.-.-.- [C]
Molecular weight: Translated: 20209; Mature: 20209
Theoretical pI: Translated: 5.28; Mature: 5.28
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.7 %Cys (Translated Protein) 2.2 %Met (Translated Protein) 3.9 %Cys+Met (Translated Protein) 1.7 %Cys (Mature Protein) 2.2 %Met (Mature Protein) 3.9 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKLLSIAISSYNAAAYLHYCVESLVIGGEQVGILIINDGSQDQTQEIAECLASKYPNIVR CCHHEEEHHCCCHHHHHHHHHHHHHCCCCEEEEEEEECCCCHHHHHHHHHHHHHCHHHHH AIYQENKCHGGAVNRGLAEASGRYFKVVDSDDWVDPRAYLKILETLQELESKGQEVDVFV HHHHHCCCCCCHHHCCHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEE TNFVYEKEGQSRKKSMSYDSVLPVRQIFGWDQVGNFSKGQYTMMHSLIYRTDLLRASQF EHEEECCCCCHHHHHCCCHHHHHHHHHHCCHHCCCCCCCHHHHHHHHHHHHHHHHHCCC >Mature Secondary Structure MKLLSIAISSYNAAAYLHYCVESLVIGGEQVGILIINDGSQDQTQEIAECLASKYPNIVR CCHHEEEHHCCCHHHHHHHHHHHHHCCCCEEEEEEEECCCCHHHHHHHHHHHHHCHHHHH AIYQENKCHGGAVNRGLAEASGRYFKVVDSDDWVDPRAYLKILETLQELESKGQEVDVFV HHHHHCCCCCCHHHCCHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEE TNFVYEKEGQSRKKSMSYDSVLPVRQIFGWDQVGNFSKGQYTMMHSLIYRTDLLRASQF EHEEECCCCCHHHHHCCCHHHHHHHHHHCCHHCCCCCCCHHHHHHHHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 8969506; 9384377 [H]