Definition Streptococcus pneumoniae D39, complete genome.
Accession NC_008533
Length 2,046,115

Click here to switch to the map view.

The map label for this gene is cps2T

Identifier: 116516453

GI number: 116516453

Start: 318727

End: 319911

Strand: Direct

Name: cps2T

Synonym: SPD_0320

Alternate gene names: NA

Gene position: 318727-319911 (Clockwise)

Preceding gene: 116516773

Following gene: 116516251

Centisome position: 15.58

GC content: 40.25

Gene sequence:

>1185_bases
ATGAAGAAGTCAGTTTATATCATTGGTTCAAAGGGGATTCCTGCCAAGTATGGAGGATTTGAAACCTTTGTTGAGAAATT
AACAGAATATCAAAAAGATGGGAACATCCAATACTATGTTGCCTGCATACGCGAAAATTCTGCAAAATCAGGATTTACAG
CAGATACATTTGAGTACAATGGTGCTATTTGTTACAACATTGATGTGCCTAATATTGGTCCTGCTAGAGCCATTGCTTAC
GATATTGCAGCGGTCAATAAGGCTATTGAATTGGCTAAGGGAAACAAGGACGAGGCTCCCATTTTTTACATTCTAGCTTG
TCGTATCGGACCTTTTATTTCTGGACTTAAGAAAAAAATTCGTTCGATCGGAGGCCGTTTGCTGGTAAATCCAGATGGTC
ATGAGTGGCTTCGGGCTAAATGGAGCCTGCCAGTTCGGAAGTATTGGAAATTTTCGGAACAGTTGATGGTCAAACATGCA
GATTTATTAGTCTGTGATAGCAAAAATATCGAAAAATATGTCCGAGAGGACTATAAACAGTATCAGCCCAAGACGACCTA
TATCGCTTATGGTACAGATACTACCCCTTCAAGTCTGAAATCAGAAGATGCCAAAGTTCGAAACTGGTATCGTGAAAAGG
GAGTAAGCGAAAATGGCTATTATCTAGTGGTGGGACGATTTGTTCCCGAAAACAACTATGAGACCATGATTCGTGAATTT
ATGAAGTCTAATTCTAAAAAAGACTTTGTTCTCATTACAAATGTGGAACAGAATAAGTTTTACGATCAGTTGCTCAAAGA
TACAGGCTTCGACAAAGACCCGAGAGTCAAATTTGTTGGGACTGTCTATGACCAAGAATTGCTCAAATACATCCGAGAGA
ATGCTTTTGCCTATTTTCACGGTCATGAAGTGGGAGGGACCAATCCATCCTTGCTTGAAGCATTGGCATCCACTAGGCTT
AATCTATTACTAGATGTAGGTTTTAACCGTGAAGTCGGAGAGAATGGTGCCATCTATTGGAGAAAAGATGAGCTAGCGCG
CGTTATTGAAGCAGTGGAGCAATTTGATGAGAACGCCATTTCTGAACTAGATAAAAAATCTAGCCAACGAATTGCAGAAG
CTTTTACGTGGGAAGAGATAGTGGTGGATTATGAGGAAGAGTTTGAAGGGGGAAAAAGTGAGTAA

Upstream 100 bases:

>100_bases
ATTTGGAAAGACATTGAAATTTTATTGAAGACAGTTAAAGTAGTATTTATGAGAGATGGAGCGAAGTAGTTTACTTTTGT
TTTAGACTACTAGGAGAAAA

Downstream 100 bases:

>100_bases
CAAGCAAATTGCGATTATGATGGCAACTTATAATGGAGCTAAATACATTGGAGAACAGATAGACTCTATTCTTAGGCAAA
CTTATCAAGATTGGAAATTA

Product: glycosyl transferase, group 1 family protein

Products: NA

Alternate protein names: Glycosyltransferase; Glycosyl Transferase; Glycosyltransferase RfaG; Glycosyl Transferase Group 1 Family Protein; Group 1 Glycosyl Transferase; Rhamnosyl Transferase WchF; Polysaccharide Biosynthesis Protein/ Rhamnosyl Transferase; Rhamnosyl Transferase; Glycosyl Transferase Family Protein; Glycosyl-(Rhamnosyl) Transferase; Glycosyltransferase Protein; Alpha-(1 2)-Rhamnosyltransferase; A-Glycosyltransferase; Rhamnosyltransferase RgpA; Polysaccharide Export Protein; Glycosyltransferase-Like Protein; Glycosyltransferase Family 4 Protein

Number of amino acids: Translated: 394; Mature: 394

Protein sequence:

>394_residues
MKKSVYIIGSKGIPAKYGGFETFVEKLTEYQKDGNIQYYVACIRENSAKSGFTADTFEYNGAICYNIDVPNIGPARAIAY
DIAAVNKAIELAKGNKDEAPIFYILACRIGPFISGLKKKIRSIGGRLLVNPDGHEWLRAKWSLPVRKYWKFSEQLMVKHA
DLLVCDSKNIEKYVREDYKQYQPKTTYIAYGTDTTPSSLKSEDAKVRNWYREKGVSENGYYLVVGRFVPENNYETMIREF
MKSNSKKDFVLITNVEQNKFYDQLLKDTGFDKDPRVKFVGTVYDQELLKYIRENAFAYFHGHEVGGTNPSLLEALASTRL
NLLLDVGFNREVGENGAIYWRKDELARVIEAVEQFDENAISELDKKSSQRIAEAFTWEEIVVDYEEEFEGGKSE

Sequences:

>Translated_394_residues
MKKSVYIIGSKGIPAKYGGFETFVEKLTEYQKDGNIQYYVACIRENSAKSGFTADTFEYNGAICYNIDVPNIGPARAIAY
DIAAVNKAIELAKGNKDEAPIFYILACRIGPFISGLKKKIRSIGGRLLVNPDGHEWLRAKWSLPVRKYWKFSEQLMVKHA
DLLVCDSKNIEKYVREDYKQYQPKTTYIAYGTDTTPSSLKSEDAKVRNWYREKGVSENGYYLVVGRFVPENNYETMIREF
MKSNSKKDFVLITNVEQNKFYDQLLKDTGFDKDPRVKFVGTVYDQELLKYIRENAFAYFHGHEVGGTNPSLLEALASTRL
NLLLDVGFNREVGENGAIYWRKDELARVIEAVEQFDENAISELDKKSSQRIAEAFTWEEIVVDYEEEFEGGKSE
>Mature_394_residues
MKKSVYIIGSKGIPAKYGGFETFVEKLTEYQKDGNIQYYVACIRENSAKSGFTADTFEYNGAICYNIDVPNIGPARAIAY
DIAAVNKAIELAKGNKDEAPIFYILACRIGPFISGLKKKIRSIGGRLLVNPDGHEWLRAKWSLPVRKYWKFSEQLMVKHA
DLLVCDSKNIEKYVREDYKQYQPKTTYIAYGTDTTPSSLKSEDAKVRNWYREKGVSENGYYLVVGRFVPENNYETMIREF
MKSNSKKDFVLITNVEQNKFYDQLLKDTGFDKDPRVKFVGTVYDQELLKYIRENAFAYFHGHEVGGTNPSLLEALASTRL
NLLLDVGFNREVGENGAIYWRKDELARVIEAVEQFDENAISELDKKSSQRIAEAFTWEEIVVDYEEEFEGGKSE

Specific function: Unknown

COG id: COG0438

COG function: function code M; Glycosyltransferase

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 45167; Mature: 45167

Theoretical pI: Translated: 5.89; Mature: 5.89

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.0 %Cys     (Translated Protein)
1.0 %Met     (Translated Protein)
2.0 %Cys+Met (Translated Protein)
1.0 %Cys     (Mature Protein)
1.0 %Met     (Mature Protein)
2.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKKSVYIIGSKGIPAKYGGFETFVEKLTEYQKDGNIQYYVACIRENSAKSGFTADTFEYN
CCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCEEEEEEEEECCCCCCCCCCCCEEEC
GAICYNIDVPNIGPARAIAYDIAAVNKAIELAKGNKDEAPIFYILACRIGPFISGLKKKI
CEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEHHHHHHHHHHHHHH
RSIGGRLLVNPDGHEWLRAKWSLPVRKYWKFSEQLMVKHADLLVCDSKNIEKYVREDYKQ
HHCCCEEEECCCCCHHHHEECCCCHHHHHHHHHHHHHHHCCEEEECCCCHHHHHHHHHHH
YQPKTTYIAYGTDTTPSSLKSEDAKVRNWYREKGVSENGYYLVVGRFVPENNYETMIREF
CCCCEEEEEECCCCCCHHHHCCHHHHHHHHHHCCCCCCCEEEEEEEECCCCCHHHHHHHH
MKSNSKKDFVLITNVEQNKFYDQLLKDTGFDKDPRVKFVGTVYDQELLKYIRENAFAYFH
HHCCCCCCEEEEEECHHHHHHHHHHHHCCCCCCCCEEEEEHHHHHHHHHHHHHCCEEEEE
GHEVGGTNPSLLEALASTRLNLLLDVGFNREVGENGAIYWRKDELARVIEAVEQFDENAI
CCCCCCCCHHHHHHHHHHHEEEEEEECCCCCCCCCCCEEEEHHHHHHHHHHHHHHHHHHH
SELDKKSSQRIAEAFTWEEIVVDYEEEFEGGKSE
HHHHHHHHHHHHHHHHHHHHHCCCHHHCCCCCCC
>Mature Secondary Structure
MKKSVYIIGSKGIPAKYGGFETFVEKLTEYQKDGNIQYYVACIRENSAKSGFTADTFEYN
CCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCEEEEEEEEECCCCCCCCCCCCEEEC
GAICYNIDVPNIGPARAIAYDIAAVNKAIELAKGNKDEAPIFYILACRIGPFISGLKKKI
CEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEHHHHHHHHHHHHHH
RSIGGRLLVNPDGHEWLRAKWSLPVRKYWKFSEQLMVKHADLLVCDSKNIEKYVREDYKQ
HHCCCEEEECCCCCHHHHEECCCCHHHHHHHHHHHHHHHCCEEEECCCCHHHHHHHHHHH
YQPKTTYIAYGTDTTPSSLKSEDAKVRNWYREKGVSENGYYLVVGRFVPENNYETMIREF
CCCCEEEEEECCCCCCHHHHCCHHHHHHHHHHCCCCCCCEEEEEEEECCCCCHHHHHHHH
MKSNSKKDFVLITNVEQNKFYDQLLKDTGFDKDPRVKFVGTVYDQELLKYIRENAFAYFH
HHCCCCCCEEEEEECHHHHHHHHHHHHCCCCCCCCEEEEEHHHHHHHHHHHHHCCEEEEE
GHEVGGTNPSLLEALASTRLNLLLDVGFNREVGENGAIYWRKDELARVIEAVEQFDENAI
CCCCCCCCHHHHHHHHHHHEEEEEEECCCCCCCCCCCEEEEHHHHHHHHHHHHHHHHHHH
SELDKKSSQRIAEAFTWEEIVVDYEEEFEGGKSE
HHHHHHHHHHHHHHHHHHHHHCCCHHHCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA