Definition Escherichia coli UTI89 chromosome, complete genome.
Accession NC_007946
Length 5,065,741

Click here to switch to the map view.

The map label for this gene is kpsD [H]

Identifier: 91212356

GI number: 91212356

Start: 3291411

End: 3293090

Strand: Direct

Name: kpsD [H]

Synonym: UTI89_C3364

Alternate gene names: 91212356

Gene position: 3291411-3293090 (Clockwise)

Preceding gene: 91212355

Following gene: 91212357

Centisome position: 64.97

GC content: 52.02

Gene sequence:

>1680_bases
GTGATGAAATTATTTAAATCAATTTTACTGATTGCCGCCTGTCACGCGGCGCAGGCCAGCGCGGCCATTGATATTAACGC
TGACCCAAACCTTACAGGAGCCGCGCCGCTTACCGGTATTCTGAACGGGCAACAGTCGGATACGCAAAACATGAGCGGCT
TCGACAATACCCCGCCGCCCTCACCGCCGGTGGTAATGAGCCGTATGTTTGGTGCTCAACTTTTCAACGGCACCAGCGCG
GATAGCGGTGCGACGGTAGGATTCAACCCTGACTATATTCTGAATCCGGGTGATAGCATTCAGGTTCGCTTGTGGGGTGC
GTTCACCTTTGATGGTGCGTTACAGGTTGATCCCAAAGGTAATATTTTCCTGCCGAACGTTGGTCCGGTGAAAGTTGCTG
GCGTCAGTAATAGTCAGCTAAATGCCCTGGTCACATCCAAAGTGAAGGAAGTATACCAGTCCAACGTCAACGTCTACGCC
TCCTTATTACAGGCGCAGCCAGTAAAAGTGTACGTGACCGGATTTGTGCGTAATCCTGGTCTGTATGGCGGTGTGACGTC
TGATTCGTTACTCAATTATCTGATCAAGGCTGGCGGCGTTGATCCAGAGCGCGGAAGTTACGTTGATATTGTGGTCAAGC
GCGGTAACCGCGTGCGCTCCAACGTCAACCTGTACGACTTCCTGCTGAACGGCAAACTGGGGCTTTCGCAGTTCGCCGAT
GGTGACACCATCATCGTCGGGCCGCGTCAGCATACTTTCAGCGTTCAGGGCGATGTCTTTAACAGCTACGACTTTGAGTT
CCGCGAAAGCAGCATTCCCGTAACGGAAGCGTTGAGCTGGGCGCGCCCTAAGCCTGGCGCGACTCACATTACGATTATGC
GTAAACAGGGGCTGCAAAAACGCAGCGAATACTATCCGATCAGTTCTGCGCCAGGCCGTATGTTGCAAAATGGCGATACC
TTAATCGTGAGCACTGACCGCTATGCCGGCACCATTCAGGTGCGGGTTGAAGGCGCACACTCCGGTGAACATGCCATGGT
ACTGCCTTATGGTTCCACTATGCGTGCGGTTCTGGAAAAAGTCCGCCCGAACAGCATGTCGCAGATGAACGCGGTTCAGC
TTTATCGCCCATCAGTAGCTCAGCGTCAGAAAGAGATGCTGAATCTCTCGCTGCAAAAACTGGAGGAAGCATCACTTTCT
GCCCAGTCCTCCACCAAAGAAGAAGCCAGCCTGCGAATGCAGGAAGCGCAACTGATCAGCCGCTTTGTGGCGAAAGCACG
CACCGTAGTTCCGAAAGGTGAAGTGATCCTCAACGAATCCAATATTGATTCTGTTCTGCTTGAAGATGGCGACGTCATCA
ATATTCCGGAGAAAACATCGCTGGTTATGGTTCATGGCGAAGTGCTGTTCCCGAACGCGGTGAGCTGGCAGAAGGGTATG
ACCACCGAGGATTACATCGAGAAATGTGGTGGCCTGACGCAGAAATCGGGTAACGCCAGAATTATCGTCATTCGTCAGAA
CGGTGCTGCCGTCAACGCAGAAGATGTGGATTCACTCAAACCGGGCGATGAGATTATGGTTCTGCCGAAATATGAATCGA
AAAACATTGAAGTTACCCGTGGTATTTCCACCATCCTCTATCAGCTGGCGGTGGGTGCAAAAGTGATTCTGTCTTTGTAA

Upstream 100 bases:

>100_bases
CCTGCTGGTTACTGGTGTGCTGCCTGCTGTTCGGCACCCTGAAACTGTTGCTGGCTGTTATTGAAGATCACCGAGACTAA
CGCTGTCGCTGAATGAGTTT

Downstream 100 bases:

>100_bases
GGAGTTGAAATGAGCAAAGCAGTTATTGTCATTCCGGCTCGTTATGGCTCATCGCGCCTGCCGGGTAAGCCACTGCTCGA
TATTGTTGGTAAACCGATGA

Product: polysialic acid transport protein KpsD

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 559; Mature: 559

Protein sequence:

>559_residues
MMKLFKSILLIAACHAAQASAAIDINADPNLTGAAPLTGILNGQQSDTQNMSGFDNTPPPSPPVVMSRMFGAQLFNGTSA
DSGATVGFNPDYILNPGDSIQVRLWGAFTFDGALQVDPKGNIFLPNVGPVKVAGVSNSQLNALVTSKVKEVYQSNVNVYA
SLLQAQPVKVYVTGFVRNPGLYGGVTSDSLLNYLIKAGGVDPERGSYVDIVVKRGNRVRSNVNLYDFLLNGKLGLSQFAD
GDTIIVGPRQHTFSVQGDVFNSYDFEFRESSIPVTEALSWARPKPGATHITIMRKQGLQKRSEYYPISSAPGRMLQNGDT
LIVSTDRYAGTIQVRVEGAHSGEHAMVLPYGSTMRAVLEKVRPNSMSQMNAVQLYRPSVAQRQKEMLNLSLQKLEEASLS
AQSSTKEEASLRMQEAQLISRFVAKARTVVPKGEVILNESNIDSVLLEDGDVINIPEKTSLVMVHGEVLFPNAVSWQKGM
TTEDYIEKCGGLTQKSGNARIIVIRQNGAAVNAEDVDSLKPGDEIMVLPKYESKNIEVTRGISTILYQLAVGAKVILSL

Sequences:

>Translated_559_residues
MMKLFKSILLIAACHAAQASAAIDINADPNLTGAAPLTGILNGQQSDTQNMSGFDNTPPPSPPVVMSRMFGAQLFNGTSA
DSGATVGFNPDYILNPGDSIQVRLWGAFTFDGALQVDPKGNIFLPNVGPVKVAGVSNSQLNALVTSKVKEVYQSNVNVYA
SLLQAQPVKVYVTGFVRNPGLYGGVTSDSLLNYLIKAGGVDPERGSYVDIVVKRGNRVRSNVNLYDFLLNGKLGLSQFAD
GDTIIVGPRQHTFSVQGDVFNSYDFEFRESSIPVTEALSWARPKPGATHITIMRKQGLQKRSEYYPISSAPGRMLQNGDT
LIVSTDRYAGTIQVRVEGAHSGEHAMVLPYGSTMRAVLEKVRPNSMSQMNAVQLYRPSVAQRQKEMLNLSLQKLEEASLS
AQSSTKEEASLRMQEAQLISRFVAKARTVVPKGEVILNESNIDSVLLEDGDVINIPEKTSLVMVHGEVLFPNAVSWQKGM
TTEDYIEKCGGLTQKSGNARIIVIRQNGAAVNAEDVDSLKPGDEIMVLPKYESKNIEVTRGISTILYQLAVGAKVILSL
>Mature_559_residues
MMKLFKSILLIAACHAAQASAAIDINADPNLTGAAPLTGILNGQQSDTQNMSGFDNTPPPSPPVVMSRMFGAQLFNGTSA
DSGATVGFNPDYILNPGDSIQVRLWGAFTFDGALQVDPKGNIFLPNVGPVKVAGVSNSQLNALVTSKVKEVYQSNVNVYA
SLLQAQPVKVYVTGFVRNPGLYGGVTSDSLLNYLIKAGGVDPERGSYVDIVVKRGNRVRSNVNLYDFLLNGKLGLSQFAD
GDTIIVGPRQHTFSVQGDVFNSYDFEFRESSIPVTEALSWARPKPGATHITIMRKQGLQKRSEYYPISSAPGRMLQNGDT
LIVSTDRYAGTIQVRVEGAHSGEHAMVLPYGSTMRAVLEKVRPNSMSQMNAVQLYRPSVAQRQKEMLNLSLQKLEEASLS
AQSSTKEEASLRMQEAQLISRFVAKARTVVPKGEVILNESNIDSVLLEDGDVINIPEKTSLVMVHGEVLFPNAVSWQKGM
TTEDYIEKCGGLTQKSGNARIIVIRQNGAAVNAEDVDSLKPGDEIMVLPKYESKNIEVTRGISTILYQLAVGAKVILSL

Specific function: Involved in the translocation of the polysialic acid capsule across the outer membrane to the cell surface. May function as the periplasmic binding element of the PSA transport system, in which it transiently interacts with the membrane component of the t

COG id: COG1596

COG function: function code M; Periplasmic protein involved in polysaccharide export

Gene ontology:

Cell location: Periplasm [H]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: To E.coli K5 kpsD [H]

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003715 [H]

Pfam domain/function: PF02563 Poly_export [H]

EC number: NA

Molecular weight: Translated: 60503; Mature: 60503

Theoretical pI: Translated: 7.64; Mature: 7.64

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.4 %Cys     (Translated Protein)
2.9 %Met     (Translated Protein)
3.2 %Cys+Met (Translated Protein)
0.4 %Cys     (Mature Protein)
2.9 %Met     (Mature Protein)
3.2 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MMKLFKSILLIAACHAAQASAAIDINADPNLTGAAPLTGILNGQQSDTQNMSGFDNTPPP
CHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHCCCCCCCCCCCCCCCCCCCCC
SPPVVMSRMFGAQLFNGTSADSGATVGFNPDYILNPGDSIQVRLWGAFTFDGALQVDPKG
CCHHHHHHHHHHHHCCCCCCCCCCEECCCCCEEECCCCEEEEEEEEEEEECCEEEECCCC
NIFLPNVGPVKVAGVSNSQLNALVTSKVKEVYQSNVNVYASLLQAQPVKVYVTGFVRNPG
CEECCCCCCEEEEECCCCHHHHHHHHHHHHHHHCCHHHHHHHHHCCCEEEEEEEEEECCC
LYGGVTSDSLLNYLIKAGGVDPERGSYVDIVVKRGNRVRSNVNLYDFLLNGKLGLSQFAD
CCCCCCHHHHHHHHHHHCCCCCCCCCEEEEEEECCCCCCCCCCEEEEEECCCCCHHHCCC
GDTIIVGPRQHTFSVQGDVFNSYDFEFRESSIPVTEALSWARPKPGATHITIMRKQGLQK
CCEEEECCCCEEEEECCCCCCCCCCEECCCCCCHHHHHHHCCCCCCCEEEEEEEHHCCHH
RSEYYPISSAPGRMLQNGDTLIVSTDRYAGTIQVRVEGAHSGEHAMVLPYGSTMRAVLEK
HHHCCCCCCCCCHHHCCCCEEEEECCCCCEEEEEEEECCCCCCEEEEEECCHHHHHHHHH
VRPNSMSQMNAVQLYRPSVAQRQKEMLNLSLQKLEEASLSAQSSTKEEASLRMQEAQLIS
HCCCCHHHHCHHEEECCHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHH
RFVAKARTVVPKGEVILNESNIDSVLLEDGDVINIPEKTSLVMVHGEVLFPNAVSWQKGM
HHHHHHHHCCCCCCEEEECCCCCEEEECCCCEEECCCCCEEEEEECCEECCCCCCCCCCC
TTEDYIEKCGGLTQKSGNARIIVIRQNGAAVNAEDVDSLKPGDEIMVLPKYESKNIEVTR
CHHHHHHHHCCCCCCCCCEEEEEEECCCCEECHHHHHCCCCCCCEEEEECCCCCCEEEEH
GISTILYQLAVGAKVILSL
HHHHHHHHHHHCCEEEEEC
>Mature Secondary Structure
MMKLFKSILLIAACHAAQASAAIDINADPNLTGAAPLTGILNGQQSDTQNMSGFDNTPPP
CHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHCCCCCCCCCCCCCCCCCCCCC
SPPVVMSRMFGAQLFNGTSADSGATVGFNPDYILNPGDSIQVRLWGAFTFDGALQVDPKG
CCHHHHHHHHHHHHCCCCCCCCCCEECCCCCEEECCCCEEEEEEEEEEEECCEEEECCCC
NIFLPNVGPVKVAGVSNSQLNALVTSKVKEVYQSNVNVYASLLQAQPVKVYVTGFVRNPG
CEECCCCCCEEEEECCCCHHHHHHHHHHHHHHHCCHHHHHHHHHCCCEEEEEEEEEECCC
LYGGVTSDSLLNYLIKAGGVDPERGSYVDIVVKRGNRVRSNVNLYDFLLNGKLGLSQFAD
CCCCCCHHHHHHHHHHHCCCCCCCCCEEEEEEECCCCCCCCCCEEEEEECCCCCHHHCCC
GDTIIVGPRQHTFSVQGDVFNSYDFEFRESSIPVTEALSWARPKPGATHITIMRKQGLQK
CCEEEECCCCEEEEECCCCCCCCCCEECCCCCCHHHHHHHCCCCCCCEEEEEEEHHCCHH
RSEYYPISSAPGRMLQNGDTLIVSTDRYAGTIQVRVEGAHSGEHAMVLPYGSTMRAVLEK
HHHCCCCCCCCCHHHCCCCEEEEECCCCCEEEEEEEECCCCCCEEEEEECCHHHHHHHHH
VRPNSMSQMNAVQLYRPSVAQRQKEMLNLSLQKLEEASLSAQSSTKEEASLRMQEAQLIS
HCCCCHHHHCHHEEECCHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHH
RFVAKARTVVPKGEVILNESNIDSVLLEDGDVINIPEKTSLVMVHGEVLFPNAVSWQKGM
HHHHHHHHCCCCCCEEEECCCCCEEEECCCCEEECCCCCEEEEEECCEECCCCCCCCCCC
TTEDYIEKCGGLTQKSGNARIIVIRQNGAAVNAEDVDSLKPGDEIMVLPKYESKNIEVTR
CHHHHHHHHCCCCCCCCCEEEEEEECCCCEECHHHHHCCCCCCCEEEEECCCCCCEEEEH
GISTILYQLAVGAKVILSL
HHHHHHHHHHHCCEEEEEC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8021185 [H]