The gene/protein map for NC_011959 is currently unavailable.
Definition Escherichia coli IAI39 chromosome, complete genome.
Accession NC_011750
Length 5,132,068

Click here to switch to the map view.

The map label for this gene is sipD [H]

Identifier: 218702500

GI number: 218702500

Start: 4423335

End: 4424684

Strand: Direct

Name: sipD [H]

Synonym: ECIAI39_4256

Alternate gene names: 218702500

Gene position: 4423335-4424684 (Clockwise)

Preceding gene: 218702499

Following gene: 218702501

Centisome position: 86.19

GC content: 40.3

Gene sequence:

>1350_bases
ATGATTAATAAAGATAATAATTTAAATATAGATATTCTACGTCAGCAGAATGCAAACATTCCTCTTCAAGAGGCATCCTC
GCAAAGTGAGGAGCCTCATCAACTGGGAAACAGCAAAAGCGAGGGTAATTTGCATAACAGCGCCAGTAAGGATAATAATA
TTGGCGCTATGCTCAAGAAAGATGTTGAAAAACTATCAACTTTAGTCAGTGAACGTGGCCCGCGAAATGAAGACTGGAAA
TGCCAGCATGACATACAGCTTAACTTCGTTAAAAACATGCTCAGAAATTTCCCAACTGAGAACCTGTCAGAGAAAGGATC
CGCGCTCTTACACGAAAGTTATTTGCTGATAAAAAACGTTCTGCAGCTTACAGAAAATGTTAATACTATTGACGTAAAAT
TAAAGGATACCGCTACAATATTGACCCTGCATAACTCTGTGAGAAAAACCCTGGCTAACGTTGCTCACAACACAACCGGG
TCAAGTGCAATGGCAACGACCAAAGGTACCTTAAATCACGCTATCGCTTCTTTAGACAGTTCTGTGCCTGAAATGAGAAG
TTCAGCAACCAGCAGTACTTTAACGGCCAGTAGTTCAAACGAATTTAAAAGTGATAAATATATATGCAGTGATATTGCTG
ATTTGATGAGTGTGCTCGGTAGCGACTACCTCGAAATCTATGCAAATAGCGTTGAGATAATGAGCGCATACTGGCAGGAC
TTCAGTGAACATATCCAAAGCAATATGGGAAAATGGACTCACTCTAATAAAAAAGGCGATGCGATAGTTTTTGACGTCAA
TGCATTACAAAAAGCCTTGATGCATTTTTATTATATCGATAAATATCCTAATGGTGATTTCCACTATCATTACAATCCAG
ACTATGTTTTATACCCTCCGGCACCTGCTGACAAAATTGGTGTACCACTAGAAGAAGCAGAAAAATGGTGCGCCGCTTTA
GGACTTCCGGTAATTCCACCCGATCCCAAACACCGCACACCTTCTCCGATTGTCGAAGTTGAACCGCAAGGTAGCGGTCT
GTATGTTATTATTCCCAACCCTCAGATTATTGACAGTATGAGTCAGTCTTCAGACTCAATGGTGCATAGGGACGATAAAG
GGAAGGAAAAAAATATATCTAAAGAATTCACCGGTTATGAAATAAGCACAGCTGAATACCAGGCATGGCTGGCGGGTTAT
AACGGCCAGACAGAAAACATGAAAACCGACGTGCAGGTTATCACGACTAAATACAGTACAGCCAACTCCACCTACGATAC
AATTATCAAATTATTATCTTCAACCATTACTGCGTTGTTTGATTCAGCGAAAGATTATTTACGTTTCTGA

Upstream 100 bases:

>100_bases
ATAATACAAACCATTATTGATACTGATAGTGCTATTGTTCAAGGAATTCGCTCATAAATTATTGCAATAATAGTTTCTTA
TATTTAAGGAGGAATGACAC

Downstream 100 bases:

>100_bases
TTAAGTAATACCCACAATTATCGCCCACTATCTGGTCCCGAAAAATGGGACCAGCGTTTCATATTATTAAGCATCAGCTT
TCATTCATTTCTTCATCACA

Product: hypothetical protein

Products: NA

Alternate protein names: Salmonella invasion protein D [H]

Number of amino acids: Translated: 449; Mature: 449

Protein sequence:

>449_residues
MINKDNNLNIDILRQQNANIPLQEASSQSEEPHQLGNSKSEGNLHNSASKDNNIGAMLKKDVEKLSTLVSERGPRNEDWK
CQHDIQLNFVKNMLRNFPTENLSEKGSALLHESYLLIKNVLQLTENVNTIDVKLKDTATILTLHNSVRKTLANVAHNTTG
SSAMATTKGTLNHAIASLDSSVPEMRSSATSSTLTASSSNEFKSDKYICSDIADLMSVLGSDYLEIYANSVEIMSAYWQD
FSEHIQSNMGKWTHSNKKGDAIVFDVNALQKALMHFYYIDKYPNGDFHYHYNPDYVLYPPAPADKIGVPLEEAEKWCAAL
GLPVIPPDPKHRTPSPIVEVEPQGSGLYVIIPNPQIIDSMSQSSDSMVHRDDKGKEKNISKEFTGYEISTAEYQAWLAGY
NGQTENMKTDVQVITTKYSTANSTYDTIIKLLSSTITALFDSAKDYLRF

Sequences:

>Translated_449_residues
MINKDNNLNIDILRQQNANIPLQEASSQSEEPHQLGNSKSEGNLHNSASKDNNIGAMLKKDVEKLSTLVSERGPRNEDWK
CQHDIQLNFVKNMLRNFPTENLSEKGSALLHESYLLIKNVLQLTENVNTIDVKLKDTATILTLHNSVRKTLANVAHNTTG
SSAMATTKGTLNHAIASLDSSVPEMRSSATSSTLTASSSNEFKSDKYICSDIADLMSVLGSDYLEIYANSVEIMSAYWQD
FSEHIQSNMGKWTHSNKKGDAIVFDVNALQKALMHFYYIDKYPNGDFHYHYNPDYVLYPPAPADKIGVPLEEAEKWCAAL
GLPVIPPDPKHRTPSPIVEVEPQGSGLYVIIPNPQIIDSMSQSSDSMVHRDDKGKEKNISKEFTGYEISTAEYQAWLAGY
NGQTENMKTDVQVITTKYSTANSTYDTIIKLLSSTITALFDSAKDYLRF
>Mature_449_residues
MINKDNNLNIDILRQQNANIPLQEASSQSEEPHQLGNSKSEGNLHNSASKDNNIGAMLKKDVEKLSTLVSERGPRNEDWK
CQHDIQLNFVKNMLRNFPTENLSEKGSALLHESYLLIKNVLQLTENVNTIDVKLKDTATILTLHNSVRKTLANVAHNTTG
SSAMATTKGTLNHAIASLDSSVPEMRSSATSSTLTASSSNEFKSDKYICSDIADLMSVLGSDYLEIYANSVEIMSAYWQD
FSEHIQSNMGKWTHSNKKGDAIVFDVNALQKALMHFYYIDKYPNGDFHYHYNPDYVLYPPAPADKIGVPLEEAEKWCAAL
GLPVIPPDPKHRTPSPIVEVEPQGSGLYVIIPNPQIIDSMSQSSDSMVHRDDKGKEKNISKEFTGYEISTAEYQAWLAGY
NGQTENMKTDVQVITTKYSTANSTYDTIIKLLSSTITALFDSAKDYLRF

Specific function: Required for translocation of effector proteins via the type III secretion system SPI-1, which is essential for an efficient bacterial internalization. Probably acts by modulating the secretion of sipA, sipB, and sipC [H]

COG id: NA

COG function: NA

Gene ontology:

Cell location: Secreted. Note=Secreted via the type III secretion system 1 (SPI-1 TTSS) [H]

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the invasin protein D family [H]

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR009483
- InterPro:   IPR013386 [H]

Pfam domain/function: PF06511 IpaD [H]

EC number: NA

Molecular weight: Translated: 50029; Mature: 50029

Theoretical pI: Translated: 5.57; Mature: 5.57

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.7 %Cys     (Translated Protein)
2.7 %Met     (Translated Protein)
3.3 %Cys+Met (Translated Protein)
0.7 %Cys     (Mature Protein)
2.7 %Met     (Mature Protein)
3.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MINKDNNLNIDILRQQNANIPLQEASSQSEEPHQLGNSKSEGNLHNSASKDNNIGAMLKK
CCCCCCCCCEEEEECCCCCCCHHHCCCCCCCHHHCCCCCCCCCCCCCCCCCCCHHHHHHH
DVEKLSTLVSERGPRNEDWKCQHDIQLNFVKNMLRNFPTENLSEKGSALLHESYLLIKNV
HHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHHCHHHHHHHHHHHHHH
LQLTENVNTIDVKLKDTATILTLHNSVRKTLANVAHNTTGSSAMATTKGTLNHAIASLDS
HHHHCCCCEEEEEEECCEEEEHHHHHHHHHHHHHHCCCCCCCCEEECCHHHHHHHHHHHH
SVPEMRSSATSSTLTASSSNEFKSDKYICSDIADLMSVLGSDYLEIYANSVEIMSAYWQD
HHHHHHHHCCCCEEECCCCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHH
FSEHIQSNMGKWTHSNKKGDAIVFDVNALQKALMHFYYIDKYPNGDFHYHYNPDYVLYPP
HHHHHHHHCCCCCCCCCCCCEEEEEHHHHHHHHHHHHHHCCCCCCCEEEEECCCEEEECC
APADKIGVPLEEAEKWCAALGLPVIPPDPKHRTPSPIVEVEPQGSGLYVIIPNPQIIDSM
CCCHHCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCEEEECCCCCEEEEEECCCHHHHHH
SQSSDSMVHRDDKGKEKNISKEFTGYEISTAEYQAWLAGYNGQTENMKTDVQVITTKYST
HCCCCCHHCCCCCCCCCCCCHHCCCCEEEHHHHHHHHHCCCCCCCCCCCCEEEEEEECCC
ANSTYDTIIKLLSSTITALFDSAKDYLRF
CCCHHHHHHHHHHHHHHHHHHHHHHHHCC
>Mature Secondary Structure
MINKDNNLNIDILRQQNANIPLQEASSQSEEPHQLGNSKSEGNLHNSASKDNNIGAMLKK
CCCCCCCCCEEEEECCCCCCCHHHCCCCCCCHHHCCCCCCCCCCCCCCCCCCCHHHHHHH
DVEKLSTLVSERGPRNEDWKCQHDIQLNFVKNMLRNFPTENLSEKGSALLHESYLLIKNV
HHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHHCHHHHHHHHHHHHHH
LQLTENVNTIDVKLKDTATILTLHNSVRKTLANVAHNTTGSSAMATTKGTLNHAIASLDS
HHHHCCCCEEEEEEECCEEEEHHHHHHHHHHHHHHCCCCCCCCEEECCHHHHHHHHHHHH
SVPEMRSSATSSTLTASSSNEFKSDKYICSDIADLMSVLGSDYLEIYANSVEIMSAYWQD
HHHHHHHHCCCCEEECCCCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHH
FSEHIQSNMGKWTHSNKKGDAIVFDVNALQKALMHFYYIDKYPNGDFHYHYNPDYVLYPP
HHHHHHHHCCCCCCCCCCCCEEEEEHHHHHHHHHHHHHHCCCCCCCEEEEECCCEEEECC
APADKIGVPLEEAEKWCAALGLPVIPPDPKHRTPSPIVEVEPQGSGLYVIIPNPQIIDSM
CCCHHCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCEEEECCCCCEEEEEECCCHHHHHH
SQSSDSMVHRDDKGKEKNISKEFTGYEISTAEYQAWLAGYNGQTENMKTDVQVITTKYST
HCCCCCHHCCCCCCCCCCCCHHCCCCEEEHHHHHHHHHCCCCCCCCCCCCEEEEEEECCC
ANSTYDTIIKLLSSTITALFDSAKDYLRF
CCCHHHHHHHHHHHHHHHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8748032; 8522512; 11677609; 10692170 [H]