Definition Salmonella enterica subsp. enterica serovar Typhi str. Ty2 chromosome, complete genome.
Accession NC_004631
Length 4,791,961

Click here to switch to the map view.

The map label for this gene is sipB [H]

Identifier: 29143146

GI number: 29143146

Start: 2866971

End: 2868752

Strand: Reverse

Name: sipB [H]

Synonym: t2787

Alternate gene names: 29143146

Gene position: 2868752-2866971 (Counterclockwise)

Preceding gene: 29143147

Following gene: 29143145

Centisome position: 59.87

GC content: 52.3

Gene sequence:

>1782_bases
ATGGTAAATGACGCAAGTAGCATTAGCCGTAGCGGATATACCCAAAATCCGCGCCTCGCTGAGGCGGCTTTTGAAGGCGT
TCGTAAGAACACGGACTTTTTAAAAGCGGCGGATAAAGCTTTTAAAGATGTGGTGGCAACGAAAGCGGGCGACCTTAAAG
CCGGAACAAAGTCCGGCGAGAGCGCTATTAATACGGTGGGTCTAAAGCCGCCTACGGACGCCGCCCGGGAAAAACTCTCC
AGCGAAGGGCAATTGACATTACTGCTTGGCAAGTTAATGACACTACTGGGCGATGTTTCGCTGTCTCAACTGGAGTCTCG
TCTGGCGGTATGGCAGGCGATGATTGAGTCACAAAAAGAGATGGGGATTCAGGTATCGAAAGAATTCCAGACGGCTCTGG
GAGAGGCTCAGGAGGCGACGGATCTCTATGAAGCCAGCATCAAAAAGACGGATACCGCCAAGAGTGTTTATGACGCTGCG
GCCAAAAAACTGACGCAGGCGCAAAATAAATTGCAATCGCTGGACCCAGCTGACCCCGGCTATGCACAAGCTGAAGCCGC
GGTAGAACAGGCCGGAAAAGAAGCGACAGAGGCGAAAGAGGCCTTAGATAAGGCCACGGATGCGACGGTTAAAGCAGGCA
CAGACGCCAAAGCGAAAGCCGAGAAAGCGGATAACATTCTGACCAAATTCCAGGGAACGGCTAATGCCGCCTCTCAGAAT
CAGGTTTCCCAGGGTGAGCAGGATAATCTGTCAAATGTCGCCCGCCTCACTATGCTCATGGCCATGTTTATTGAGATTGT
GGGCAAAAATACGGAAGAAAGCCTGCAAAACGATCTTGCGCTTTTCAACGCCTTGCAGGAAGGGCGTCAGGCGGAGATGG
AAAAGAAATCGGCTGAATTCCAGGAAGAGACGCGCAAAGCCGAGGAAACGAACCGCATTATGGGATGTATCGGGAAAGTC
CTCGGCGCGCTGCTAACCATTGTCAGCGTTGTGGCCGCTGTTTTTACCGGTGGGGCGAGTCTGGCGCTGGCTGCGGTGGG
ACTTGCGGTAATGGTGGCCGATGAAATTGTGAAGGCGGCGACGGGGGTGTCGTTTATTCAGCAGGCGCTAAACCCGATTA
TGGAGCATGTGCTGAAGCCGTTAATGGAGCTGATTGGCAAGGCGATTACCAAAGCGCTGGAAGGATTAGGCGTCGATAAG
AAAACGGCAGAGATGGCAGGCAGCATTGTTGGTGCGATTGTCGCCGCTATTGCCATGGTAGCGGTCATTGTGGTGGTCGC
AGTTGTCGGGAAAGGCGCGGCGGCGAAACTGGGTAACGCGCTGAGCAAAATGATGGGCGAAACGATTAAGAAGTTGGTGC
CTAACGTGCTGAAACAGTTGGCACAAAACGGCAGCAAACTCTTTACCCAGGGGATGCAACGTATTACTAGCGGCCTGGGT
AATGTGGGTAGCAAGATGGGCCTGCAAACGAATGCCTTAAGTAAAGAGCTGGTAGGTAATACCCTAAATAAAGTGGCGTT
GGGCATGGAAGTCACGAATACCGCAGCCCAGTCAGCCGGTGGGGTTGCCGAGGGGGTATTTATTAAAAATGCCAGCGAGG
CGCTTGCTGATTTTATGCTCGCCCGTTTTGCCATGGATCAGATTCAGCAGTGGCTTAAACAATCCGTAGAAATATTTGGT
GAAAACCAGAAGGTAACGGCGGAACTGCAAAAAGCCATGTCTTCTGCGGTACAGCAAAATGCGGATGCTTCGCGTTTTAT
TCTGCGCCAGAGTCGCGCATAA

Upstream 100 bases:

>100_bases
GTACTGAAGATGAGTCTCTGCGGGCAAAAGCGTTGGTCTATCTGGAGGCGCTAAAAACGGCGGAGACAGAGCAGCACAGT
GAACAAGAAAAGGAATAATT

Downstream 100 bases:

>100_bases
AAACTGCCAAAATAAAGGGAGAAAAATATGTTAATTAGTAATGTGGGAATAAATCCCGCCGCTTATTTAAATAATCATTC
TGTTGAGAATAGTTCACAGA

Product: pathogenicity island 1 effector protein

Products: NA

Alternate protein names: Effector protein sipB [H]

Number of amino acids: Translated: 593; Mature: 593

Protein sequence:

>593_residues
MVNDASSISRSGYTQNPRLAEAAFEGVRKNTDFLKAADKAFKDVVATKAGDLKAGTKSGESAINTVGLKPPTDAAREKLS
SEGQLTLLLGKLMTLLGDVSLSQLESRLAVWQAMIESQKEMGIQVSKEFQTALGEAQEATDLYEASIKKTDTAKSVYDAA
AKKLTQAQNKLQSLDPADPGYAQAEAAVEQAGKEATEAKEALDKATDATVKAGTDAKAKAEKADNILTKFQGTANAASQN
QVSQGEQDNLSNVARLTMLMAMFIEIVGKNTEESLQNDLALFNALQEGRQAEMEKKSAEFQEETRKAEETNRIMGCIGKV
LGALLTIVSVVAAVFTGGASLALAAVGLAVMVADEIVKAATGVSFIQQALNPIMEHVLKPLMELIGKAITKALEGLGVDK
KTAEMAGSIVGAIVAAIAMVAVIVVVAVVGKGAAAKLGNALSKMMGETIKKLVPNVLKQLAQNGSKLFTQGMQRITSGLG
NVGSKMGLQTNALSKELVGNTLNKVALGMEVTNTAAQSAGGVAEGVFIKNASEALADFMLARFAMDQIQQWLKQSVEIFG
ENQKVTAELQKAMSSAVQQNADASRFILRQSRA

Sequences:

>Translated_593_residues
MVNDASSISRSGYTQNPRLAEAAFEGVRKNTDFLKAADKAFKDVVATKAGDLKAGTKSGESAINTVGLKPPTDAAREKLS
SEGQLTLLLGKLMTLLGDVSLSQLESRLAVWQAMIESQKEMGIQVSKEFQTALGEAQEATDLYEASIKKTDTAKSVYDAA
AKKLTQAQNKLQSLDPADPGYAQAEAAVEQAGKEATEAKEALDKATDATVKAGTDAKAKAEKADNILTKFQGTANAASQN
QVSQGEQDNLSNVARLTMLMAMFIEIVGKNTEESLQNDLALFNALQEGRQAEMEKKSAEFQEETRKAEETNRIMGCIGKV
LGALLTIVSVVAAVFTGGASLALAAVGLAVMVADEIVKAATGVSFIQQALNPIMEHVLKPLMELIGKAITKALEGLGVDK
KTAEMAGSIVGAIVAAIAMVAVIVVVAVVGKGAAAKLGNALSKMMGETIKKLVPNVLKQLAQNGSKLFTQGMQRITSGLG
NVGSKMGLQTNALSKELVGNTLNKVALGMEVTNTAAQSAGGVAEGVFIKNASEALADFMLARFAMDQIQQWLKQSVEIFG
ENQKVTAELQKAMSSAVQQNADASRFILRQSRA
>Mature_593_residues
MVNDASSISRSGYTQNPRLAEAAFEGVRKNTDFLKAADKAFKDVVATKAGDLKAGTKSGESAINTVGLKPPTDAAREKLS
SEGQLTLLLGKLMTLLGDVSLSQLESRLAVWQAMIESQKEMGIQVSKEFQTALGEAQEATDLYEASIKKTDTAKSVYDAA
AKKLTQAQNKLQSLDPADPGYAQAEAAVEQAGKEATEAKEALDKATDATVKAGTDAKAKAEKADNILTKFQGTANAASQN
QVSQGEQDNLSNVARLTMLMAMFIEIVGKNTEESLQNDLALFNALQEGRQAEMEKKSAEFQEETRKAEETNRIMGCIGKV
LGALLTIVSVVAAVFTGGASLALAAVGLAVMVADEIVKAATGVSFIQQALNPIMEHVLKPLMELIGKAITKALEGLGVDK
KTAEMAGSIVGAIVAAIAMVAVIVVVAVVGKGAAAKLGNALSKMMGETIKKLVPNVLKQLAQNGSKLFTQGMQRITSGLG
NVGSKMGLQTNALSKELVGNTLNKVALGMEVTNTAAQSAGGVAEGVFIKNASEALADFMLARFAMDQIQQWLKQSVEIFG
ENQKVTAELQKAMSSAVQQNADASRFILRQSRA

Specific function: Required for entry into the host cell through presentation or delivery of sipC at the host cell plasma membrane. Along with sipC, is necessary for the transfer of other effector proteins into the host cell. Induces macrophage apoptosis either by binding a

COG id: NA

COG function: NA

Gene ontology:

Cell location: Secreted. Host cell membrane; Multi-pass membrane protein (By similarity). Note=Secreted via the type III secretion system 1 (SPI-1 TTSS) and inserted into the host cell plasma membrane (By similarity) [H]

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the invasin protein B family [H]

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR006972
- InterPro:   IPR003895 [H]

Pfam domain/function: PF04888 SseC [H]

EC number: NA

Molecular weight: Translated: 62421; Mature: 62421

Theoretical pI: Translated: 6.27; Mature: 6.27

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.2 %Cys     (Translated Protein)
3.7 %Met     (Translated Protein)
3.9 %Cys+Met (Translated Protein)
0.2 %Cys     (Mature Protein)
3.7 %Met     (Mature Protein)
3.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MVNDASSISRSGYTQNPRLAEAAFEGVRKNTDFLKAADKAFKDVVATKAGDLKAGTKSGE
CCCCHHHHHHCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCH
SAINTVGLKPPTDAAREKLSSEGQLTLLLGKLMTLLGDVSLSQLESRLAVWQAMIESQKE
HHHHHCCCCCCCHHHHHHHCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHH
MGIQVSKEFQTALGEAQEATDLYEASIKKTDTAKSVYDAAAKKLTQAQNKLQSLDPADPG
CCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC
YAQAEAAVEQAGKEATEAKEALDKATDATVKAGTDAKAKAEKADNILTKFQGTANAASQN
HHHHHHHHHHHCHHHHHHHHHHHHHHCCHHCCCCCHHHHHHHHHHHHHHHCCCCCCCHHH
QVSQGEQDNLSNVARLTMLMAMFIEIVGKNTEESLQNDLALFNALQEGRQAEMEKKSAEF
HHCCCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
QEETRKAEETNRIMGCIGKVLGALLTIVSVVAAVFTGGASLALAAVGLAVMVADEIVKAA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHH
TGVSFIQQALNPIMEHVLKPLMELIGKAITKALEGLGVDKKTAEMAGSIVGAIVAAIAMV
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHH
AVIVVVAVVGKGAAAKLGNALSKMMGETIKKLVPNVLKQLAQNGSKLFTQGMQRITSGLG
HHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH
NVGSKMGLQTNALSKELVGNTLNKVALGMEVTNTAAQSAGGVAEGVFIKNASEALADFML
HHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHCCHHHHHHHHHH
ARFAMDQIQQWLKQSVEIFGENQKVTAELQKAMSSAVQQNADASRFILRQSRA
HHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHCCC
>Mature Secondary Structure
MVNDASSISRSGYTQNPRLAEAAFEGVRKNTDFLKAADKAFKDVVATKAGDLKAGTKSGE
CCCCHHHHHHCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCH
SAINTVGLKPPTDAAREKLSSEGQLTLLLGKLMTLLGDVSLSQLESRLAVWQAMIESQKE
HHHHHCCCCCCCHHHHHHHCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHH
MGIQVSKEFQTALGEAQEATDLYEASIKKTDTAKSVYDAAAKKLTQAQNKLQSLDPADPG
CCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC
YAQAEAAVEQAGKEATEAKEALDKATDATVKAGTDAKAKAEKADNILTKFQGTANAASQN
HHHHHHHHHHHCHHHHHHHHHHHHHHCCHHCCCCCHHHHHHHHHHHHHHHCCCCCCCHHH
QVSQGEQDNLSNVARLTMLMAMFIEIVGKNTEESLQNDLALFNALQEGRQAEMEKKSAEF
HHCCCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
QEETRKAEETNRIMGCIGKVLGALLTIVSVVAAVFTGGASLALAAVGLAVMVADEIVKAA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHH
TGVSFIQQALNPIMEHVLKPLMELIGKAITKALEGLGVDKKTAEMAGSIVGAIVAAIAMV
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHH
AVIVVVAVVGKGAAAKLGNALSKMMGETIKKLVPNVLKQLAQNGSKLFTQGMQRITSGLG
HHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH
NVGSKMGLQTNALSKELVGNTLNKVALGMEVTNTAAQSAGGVAEGVFIKNASEALADFML
HHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHCCHHHHHHHHHH
ARFAMDQIQQWLKQSVEIFGENQKVTAELQKAMSSAVQQNADASRFILRQSRA
HHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: NA