Definition Salmonella enterica subsp. enterica serovar Typhi str. Ty2 chromosome, complete genome.
Accession NC_004631
Length 4,791,961

Click here to switch to the map view.

The map label for this gene is stgD [H]

Identifier: 29143956

GI number: 29143956

Start: 3770026

End: 3771093

Strand: Direct

Name: stgD [H]

Synonym: t3662

Alternate gene names: 29143956

Gene position: 3770026-3771093 (Clockwise)

Preceding gene: 29143955

Following gene: 29143958

Centisome position: 78.67

GC content: 46.25

Gene sequence:

>1068_bases
ATGCGCTTATGGACAATAATATTATTTTCATGCTTTATGGTTCTTATTTCGCCTGTTTGCCGGGCTGGCGATGGCATTTG
TCATGCGGTTAAAGGGACTTATATTTATAACCTTAATCTCAATGACGCCCGGATTCCCGCAGAAAAAAATAAAGCCGGAA
CAGAGGTCAGGGATCTGGAGACATTAACATCGTCAGAATCATATAAAGTTGGCTGTAGCTGTTTGACCCATTTTTCATCG
ACGTTTCGTGAAGTCTATTATACCGCTCGTTCGCCGCTGAGTATTGATACCACCAGAAATGGTTACACCTACTATACGCT
TAACGATAACCTAAGTATTGCAACGTCGATTGCTGTTCTTGGACGAGGCTTTATTGCGGTTCCCTTTGAGGCTGAACCTA
ATGTAGTAAAAGATGGGATGAATTGTTATACCGATACCGTTGAGGGCAATCTCGCAACGTTATATACCGGGTCGGAGGTC
AAGGTCTCTTTCCTTATCAATAAACCGTTTGTCGGTCAGGTTGCCATTCCTGGCACGATTGTGGCTAATTTATACGGGGG
GCTTGATGCCAGCTCCTCGACTGCCAGTACCGACAAATTGGCGGAAGTCAGAATTGTTGGCGATATCGTGGCGCCGCAGA
GCTGTGAAATTGATTCCGGCCAGGTGATAGAGGTGAATTTCGGTAAGATCCCGGTTGCGGATTTCTCGACAACGCAAGGC
ACCGCTGCTGCGGGTCATAAAGTGACTAAAACGGTACAGGTGAAATGTACCGGAATGTTGGATGAGAATATCGTCTATTC
GACATTTAATGCTGATCCCGTTGATTCCAGCGCAAATATGATGAAAGTTCTGGGTAATGATGATGTCGGAATTATGATTT
ATGACAAATGGGATCGGATGGTTAAGGTGACAGGCGGAAAAATGGATATGGATATGGGCGTCAATAATGGCGGCGCCGAG
ACTAACTCATTGACTTTTTCTGCCGCGCCTGCCAGCGCGACGGGAGCGCGGCCTCAACCTGGTACATTTGAAGCTTACGC
CACGATCACGCTGGAAATAACGAACTAA

Upstream 100 bases:

>100_bases
ATGAATCAACAACATGTGCCGTTAATTATACGTTAGGTCATAAAACAGAGAATGCTGAACTGGTGCAGCAAGCCGTCACT
TGCCAATAAGGAATTTAATT

Downstream 100 bases:

>100_bases
TCTAATTACAAAATAAGTCATATCAATGAACTACGGCCCTGGCTCAATCCAGGTGGATCGGAAATCAAACCATCCCAGGT
TATTCAGGTGGATCCCCCTG

Product: fimbrial protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 355; Mature: 355

Protein sequence:

>355_residues
MRLWTIILFSCFMVLISPVCRAGDGICHAVKGTYIYNLNLNDARIPAEKNKAGTEVRDLETLTSSESYKVGCSCLTHFSS
TFREVYYTARSPLSIDTTRNGYTYYTLNDNLSIATSIAVLGRGFIAVPFEAEPNVVKDGMNCYTDTVEGNLATLYTGSEV
KVSFLINKPFVGQVAIPGTIVANLYGGLDASSSTASTDKLAEVRIVGDIVAPQSCEIDSGQVIEVNFGKIPVADFSTTQG
TAAAGHKVTKTVQVKCTGMLDENIVYSTFNADPVDSSANMMKVLGNDDVGIMIYDKWDRMVKVTGGKMDMDMGVNNGGAE
TNSLTFSAAPASATGARPQPGTFEAYATITLEITN

Sequences:

>Translated_355_residues
MRLWTIILFSCFMVLISPVCRAGDGICHAVKGTYIYNLNLNDARIPAEKNKAGTEVRDLETLTSSESYKVGCSCLTHFSS
TFREVYYTARSPLSIDTTRNGYTYYTLNDNLSIATSIAVLGRGFIAVPFEAEPNVVKDGMNCYTDTVEGNLATLYTGSEV
KVSFLINKPFVGQVAIPGTIVANLYGGLDASSSTASTDKLAEVRIVGDIVAPQSCEIDSGQVIEVNFGKIPVADFSTTQG
TAAAGHKVTKTVQVKCTGMLDENIVYSTFNADPVDSSANMMKVLGNDDVGIMIYDKWDRMVKVTGGKMDMDMGVNNGGAE
TNSLTFSAAPASATGARPQPGTFEAYATITLEITN
>Mature_355_residues
MRLWTIILFSCFMVLISPVCRAGDGICHAVKGTYIYNLNLNDARIPAEKNKAGTEVRDLETLTSSESYKVGCSCLTHFSS
TFREVYYTARSPLSIDTTRNGYTYYTLNDNLSIATSIAVLGRGFIAVPFEAEPNVVKDGMNCYTDTVEGNLATLYTGSEV
KVSFLINKPFVGQVAIPGTIVANLYGGLDASSSTASTDKLAEVRIVGDIVAPQSCEIDSGQVIEVNFGKIPVADFSTTQG
TAAAGHKVTKTVQVKCTGMLDENIVYSTFNADPVDSSANMMKVLGNDDVGIMIYDKWDRMVKVTGGKMDMDMGVNNGGAE
TNSLTFSAAPASATGARPQPGTFEAYATITLEITN

Specific function: Unknown

COG id: NA

COG function: NA

Gene ontology:

Cell location: Fimbrium (Potential) [H]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the fimbrial protein family [H]

Homologues:

Organism=Escherichia coli, GI1787173, Length=361, Percent_Identity=25.4847645429363, Blast_Score=86, Evalue=5e-18,
Organism=Escherichia coli, GI145693100, Length=273, Percent_Identity=25.2747252747253, Blast_Score=72, Evalue=6e-14,
Organism=Escherichia coli, GI1789534, Length=381, Percent_Identity=26.7716535433071, Blast_Score=69, Evalue=5e-13,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR008966
- InterPro:   IPR000259 [H]

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 38016; Mature: 38016

Theoretical pI: Translated: 4.63; Mature: 4.63

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.3 %Cys     (Translated Protein)
3.1 %Met     (Translated Protein)
5.4 %Cys+Met (Translated Protein)
2.3 %Cys     (Mature Protein)
3.1 %Met     (Mature Protein)
5.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MRLWTIILFSCFMVLISPVCRAGDGICHAVKGTYIYNLNLNDARIPAEKNKAGTEVRDLE
CCHHHHHHHHHHHHHHHHHHHCCCCEEEEECCEEEEEECCCCCCCCCCCCCCCCCHHHHH
TLTSSESYKVGCSCLTHFSSTFREVYYTARSPLSIDTTRNGYTYYTLNDNLSIATSIAVL
HHCCCCCEEEHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCEEEEEECCCCCHHHHHHHH
GRGFIAVPFEAEPNVVKDGMNCYTDTVEGNLATLYTGSEVKVSFLINKPFVGQVAIPGTI
CCCEEEEECCCCCCHHHCCCCEEEECCCCCEEEEEECCCEEEEEEECCCCCCEEECCCEE
VANLYGGLDASSSTASTDKLAEVRIVGDIVAPQSCEIDSGQVIEVNFGKIPVADFSTTQG
EEEHHCCCCCCCCCCCCCCEEEEEEEEEECCCCCCCCCCCCEEEEECCEEEECCCCCCCC
TAAAGHKVTKTVQVKCTGMLDENIVYSTFNADPVDSSANMMKVLGNDDVGIMIYDKWDRM
CCCCCCEEEEEEEEEEEEEECCCEEEEECCCCCCCCCCCEEEEECCCCEEEEEEECCCEE
VKVTGGKMDMDMGVNNGGAETNSLTFSAAPASATGARPQPGTFEAYATITLEITN
EEEECCEEEEECCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEEEEEEECC
>Mature Secondary Structure
MRLWTIILFSCFMVLISPVCRAGDGICHAVKGTYIYNLNLNDARIPAEKNKAGTEVRDLE
CCHHHHHHHHHHHHHHHHHHHCCCCEEEEECCEEEEEECCCCCCCCCCCCCCCCCHHHHH
TLTSSESYKVGCSCLTHFSSTFREVYYTARSPLSIDTTRNGYTYYTLNDNLSIATSIAVL
HHCCCCCEEEHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCEEEEEECCCCCHHHHHHHH
GRGFIAVPFEAEPNVVKDGMNCYTDTVEGNLATLYTGSEVKVSFLINKPFVGQVAIPGTI
CCCEEEEECCCCCCHHHCCCCEEEECCCCCEEEEEECCCEEEEEEECCCCCCEEECCCEE
VANLYGGLDASSSTASTDKLAEVRIVGDIVAPQSCEIDSGQVIEVNFGKIPVADFSTTQG
EEEHHCCCCCCCCCCCCCCEEEEEEEEEECCCCCCCCCCCCEEEEECCEEEECCCCCCCC
TAAAGHKVTKTVQVKCTGMLDENIVYSTFNADPVDSSANMMKVLGNDDVGIMIYDKWDRM
CCCCCCEEEEEEEEEEEEEECCCEEEEECCCCCCCCCCCEEEEECCCCEEEEEEECCCEE
VKVTGGKMDMDMGVNNGGAETNSLTFSAAPASATGARPQPGTFEAYATITLEITN
EEEECCEEEEECCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEEEEEEECC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 7721701; 11677609 [H]