Definition Chlamydia muridarum Nigg, complete genome.
Accession NC_002620
Length 1,072,950

Click here to switch to the map view.

The map label for this gene is gspD

Identifier: 15835475

GI number: 15835475

Start: 995109

End: 997388

Strand: Reverse

Name: gspD

Synonym: TC0861

Alternate gene names: 15835475

Gene position: 997388-995109 (Counterclockwise)

Preceding gene: 15835476

Following gene: 15835474

Centisome position: 92.96

GC content: 41.75

Gene sequence:

>2280_bases
GTGAAGAATGTTTTGCGCTATGGTTTCATAGGAGCGTTCTGTTTTGGGAGTTTAGATATTCCGGTGTTTTCCATCACCGT
TGCGGAAAAATTAGCTTCCATAGAAGGAAAAACAGAAGCTCAGGCTCCTCTTGCTCACATTTCTTCCTTTAACTCCGAGT
TAAAGGAAGCGAATGCTCTACTCAAGTCTTTATATGATGAAGCTTTATCTTTACGATCTCTAGGAGAAACTTCTCAAGAA
GTTTGGAACGATTTACGGGATCGTTTGATCAGCGCGAAACAACGGGTGCGGGCGTTAGAAGATCTATGGTCGGCAGAGGT
CTCAGAGAAAGGGGGTGATCCTGAAGATTATGCTCTTTGGAATCATCCTGAGACTACTATTTACAACCTTGTTAGCGATT
ATGGTGATGAACAAAGTATTTACTTGATCCCTCAAAATGTGGGGGCAATGCGAATAACAGCCATGTCAAAGTTGGTTGTG
CCTAAAGAGGGATTTGAAGAGTGTTTATCGTTGCTTTTGGCTCGTTTAGGTATTGGGGTGAGACAGGTGAGCCCCTGGAT
CAAAGAGCTCTATTTAACGAGTAAGGAAGAGACTGGAGTAGTAGGTATCTTTGGAGCTAGACAAGACCTCGATGTTCTGC
CTTCAACCGCTCATATTGCTTTTGTTCTTTCTTCTAAAAATTTAGACGCACGATCTGATGTACAAGCTTTGCGGAAGTTT
GCAAACAGCGATACCATGTTGATTGATTTTATCGGCGGGAAAATTTGGCTGTTTGGAGTGGTTCATGAAATCACTGAGCT
TCTCAAAATTTATGAATTTTTACAGTCAGATAATATTCGACAAGAGCATCGGATAGTATCTTTGTCTAAGATAGACCCTT
TCGAAATGTTAGCTATTTTGAAAGCAGCTTTTCGAGAAGATTTAGCTAAAGAAGGGGAAGATTCTGCGGGCGTAGGATTA
AAAGTTGTTCCCTTGCAGAATCATGGACGCTCTCTTTTCTTAAGTGGAGCTCTTCCTATAGTTCAAAAGGCGATCGATCT
CATTCGCGAGTTGGAAGAAGGAATAGAGAATCCTACAGATAAAACAGTGTTTTGGTATAACGTCAAACACTCAGATCCTC
AAGAGCTTGCAGCTTTGCTCTCTCAAGTTCATGATATTTTTTCAAGCGGTTCGGGGATAGCGGGAAGTCAGGATACTAGC
GTATCTGCTAATAAGTCTGGGGCAGCCTCGAATGGATTAGCTGTGCAGATAGATACGTCTATCGGGGGAACCTCGAAGGA
AGGCTCCACCAAATATGGGAGCTTTATTGCCGACTCTAAGACCGGAACTTTGATTATGGTCATTGAGAAAGAGGCTCTTC
CAAAAATTAAGATGTTATTGAAAAAACTCGATGTGCCGAAAAAAATGGTTCGTATAGAGGTCTTGCTTTTCGAAAGAAAA
CTATCGAGCCAGCGTAAATCGGGGTTAAATCTATTACGTTTAGGAGAAGAGGTTTGTAAACAGGGGACTCAAGCAGTTTC
TTGGGCAAATGGTGGAATTTTAGAATTCTTGTTTAAGGGAGGAGCAAAGGGGATTGTCCCTAGTTATGACTTTGCTTATC
AATTCCTTATGGCTCAAGAAGATGTGCGCATTAATGCGAGCCCTTCTGTAGTAACGATGAACCAAACCCCAGCTAGAATT
GCGATTGTGGAAGAAATGTCGATAGCAGTTTCTTCAGAGAAAGACAAAGCGCAATATAATCGAGCTCAGTACGGGATTAT
GATTAAGATCCTGCCGGTAATTAATATTGGGGAGGAAGATGGGAAGAGTTTCATCACTCTAGAAACAGATATTACGTTTG
ATTCAACAGGGAAAAATCAAGCAGATCGTCCTGATGTTACTCGAAGAAATATTACAAATAAGGTGCGAATTCAGGATGGG
GAAACTGTAATCATTGGGGGATTGCGGTGTAATCAAACTATGGATTCTCGAGATGGGATTCCTTTCTTAGGAGAATTGCC
AGGAATAGGCAAATTATTTGGCATGGATGCCACTTCGGATTCACAGACAGAGATGTTTATGTTTATTACTCCAAAGATTT
TGGATAATCCTGTTGAGGAAGAAGAAAAGTTGGAATGTGCCTTTCTAGCTTCTCGTCCTGGTGAGAATGAGGATTTTCTA
AGAGCTGTAGTGTCAGGTCAGCAGGCTGCTAAACAGGCTATGGAGAAAAAAGAGTCTATCGCATGGAGAGAAGAAACTCA
TAGTTTGCGGGAAGGAGTGGAGTACGATGGCCGAGAATAA

Upstream 100 bases:

>100_bases
CTCCTTGCCACTAAAGCAGGTAAGTGAGATGGCGCCGCAGGTGCATGTAGTAGGACGGGAGAGCGGGGATAAAAATGCTC
GTGTAGGGGGAGGATCTTAA

Downstream 100 bases:

>100_bases
GTGTTCCATGCAAGATCTTTTAGATCGCCTTCCTTACTCCTTTTTAAAGAAAAATTATCTGCTGCCTATAGATGACTTAG
GGGATAAGATCGTATTGGCG

Product: general secretion pathway protein D

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 759; Mature: 759

Protein sequence:

>759_residues
MKNVLRYGFIGAFCFGSLDIPVFSITVAEKLASIEGKTEAQAPLAHISSFNSELKEANALLKSLYDEALSLRSLGETSQE
VWNDLRDRLISAKQRVRALEDLWSAEVSEKGGDPEDYALWNHPETTIYNLVSDYGDEQSIYLIPQNVGAMRITAMSKLVV
PKEGFEECLSLLLARLGIGVRQVSPWIKELYLTSKEETGVVGIFGARQDLDVLPSTAHIAFVLSSKNLDARSDVQALRKF
ANSDTMLIDFIGGKIWLFGVVHEITELLKIYEFLQSDNIRQEHRIVSLSKIDPFEMLAILKAAFREDLAKEGEDSAGVGL
KVVPLQNHGRSLFLSGALPIVQKAIDLIRELEEGIENPTDKTVFWYNVKHSDPQELAALLSQVHDIFSSGSGIAGSQDTS
VSANKSGAASNGLAVQIDTSIGGTSKEGSTKYGSFIADSKTGTLIMVIEKEALPKIKMLLKKLDVPKKMVRIEVLLFERK
LSSQRKSGLNLLRLGEEVCKQGTQAVSWANGGILEFLFKGGAKGIVPSYDFAYQFLMAQEDVRINASPSVVTMNQTPARI
AIVEEMSIAVSSEKDKAQYNRAQYGIMIKILPVINIGEEDGKSFITLETDITFDSTGKNQADRPDVTRRNITNKVRIQDG
ETVIIGGLRCNQTMDSRDGIPFLGELPGIGKLFGMDATSDSQTEMFMFITPKILDNPVEEEEKLECAFLASRPGENEDFL
RAVVSGQQAAKQAMEKKESIAWREETHSLREGVEYDGRE

Sequences:

>Translated_759_residues
MKNVLRYGFIGAFCFGSLDIPVFSITVAEKLASIEGKTEAQAPLAHISSFNSELKEANALLKSLYDEALSLRSLGETSQE
VWNDLRDRLISAKQRVRALEDLWSAEVSEKGGDPEDYALWNHPETTIYNLVSDYGDEQSIYLIPQNVGAMRITAMSKLVV
PKEGFEECLSLLLARLGIGVRQVSPWIKELYLTSKEETGVVGIFGARQDLDVLPSTAHIAFVLSSKNLDARSDVQALRKF
ANSDTMLIDFIGGKIWLFGVVHEITELLKIYEFLQSDNIRQEHRIVSLSKIDPFEMLAILKAAFREDLAKEGEDSAGVGL
KVVPLQNHGRSLFLSGALPIVQKAIDLIRELEEGIENPTDKTVFWYNVKHSDPQELAALLSQVHDIFSSGSGIAGSQDTS
VSANKSGAASNGLAVQIDTSIGGTSKEGSTKYGSFIADSKTGTLIMVIEKEALPKIKMLLKKLDVPKKMVRIEVLLFERK
LSSQRKSGLNLLRLGEEVCKQGTQAVSWANGGILEFLFKGGAKGIVPSYDFAYQFLMAQEDVRINASPSVVTMNQTPARI
AIVEEMSIAVSSEKDKAQYNRAQYGIMIKILPVINIGEEDGKSFITLETDITFDSTGKNQADRPDVTRRNITNKVRIQDG
ETVIIGGLRCNQTMDSRDGIPFLGELPGIGKLFGMDATSDSQTEMFMFITPKILDNPVEEEEKLECAFLASRPGENEDFL
RAVVSGQQAAKQAMEKKESIAWREETHSLREGVEYDGRE
>Mature_759_residues
MKNVLRYGFIGAFCFGSLDIPVFSITVAEKLASIEGKTEAQAPLAHISSFNSELKEANALLKSLYDEALSLRSLGETSQE
VWNDLRDRLISAKQRVRALEDLWSAEVSEKGGDPEDYALWNHPETTIYNLVSDYGDEQSIYLIPQNVGAMRITAMSKLVV
PKEGFEECLSLLLARLGIGVRQVSPWIKELYLTSKEETGVVGIFGARQDLDVLPSTAHIAFVLSSKNLDARSDVQALRKF
ANSDTMLIDFIGGKIWLFGVVHEITELLKIYEFLQSDNIRQEHRIVSLSKIDPFEMLAILKAAFREDLAKEGEDSAGVGL
KVVPLQNHGRSLFLSGALPIVQKAIDLIRELEEGIENPTDKTVFWYNVKHSDPQELAALLSQVHDIFSSGSGIAGSQDTS
VSANKSGAASNGLAVQIDTSIGGTSKEGSTKYGSFIADSKTGTLIMVIEKEALPKIKMLLKKLDVPKKMVRIEVLLFERK
LSSQRKSGLNLLRLGEEVCKQGTQAVSWANGGILEFLFKGGAKGIVPSYDFAYQFLMAQEDVRINASPSVVTMNQTPARI
AIVEEMSIAVSSEKDKAQYNRAQYGIMIKILPVINIGEEDGKSFITLETDITFDSTGKNQADRPDVTRRNITNKVRIQDG
ETVIIGGLRCNQTMDSRDGIPFLGELPGIGKLFGMDATSDSQTEMFMFITPKILDNPVEEEEKLECAFLASRPGENEDFL
RAVVSGQQAAKQAMEKKESIAWREETHSLREGVEYDGRE

Specific function: Essential for the formation of pili. Involved in the biogenesis of type 4 fimbriae probably by serving as a "porthole" allowing passage of the fimbrae through the outer membrane [H]

COG id: COG1450

COG function: function code NU; Type II secretory pathway, component PulD

Gene ontology:

Cell location: Cell outer membrane (Probable) [H]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the GSP D family [H]

Homologues:

Organism=Escherichia coli, GI87082242, Length=302, Percent_Identity=22.8476821192053, Blast_Score=79, Evalue=1e-15,
Organism=Escherichia coli, GI1789793, Length=286, Percent_Identity=23.0769230769231, Blast_Score=69, Evalue=1e-12,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001775
- InterPro:   IPR005644
- InterPro:   IPR013355
- InterPro:   IPR011662
- InterPro:   IPR004846
- InterPro:   IPR004845 [H]

Pfam domain/function: PF00263 Secretin; PF03958 Secretin_N; PF07660 STN [H]

EC number: NA

Molecular weight: Translated: 83773; Mature: 83773

Theoretical pI: Translated: 4.85; Mature: 4.85

Prosite motif: PS00307 LECTIN_LEGUME_BETA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.7 %Cys     (Translated Protein)
2.2 %Met     (Translated Protein)
2.9 %Cys+Met (Translated Protein)
0.7 %Cys     (Mature Protein)
2.2 %Met     (Mature Protein)
2.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKNVLRYGFIGAFCFGSLDIPVFSITVAEKLASIEGKTEAQAPLAHISSFNSELKEANAL
CCCHHHHHHHHHHHHCCCCCCEEEHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHH
LKSLYDEALSLRSLGETSQEVWNDLRDRLISAKQRVRALEDLWSAEVSEKGGDPEDYALW
HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCC
NHPETTIYNLVSDYGDEQSIYLIPQNVGAMRITAMSKLVVPKEGFEECLSLLLARLGIGV
CCCHHHHHHHHHHCCCCCEEEEEECCCCCEEEHHHHHHCCCHHHHHHHHHHHHHHHCCCH
RQVSPWIKELYLTSKEETGVVGIFGARQDLDVLPSTAHIAFVLSSKNLDARSDVQALRKF
HHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCEEEEEEEECCCCCCHHHHHHHHHH
ANSDTMLIDFIGGKIWLFGVVHEITELLKIYEFLQSDNIRQEHRIVSLSKIDPFEMLAIL
CCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHEEEECCCCHHHHHHHH
KAAFREDLAKEGEDSAGVGLKVVPLQNHGRSLFLSGALPIVQKAIDLIRELEEGIENPTD
HHHHHHHHHHCCCCCCCCCEEEEEECCCCCEEEEECCHHHHHHHHHHHHHHHHHHCCCCC
KTVFWYNVKHSDPQELAALLSQVHDIFSSGSGIAGSQDTSVSANKSGAASNGLAVQIDTS
CEEEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECC
IGGTSKEGSTKYGSFIADSKTGTLIMVIEKEALPKIKMLLKKLDVPKKMVRIEVLLFERK
CCCCCCCCCCCCCCEEECCCCCEEEEEEECCCCHHHHHHHHHHCCCHHHHHHHHHHHHHH
LSSQRKSGLNLLRLGEEVCKQGTQAVSWANGGILEFLFKGGAKGIVPSYDFAYQFLMAQE
HHHHHHHCCHHHHHHHHHHHCCCCEEECCCCCHHHHHHHCCCCCCCCCHHHHHHHHHHCC
DVRINASPSVVTMNQTPARIAIVEEMSIAVSSEKDKAQYNRAQYGIMIKILPVINIGEED
CEEECCCCCEEEECCCCCEEEEEEHHHHEECCCHHHHHHHHHHCCEEEEEEEEEECCCCC
GKSFITLETDITFDSTGKNQADRPDVTRRNITNKVRIQDGETVIIGGLRCNQTMDSRDGI
CCEEEEEEEEEEECCCCCCCCCCCCCHHHCCCCEEEEECCCEEEEECEEECCCCCCCCCC
PFLGELPGIGKLFGMDATSDSQTEMFMFITPKILDNPVEEEEKLECAFLASRPGENEDFL
CCCCCCCCCCHHCCCCCCCCCCCEEEEEECHHHHCCCCCHHHHCEEEEEECCCCCCHHHH
RAVVSGQQAAKQAMEKKESIAWREETHSLREGVEYDGRE
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC
>Mature Secondary Structure
MKNVLRYGFIGAFCFGSLDIPVFSITVAEKLASIEGKTEAQAPLAHISSFNSELKEANAL
CCCHHHHHHHHHHHHCCCCCCEEEHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHH
LKSLYDEALSLRSLGETSQEVWNDLRDRLISAKQRVRALEDLWSAEVSEKGGDPEDYALW
HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCC
NHPETTIYNLVSDYGDEQSIYLIPQNVGAMRITAMSKLVVPKEGFEECLSLLLARLGIGV
CCCHHHHHHHHHHCCCCCEEEEEECCCCCEEEHHHHHHCCCHHHHHHHHHHHHHHHCCCH
RQVSPWIKELYLTSKEETGVVGIFGARQDLDVLPSTAHIAFVLSSKNLDARSDVQALRKF
HHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCEEEEEEEECCCCCCHHHHHHHHHH
ANSDTMLIDFIGGKIWLFGVVHEITELLKIYEFLQSDNIRQEHRIVSLSKIDPFEMLAIL
CCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHEEEECCCCHHHHHHHH
KAAFREDLAKEGEDSAGVGLKVVPLQNHGRSLFLSGALPIVQKAIDLIRELEEGIENPTD
HHHHHHHHHHCCCCCCCCCEEEEEECCCCCEEEEECCHHHHHHHHHHHHHHHHHHCCCCC
KTVFWYNVKHSDPQELAALLSQVHDIFSSGSGIAGSQDTSVSANKSGAASNGLAVQIDTS
CEEEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECC
IGGTSKEGSTKYGSFIADSKTGTLIMVIEKEALPKIKMLLKKLDVPKKMVRIEVLLFERK
CCCCCCCCCCCCCCEEECCCCCEEEEEEECCCCHHHHHHHHHHCCCHHHHHHHHHHHHHH
LSSQRKSGLNLLRLGEEVCKQGTQAVSWANGGILEFLFKGGAKGIVPSYDFAYQFLMAQE
HHHHHHHCCHHHHHHHHHHHCCCCEEECCCCCHHHHHHHCCCCCCCCCHHHHHHHHHHCC
DVRINASPSVVTMNQTPARIAIVEEMSIAVSSEKDKAQYNRAQYGIMIKILPVINIGEED
CEEECCCCCEEEECCCCCEEEEEEHHHHEECCCHHHHHHHHHHCCEEEEEEEEEECCCCC
GKSFITLETDITFDSTGKNQADRPDVTRRNITNKVRIQDGETVIIGGLRCNQTMDSRDGI
CCEEEEEEEEEEECCCCCCCCCCCCCHHHCCCCEEEEECCCEEEEECEEECCCCCCCCCC
PFLGELPGIGKLFGMDATSDSQTEMFMFITPKILDNPVEEEEKLECAFLASRPGENEDFL
CCCCCCCCCCHHCCCCCCCCCCCEEEEEECHHHHCCCCCHHHHCEEEEEECCCCCCHHHH
RAVVSGQQAAKQAMEKKESIAWREETHSLREGVEYDGRE
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 7901733; 10984043 [H]