| Definition | Streptococcus pneumoniae D39, complete genome. |
|---|---|
| Accession | NC_008533 |
| Length | 2,046,115 |
Click here to switch to the map view.
The map label for this gene is pfbA
Identifier: 116515361
GI number: 116515361
Start: 1630597
End: 1632723
Strand: Reverse
Name: pfbA
Synonym: SPD_1617
Alternate gene names: 116515361
Gene position: 1632723-1630597 (Counterclockwise)
Preceding gene: 116516120
Following gene: 116516072
Centisome position: 79.8
GC content: 31.55
Gene sequence:
>2127_bases ATGAAATATTTTGTTCCTAATGAGGTATTCAGTATTCGTAAATTAAAGGTGGGGACTTGCTCGGTACTATTGGCAATTTC AATTTTGGGAAGCCAAGGTATTTTATCGGATGAAGTTGTTACTAGTTCTTCACCGATGGCTACAAAAGAGTCTTCTAATG CAATTACTAATGATTTAGATAATTCACCAACTGTTAATCAGAATCGTTCTGCTGAAATGATTGCCTCTAATTCAACCACT AATGGTTTAGATAATTCGTTAAGTGTTAATAGTATCAGCTCTAATGGTACTATTCGTTCCAATTCACAATTAGACAACAG AACAGTTGAATCTACAGTAACATCTACTAATGAAAATAAGAGTTATAAGGAAGATGTTATAAGTGACAGAATTATCAAAA AAGAATTTGAAGATACTGCTTTAAGTGTAAAAGATTATGGTGCGGTAGGTGATGGGATTCATGATGATCGACAAGCAATT CAAGATGCAATAGATGCTGCAGCTCAAGGGCTAGGTGGAGGAAATGTATATTTTCCTGAAGGAACTTATTTAGTAAAAGA AATTGTTTTTTTAAAAAGTCATACACACTTAGAATTGAATGAGAAAGCTACAATTCTAAATGGTATAAATATTAAGAATC ACCCTTCCATTGTTTTTATGACAGGTTTATTTACGGATGATGGTGCGCAAGTAGAATGGGGCCCAACAGAAGATATTAGT TATTCTGGTGGTACGATTGATATGAACGGTGCTTTGAATGAAGAAGGAACTAAAGCAAAAAATCTACCACTTATAAATTC TTCAGGTGCATTTGCTATTGGGAATTCAAATAACGTAACTATAAAAAATGTAACATTCAAGGATAGTTATCAAGGGCATG CTATTCAAATTGCAGGTTCGAAAAATGTATTAGTTGATAATTCTCGTTTTCTTGGGCAAGCCTTACCCAAAACGATGAAG GATGGGCAAATCATAAGTAAGGAGAGCATTCAGATTGAACCATTAACTAGAAAAGGTTTTCCTTATGCCTTGAATGATGA TGGGAAAAAATCTGAAAATGTGACTATTCAAAATTCCTATTTTGGCAAAAGTGATAAATCTGGGGAATTAGTAACAGCAA TTGGCACACACTATCAAACATTGTCGACACAGAACCCCTCTAATATTAAAATTTTAAATAATCATTTTGATAACATGATG TATGCAGGTGTACGTTTTACAGGATTCACTGATGTATTAATCAAAGGAAATCGCTTTGATAAGAAAGTTAAAGGAGAGAG TGTACATTATCGAGAAAGCGGAGCAGCTTTAGTAAATGCTTATAGCTATAAAAACACTAAAGACCTATTAGATTTAAATA AACAGGTGGTTATCGCCGAAAATATATTTAATATTGCCGATCCTAAAACAAAAGCGATACGAGTTGCAAAAGATAGTGCA GAATATTTAGGAAAAGTATCAGATATTACTGTAACAAAAAATGTAATTAATAATAATTCTAAGGAAACAGAACAACCAAA TATTGAATTATTACGAGTTAGTGATAATTTAGTAGTGTCAGAGAATAGTATATTCGGTGGTAAAGAAGGAATTGTTATTG AGGATTCAAAGGGTAAAATAACCGTTTTAAATAACCAATTTTATAATTTATCCGGTAAGTATATATCATTCATCAAATCT AATGCAAACGGGAAAGAACCTGTTATACGTGATAGCGATGGTAATTTCAATATTGTAACGGAGAATGGGCTTTACAAAAT TGTAACAAATAATTTAAGTGATAAAAACGAAAAAGAAAAAAACAAAGAGGAAAAACAATCTAATTCAAATAATGTAATTG ATAGTAACCAGAAGAACGGAGAGTTTAACTCAAGTAAAGATAATAGACAAATGAATGACAAGATCGACAATAAACAAGAT AATAAGACAGAAGAAGTAAACTATAAAATAGTTGGAGATGGCAGAGAAACTGAAAATCATATTAATAAATCTAAAGAAAT AGTAGATGTAAAACAAAAATTACCAAAGACTGGTTCGAACAAGATTATGGAACTATTCTTAACAGTGACAGGAATTGGTT TACTTTTGACACTAAAAGGGTTGAAGTATTATGGTAAAGATAAATAA
Upstream 100 bases:
>100_bases GGAAAATATATGATAAAGATAATGACAGCGGTGTCATTCTATCTATTTTAAGAAAAGTAATAATCAATTGTTAAAAATAG TAAAAAAATTGGAGGTTCTG
Downstream 100 bases:
>100_bases AATTTGTTCGATACAAGGAAGTTCCGTAGAAAATGAGGATATTGTAGGTTCACAAAATCAGTATTTTTGGATTATTGGTG GAGCTACAGATTTATATAAT
Product: cell wall surface anchor family protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 708; Mature: 708
Protein sequence:
>708_residues MKYFVPNEVFSIRKLKVGTCSVLLAISILGSQGILSDEVVTSSSPMATKESSNAITNDLDNSPTVNQNRSAEMIASNSTT NGLDNSLSVNSISSNGTIRSNSQLDNRTVESTVTSTNENKSYKEDVISDRIIKKEFEDTALSVKDYGAVGDGIHDDRQAI QDAIDAAAQGLGGGNVYFPEGTYLVKEIVFLKSHTHLELNEKATILNGINIKNHPSIVFMTGLFTDDGAQVEWGPTEDIS YSGGTIDMNGALNEEGTKAKNLPLINSSGAFAIGNSNNVTIKNVTFKDSYQGHAIQIAGSKNVLVDNSRFLGQALPKTMK DGQIISKESIQIEPLTRKGFPYALNDDGKKSENVTIQNSYFGKSDKSGELVTAIGTHYQTLSTQNPSNIKILNNHFDNMM YAGVRFTGFTDVLIKGNRFDKKVKGESVHYRESGAALVNAYSYKNTKDLLDLNKQVVIAENIFNIADPKTKAIRVAKDSA EYLGKVSDITVTKNVINNNSKETEQPNIELLRVSDNLVVSENSIFGGKEGIVIEDSKGKITVLNNQFYNLSGKYISFIKS NANGKEPVIRDSDGNFNIVTENGLYKIVTNNLSDKNEKEKNKEEKQSNSNNVIDSNQKNGEFNSSKDNRQMNDKIDNKQD NKTEEVNYKIVGDGRETENHINKSKEIVDVKQKLPKTGSNKIMELFLTVTGIGLLLTLKGLKYYGKDK
Sequences:
>Translated_708_residues MKYFVPNEVFSIRKLKVGTCSVLLAISILGSQGILSDEVVTSSSPMATKESSNAITNDLDNSPTVNQNRSAEMIASNSTT NGLDNSLSVNSISSNGTIRSNSQLDNRTVESTVTSTNENKSYKEDVISDRIIKKEFEDTALSVKDYGAVGDGIHDDRQAI QDAIDAAAQGLGGGNVYFPEGTYLVKEIVFLKSHTHLELNEKATILNGINIKNHPSIVFMTGLFTDDGAQVEWGPTEDIS YSGGTIDMNGALNEEGTKAKNLPLINSSGAFAIGNSNNVTIKNVTFKDSYQGHAIQIAGSKNVLVDNSRFLGQALPKTMK DGQIISKESIQIEPLTRKGFPYALNDDGKKSENVTIQNSYFGKSDKSGELVTAIGTHYQTLSTQNPSNIKILNNHFDNMM YAGVRFTGFTDVLIKGNRFDKKVKGESVHYRESGAALVNAYSYKNTKDLLDLNKQVVIAENIFNIADPKTKAIRVAKDSA EYLGKVSDITVTKNVINNNSKETEQPNIELLRVSDNLVVSENSIFGGKEGIVIEDSKGKITVLNNQFYNLSGKYISFIKS NANGKEPVIRDSDGNFNIVTENGLYKIVTNNLSDKNEKEKNKEEKQSNSNNVIDSNQKNGEFNSSKDNRQMNDKIDNKQD NKTEEVNYKIVGDGRETENHINKSKEIVDVKQKLPKTGSNKIMELFLTVTGIGLLLTLKGLKYYGKDK >Mature_708_residues MKYFVPNEVFSIRKLKVGTCSVLLAISILGSQGILSDEVVTSSSPMATKESSNAITNDLDNSPTVNQNRSAEMIASNSTT NGLDNSLSVNSISSNGTIRSNSQLDNRTVESTVTSTNENKSYKEDVISDRIIKKEFEDTALSVKDYGAVGDGIHDDRQAI QDAIDAAAQGLGGGNVYFPEGTYLVKEIVFLKSHTHLELNEKATILNGINIKNHPSIVFMTGLFTDDGAQVEWGPTEDIS YSGGTIDMNGALNEEGTKAKNLPLINSSGAFAIGNSNNVTIKNVTFKDSYQGHAIQIAGSKNVLVDNSRFLGQALPKTMK DGQIISKESIQIEPLTRKGFPYALNDDGKKSENVTIQNSYFGKSDKSGELVTAIGTHYQTLSTQNPSNIKILNNHFDNMM YAGVRFTGFTDVLIKGNRFDKKVKGESVHYRESGAALVNAYSYKNTKDLLDLNKQVVIAENIFNIADPKTKAIRVAKDSA EYLGKVSDITVTKNVINNNSKETEQPNIELLRVSDNLVVSENSIFGGKEGIVIEDSKGKITVLNNQFYNLSGKYISFIKS NANGKEPVIRDSDGNFNIVTENGLYKIVTNNLSDKNEKEKNKEEKQSNSNNVIDSNQKNGEFNSSKDNRQMNDKIDNKQD NKTEEVNYKIVGDGRETENHINKSKEIVDVKQKLPKTGSNKIMELFLTVTGIGLLLTLKGLKYYGKDK
Specific function: Acts as a fibronectin-dependent adhesin and invasin. Binds host (in this case human) fibronectin, plasmin, plasminogen, and human serum albumin. Where the bacteria adhere to human cells there is major recruitment of microvilli which seem to fuse to cover
COG id: COG5434
COG function: function code M; Endopolygalacturonase
Gene ontology:
Cell location: Secreted, cell wall; Peptidoglycan-anchor (Probable)
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: Contains 5 PbH1 repeats
Homologues:
None
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): PFBA_STRR6 (Q8CYC9)
Other databases:
- EMBL: AE007317 - PIR: B98078 - RefSeq: NP_359244.1 - ProteinModelPortal: Q8CYC9 - SMR: Q8CYC9 - STRING: Q8CYC9 - EnsemblBacteria: EBSTRT00000014826 - GeneID: 933108 - GenomeReviews: AE007317_GR - KEGG: spr:spr1652 - eggNOG: COG5434 - GeneTree: EBGT00050000029768 - HOGENOM: HBG702072 - OMA: VHYRENG - ProtClustDB: CLSK560218 - BioCyc: SPNE171101:SPR1652-MONOMER - GO: GO:0016020 - GO: GO:0009405 - InterPro: IPR005877 - InterPro: IPR006626 - InterPro: IPR012334 - InterPro: IPR011050 - Gene3D: G3DSA:2.160.20.10 - SMART: SM00710 - TIGRFAMs: TIGR01168
Pfam domain/function: PF04650 YSIRK_signal; SSF51126 Pectin_lyas_like
EC number: NA
Molecular weight: Translated: 77721; Mature: 77721
Theoretical pI: Translated: 6.46; Mature: 6.46
Prosite motif: PS50847 GRAM_POS_ANCHORING
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.1 %Cys (Translated Protein) 1.4 %Met (Translated Protein) 1.6 %Cys+Met (Translated Protein) 0.1 %Cys (Mature Protein) 1.4 %Met (Mature Protein) 1.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKYFVPNEVFSIRKLKVGTCSVLLAISILGSQGILSDEVVTSSSPMATKESSNAITNDLD CCCCCCCCCEEEEEEECCHHHHHHHHHHHCCCCCCCCCEECCCCCCCCCCCCCCHHCCCC NSPTVNQNRSAEMIASNSTTNGLDNSLSVNSISSNGTIRSNSQLDNRTVESTVTSTNENK CCCCCCCCCCCEEEECCCCCCCCCCCEEEEEECCCCEEECCCCCCCCCHHHHHCCCCCCC SYKEDVISDRIIKKEFEDTALSVKDYGAVGDGIHDDRQAIQDAIDAAAQGLGGGNVYFPE HHHHHHHHHHHHHHHHHHHEEEHHHCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEECCC GTYLVKEIVFLKSHTHLELNEKATILNGINIKNHPSIVFMTGLFTDDGAQVEWGPTEDIS CHHHHHHHHHHHCCCEEEECCCEEEEECCCCCCCCCEEEEEEEECCCCCEEECCCCCCCC YSGGTIDMNGALNEEGTKAKNLPLINSSGAFAIGNSNNVTIKNVTFKDSYQGHAIQIAGS CCCCEEEECCCCCCCCCCCCCCCEEECCCCEEECCCCCEEEEEEEECCCCCCCEEEEECC KNVLVDNSRFLGQALPKTMKDGQIISKESIQIEPLTRKGFPYALNDDGKKSENVTIQNSY CCEEEECCHHHHHHHHHHHCCCCEEECCCEEEEEECCCCCCEEECCCCCCCCCEEEEECC FGKSDKSGELVTAIGTHYQTLSTQNPSNIKILNNHFDNMMYAGVRFTGFTDVLIKGNRFD CCCCCCCCCEEEEECCCEEEECCCCCCCEEEEECCCCCEEEECEEECCCEEEEEECCCCC KKVKGESVHYRESGAALVNAYSYKNTKDLLDLNKQVVIAENIFNIADPKTKAIRVAKDSA CCCCCCCEEEECCCCEEEEEECCCCCHHHHCCCCCEEEEECHHCCCCCCCEEEEEECCCH EYLGKVSDITVTKNVINNNSKETEQPNIELLRVSDNLVVSENSIFGGKEGIVIEDSKGKI HHCCCCCCEEEEHHHHCCCCCCCCCCCEEEEEECCCEEEECCCCCCCCCCEEEECCCCEE TVLNNQFYNLSGKYISFIKSNANGKEPVIRDSDGNFNIVTENGLYKIVTNNLSDKNEKEK EEEECCEEECCCCEEEEEECCCCCCCCEEECCCCCEEEEECCCEEEEEECCCCCCCHHHH NKEEKQSNSNNVIDSNQKNGEFNSSKDNRQMNDKIDNKQDNKTEEVNYKIVGDGRETENH HHHHHHCCCCCCCCCCCCCCCCCCCCCCCHHHHHCCCCCCCCCEEEEEEEEECCCCHHHH INKSKEIVDVKQKLPKTGSNKIMELFLTVTGIGLLLTLKGLKYYGKDK HHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHEEHHHHHHHCCCCC >Mature Secondary Structure MKYFVPNEVFSIRKLKVGTCSVLLAISILGSQGILSDEVVTSSSPMATKESSNAITNDLD CCCCCCCCCEEEEEEECCHHHHHHHHHHHCCCCCCCCCEECCCCCCCCCCCCCCHHCCCC NSPTVNQNRSAEMIASNSTTNGLDNSLSVNSISSNGTIRSNSQLDNRTVESTVTSTNENK CCCCCCCCCCCEEEECCCCCCCCCCCEEEEEECCCCEEECCCCCCCCCHHHHHCCCCCCC SYKEDVISDRIIKKEFEDTALSVKDYGAVGDGIHDDRQAIQDAIDAAAQGLGGGNVYFPE HHHHHHHHHHHHHHHHHHHEEEHHHCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEECCC GTYLVKEIVFLKSHTHLELNEKATILNGINIKNHPSIVFMTGLFTDDGAQVEWGPTEDIS CHHHHHHHHHHHCCCEEEECCCEEEEECCCCCCCCCEEEEEEEECCCCCEEECCCCCCCC YSGGTIDMNGALNEEGTKAKNLPLINSSGAFAIGNSNNVTIKNVTFKDSYQGHAIQIAGS CCCCEEEECCCCCCCCCCCCCCCEEECCCCEEECCCCCEEEEEEEECCCCCCCEEEEECC KNVLVDNSRFLGQALPKTMKDGQIISKESIQIEPLTRKGFPYALNDDGKKSENVTIQNSY CCEEEECCHHHHHHHHHHHCCCCEEECCCEEEEEECCCCCCEEECCCCCCCCCEEEEECC FGKSDKSGELVTAIGTHYQTLSTQNPSNIKILNNHFDNMMYAGVRFTGFTDVLIKGNRFD CCCCCCCCCEEEEECCCEEEECCCCCCCEEEEECCCCCEEEECEEECCCEEEEEECCCCC KKVKGESVHYRESGAALVNAYSYKNTKDLLDLNKQVVIAENIFNIADPKTKAIRVAKDSA CCCCCCCEEEECCCCEEEEEECCCCCHHHHCCCCCEEEEECHHCCCCCCCEEEEEECCCH EYLGKVSDITVTKNVINNNSKETEQPNIELLRVSDNLVVSENSIFGGKEGIVIEDSKGKI HHCCCCCCEEEEHHHHCCCCCCCCCCCEEEEEECCCEEEECCCCCCCCCCEEEECCCCEE TVLNNQFYNLSGKYISFIKSNANGKEPVIRDSDGNFNIVTENGLYKIVTNNLSDKNEKEK EEEECCEEECCCCEEEEEECCCCCCCCEEECCCCCEEEEECCCEEEEEECCCCCCCHHHH NKEEKQSNSNNVIDSNQKNGEFNSSKDNRQMNDKIDNKQDNKTEEVNYKIVGDGRETENH HHHHHHCCCCCCCCCCCCCCCCCCCCCCCHHHHHCCCCCCCCCEEEEEEEEECCCCHHHH INKSKEIVDVKQKLPKTGSNKIMELFLTVTGIGLLLTLKGLKYYGKDK HHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHEEHHHHHHHCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 11544234