The gene/protein map for NC_003143 is currently unavailable.
Definition Yersinia pestis CO92 chromosome, complete genome.
Accession NC_003143
Length 4,653,728

Click here to switch to the map view.

The map label for this gene is yfbK [C]

Identifier: 218930054

GI number: 218930054

Start: 3359114

End: 3360643

Strand: Direct

Name: yfbK [C]

Synonym: YPO3007

Alternate gene names: 218930054

Gene position: 3359114-3360643 (Clockwise)

Preceding gene: 218930049

Following gene: 218930055

Centisome position: 72.18

GC content: 46.54

Gene sequence:

>1530_bases
TTGGCCGTTTTACAAATTTTTGACAACTTTATACAGCAAACAGACAACTTAGCGCATGATAAGATGAATAATCGTGGCAC
TACTCTTCGAGTCTTGCACAGAGGTACTATCGTGTTACTAAAACCAGCCATTTTAATAAAAACTGACATGCTAATAAAAC
CAGACATCCTGATAAAACCGGCCACGCTAATAAAACAGGCCGCTCGAATAAAGCAGATGACCCTATTGTTGCTATTATTC
CTTTTTTCGCTGTTTGGGGTAGCCAAGGCGGCGACACAAGTGGTCAATGTCAAATCAGAACTTGCTGCGCCCGTTATGCT
GGCTAATAGCGAAGATAAAAATTACCTGAAAATTTCTCTTACCGGTTTTAATCTCGACAGCACCCGTCGTAGCCCAATCA
ATCTGGCGCTGGTCATTGATCGTTCTACGTCAATGAGCGGTGAGCGCATCGAGAAAGCCAGAGAAGCGGCGATTTTAGCG
GTTAATATGCTTAACATCACCGATACGCTATCGGTGGTGGCCTACGATAACCACGCCGAGGTGATCATTCCGGCCACGAA
AGTCACTGATAAGCCAGCGCTGATTGCCAGCATTCAACAGCACATTCACCCAAGGGGAATGACCGCCTTGTTTGCTGGTG
TCAGTATGGGTATTGGTCAAGTGGATAAACACCTGAACCGTGAGCAGGTCAATCGCATCATCCTTATCTCTGATGGTCAG
GCGAATACTGGCCCCACCTCAATCAGCGAACTTTCCGATCTGGCCCGCATGGCGGCTAAAAAAGGGATTGCCATCACCAC
TATCGGGCTGGGCCAGGATTATAACGAGGATCTGATGACTGCCATTGCGGGTTATAGCGACGGTAACCACACCTTTGTCG
CTAACTCGGCAGATCTGGAAAAGGCGTTCACCAAAGAATTCCAAGATGTGATGTCCGTCGTCGCACAGGATATCGTTGTT
CAGATTAAGACTGGCGATAAGGTGAAACCGGTACGACTACTGGGGCGCGATGGCGATATCCTCGGCAATACAGTGAATGT
GAAACTGAATCAGCTTTACTCTAATCAGGAAAAATACATTCTACTGGAGGTGATTCCGGAAAAAGGCACTGACAAGCAGC
AAAAAGATCTGGCCGATGTCAGTATCAGCTACCTGAATCTCAGCAGTAAAAAACAAGATCAGATTAATGAACGAGTGACC
GTCAGCTACAGCCAATCCGTAGAAAAGGTCAACGATGCTGTACAAGAAGAAGTATTAGCGGAGTCAGAGATTCAAAAAAC
AGCGCTGGCCAATGATGAAGCCATTAAATTGATCGACGCGGGGCGCAAGGATGAAGCGAAGAAAGTGCTAGAGTCCAATG
CCTCAAAACTGGACAGCATGTCGTTCTCCAGCCCGGTTGCAGAGAAAAAAGTGCGAGAGAGTACGAATAAACAGCGTAAA
TTAGCGGATGACATTGACAGTAAAGATGCCGCCACTTACCGTAAAGAGCTGAAAGAACAGAACTACAACGTCAAACAGCA
GCAGAAGTAA

Upstream 100 bases:

>100_bases
TGGGAATTTATTTTTCTCTTATTCCCCCCAGAATTCCTATATGGTTATTCATTCCTCTGACTACATTACCGGCTATTCAG
CACGGTGTTCATGACGCTAA

Downstream 100 bases:

>100_bases
AAATAAAAGAGGTTATTGACAAAGTGCCCGCGACGGAGAGAACAGGCAGATCGTAAAGACGCCGTAAATACATCCATGTA
GGCTCGAGCCGCGCCATCCT

Product: hypothetical protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 509; Mature: 508

Protein sequence:

>509_residues
MAVLQIFDNFIQQTDNLAHDKMNNRGTTLRVLHRGTIVLLKPAILIKTDMLIKPDILIKPATLIKQAARIKQMTLLLLLF
LFSLFGVAKAATQVVNVKSELAAPVMLANSEDKNYLKISLTGFNLDSTRRSPINLALVIDRSTSMSGERIEKAREAAILA
VNMLNITDTLSVVAYDNHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAGVSMGIGQVDKHLNREQVNRIILISDGQ
ANTGPTSISELSDLARMAAKKGIAITTIGLGQDYNEDLMTAIAGYSDGNHTFVANSADLEKAFTKEFQDVMSVVAQDIVV
QIKTGDKVKPVRLLGRDGDILGNTVNVKLNQLYSNQEKYILLEVIPEKGTDKQQKDLADVSISYLNLSSKKQDQINERVT
VSYSQSVEKVNDAVQEEVLAESEIQKTALANDEAIKLIDAGRKDEAKKVLESNASKLDSMSFSSPVAEKKVRESTNKQRK
LADDIDSKDAATYRKELKEQNYNVKQQQK

Sequences:

>Translated_509_residues
MAVLQIFDNFIQQTDNLAHDKMNNRGTTLRVLHRGTIVLLKPAILIKTDMLIKPDILIKPATLIKQAARIKQMTLLLLLF
LFSLFGVAKAATQVVNVKSELAAPVMLANSEDKNYLKISLTGFNLDSTRRSPINLALVIDRSTSMSGERIEKAREAAILA
VNMLNITDTLSVVAYDNHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAGVSMGIGQVDKHLNREQVNRIILISDGQ
ANTGPTSISELSDLARMAAKKGIAITTIGLGQDYNEDLMTAIAGYSDGNHTFVANSADLEKAFTKEFQDVMSVVAQDIVV
QIKTGDKVKPVRLLGRDGDILGNTVNVKLNQLYSNQEKYILLEVIPEKGTDKQQKDLADVSISYLNLSSKKQDQINERVT
VSYSQSVEKVNDAVQEEVLAESEIQKTALANDEAIKLIDAGRKDEAKKVLESNASKLDSMSFSSPVAEKKVRESTNKQRK
LADDIDSKDAATYRKELKEQNYNVKQQQK
>Mature_508_residues
AVLQIFDNFIQQTDNLAHDKMNNRGTTLRVLHRGTIVLLKPAILIKTDMLIKPDILIKPATLIKQAARIKQMTLLLLLFL
FSLFGVAKAATQVVNVKSELAAPVMLANSEDKNYLKISLTGFNLDSTRRSPINLALVIDRSTSMSGERIEKAREAAILAV
NMLNITDTLSVVAYDNHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAGVSMGIGQVDKHLNREQVNRIILISDGQA
NTGPTSISELSDLARMAAKKGIAITTIGLGQDYNEDLMTAIAGYSDGNHTFVANSADLEKAFTKEFQDVMSVVAQDIVVQ
IKTGDKVKPVRLLGRDGDILGNTVNVKLNQLYSNQEKYILLEVIPEKGTDKQQKDLADVSISYLNLSSKKQDQINERVTV
SYSQSVEKVNDAVQEEVLAESEIQKTALANDEAIKLIDAGRKDEAKKVLESNASKLDSMSFSSPVAEKKVRESTNKQRKL
ADDIDSKDAATYRKELKEQNYNVKQQQK

Specific function: Unknown

COG id: COG2304

COG function: function code R; Uncharacterized protein containing a von Willebrand factor type A (vWA) domain

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 VWFA domain [H]

Homologues:

Organism=Escherichia coli, GI1788606, Length=230, Percent_Identity=26.9565217391304, Blast_Score=85, Evalue=1e-17,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR002035 [H]

Pfam domain/function: PF00092 VWA [H]

EC number: NA

Molecular weight: Translated: 56098; Mature: 55967

Theoretical pI: Translated: 8.96; Mature: 8.96

Prosite motif: PS50234 VWFA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.0 %Cys     (Translated Protein)
2.6 %Met     (Translated Protein)
2.6 %Cys+Met (Translated Protein)
0.0 %Cys     (Mature Protein)
2.4 %Met     (Mature Protein)
2.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MAVLQIFDNFIQQTDNLAHDKMNNRGTTLRVLHRGTIVLLKPAILIKTDMLIKPDILIKP
CHHHHHHHHHHHHHCCHHHHCCCCCCCEEEEEECCEEEEECCHHHEEECCEECCCCEECC
ATLIKQAARIKQMTLLLLLFLFSLFGVAKAATQVVNVKSELAAPVMLANSEDKNYLKISL
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCEEEEEE
TGFNLDSTRRSPINLALVIDRSTSMSGERIEKAREAAILAVNMLNITDTLSVVAYDNHAE
EEECCCCCCCCCEEEEEEEECCCCCCHHHHHHHHHHHEEEEEEECCCCEEEEEEECCCCE
VIIPATKVTDKPALIASIQQHIHPRGMTALFAGVSMGIGQVDKHLNREQVNRIILISDGQ
EEEECCCCCCCHHHHHHHHHHCCCCCHHHHHHHHHCCHHHHHHHCCHHHCCEEEEEECCC
ANTGPTSISELSDLARMAAKKGIAITTIGLGQDYNEDLMTAIAGYSDGNHTFVANSADLE
CCCCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHCCCCCCEEEEECCHHHH
KAFTKEFQDVMSVVAQDIVVQIKTGDKVKPVRLLGRDGDILGNTVNVKLNQLYSNQEKYI
HHHHHHHHHHHHHHHHHHEEEEECCCCCCCEEEECCCCCCCCCEEEEEEEHHHCCCCCEE
LLEVIPEKGTDKQQKDLADVSISYLNLSSKKQDQINERVTVSYSQSVEKVNDAVQEEVLA
EEEEECCCCCCHHHHHHHHHEEEEEECCCCHHHHCCCEEEEEHHHHHHHHHHHHHHHHHH
ESEIQKTALANDEAIKLIDAGRKDEAKKVLESNASKLDSMSFSSPVAEKKVRESTNKQRK
HHHHHHHHCCCCCEEEEECCCCCHHHHHHHHCCHHHHHCCCCCCCHHHHHHHHHHHHHHH
LADDIDSKDAATYRKELKEQNYNVKQQQK
HHHCCCCCHHHHHHHHHHHCCCCCCCCCC
>Mature Secondary Structure 
AVLQIFDNFIQQTDNLAHDKMNNRGTTLRVLHRGTIVLLKPAILIKTDMLIKPDILIKP
HHHHHHHHHHHHHCCHHHHCCCCCCCEEEEEECCEEEEECCHHHEEECCEECCCCEECC
ATLIKQAARIKQMTLLLLLFLFSLFGVAKAATQVVNVKSELAAPVMLANSEDKNYLKISL
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCEEEEEE
TGFNLDSTRRSPINLALVIDRSTSMSGERIEKAREAAILAVNMLNITDTLSVVAYDNHAE
EEECCCCCCCCCEEEEEEEECCCCCCHHHHHHHHHHHEEEEEEECCCCEEEEEEECCCCE
VIIPATKVTDKPALIASIQQHIHPRGMTALFAGVSMGIGQVDKHLNREQVNRIILISDGQ
EEEECCCCCCCHHHHHHHHHHCCCCCHHHHHHHHHCCHHHHHHHCCHHHCCEEEEEECCC
ANTGPTSISELSDLARMAAKKGIAITTIGLGQDYNEDLMTAIAGYSDGNHTFVANSADLE
CCCCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHCCCCCCEEEEECCHHHH
KAFTKEFQDVMSVVAQDIVVQIKTGDKVKPVRLLGRDGDILGNTVNVKLNQLYSNQEKYI
HHHHHHHHHHHHHHHHHHEEEEECCCCCCCEEEECCCCCCCCCEEEEEEEHHHCCCCCEE
LLEVIPEKGTDKQQKDLADVSISYLNLSSKKQDQINERVTVSYSQSVEKVNDAVQEEVLA
EEEEECCCCCCHHHHHHHHHEEEEEECCCCHHHHCCCEEEEEHHHHHHHHHHHHHHHHHH
ESEIQKTALANDEAIKLIDAGRKDEAKKVLESNASKLDSMSFSSPVAEKKVRESTNKQRK
HHHHHHHHCCCCCEEEEECCCCCHHHHHHHHCCHHHHHCCCCCCCHHHHHHHHHHHHHHH
LADDIDSKDAATYRKELKEQNYNVKQQQK
HHHCCCCCHHHHHHHHHHHCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8590279; 8905231 [H]