Definition Escherichia coli HS, complete genome.
Accession NC_009800
Length 4,643,538

Click here to switch to the map view.

The map label for this gene is yfbK

Identifier: 157161758

GI number: 157161758

Start: 2425566

End: 2427293

Strand: Reverse

Name: yfbK

Synonym: EcHS_A2419

Alternate gene names: 157161758

Gene position: 2427293-2425566 (Counterclockwise)

Preceding gene: 157161761

Following gene: 157161757

Centisome position: 52.27

GC content: 47.16

Gene sequence:

>1728_bases
ATGCGAAATAAAAATATAATCATGTTGCTTATGAGTAGTTTGATTTTGTCAGGATGTGGGCCGCAACCTGAGAATAAGGA
AAGTCAGCAACAACAACCCAGTACTCCCACAGAGCAGCAAGTGCTTGCCGCGCAGCAAGCTGCAATAAAAGAGGCTGAGC
AAAGCGCCGCCGCCGCGAAAGCCTTGGCCCAGCAAGAAGTGCAACAATATTCAGACAAACAGGCTTTACAGGGGCGATTG
CAGGAAGCGCCAACATTTGCAAGAGCGGCTAAAGCAAAAGCTACACATATCGCAAATCCAGGAACCGCTCGCTACCAGCA
GTTCGATGATAATCCGGTTAAGCAGGTAGCGCAAAATCCGTTGGCGACGTTTAGTCTTGACGTTGACACTGGCAGTTATG
CGAATGTAAGGCGTTTCCTCAATCAAGGGCTGTTACCTCCGCCAGACGCTGTGCGGGTGGAGGAGATAGTCAATTATTTC
CCGTCTGATTGGGATATCAAAGACAAACAATCTATTCCGGCCTCTAAGCCAATACCTTTCGCTATGCGCTACGAATTGGC
ACCTGCACCATGGAATGAACAGCGAACATTGCTGAAAGTTGATATCCTGGCGAAAGATCGCAAAAGTGAAGAGTTACCAG
CTTCTAATCTGGTCTTTCTTATCGACACTTCTGGTTCAATGATTTCTGATGAACGTTTGCCACTTATCCAGTCTTCGTTG
AAATTATTGGTCAAAGAACTTCGTGAGCAGGATAACATTGCCATCGTGACCTACGCTGGCGACTCCCGTATTGCATTGCC
TTCTATCTCCGGGAGTCATAAGGCGGAAATTAATGCCGCAATTGATTCGCTGGATGCCGAAGGCAGTACCAATGGCGGTG
CCGGGCTGGAACTGGCTTATCAGCAGGCGACGAAGGGGTTTATTAAGGGCGGCATCAATCGCATTTTATTAGCCACTGAC
GGTGACTTTAACGTTGGCATTGACGATCCAAAATCGATTGAATCAATGGTCAAAAAACAGCGGGAGTCTGGTGTTACTCT
GTCGACGTTTGGCGTGGGGAATAGCAATTACAACGAGGCAATGATGGTGCGAATTGCCGATGTTGGTAACGGCAACTACA
GCTACATTGATACCCTCTCTGAAGCGCAGAAAGTATTGAATAGTGAAATGCGGCAGATGTTGATTACCGTAGCAAAAGAT
GTCAAAGCGCAAATTGAGTTTAACCCCGCGTGGGTAACGGAATACCGTCAGATTGGTTATGAAAAGCGCCAACTTCGGGT
GGAACATTTTAATAACGACAACGTTGATGCAGGGGATATAGGCGCAGGCAAACATATAACGTTGTTATTCGAATTAACGC
TGAACGGGCAAAAAGCATCAATTGATAAGTTACGCTATGCCCCGGATAACAAATTAGCGAAATCGGACAAAACGAAAGAA
CTGGCCTGGTTAAAAATTCGCTGGAAATACCCGCAGGGAAAAGAAAGTCAGTTAGTTGAATTCCCGCTGGGGCCAACAAT
AAACGCGCCCTCTGAAGATATGCGTTTTCGCGCAGCAGTAGCTGCATATGGGCAAAAGTTACGCGGTTCTGAATACCTGA
ACAATACCTCCTGGCAGCAGATCAAACAGTGGGCTCAGCAGGCAAAAGGAGAAGATCCACAGGGTTACAGGGCGGAATTT
ATTCGCCTGATTGAACTGGCGGATGGTGTGACTGACATCAGTCAGTGA

Upstream 100 bases:

>100_bases
TTTGCTATGGGTTTCGATATTGTATTTTATTAAGATTAGCAGGATTATACAAAGAGTATATTTTATGTCTGGTGCCTGAG
TTTATTTAAAAGGATTTTAT

Downstream 100 bases:

>100_bases
TGACTGTTTAGCAAACTATGTTCGACCAGTCAGCATATTTGCTGACTGGTCGAATTAATTAACAATGATGTTAACTCACT
CTTTTGCCTGATGCTCTATT

Product: von Willebrand factor type A domain-containing protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 575; Mature: 575

Protein sequence:

>575_residues
MRNKNIIMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRL
QEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYF
PSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSL
KLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATD
GDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKD
VKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKE
LAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEF
IRLIELADGVTDISQ

Sequences:

>Translated_575_residues
MRNKNIIMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRL
QEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYF
PSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSL
KLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATD
GDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKD
VKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKE
LAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEF
IRLIELADGVTDISQ
>Mature_575_residues
MRNKNIIMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRL
QEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYF
PSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSL
KLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATD
GDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKD
VKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKE
LAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEF
IRLIELADGVTDISQ

Specific function: Unknown

COG id: COG2304

COG function: function code R; Uncharacterized protein containing a von Willebrand factor type A (vWA) domain

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: To Synechocystis PCC 6803 sll0103

Homologues:

Organism=Escherichia coli, GI1788606, Length=575, Percent_Identity=100, Blast_Score=1178, Evalue=0.0,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): YFBK_ECOLI (P76481)

Other databases:

- EMBL:   U00096
- EMBL:   AP009048
- PIR:   D64998
- RefSeq:   AP_002868.1
- RefSeq:   NP_416773.1
- ProteinModelPortal:   P76481
- SMR:   P76481
- IntAct:   P76481
- STRING:   P76481
- EnsemblBacteria:   EBESCT00000004284
- EnsemblBacteria:   EBESCT00000016979
- GeneID:   946743
- GenomeReviews:   AP009048_GR
- GenomeReviews:   U00096_GR
- KEGG:   ecj:JW2265
- KEGG:   eco:b2270
- EchoBASE:   EB3848
- EcoGene:   EG14095
- eggNOG:   COG2304
- GeneTree:   EBGT00050000012129
- HOGENOM:   HBG503062
- OMA:   EDFNNDK
- ProtClustDB:   CLSK891769
- BioCyc:   EcoCyc:G7177-MONOMER
- Genevestigator:   P76481
- InterPro:   IPR021908
- InterPro:   IPR022156
- InterPro:   IPR002035
- SMART:   SM00327

Pfam domain/function: PF12034 DUF3520; PF00092 VWA; PF12450 vWF_A

EC number: NA

Molecular weight: Translated: 63635; Mature: 63635

Theoretical pI: Translated: 5.68; Mature: 5.68

Prosite motif: PS51257 PROKAR_LIPOPROTEIN; PS50234 VWFA; PS00013 PROKAR_LIPOPROTEIN

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.2 %Cys     (Translated Protein)
1.9 %Met     (Translated Protein)
2.1 %Cys+Met (Translated Protein)
0.2 %Cys     (Mature Protein)
1.9 %Met     (Mature Protein)
2.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MRNKNIIMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAK
CCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHH
ALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNP
HHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCCHHHHCCCCCHHHHHHCCC
LATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPF
CEEEEEECCCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCE
AMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSL
EEEEEECCCCCCCCCEEEEEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCHHHHHHH
KLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAY
HHHHHHHHCCCCEEEEEEECCCEEEECCCCCCCCHHHHHHHHCCCCCCCCCCCCCHHHHH
QQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEA
HHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCCE
MMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGY
EEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEEEECCHHHHHHHHCCC
EKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKE
HHHHEEEEECCCCCCCCCCCCCCCEEEEEEEEEECCCCCCHHHHCCCCCCCCCCCCCCCC
LAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQ
EEEEEEEEECCCCCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCHHHCCCCHHH
IKQWAQQAKGEDPQGYRAEFIRLIELADGVTDISQ
HHHHHHHHCCCCCCCHHHHHHHHHHHHCCHHHCCC
>Mature Secondary Structure
MRNKNIIMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAK
CCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHH
ALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNP
HHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCCHHHHCCCCCHHHHHHCCC
LATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPF
CEEEEEECCCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCE
AMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSL
EEEEEECCCCCCCCCEEEEEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCHHHHHHH
KLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAY
HHHHHHHHCCCCEEEEEEECCCEEEECCCCCCCCHHHHHHHHCCCCCCCCCCCCCHHHHH
QQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEA
HHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCCE
MMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGY
EEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEEEECCHHHHHHHHCCC
EKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKE
HHHHEEEEECCCCCCCCCCCCCCCEEEEEEEEEECCCCCCHHHHCCCCCCCCCCCCCCCC
LAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQ
EEEEEEEEECCCCCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCHHHCCCCHHH
IKQWAQQAKGEDPQGYRAEFIRLIELADGVTDISQ
HHHHHHHHCCCCCCCHHHHHHHHHHHHCCHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9278503