Definition Yersinia pestis CO92 chromosome, complete genome.
Accession NC_003143
Length 4,653,728

Click here to switch to the map view.

The map label for this gene is yfiQ [H]

Identifier: 218930297

GI number: 218930297

Start: 3644013

End: 3646655

Strand: Direct

Name: yfiQ [H]

Synonym: YPO3272

Alternate gene names: 218930297

Gene position: 3644013-3646655 (Clockwise)

Preceding gene: 218930296

Following gene: 218930298

Centisome position: 78.3

GC content: 52.14

Gene sequence:

>2643_bases
ATGAGCCAACGTGGATTAGAAGCACTGTTACGTCCGAAATCTATCGCAGTGATCGGTGCTTCTGAAAAACCTGAGCGAGC
GGGTTTTCTGATGATGCGCAACTTGCTGGATGGCGGTTTCAATGGACCGATTCTCCCGGTAACGCCCAATCACAAAGCGG
TGTGCGGTGTCTTGGCTTACGCCAATATCGCCAGCCTACCCATCACTCCTGATTTAGCGATTTTGTGCACCCATGATTGC
CGTAATCTGACGCTACTGGAAGACCTTGGGAACCGGGGCTGTAAAGCCGCCATTATTTTGTCTGCTGCACCGGAGCAGTT
CCCAGAACTGAAAGCCTGCGCTCAACGCCACCATATGCGGCTACTCGGGCCTAATAGCCTGGGGCTACTGGCCCCGTGGC
AAGGGCTGAATGCCAGCTTTTCACCCGTCCCCATCAAAAAAGGGCGGCTGGCGTTTATCTCCCAGTCTGCAGCCGTGGCC
AATACGATCCTTGATTGGGCGCAGCAACGGGAAGTTGGCTTTTCTTACTTTATTGCACTGGGCGACAGCTTGGATATTGA
TGTCGATGACCTACTCGATTTCCTTGCCCGAGACAGCAAAACCAGCGCGATTATGCTGTATATCGAACATATCAGCGATG
CACGCCGCTTCTTATCCGCTTCCCGCAGTGCTTCACGCAATAAACCGGTTTTGGTGGTGAAAAGTGGCCGCAGTCAGCGC
GCTCAACAGCTCTTGAATGGTCAGCAAGGATTGGATGCCGCCTATGATGCCGCTATCCAGCGGGCCGGGTTGCTGCGAGT
ACAGGATACCCATGAATTGTTCTCTGCCGTTGAGACTCTCAGCCATATGCGCCCACTGCGCGGTGAACGGCTGTTGATTG
TCAGCAATGGTTCAGCACCGGCGGCGATGGCGCTGGATGAACTTATCCGCCGTAATGGTAAATTGGCGACCTTGTCGGAT
GCCACGCAATCCGCACTGAGTGAAGCTCTGCCGCCTTTTGTTGCGCTACGTAACCCAATAGACCTACGGGATGATGCCAG
TGCTGAGCGCTATTTAGCGGCACTCAAGCCGCTATTGGACAGTAGTGATTACGATACGCTACTCCTGATCCACTCGCCCA
GCGCCGCCGCACCGGGAGCCAAAACGGCTGAATTATTGATTTCAGCTATTCGCCAACATCCGCGCGGTAAACGCATCACG
TTACTCACCAATTGGTGTGGTGAATATTCATCACAAGATGCCCGTCGCTTGTTTACTGAAGCGGGTATTCCAACCTATCG
CACCCCGGAAGGCGCGATCACGGCTTTTATGCATATGGTGGAGTATCGTCGTAATCAGAAACAACTGAAAGAGACCCCGG
CATTACCAATAGGTTTGACCGCCAATACTGCCCATGTCCATCAGCTAATTCGCCAGGCCTTGGCCGAAGGGGCAACCCAG
CTTGATACCCATGAAGTGCAACCCATTCTTGAAGCGTATGGCCTCAGGACATTGCCAACCTGGATTGCCAGCGACAGTGT
AGAAGCCGTACATATTGCTGAACGACTGGGCTATCCGGTCGCAATTAAACTTCGCTCGCCTGATATCCCGCATAAATCGG
AAGTTCAGGGGGTCATGTTGTATTTGCGCACCGCCATTGAAGTGCAGCGGGCGGCAGACGATATCCTCGATCGGGTAAAG
CGGACCTATCCACAAGCACGAATTCATGGTTTGCTGGTGCAAAGCATGGCGAATCGAGCAGGAGCACAAGAATTGCGTAT
TGCTGTGGAGCAAGATGCTATTTTCGGCCCGTTGATCATGTTAGGTGAAGGCGGTATTGAGTGGCACCACGAGACACAAG
CCGCCGTCGCACTACCGCCTCTCAATATGGTGCTGGCGCGCTACCTGATTATACAGGCCGTCAAAGGGGGGAAAATTCGC
AGCCGCGGATCACTACAACCTTTGGATATTCCAGGGCTAAGCCGCTTGCTGGTACAAGTTTCCAACTTGATCCTCGACTG
CCCCGAAATCACCCGTCTGGATATTCACCCGGTACTGGCCTCCGGCAGTGAGTTCACGCTGCTGGATGTGTCGATGCAAT
TAGCCCCTGTCACTGGTGACCCTCAAGCTCGTCTGGCAATTCGCCCGTATCCGCATGAATTGGAAGAAAAGGTCACGCTC
AGAGACAACTCCCAGTGCTTATTCCGCCCGATTCTGCCGGAAGATGAGCCACTGCTAAAACTGTTTATTGATCAAGTAAC
TAAAGAAGACCTTTATTATCGCTACTTCAGTGAAATCAATGAATTCAGCCATGATGATTTGGCGAATATGACACAAATTG
ACTATGATCGGGAAATGGCTTTTGTTGCTGTGCGTCAAAATAGTGAAGGGCCAGAGATCATAGGTGTCACGCGGGCATTT
TCTGATCCTGACAACATTGATGCCGAATTTGCCGTACTGGTCCGCTCGGATCTAAAAGGGCTGGGCTTAGGCAGGGCATT
ACTTGAGAAGATGATCCGTTATGCCCGTAGCCATGGGCTATCCCGGCTCACCGCAGTCACCATGCCAAATAACCGCGGTA
TGATTGGTTTGGCACAAAAACTCGGTTTTACTATTGATGTGCAAATAGAAGATGGGATCGTAAATCTGGAGCTGACACTT
TGA

Upstream 100 bases:

>100_bases
CGGAAAACCTCATCACCCAGTGGGGCGAGTCACAGCAAGCCACGAAGAAAACGCGTAAAGTCAGAGAAGCTATCCTTATA
TCAACAACAGGAACCTGCAC

Downstream 100 bases:

>100_bases
TCTTTGTTACACCAATCAGCGTGATATAGGCCACAGAGCTAAGTGATTCAGCCATTGGCAGGCAAAACTTGCGCACAATC
CAGAAAAGTAATGGTATTAT

Product: putative acetyltransferase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 880; Mature: 879

Protein sequence:

>880_residues
MSQRGLEALLRPKSIAVIGASEKPERAGFLMMRNLLDGGFNGPILPVTPNHKAVCGVLAYANIASLPITPDLAILCTHDC
RNLTLLEDLGNRGCKAAIILSAAPEQFPELKACAQRHHMRLLGPNSLGLLAPWQGLNASFSPVPIKKGRLAFISQSAAVA
NTILDWAQQREVGFSYFIALGDSLDIDVDDLLDFLARDSKTSAIMLYIEHISDARRFLSASRSASRNKPVLVVKSGRSQR
AQQLLNGQQGLDAAYDAAIQRAGLLRVQDTHELFSAVETLSHMRPLRGERLLIVSNGSAPAAMALDELIRRNGKLATLSD
ATQSALSEALPPFVALRNPIDLRDDASAERYLAALKPLLDSSDYDTLLLIHSPSAAAPGAKTAELLISAIRQHPRGKRIT
LLTNWCGEYSSQDARRLFTEAGIPTYRTPEGAITAFMHMVEYRRNQKQLKETPALPIGLTANTAHVHQLIRQALAEGATQ
LDTHEVQPILEAYGLRTLPTWIASDSVEAVHIAERLGYPVAIKLRSPDIPHKSEVQGVMLYLRTAIEVQRAADDILDRVK
RTYPQARIHGLLVQSMANRAGAQELRIAVEQDAIFGPLIMLGEGGIEWHHETQAAVALPPLNMVLARYLIIQAVKGGKIR
SRGSLQPLDIPGLSRLLVQVSNLILDCPEITRLDIHPVLASGSEFTLLDVSMQLAPVTGDPQARLAIRPYPHELEEKVTL
RDNSQCLFRPILPEDEPLLKLFIDQVTKEDLYYRYFSEINEFSHDDLANMTQIDYDREMAFVAVRQNSEGPEIIGVTRAF
SDPDNIDAEFAVLVRSDLKGLGLGRALLEKMIRYARSHGLSRLTAVTMPNNRGMIGLAQKLGFTIDVQIEDGIVNLELTL

Sequences:

>Translated_880_residues
MSQRGLEALLRPKSIAVIGASEKPERAGFLMMRNLLDGGFNGPILPVTPNHKAVCGVLAYANIASLPITPDLAILCTHDC
RNLTLLEDLGNRGCKAAIILSAAPEQFPELKACAQRHHMRLLGPNSLGLLAPWQGLNASFSPVPIKKGRLAFISQSAAVA
NTILDWAQQREVGFSYFIALGDSLDIDVDDLLDFLARDSKTSAIMLYIEHISDARRFLSASRSASRNKPVLVVKSGRSQR
AQQLLNGQQGLDAAYDAAIQRAGLLRVQDTHELFSAVETLSHMRPLRGERLLIVSNGSAPAAMALDELIRRNGKLATLSD
ATQSALSEALPPFVALRNPIDLRDDASAERYLAALKPLLDSSDYDTLLLIHSPSAAAPGAKTAELLISAIRQHPRGKRIT
LLTNWCGEYSSQDARRLFTEAGIPTYRTPEGAITAFMHMVEYRRNQKQLKETPALPIGLTANTAHVHQLIRQALAEGATQ
LDTHEVQPILEAYGLRTLPTWIASDSVEAVHIAERLGYPVAIKLRSPDIPHKSEVQGVMLYLRTAIEVQRAADDILDRVK
RTYPQARIHGLLVQSMANRAGAQELRIAVEQDAIFGPLIMLGEGGIEWHHETQAAVALPPLNMVLARYLIIQAVKGGKIR
SRGSLQPLDIPGLSRLLVQVSNLILDCPEITRLDIHPVLASGSEFTLLDVSMQLAPVTGDPQARLAIRPYPHELEEKVTL
RDNSQCLFRPILPEDEPLLKLFIDQVTKEDLYYRYFSEINEFSHDDLANMTQIDYDREMAFVAVRQNSEGPEIIGVTRAF
SDPDNIDAEFAVLVRSDLKGLGLGRALLEKMIRYARSHGLSRLTAVTMPNNRGMIGLAQKLGFTIDVQIEDGIVNLELTL
>Mature_879_residues
SQRGLEALLRPKSIAVIGASEKPERAGFLMMRNLLDGGFNGPILPVTPNHKAVCGVLAYANIASLPITPDLAILCTHDCR
NLTLLEDLGNRGCKAAIILSAAPEQFPELKACAQRHHMRLLGPNSLGLLAPWQGLNASFSPVPIKKGRLAFISQSAAVAN
TILDWAQQREVGFSYFIALGDSLDIDVDDLLDFLARDSKTSAIMLYIEHISDARRFLSASRSASRNKPVLVVKSGRSQRA
QQLLNGQQGLDAAYDAAIQRAGLLRVQDTHELFSAVETLSHMRPLRGERLLIVSNGSAPAAMALDELIRRNGKLATLSDA
TQSALSEALPPFVALRNPIDLRDDASAERYLAALKPLLDSSDYDTLLLIHSPSAAAPGAKTAELLISAIRQHPRGKRITL
LTNWCGEYSSQDARRLFTEAGIPTYRTPEGAITAFMHMVEYRRNQKQLKETPALPIGLTANTAHVHQLIRQALAEGATQL
DTHEVQPILEAYGLRTLPTWIASDSVEAVHIAERLGYPVAIKLRSPDIPHKSEVQGVMLYLRTAIEVQRAADDILDRVKR
TYPQARIHGLLVQSMANRAGAQELRIAVEQDAIFGPLIMLGEGGIEWHHETQAAVALPPLNMVLARYLIIQAVKGGKIRS
RGSLQPLDIPGLSRLLVQVSNLILDCPEITRLDIHPVLASGSEFTLLDVSMQLAPVTGDPQARLAIRPYPHELEEKVTLR
DNSQCLFRPILPEDEPLLKLFIDQVTKEDLYYRYFSEINEFSHDDLANMTQIDYDREMAFVAVRQNSEGPEIIGVTRAFS
DPDNIDAEFAVLVRSDLKGLGLGRALLEKMIRYARSHGLSRLTAVTMPNNRGMIGLAQKLGFTIDVQIEDGIVNLELTL

Specific function: Unknown

COG id: COG1042

COG function: function code C; Acyl-CoA synthetase (NDP forming)

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 N-acetyltransferase domain [H]

Homologues:

Organism=Escherichia coli, GI1788938, Length=880, Percent_Identity=75.4545454545455, Blast_Score=1377, Evalue=0.0,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000182
- InterPro:   IPR016181
- InterPro:   IPR011761
- InterPro:   IPR003781
- InterPro:   IPR016040
- InterPro:   IPR016102 [H]

Pfam domain/function: PF00583 Acetyltransf_1 [H]

EC number: NA

Molecular weight: Translated: 96785; Mature: 96654

Theoretical pI: Translated: 6.74; Mature: 6.74

Prosite motif: PS50975 ATP_GRASP

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.9 %Cys     (Translated Protein)
2.2 %Met     (Translated Protein)
3.1 %Cys+Met (Translated Protein)
0.9 %Cys     (Mature Protein)
2.0 %Met     (Mature Protein)
3.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSQRGLEALLRPKSIAVIGASEKPERAGFLMMRNLLDGGFNGPILPVTPNHKAVCGVLAY
CCCCHHHHHHCCCEEEEEECCCCCCHHHHHHHHHHHCCCCCCCEEEECCCCHHHHHHHHH
ANIASLPITPDLAILCTHDCRNLTLLEDLGNRGCKAAIILSAAPEQFPELKACAQRHHMR
HHHCCCCCCCCEEEEEECCCCCEEHHHHHCCCCCEEEEEEECCCCCCHHHHHHHHHCCEE
LLGPNSLGLLAPWQGLNASFSPVPIKKGRLAFISQSAAVANTILDWAQQREVGFSYFIAL
EECCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECHHHHHHHHHHHHHHHCCCEEEEEEE
GDSLDIDVDDLLDFLARDSKTSAIMLYIEHISDARRFLSASRSASRNKPVLVVKSGRSQR
CCCCCCCHHHHHHHHHCCCCCCEEEEEHHHHHHHHHHHHHHHCCCCCCCEEEEECCCHHH
AQQLLNGQQGLDAAYDAAIQRAGLLRVQDTHELFSAVETLSHMRPLRGERLLIVSNGSAP
HHHHHCCCCCCCHHHHHHHHHCCCEEECHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCC
AAMALDELIRRNGKLATLSDATQSALSEALPPFVALRNPIDLRDDASAERYLAALKPLLD
HHHHHHHHHHCCCCEEEECHHHHHHHHHHCCCCEECCCCCCCCCCCCHHHHHHHHHHHHC
SSDYDTLLLIHSPSAAAPGAKTAELLISAIRQHPRGKRITLLTNWCGEYSSQDARRLFTE
CCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCHHHHHHHHH
AGIPTYRTPEGAITAFMHMVEYRRNQKQLKETPALPIGLTANTAHVHQLIRQALAEGATQ
CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHHHCCCHH
LDTHEVQPILEAYGLRTLPTWIASDSVEAVHIAERLGYPVAIKLRSPDIPHKSEVQGVML
CCHHHHHHHHHHCCCCCCCHHHCCCCCHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHH
YLRTAIEVQRAADDILDRVKRTYPQARIHGLLVQSMANRAGAQELRIAVEQDAIFGPLIM
HHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCHHHHEEEECCCCCCCCEEE
LGEGGIEWHHETQAAVALPPLNMVLARYLIIQAVKGGKIRSRGSLQPLDIPGLSRLLVQV
ECCCCCCCCCCCCCEEECCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHH
SNLILDCPEITRLDIHPVLASGSEFTLLDVSMQLAPVTGDPQARLAIRPYPHELEEKVTL
HHHHCCCCCCCEEECCCEECCCCCEEEEEEEEEEEECCCCCCCEEEECCCCHHHHCEEEE
RDNSQCLFRPILPEDEPLLKLFIDQVTKEDLYYRYFSEINEFSHDDLANMTQIDYDREMA
CCCCCEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCCCCCEE
FVAVRQNSEGPEIIGVTRAFSDPDNIDAEFAVLVRSDLKGLGLGRALLEKMIRYARSHGL
EEEEECCCCCCCEEEEEECCCCCCCCCCCEEEEEHHCCCCCCCCHHHHHHHHHHHHHCCC
SRLTAVTMPNNRGMIGLAQKLGFTIDVQIEDGIVNLELTL
CEEEEEECCCCCCCEEEHHHCCCEEEEEEECCEEEEEEEC
>Mature Secondary Structure 
SQRGLEALLRPKSIAVIGASEKPERAGFLMMRNLLDGGFNGPILPVTPNHKAVCGVLAY
CCCHHHHHHCCCEEEEEECCCCCCHHHHHHHHHHHCCCCCCCEEEECCCCHHHHHHHHH
ANIASLPITPDLAILCTHDCRNLTLLEDLGNRGCKAAIILSAAPEQFPELKACAQRHHMR
HHHCCCCCCCCEEEEEECCCCCEEHHHHHCCCCCEEEEEEECCCCCCHHHHHHHHHCCEE
LLGPNSLGLLAPWQGLNASFSPVPIKKGRLAFISQSAAVANTILDWAQQREVGFSYFIAL
EECCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECHHHHHHHHHHHHHHHCCCEEEEEEE
GDSLDIDVDDLLDFLARDSKTSAIMLYIEHISDARRFLSASRSASRNKPVLVVKSGRSQR
CCCCCCCHHHHHHHHHCCCCCCEEEEEHHHHHHHHHHHHHHHCCCCCCCEEEEECCCHHH
AQQLLNGQQGLDAAYDAAIQRAGLLRVQDTHELFSAVETLSHMRPLRGERLLIVSNGSAP
HHHHHCCCCCCCHHHHHHHHHCCCEEECHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCC
AAMALDELIRRNGKLATLSDATQSALSEALPPFVALRNPIDLRDDASAERYLAALKPLLD
HHHHHHHHHHCCCCEEEECHHHHHHHHHHCCCCEECCCCCCCCCCCCHHHHHHHHHHHHC
SSDYDTLLLIHSPSAAAPGAKTAELLISAIRQHPRGKRITLLTNWCGEYSSQDARRLFTE
CCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCHHHHHHHHH
AGIPTYRTPEGAITAFMHMVEYRRNQKQLKETPALPIGLTANTAHVHQLIRQALAEGATQ
CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHHHCCCHH
LDTHEVQPILEAYGLRTLPTWIASDSVEAVHIAERLGYPVAIKLRSPDIPHKSEVQGVML
CCHHHHHHHHHHCCCCCCCHHHCCCCCHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHH
YLRTAIEVQRAADDILDRVKRTYPQARIHGLLVQSMANRAGAQELRIAVEQDAIFGPLIM
HHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCHHHHEEEECCCCCCCCEEE
LGEGGIEWHHETQAAVALPPLNMVLARYLIIQAVKGGKIRSRGSLQPLDIPGLSRLLVQV
ECCCCCCCCCCCCCEEECCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHH
SNLILDCPEITRLDIHPVLASGSEFTLLDVSMQLAPVTGDPQARLAIRPYPHELEEKVTL
HHHHCCCCCCCEEECCCEECCCCCEEEEEEEEEEEECCCCCCCEEEECCCCHHHHCEEEE
RDNSQCLFRPILPEDEPLLKLFIDQVTKEDLYYRYFSEINEFSHDDLANMTQIDYDREMA
CCCCCEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCCCCCEE
FVAVRQNSEGPEIIGVTRAFSDPDNIDAEFAVLVRSDLKGLGLGRALLEKMIRYARSHGL
EEEEECCCCCCCEEEEEECCCCCCCCCCCEEEEEHHCCCCCCCCHHHHHHHHHHHHHCCC
SRLTAVTMPNNRGMIGLAQKLGFTIDVQIEDGIVNLELTL
CEEEEEECCCCCCCEEEHHHCCCEEEEEEECCEEEEEEEC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9278503 [H]