Definition Bacillus anthracis str. Sterne chromosome, complete genome.
Accession NC_005945
Length 5,228,663

Click here to switch to the map view.

The map label for this gene is ykoW [H]

Identifier: 49183543

GI number: 49183543

Start: 546135

End: 548204

Strand: Direct

Name: ykoW [H]

Synonym: BAS0517

Alternate gene names: 49183543

Gene position: 546135-548204 (Clockwise)

Preceding gene: 49183542

Following gene: 49183544

Centisome position: 10.45

GC content: 30.14

Gene sequence:

>2070_bases
ATGAAGGAACAATATAGTAATCAAAATACCTTTTTAAGTATGATAGATATGGATTTAATAAGAAGAGGATTATTACATGC
TATTCAAGATTTAGTATTTATTGTGAAAGTTATTGATGATGAAACATTTAAATATATTTATGTAAATAAATTAGGTATGG
ATTATGCGAAGCTAAGTGAAGAATGTTATGGAAAAACTTTTGCAGAAGTATTACCGGAAGATACGGCGGAAATATTGCAA
GTGCAATATGCAAAAGTAGCGAGCGAAGCGAAAGCACATACCTTTTGTGATGTAGTTAGTTTACCAAAGGGTGAGCTACA
TTATGAATCTTCACTGAATCCTGTATTTGACGAAGAAGGAGTATGTCAGTTTATTATTTGTATTACAAGGGATATCACAG
CTCAAATAGAGGAGAAGATGGAGATAGAGGAAAAACAAATGCTGTTTAAGTCATTACTAGAATATAATAATGACTCCATC
ATATCTATAGATTCTATAGGAAGAATTACATATGCAAATCCAGCAACGTATGAAATATTTGGATACCGATTTGAAGAATT
AAATAATAAGTTTATTTTTGAATTTATTAATAAAGAATATGAAAAAGATTTTCAAATTATATTTAAAGAAGCCATGCAAG
GAAGAGCAAAGCAAATTGTTTCAAAGAAATATGTTCATAAAGAAGGGTACGAGCTCTATATTTCCTTGAGAACTATCCCG
ATTATTGTAAATGGTGAAATTGTTGGAGTATATATTGTTACAAGGGATGTTACAAAGCAAGTATTAAATGAAATGAGAAC
GGAATATTTAGCTTATTATGATCAGCTAACAGGATTAATGAATAGAATTTCATGTACAAATAAGTTAAATGATTTTTTAA
ATGAGAAGGTAGCTTTTGCGTTTATCTTTATAGATTTAGATGAATTTCACCTTATTAATGATACGTTTGGTCATAAAGAA
GGAGATAAAGTGTTACAAAAAGTTACAGAATGTTTAAGCAGTTTTCAAATACCCGATATGCACTTATTTAGAGAACACGA
TGATCAATTTGTTATGTTAATAGAAAATATAACGAAAGAACGTGTAGAAGTATTTGCAAAAAGCATACAAAAAAAGATTA
GTGAACATTTTGTAATCGAAGAAGAGGACGTGTATTTAAGTGCGTCAATTGGAATTGTAATGGCCCCAACAGATGGAGAA
GATGAAAAAATATTATTTCAAAGAGTTGATGCTGCTTTGGAAAAAGCAAAAGAAAAAGGAAAAGGGCATTATCATTTTTA
TGGCAATGGATTAGATTGTGAGCGTGAAAAAAGATTTATAATAGAAAACCAATTACATCGTGCTATAGAAAAAAATGAAT
TCTTCTTATATTATCAGCCGCAAATGAATATTGAAACGAAAAAAATAGCTAGTATGGAGGCCTTAATAAGGTGGGAGAAT
AAAGAATTAGGATTTGTCTCTCCCAATCAATTTATTCCGCTAGCAGAAAGAACAGGATTTATTATTAAGCTTGATGAATG
GGTAATAAATGAAGTGTGTCGGCAAATACGTGAATGGTTAAATAAAGGATATGAAGTTGTCCCGATTGCAGTTAATATTT
CAGCCAGGCATTTTCGCTCTATTACATTAATAGAGATGATTACACGAGCTTTACATAAGTATAATGTATCAGCTCATTTA
TTAGCAATAGAGGTTACAGAAGGAGCACTTATACATAAAGATATATCGAAAAGAGTATTGCTACAATTAAAAGAACAAAA
TTTAAAGATTCATTTAGATGATTTTGGGACAGGGTATTCATCTTTAAGTTATTTAAAAACGTATCCAATTGATACTTTGA
AGATTGATCGTTCTTTTATGGAAGGTATACATGTAGATGAACGAGATACGAATATTACGGCTGCAATTATTCATTTAGCT
CATACGTTAGAATTAGATGTAATTGCAGAAGGGGTAGAAAAGGCGGAACAAATACAATTTTTGAAGGAAAAGAATGTGAA
GTTTGTACAAGGTTATTATTATAGCCGCCCTTTATCAAAATATGATGTGGAAAATGTGTATTATAAATAA

Upstream 100 bases:

>100_bases
AAAAATAAAAATAATATATAATCATACATTTAAAAAAGGGTACATATTCATATAAAATGGGATATATGAATAAAAATAGA
TTAGACGGGTTGAGATAAAT

Downstream 100 bases:

>100_bases
TTGATAAAAAGTGATGTGCTAAGAAGCACATCACTTTTTATGTGTACCGGGTTTATCATTGTAACCAGATGTAAAAATAG
CTGTAAGAAATGTGAGTGAT

Product: sensory box/GGDEF family protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 689; Mature: 689

Protein sequence:

>689_residues
MKEQYSNQNTFLSMIDMDLIRRGLLHAIQDLVFIVKVIDDETFKYIYVNKLGMDYAKLSEECYGKTFAEVLPEDTAEILQ
VQYAKVASEAKAHTFCDVVSLPKGELHYESSLNPVFDEEGVCQFIICITRDITAQIEEKMEIEEKQMLFKSLLEYNNDSI
ISIDSIGRITYANPATYEIFGYRFEELNNKFIFEFINKEYEKDFQIIFKEAMQGRAKQIVSKKYVHKEGYELYISLRTIP
IIVNGEIVGVYIVTRDVTKQVLNEMRTEYLAYYDQLTGLMNRISCTNKLNDFLNEKVAFAFIFIDLDEFHLINDTFGHKE
GDKVLQKVTECLSSFQIPDMHLFREHDDQFVMLIENITKERVEVFAKSIQKKISEHFVIEEEDVYLSASIGIVMAPTDGE
DEKILFQRVDAALEKAKEKGKGHYHFYGNGLDCEREKRFIIENQLHRAIEKNEFFLYYQPQMNIETKKIASMEALIRWEN
KELGFVSPNQFIPLAERTGFIIKLDEWVINEVCRQIREWLNKGYEVVPIAVNISARHFRSITLIEMITRALHKYNVSAHL
LAIEVTEGALIHKDISKRVLLQLKEQNLKIHLDDFGTGYSSLSYLKTYPIDTLKIDRSFMEGIHVDERDTNITAAIIHLA
HTLELDVIAEGVEKAEQIQFLKEKNVKFVQGYYYSRPLSKYDVENVYYK

Sequences:

>Translated_689_residues
MKEQYSNQNTFLSMIDMDLIRRGLLHAIQDLVFIVKVIDDETFKYIYVNKLGMDYAKLSEECYGKTFAEVLPEDTAEILQ
VQYAKVASEAKAHTFCDVVSLPKGELHYESSLNPVFDEEGVCQFIICITRDITAQIEEKMEIEEKQMLFKSLLEYNNDSI
ISIDSIGRITYANPATYEIFGYRFEELNNKFIFEFINKEYEKDFQIIFKEAMQGRAKQIVSKKYVHKEGYELYISLRTIP
IIVNGEIVGVYIVTRDVTKQVLNEMRTEYLAYYDQLTGLMNRISCTNKLNDFLNEKVAFAFIFIDLDEFHLINDTFGHKE
GDKVLQKVTECLSSFQIPDMHLFREHDDQFVMLIENITKERVEVFAKSIQKKISEHFVIEEEDVYLSASIGIVMAPTDGE
DEKILFQRVDAALEKAKEKGKGHYHFYGNGLDCEREKRFIIENQLHRAIEKNEFFLYYQPQMNIETKKIASMEALIRWEN
KELGFVSPNQFIPLAERTGFIIKLDEWVINEVCRQIREWLNKGYEVVPIAVNISARHFRSITLIEMITRALHKYNVSAHL
LAIEVTEGALIHKDISKRVLLQLKEQNLKIHLDDFGTGYSSLSYLKTYPIDTLKIDRSFMEGIHVDERDTNITAAIIHLA
HTLELDVIAEGVEKAEQIQFLKEKNVKFVQGYYYSRPLSKYDVENVYYK
>Mature_689_residues
MKEQYSNQNTFLSMIDMDLIRRGLLHAIQDLVFIVKVIDDETFKYIYVNKLGMDYAKLSEECYGKTFAEVLPEDTAEILQ
VQYAKVASEAKAHTFCDVVSLPKGELHYESSLNPVFDEEGVCQFIICITRDITAQIEEKMEIEEKQMLFKSLLEYNNDSI
ISIDSIGRITYANPATYEIFGYRFEELNNKFIFEFINKEYEKDFQIIFKEAMQGRAKQIVSKKYVHKEGYELYISLRTIP
IIVNGEIVGVYIVTRDVTKQVLNEMRTEYLAYYDQLTGLMNRISCTNKLNDFLNEKVAFAFIFIDLDEFHLINDTFGHKE
GDKVLQKVTECLSSFQIPDMHLFREHDDQFVMLIENITKERVEVFAKSIQKKISEHFVIEEEDVYLSASIGIVMAPTDGE
DEKILFQRVDAALEKAKEKGKGHYHFYGNGLDCEREKRFIIENQLHRAIEKNEFFLYYQPQMNIETKKIASMEALIRWEN
KELGFVSPNQFIPLAERTGFIIKLDEWVINEVCRQIREWLNKGYEVVPIAVNISARHFRSITLIEMITRALHKYNVSAHL
LAIEVTEGALIHKDISKRVLLQLKEQNLKIHLDDFGTGYSSLSYLKTYPIDTLKIDRSFMEGIHVDERDTNITAAIIHLA
HTLELDVIAEGVEKAEQIQFLKEKNVKFVQGYYYSRPLSKYDVENVYYK

Specific function: Probable signaling protein whose physiological role is not yet known [H]

COG id: COG5001

COG function: function code T; Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain

Gene ontology:

Cell location: Cell membrane; Multi-pass membrane protein (Potential) [H]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 PAS (PER-ARNT-SIM) domain [H]

Homologues:

Organism=Escherichia coli, GI1787541, Length=441, Percent_Identity=33.1065759637188, Blast_Score=267, Evalue=1e-72,
Organism=Escherichia coli, GI87081921, Length=423, Percent_Identity=32.387706855792, Blast_Score=247, Evalue=2e-66,
Organism=Escherichia coli, GI226510982, Length=411, Percent_Identity=27.9805352798054, Blast_Score=171, Evalue=2e-43,
Organism=Escherichia coli, GI1790496, Length=246, Percent_Identity=35.3658536585366, Blast_Score=168, Evalue=9e-43,
Organism=Escherichia coli, GI87081845, Length=254, Percent_Identity=37.007874015748, Blast_Score=160, Evalue=3e-40,
Organism=Escherichia coli, GI87081980, Length=255, Percent_Identity=35.2941176470588, Blast_Score=155, Evalue=8e-39,
Organism=Escherichia coli, GI87081743, Length=240, Percent_Identity=34.5833333333333, Blast_Score=154, Evalue=3e-38,
Organism=Escherichia coli, GI1786507, Length=256, Percent_Identity=33.203125, Blast_Score=143, Evalue=4e-35,
Organism=Escherichia coli, GI1788502, Length=240, Percent_Identity=31.6666666666667, Blast_Score=136, Evalue=4e-33,
Organism=Escherichia coli, GI1787055, Length=424, Percent_Identity=26.4150943396226, Blast_Score=119, Evalue=6e-28,
Organism=Escherichia coli, GI1788849, Length=442, Percent_Identity=26.0180995475113, Blast_Score=117, Evalue=2e-27,
Organism=Escherichia coli, GI87082096, Length=313, Percent_Identity=26.517571884984, Blast_Score=99, Evalue=7e-22,
Organism=Escherichia coli, GI1788381, Length=368, Percent_Identity=23.3695652173913, Blast_Score=96, Evalue=9e-21,
Organism=Escherichia coli, GI87081881, Length=289, Percent_Identity=28.719723183391, Blast_Score=85, Evalue=2e-17,
Organism=Escherichia coli, GI1789650, Length=427, Percent_Identity=22.7166276346604, Blast_Score=70, Evalue=5e-13,
Organism=Escherichia coli, GI1787262, Length=206, Percent_Identity=26.2135922330097, Blast_Score=69, Evalue=8e-13,
Organism=Escherichia coli, GI87081977, Length=155, Percent_Identity=32.9032258064516, Blast_Score=69, Evalue=1e-12,
Organism=Escherichia coli, GI1788085, Length=159, Percent_Identity=27.0440251572327, Blast_Score=65, Evalue=2e-11,
Organism=Escherichia coli, GI87082007, Length=182, Percent_Identity=25.8241758241758, Blast_Score=64, Evalue=3e-11,
Organism=Escherichia coli, GI145693134, Length=170, Percent_Identity=27.6470588235294, Blast_Score=63, Evalue=7e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001054
- InterPro:   IPR000160
- InterPro:   IPR001633
- InterPro:   IPR005330
- InterPro:   IPR000014 [H]

Pfam domain/function: PF00563 EAL; PF00990 GGDEF; PF03707 MHYT [H]

EC number: NA

Molecular weight: Translated: 80409; Mature: 80409

Theoretical pI: Translated: 5.10; Mature: 5.10

Prosite motif: PS50112 PAS ; PS50883 EAL ; PS50887 GGDEF

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.2 %Cys     (Translated Protein)
2.3 %Met     (Translated Protein)
3.5 %Cys+Met (Translated Protein)
1.2 %Cys     (Mature Protein)
2.3 %Met     (Mature Protein)
3.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKEQYSNQNTFLSMIDMDLIRRGLLHAIQDLVFIVKVIDDETFKYIYVNKLGMDYAKLSE
CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCCCHHHHHH
ECYGKTFAEVLPEDTAEILQVQYAKVASEAKAHTFCDVVSLPKGELHYESSLNPVFDEEG
HHHCCHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEECCCCCCCCCCCC
VCQFIICITRDITAQIEEKMEIEEKQMLFKSLLEYNNDSIISIDSIGRITYANPATYEIF
CEEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCEEEECCCCEEEEE
GYRFEELNNKFIFEFINKEYEKDFQIIFKEAMQGRAKQIVSKKYVHKEGYELYISLRTIP
CEEHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEEEEEE
IIVNGEIVGVYIVTRDVTKQVLNEMRTEYLAYYDQLTGLMNRISCTNKLNDFLNEKVAFA
EEECCEEEEEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEE
FIFIDLDEFHLINDTFGHKEGDKVLQKVTECLSSFQIPDMHLFREHDDQFVMLIENITKE
EEEEECCCEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHCCCCCEEEHHHHHHHH
RVEVFAKSIQKKISEHFVIEEEDVYLSASIGIVMAPTDGEDEKILFQRVDAALEKAKEKG
HHHHHHHHHHHHHHHCCEEECCCEEEEEECCEEECCCCCCCHHHHHHHHHHHHHHHHHHC
KGHYHFYGNGLDCEREKRFIIENQLHRAIEKNEFFLYYQPQMNIETKKIASMEALIRWEN
CCEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHHEECC
KELGFVSPNQFIPLAERTGFIIKLDEWVINEVCRQIREWLNKGYEVVPIAVNISARHFRS
CCCCCCCCCCCEEHHHCCCEEEEEHHHHHHHHHHHHHHHHHCCCEEEEEEEEECHHHHHH
ITLIEMITRALHKYNVSAHLLAIEVTEGALIHKDISKRVLLQLKEQNLKIHLDDFGTGYS
HHHHHHHHHHHHHCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCCCH
SLSYLKTYPIDTLKIDRSFMEGIHVDERDTNITAAIIHLAHTLELDVIAEGVEKAEQIQF
HHHHHHHCCCCCEEECHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
LKEKNVKFVQGYYYSRPLSKYDVENVYYK
HHHCCCEEEECCHHCCCCCCCCCCCCCCC
>Mature Secondary Structure
MKEQYSNQNTFLSMIDMDLIRRGLLHAIQDLVFIVKVIDDETFKYIYVNKLGMDYAKLSE
CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCCCHHHHHH
ECYGKTFAEVLPEDTAEILQVQYAKVASEAKAHTFCDVVSLPKGELHYESSLNPVFDEEG
HHHCCHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEECCCCCCCCCCCC
VCQFIICITRDITAQIEEKMEIEEKQMLFKSLLEYNNDSIISIDSIGRITYANPATYEIF
CEEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCEEEECCCCEEEEE
GYRFEELNNKFIFEFINKEYEKDFQIIFKEAMQGRAKQIVSKKYVHKEGYELYISLRTIP
CEEHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEEEEEE
IIVNGEIVGVYIVTRDVTKQVLNEMRTEYLAYYDQLTGLMNRISCTNKLNDFLNEKVAFA
EEECCEEEEEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEE
FIFIDLDEFHLINDTFGHKEGDKVLQKVTECLSSFQIPDMHLFREHDDQFVMLIENITKE
EEEEECCCEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHCCCCCEEEHHHHHHHH
RVEVFAKSIQKKISEHFVIEEEDVYLSASIGIVMAPTDGEDEKILFQRVDAALEKAKEKG
HHHHHHHHHHHHHHHCCEEECCCEEEEEECCEEECCCCCCCHHHHHHHHHHHHHHHHHHC
KGHYHFYGNGLDCEREKRFIIENQLHRAIEKNEFFLYYQPQMNIETKKIASMEALIRWEN
CCEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHHEECC
KELGFVSPNQFIPLAERTGFIIKLDEWVINEVCRQIREWLNKGYEVVPIAVNISARHFRS
CCCCCCCCCCCEEHHHCCCEEEEEHHHHHHHHHHHHHHHHHCCCEEEEEEEEECHHHHHH
ITLIEMITRALHKYNVSAHLLAIEVTEGALIHKDISKRVLLQLKEQNLKIHLDDFGTGYS
HHHHHHHHHHHHHCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCCCH
SLSYLKTYPIDTLKIDRSFMEGIHVDERDTNITAAIIHLAHTLELDVIAEGVEKAEQIQF
HHHHHHHCCCCCEEECHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
LKEKNVKFVQGYYYSRPLSKYDVENVYYK
HHHCCCEEEECCHHCCCCCCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 9384377; 11728710 [H]