Definition | Bacillus cereus Q1 chromosome, complete genome. |
---|---|
Accession | NC_011969 |
Length | 5,214,195 |
Click here to switch to the map view.
The map label for this gene is yrkA [H]
Identifier: 222094359
GI number: 222094359
Start: 669298
End: 670575
Strand: Direct
Name: yrkA [H]
Synonym: BCQ_0673
Alternate gene names: 222094359
Gene position: 669298-670575 (Clockwise)
Preceding gene: 222094358
Following gene: 222094360
Centisome position: 12.84
GC content: 35.45
Gene sequence:
>1278_bases ATGGTAGTCATTCTTATTGCATTAACAGGTTTTTTTGTAGCTGTTGAGTTTGCAATTATTAAGGTGCGTAGCAGTCGTAT TGACCAACTCGTTAGTGAGAAACGACGAGGAGCATTGGCAGCAAAAAAAGTAACTTCAAATTTAGATGAATATTTATCAG CATGTCAGTTAGGTATTACAATTACCGCTTTAGGGCTTGGGTGGTTAGGAGAGCCGACCATAAAACATTTACTCGAGCCG TTGTTTTTAAAACTGCATTTATCTCCTGCAATTTCCAGCACAGTTTCATTTATTATTGCCTTTGCAGTGATTACGTTTTT ACATGTTGTCATTGGTGAACTTGCTCCTAAGACGTTCGCCATACAAAGGGCAGAGCAAGTTAGCTTATTATTATCGAAGC CACTTATTTATTTTTACCGAATAATGTATCCGTTTATTTGGGCTTTAAATGGTTCAGCAAGGCTTGTAACAGGGTTATTC GGATTACATCCGGCTTCTGAACATGAAGTTGCTCATTCGGAAGAAGAATTACGGTTAATCTTATCTGAGAGTTATGAGAG CGGAGAGATTAATCAAAGGGAATTTAAATATGTAAATAATATTTTTGAATTTGATAATAGAGTGGCAAAGGAAATTATGG TACCTCGTACGGAAGTTGTAGGTTTATATGAGGACGAACCATTTGAAACACATATTAAAATAATCGCACAAGAAAAATAT ACGAGGTATCCTGTATTTGGTGAAGATAAAGATGAAATCATTGGGATGGTTAATGTAAAGGATTTATTTATTCGTTATAT GGATGGTAATCGGGACGAGGATTGCTCGATTATGCCATATACAAGGCCGGTCATTGAAGTACTAGAAAATATTCCAATTC ATGATTTACTATTACAAATGCAAAGAAAACACATTCCATTAGCTGTGTTGTATGATGAATATGGTGGTACAGCTGGGATT GTTACATTAGAAGATATTTTAGAAGAAATTGTTGGGGAAATTCGAGATGAATACGATGAAGATGAACACCCGCCTATAGA GCATATAAGTGAAGGGTGTAAAATCGTAGAGGGGAAAGTGCTTATTAGTGAAGTAAATGATTTATTTGGCATACACTTAA TCGCTGATGATGTAGATACAATTGGTGGATGGATTATGGTACAAAAGCAAATCGTTGCTGAAGGAGATATTATTGAAAAA CACGGCTTTTCTTTTAAAGTTCTGGAAAAAGATATGCATCAAATTAAACGAGTGGAAATAAAGAAGGGAGAAGAATGA
Upstream 100 bases:
>100_bases TTGAGTAAATGAAGAAACTAAAGTGAAACTTCTTTTTTAAAGGAGGTGTCACTTTAGCTAAGAAGGCTAAAGGATATATG TGGATATTTTAAAATTACTG
Downstream 100 bases:
>100_bases TTTTCTATAAAAGGTATATATAGAAAATTAAATGTTTGCGCTTTCAAAAAAGGTGCTATATAGTGAAGGTGGACACTGAA AAATTCTTCTGAATGATACT
Product: cbs domain protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 425; Mature: 425
Protein sequence:
>425_residues MVVILIALTGFFVAVEFAIIKVRSSRIDQLVSEKRRGALAAKKVTSNLDEYLSACQLGITITALGLGWLGEPTIKHLLEP LFLKLHLSPAISSTVSFIIAFAVITFLHVVIGELAPKTFAIQRAEQVSLLLSKPLIYFYRIMYPFIWALNGSARLVTGLF GLHPASEHEVAHSEEELRLILSESYESGEINQREFKYVNNIFEFDNRVAKEIMVPRTEVVGLYEDEPFETHIKIIAQEKY TRYPVFGEDKDEIIGMVNVKDLFIRYMDGNRDEDCSIMPYTRPVIEVLENIPIHDLLLQMQRKHIPLAVLYDEYGGTAGI VTLEDILEEIVGEIRDEYDEDEHPPIEHISEGCKIVEGKVLISEVNDLFGIHLIADDVDTIGGWIMVQKQIVAEGDIIEK HGFSFKVLEKDMHQIKRVEIKKGEE
Sequences:
>Translated_425_residues MVVILIALTGFFVAVEFAIIKVRSSRIDQLVSEKRRGALAAKKVTSNLDEYLSACQLGITITALGLGWLGEPTIKHLLEP LFLKLHLSPAISSTVSFIIAFAVITFLHVVIGELAPKTFAIQRAEQVSLLLSKPLIYFYRIMYPFIWALNGSARLVTGLF GLHPASEHEVAHSEEELRLILSESYESGEINQREFKYVNNIFEFDNRVAKEIMVPRTEVVGLYEDEPFETHIKIIAQEKY TRYPVFGEDKDEIIGMVNVKDLFIRYMDGNRDEDCSIMPYTRPVIEVLENIPIHDLLLQMQRKHIPLAVLYDEYGGTAGI VTLEDILEEIVGEIRDEYDEDEHPPIEHISEGCKIVEGKVLISEVNDLFGIHLIADDVDTIGGWIMVQKQIVAEGDIIEK HGFSFKVLEKDMHQIKRVEIKKGEE >Mature_425_residues MVVILIALTGFFVAVEFAIIKVRSSRIDQLVSEKRRGALAAKKVTSNLDEYLSACQLGITITALGLGWLGEPTIKHLLEP LFLKLHLSPAISSTVSFIIAFAVITFLHVVIGELAPKTFAIQRAEQVSLLLSKPLIYFYRIMYPFIWALNGSARLVTGLF GLHPASEHEVAHSEEELRLILSESYESGEINQREFKYVNNIFEFDNRVAKEIMVPRTEVVGLYEDEPFETHIKIIAQEKY TRYPVFGEDKDEIIGMVNVKDLFIRYMDGNRDEDCSIMPYTRPVIEVLENIPIHDLLLQMQRKHIPLAVLYDEYGGTAGI VTLEDILEEIVGEIRDEYDEDEHPPIEHISEGCKIVEGKVLISEVNDLFGIHLIADDVDTIGGWIMVQKQIVAEGDIIEK HGFSFKVLEKDMHQIKRVEIKKGEE
Specific function: Unknown
COG id: COG1253
COG function: function code R; Hemolysins and related proteins containing CBS domains
Gene ontology:
Cell location: Cell membrane; Multi-pass membrane protein (Potential) [H]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 2 CBS domains [H]
Homologues:
Organism=Homo sapiens, GI310128564, Length=387, Percent_Identity=49.6124031007752, Blast_Score=381, Evalue=1e-106, Organism=Escherichia coli, GI1790664, Length=427, Percent_Identity=23.1850117096019, Blast_Score=139, Evalue=3e-34, Organism=Escherichia coli, GI145693175, Length=416, Percent_Identity=24.0384615384615, Blast_Score=131, Evalue=9e-32, Organism=Escherichia coli, GI1786879, Length=257, Percent_Identity=30.7392996108949, Blast_Score=111, Evalue=8e-26, Organism=Escherichia coli, GI87082033, Length=242, Percent_Identity=24.3801652892562, Blast_Score=72, Evalue=7e-14, Organism=Escherichia coli, GI1788119, Length=238, Percent_Identity=25.2100840336134, Blast_Score=65, Evalue=6e-12, Organism=Caenorhabditis elegans, GI17539402, Length=243, Percent_Identity=25.9259259259259, Blast_Score=65, Evalue=5e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR016169 - InterPro: IPR000644 - InterPro: IPR002550 - InterPro: IPR005170 [H]
Pfam domain/function: PF00571 CBS; PF03471 CorC_HlyC; PF01595 DUF21 [H]
EC number: NA
Molecular weight: Translated: 48252; Mature: 48252
Theoretical pI: Translated: 4.70; Mature: 4.70
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.7 %Cys (Translated Protein) 2.1 %Met (Translated Protein) 2.8 %Cys+Met (Translated Protein) 0.7 %Cys (Mature Protein) 2.1 %Met (Mature Protein) 2.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MVVILIALTGFFVAVEFAIIKVRSSRIDQLVSEKRRGALAAKKVTSNLDEYLSACQLGIT CEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHCCE ITALGLGWLGEPTIKHLLEPLFLKLHLSPAISSTVSFIIAFAVITFLHVVIGELAPKTFA EEEECCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHH IQRAEQVSLLLSKPLIYFYRIMYPFIWALNGSARLVTGLFGLHPASEHEVAHSEEELRLI HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCCCCHHCCHHHHHHH LSESYESGEINQREFKYVNNIFEFDNRVAKEIMVPRTEVVGLYEDEPFETHIKIIAQEKY HHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCHHEEEEECCCCHHHHHHEEECCCC TRYPVFGEDKDEIIGMVNVKDLFIRYMDGNRDEDCSIMPYTRPVIEVLENIPIHDLLLQM CCCCCCCCCHHHEEEEEEHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHCCCHHHHHHHH QRKHIPLAVLYDEYGGTAGIVTLEDILEEIVGEIRDEYDEDEHPPIEHISEGCKIVEGKV HHHCCCEEEEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCCCCCCHHHHHCCCEEEECHH LISEVNDLFGIHLIADDVDTIGGWIMVQKQIVAEGDIIEKHGFSFKVLEKDMHQIKRVEI HHHHHHHHHEEEEEECCHHHHCCHHEEHHHHHCCCCEEHHCCCCHHHHHHHHHHHHHHHH KKGEE CCCCC >Mature Secondary Structure MVVILIALTGFFVAVEFAIIKVRSSRIDQLVSEKRRGALAAKKVTSNLDEYLSACQLGIT CEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHCCE ITALGLGWLGEPTIKHLLEPLFLKLHLSPAISSTVSFIIAFAVITFLHVVIGELAPKTFA EEEECCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHH IQRAEQVSLLLSKPLIYFYRIMYPFIWALNGSARLVTGLFGLHPASEHEVAHSEEELRLI HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCCCCHHCCHHHHHHH LSESYESGEINQREFKYVNNIFEFDNRVAKEIMVPRTEVVGLYEDEPFETHIKIIAQEKY HHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCHHEEEEECCCCHHHHHHEEECCCC TRYPVFGEDKDEIIGMVNVKDLFIRYMDGNRDEDCSIMPYTRPVIEVLENIPIHDLLLQM CCCCCCCCCHHHEEEEEEHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHCCCHHHHHHHH QRKHIPLAVLYDEYGGTAGIVTLEDILEEIVGEIRDEYDEDEHPPIEHISEGCKIVEGKV HHHCCCEEEEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCCCCCCHHHHHCCCEEEECHH LISEVNDLFGIHLIADDVDTIGGWIMVQKQIVAEGDIIEKHGFSFKVLEKDMHQIKRVEI HHHHHHHHHEEEEEECCHHHHCCHHEEHHHHHCCCCEEHHCCCCHHHHHHHHHHHHHHHH KKGEE CCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 9308178; 9384377; 8969508; 7608059 [H]