Definition Bacillus anthracis str. Sterne chromosome, complete genome.
Accession NC_005945
Length 5,228,663

Click here to switch to the map view.

The map label for this gene is yqiR [H]

Identifier: 49187070

GI number: 49187070

Start: 4004042

End: 4006114

Strand: Reverse

Name: yqiR [H]

Synonym: BAS4072

Alternate gene names: 49187070

Gene position: 4006114-4004042 (Counterclockwise)

Preceding gene: 49187072

Following gene: 49187069

Centisome position: 76.62

GC content: 36.81

Gene sequence:

>2073_bases
ATGAAACAAAAAGTTTTAATTGTTGGTGCAGGTGAAGGTGGCAGTACACTGCTGAATCTGTTGCAAAGTTCGAATATATT
TCAAATTATAGGGACTATTGATATAAATCCGGTAGCAAAAGGATTGCAAATGGCTAAGGAATATGGAATTACGATTGGAG
AGAGTGTAACGCCGTTTCTTTCTATGCATATTGATGTAATGTTTGATATGACAGGTGATGATAATTTACATAAAGAGTTA
CTAAAGAAAAAGCATAAAGATACTCTTCTTATACCGGGTGATATTGCAAAAATTGTTACGAGATTAGCGCATGAAAAGGA
AGATTTAATTGGAAGGTTAGAAGAACAGACGCAGCAAGGGGATTTAATTTTAAATTCTACGCATGACGGTATGATTGTCA
TTGATCGAGAGGGACAAGTTCGTCTATTTAATAAAAGTGCAGAGCGTATTATCGGGTATAAAAAAGAAGATGCGATAGGG
AAATATATTTTAGAAGTTATTCCGACTAGTAAGTTGCTTCGTATTATACGTACGAAACAAATAGAAGTGAATTATGAACT
GACGCTGGAGAATGAAAAGAAAATTATTACAACCCGTATTCCGATATTAAAAGAGGGAGGAGAGGTACAAGGGGCATTTG
CAATTTTTAAAGATATAACAGAAGTTGTAGATCTTGCGGAAGAAGTTACAGATTTAAAGGAGATTCAAACGTTACTTGAG
GCAATTATTAACTCGTCTGAGGAAGCGATTTCGGTCGTGGATGAAAAAGGAAGAGGATTAGTAATTAACCCTGCGTATAC
GAAATTAACAGGCTTAACAGAAGAGGACATTATTGGGAAGCCGGCTACAACTGATATTGTAGAAGGTGAAAGTATGCATA
TGAAAGTACTTCGAACACGTAGAGCGGTACGAGGTATACATATGAAAATTGGACAAAAAAAGCGAGATGTAATTGTAAAC
GTAGCACCAGTCATTGTGGATGGAATATTGAAAGGGAGCGTCGGTGTAATTCGCGACGTATCAGAAATTCAAAAATTAAC
AAATGAATTGAATAGAGCAAGGCAAATTATTCGAACGTTAGAAGCAAAATATTCATTTGATGACATTGTCGGAAATTCAG
ATGAAACAACGGCTGCTATTGAACAGGCGAAACTTGGGGCGAATACACCAGCAACAGTGTTGTTACGCGGGGAGTCTGGG
ACAGGTAAAGAATTGTTTGCGCATGCTATCCATAATGGAAGTAATCGAAAGTATAATAAGTTTGTTCGTGTAAACTGTGC
GGCTATTTCAGAGACGTTGTTAGAAAGTGAATTGTTCGGTTATGAGGAAGGTGCATTTTCTGGCGCGAAAAGAGGCGGAA
AACGTGGATTCTTTGAAGAAGCGAATAACGGCAGTATCTTTTTAGATGAAATAGGGGAACTATCTGCCAATACGCAAGCG
AAACTCCTTCGCGTTTTGCAAGAGAAAGAAATTGTAAAAGTTGGTGGAACGAAAGCAATCCCTATTAATGTTAGAGTAAT
TGCAGCGACGCACGTAAATTTAGAAAAAGCCATTTTAGAAGGAGAGTTTAGGGAGGATTTATATTATCGATTAAATAAAA
TTCCAATTCAAATTCCATCTCTTCGTCAGCGAAAAGGGGATATACCGGCAATCGCAGACAGATTAATTCAAAAAATTAAT
CAGGATTATGGTCGAAATATAGAGGGGCTCACCGATTCGGCTATTTCATATTTACAATCATATGAATGGCCAGGGAATGT
GAGGGAACTTGAAAATATTTTAGGGAGAGCTATTATCTTTATGAATTATAACGAGACCTATATTGATGTACAACATTTAC
CGCCATTACATAACGAAGAGCAGGTGGAGTCAAAGCAAGCTCATTTATTACCTGAGTTAGAAGAAAAGCCACTTGAGCAT
TTAGTGACGGAGTTTGAAGGGAATATCATTCATGAATATTTAGAGAGGTTTGGTGGAAATAAAACAAAAACAGCCAGAGC
GTTAGGAATTTCGGTTCGAAATTTATATTACAAGCTAGAAAAATATGAGTGTGCAAAAAAAAGCATGCAATAA

Upstream 100 bases:

>100_bases
ATGATGTTATATGAAAAGGGGGTTTGTCAAGAGGGGGCTGAACGAGTAAGATAAGGGTGTGAAAAATTTTGCAGGGATAC
TTCGAGAGGAAGTGTAATAT

Downstream 100 bases:

>100_bases
ATTGCGTACCGTGCAATTTATTGCATGGTATACAAAAACTGACAAAATAAGACAGAATCTTTTGTTATTTTCCAAAGTAT
TGATTTTACTGTGTTTTGAA

Product: sensory box sigma-54 dependent DNA-binding response regulator

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 690; Mature: 690

Protein sequence:

>690_residues
MKQKVLIVGAGEGGSTLLNLLQSSNIFQIIGTIDINPVAKGLQMAKEYGITIGESVTPFLSMHIDVMFDMTGDDNLHKEL
LKKKHKDTLLIPGDIAKIVTRLAHEKEDLIGRLEEQTQQGDLILNSTHDGMIVIDREGQVRLFNKSAERIIGYKKEDAIG
KYILEVIPTSKLLRIIRTKQIEVNYELTLENEKKIITTRIPILKEGGEVQGAFAIFKDITEVVDLAEEVTDLKEIQTLLE
AIINSSEEAISVVDEKGRGLVINPAYTKLTGLTEEDIIGKPATTDIVEGESMHMKVLRTRRAVRGIHMKIGQKKRDVIVN
VAPVIVDGILKGSVGVIRDVSEIQKLTNELNRARQIIRTLEAKYSFDDIVGNSDETTAAIEQAKLGANTPATVLLRGESG
TGKELFAHAIHNGSNRKYNKFVRVNCAAISETLLESELFGYEEGAFSGAKRGGKRGFFEEANNGSIFLDEIGELSANTQA
KLLRVLQEKEIVKVGGTKAIPINVRVIAATHVNLEKAILEGEFREDLYYRLNKIPIQIPSLRQRKGDIPAIADRLIQKIN
QDYGRNIEGLTDSAISYLQSYEWPGNVRELENILGRAIIFMNYNETYIDVQHLPPLHNEEQVESKQAHLLPELEEKPLEH
LVTEFEGNIIHEYLERFGGNKTKTARALGISVRNLYYKLEKYECAKKSMQ

Sequences:

>Translated_690_residues
MKQKVLIVGAGEGGSTLLNLLQSSNIFQIIGTIDINPVAKGLQMAKEYGITIGESVTPFLSMHIDVMFDMTGDDNLHKEL
LKKKHKDTLLIPGDIAKIVTRLAHEKEDLIGRLEEQTQQGDLILNSTHDGMIVIDREGQVRLFNKSAERIIGYKKEDAIG
KYILEVIPTSKLLRIIRTKQIEVNYELTLENEKKIITTRIPILKEGGEVQGAFAIFKDITEVVDLAEEVTDLKEIQTLLE
AIINSSEEAISVVDEKGRGLVINPAYTKLTGLTEEDIIGKPATTDIVEGESMHMKVLRTRRAVRGIHMKIGQKKRDVIVN
VAPVIVDGILKGSVGVIRDVSEIQKLTNELNRARQIIRTLEAKYSFDDIVGNSDETTAAIEQAKLGANTPATVLLRGESG
TGKELFAHAIHNGSNRKYNKFVRVNCAAISETLLESELFGYEEGAFSGAKRGGKRGFFEEANNGSIFLDEIGELSANTQA
KLLRVLQEKEIVKVGGTKAIPINVRVIAATHVNLEKAILEGEFREDLYYRLNKIPIQIPSLRQRKGDIPAIADRLIQKIN
QDYGRNIEGLTDSAISYLQSYEWPGNVRELENILGRAIIFMNYNETYIDVQHLPPLHNEEQVESKQAHLLPELEEKPLEH
LVTEFEGNIIHEYLERFGGNKTKTARALGISVRNLYYKLEKYECAKKSMQ
>Mature_690_residues
MKQKVLIVGAGEGGSTLLNLLQSSNIFQIIGTIDINPVAKGLQMAKEYGITIGESVTPFLSMHIDVMFDMTGDDNLHKEL
LKKKHKDTLLIPGDIAKIVTRLAHEKEDLIGRLEEQTQQGDLILNSTHDGMIVIDREGQVRLFNKSAERIIGYKKEDAIG
KYILEVIPTSKLLRIIRTKQIEVNYELTLENEKKIITTRIPILKEGGEVQGAFAIFKDITEVVDLAEEVTDLKEIQTLLE
AIINSSEEAISVVDEKGRGLVINPAYTKLTGLTEEDIIGKPATTDIVEGESMHMKVLRTRRAVRGIHMKIGQKKRDVIVN
VAPVIVDGILKGSVGVIRDVSEIQKLTNELNRARQIIRTLEAKYSFDDIVGNSDETTAAIEQAKLGANTPATVLLRGESG
TGKELFAHAIHNGSNRKYNKFVRVNCAAISETLLESELFGYEEGAFSGAKRGGKRGFFEEANNGSIFLDEIGELSANTQA
KLLRVLQEKEIVKVGGTKAIPINVRVIAATHVNLEKAILEGEFREDLYYRLNKIPIQIPSLRQRKGDIPAIADRLIQKIN
QDYGRNIEGLTDSAISYLQSYEWPGNVRELENILGRAIIFMNYNETYIDVQHLPPLHNEEQVESKQAHLLPELEEKPLEH
LVTEFEGNIIHEYLERFGGNKTKTARALGISVRNLYYKLEKYECAKKSMQ

Specific function: Member Of The Two-Component Regulatory System Atos/Atoc Involved In The Transcriptional Regulation Of The Ato Genes For Acetoacetate Metabolism. Also An Inhibitor Of Polyamine Biosynthesis. [C]

COG id: COG3829

COG function: function code KT; Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains

Gene ontology:

Cell location: Cytoplasmic [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 sigma-54 factor interaction domain [H]

Homologues:

Organism=Escherichia coli, GI1788550, Length=347, Percent_Identity=41.4985590778098, Blast_Score=265, Evalue=9e-72,
Organism=Escherichia coli, GI1790437, Length=313, Percent_Identity=43.4504792332268, Blast_Score=255, Evalue=6e-69,
Organism=Escherichia coli, GI1786524, Length=381, Percent_Identity=37.2703412073491, Blast_Score=245, Evalue=6e-66,
Organism=Escherichia coli, GI87082117, Length=324, Percent_Identity=41.0493827160494, Blast_Score=245, Evalue=8e-66,
Organism=Escherichia coli, GI1789233, Length=475, Percent_Identity=33.4736842105263, Blast_Score=238, Evalue=9e-64,
Organism=Escherichia coli, GI1789087, Length=239, Percent_Identity=47.6987447698745, Blast_Score=231, Evalue=1e-61,
Organism=Escherichia coli, GI1788905, Length=318, Percent_Identity=39.622641509434, Blast_Score=230, Evalue=2e-61,
Organism=Escherichia coli, GI1790299, Length=306, Percent_Identity=39.8692810457516, Blast_Score=205, Evalue=7e-54,
Organism=Escherichia coli, GI87082152, Length=314, Percent_Identity=38.5350318471338, Blast_Score=203, Evalue=3e-53,
Organism=Escherichia coli, GI1787583, Length=457, Percent_Identity=31.0722100656455, Blast_Score=192, Evalue=4e-50,
Organism=Escherichia coli, GI87081872, Length=333, Percent_Identity=34.8348348348348, Blast_Score=181, Evalue=1e-46,
Organism=Escherichia coli, GI1789828, Length=240, Percent_Identity=35, Blast_Score=133, Evalue=4e-32,
Organism=Escherichia coli, GI87081858, Length=505, Percent_Identity=23.7623762376238, Blast_Score=132, Evalue=7e-32,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003593
- InterPro:   IPR020441
- InterPro:   IPR009057
- InterPro:   IPR012287
- InterPro:   IPR002197
- InterPro:   IPR016040
- InterPro:   IPR001610
- InterPro:   IPR000014
- InterPro:   IPR013767
- InterPro:   IPR002078 [H]

Pfam domain/function: PF02954 HTH_8; PF00989 PAS; PF00158 Sigma54_activat [H]

EC number: NA

Molecular weight: Translated: 77150; Mature: 77150

Theoretical pI: Translated: 6.17; Mature: 6.17

Prosite motif: PS50112 PAS ; PS00675 SIGMA54_INTERACT_1 ; PS00676 SIGMA54_INTERACT_2 ; PS00688 SIGMA54_INTERACT_3 ; PS50045 SIGMA54_INTERACT_4

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.3 %Cys     (Translated Protein)
1.6 %Met     (Translated Protein)
1.9 %Cys+Met (Translated Protein)
0.3 %Cys     (Mature Protein)
1.6 %Met     (Mature Protein)
1.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKQKVLIVGAGEGGSTLLNLLQSSNIFQIIGTIDINPVAKGLQMAKEYGITIGESVTPFL
CCCEEEEEECCCCHHHHHHHHHCCCEEEEEEECCCCHHHHHHHHHHHHCCCCCCCCCHHH
SMHIDVMFDMTGDDNLHKELLKKKHKDTLLIPGDIAKIVTRLAHEKEDLIGRLEEQTQQG
HEEEEEEEECCCCCHHHHHHHHHHCCCCEEECHHHHHHHHHHHHHHHHHHHHHHHHHCCC
DLILNSTHDGMIVIDREGQVRLFNKSAERIIGYKKEDAIGKYILEVIPTSKLLRIIRTKQ
CEEEECCCCCEEEEECCCCEEEECCCHHHHCCCCCHHHHHHHHHHHCCHHHHHHHHHHHE
IEVNYELTLENEKKIITTRIPILKEGGEVQGAFAIFKDITEVVDLAEEVTDLKEIQTLLE
EEEEEEEEECCCCEEEEEECCEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
AIINSSEEAISVVDEKGRGLVINPAYTKLTGLTEEDIIGKPATTDIVEGESMHMKVLRTR
HHHCCCHHHHHHHHCCCCEEEECCCHHHHCCCCHHHCCCCCCCCCCCCCCHHHHHHHHHH
RAVRGIHMKIGQKKRDVIVNVAPVIVDGILKGSVGVIRDVSEIQKLTNELNRARQIIRTL
HHHHHHHHHCCCCCCCEEEEHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
EAKYSFDDIVGNSDETTAAIEQAKLGANTPATVLLRGESGTGKELFAHAIHNGSNRKYNK
HHCCCHHHHCCCCCCHHHHHHHHHCCCCCCEEEEEECCCCCCHHHHHHHHHCCCCCCCCE
FVRVNCAAISETLLESELFGYEEGAFSGAKRGGKRGFFEEANNGSIFLDEIGELSANTQA
EEEEEHHHHHHHHHHHHHCCCCCCCCCHHHCCCCCCCCCCCCCCCEEEECCCCCCCCHHH
KLLRVLQEKEIVKVGGTKAIPINVRVIAATHVNLEKAILEGEFREDLYYRLNKIPIQIPS
HHHHHHHHHHHEEECCCEEEEEEEEEEEEECCCHHHHHHCCHHHHHHHHHHHCCCEECCC
LRQRKGDIPAIADRLIQKINQDYGRNIEGLTDSAISYLQSYEWPGNVRELENILGRAIIF
HHHHCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCCCCCHHHHHHHHCCEEEE
MNYNETYIDVQHLPPLHNEEQVESKQAHLLPELEEKPLEHLVTEFEGNIIHEYLERFGGN
EECCCEEEEEECCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHCCC
KTKTARALGISVRNLYYKLEKYECAKKSMQ
CCCHHHHHHHHHHHHHHHHHHHHHHHHCCC
>Mature Secondary Structure
MKQKVLIVGAGEGGSTLLNLLQSSNIFQIIGTIDINPVAKGLQMAKEYGITIGESVTPFL
CCCEEEEEECCCCHHHHHHHHHCCCEEEEEEECCCCHHHHHHHHHHHHCCCCCCCCCHHH
SMHIDVMFDMTGDDNLHKELLKKKHKDTLLIPGDIAKIVTRLAHEKEDLIGRLEEQTQQG
HEEEEEEEECCCCCHHHHHHHHHHCCCCEEECHHHHHHHHHHHHHHHHHHHHHHHHHCCC
DLILNSTHDGMIVIDREGQVRLFNKSAERIIGYKKEDAIGKYILEVIPTSKLLRIIRTKQ
CEEEECCCCCEEEEECCCCEEEECCCHHHHCCCCCHHHHHHHHHHHCCHHHHHHHHHHHE
IEVNYELTLENEKKIITTRIPILKEGGEVQGAFAIFKDITEVVDLAEEVTDLKEIQTLLE
EEEEEEEEECCCCEEEEEECCEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
AIINSSEEAISVVDEKGRGLVINPAYTKLTGLTEEDIIGKPATTDIVEGESMHMKVLRTR
HHHCCCHHHHHHHHCCCCEEEECCCHHHHCCCCHHHCCCCCCCCCCCCCCHHHHHHHHHH
RAVRGIHMKIGQKKRDVIVNVAPVIVDGILKGSVGVIRDVSEIQKLTNELNRARQIIRTL
HHHHHHHHHCCCCCCCEEEEHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
EAKYSFDDIVGNSDETTAAIEQAKLGANTPATVLLRGESGTGKELFAHAIHNGSNRKYNK
HHCCCHHHHCCCCCCHHHHHHHHHCCCCCCEEEEEECCCCCCHHHHHHHHHCCCCCCCCE
FVRVNCAAISETLLESELFGYEEGAFSGAKRGGKRGFFEEANNGSIFLDEIGELSANTQA
EEEEEHHHHHHHHHHHHHCCCCCCCCCHHHCCCCCCCCCCCCCCCEEEECCCCCCCCHHH
KLLRVLQEKEIVKVGGTKAIPINVRVIAATHVNLEKAILEGEFREDLYYRLNKIPIQIPS
HHHHHHHHHHHEEECCCEEEEEEEEEEEEECCCHHHHHHCCHHHHHHHHHHHCCCEECCC
LRQRKGDIPAIADRLIQKINQDYGRNIEGLTDSAISYLQSYEWPGNVRELENILGRAIIF
HHHHCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCCCCCHHHHHHHHCCEEEE
MNYNETYIDVQHLPPLHNEEQVESKQAHLLPELEEKPLEHLVTEFEGNIIHEYLERFGGN
EECCCEEEEEECCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHCCC
KTKTARALGISVRNLYYKLEKYECAKKSMQ
CCCHHHHHHHHHHHHHHHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8969508; 9384377 [H]