Definition | Bacillus anthracis str. Sterne chromosome, complete genome. |
---|---|
Accession | NC_005945 |
Length | 5,228,663 |
Click here to switch to the map view.
The map label for this gene is ykvZ [H]
Identifier: 49183241
GI number: 49183241
Start: 207750
End: 208727
Strand: Direct
Name: ykvZ [H]
Synonym: BAS0208
Alternate gene names: 49183241
Gene position: 207750-208727 (Clockwise)
Preceding gene: 49183240
Following gene: 49183242
Centisome position: 3.97
GC content: 36.4
Gene sequence:
>978_bases ATGGCTAATATTAAAGATATTGCAAAAATGGCGGGAGTTTCAGTTACGACTGTTTCGAGAGTGTTGAATGATCATCCGTA TGTAAGTGAAGAAAAAAGGAAAGCGGTTATAGAGATAGTTGAGAAGTTGAATTACTCACAAAACGCAAATGCTGTTCATT TATCAAAAGGAAAGACGAATATTGTTGGTGTGATTCTCCCTTACATCAATCACCCGAGCTTCGATGCAATGGTAGGGGGA ATGATGGAGGGAGCTTTAACTCATAACTACAGGGTGCTACTTTGCCAAACGAATTATAATAAAAAAGAAGAAATGAAAAG TTTACATATGTTAAAAACGAAACAATTGGATGGTCTTATTATTTGTTCACGTGCAAATGATTGGGAAATAATAGAACCGT ATGCTTCTTACGGTACAATCATTGCTTGTGAAGATAATGATATTTCAAACATCTCAAGTGTATATACAAATCATTCGGCA GCTTTCCAGTTAGGAATGAATCACCTGATTGAAAAAGGTTATAAAAAAATTGGTTATTGTACGGGAAGAAAGCTAGGACC GAGTAGTCAAAAGCGTTTTGATGTGTATAAACAGCAATTGCAATCTATAGATGAAGAAGTGAATGAAGAATGGATTTTCA CAGAATGTTTTACATTAGAAGATGGTGTGAGAGTCGCTCATAAGTTAAAAGGTATGCAGAATCTCCCTGAAGCGTTAATA GTAGCAGGAGATGAAGTTGCGATTGGGGTTATGACGGAAGTTGGGAAGTTGGGTATTCAAGTTCCTGAGGACTTAGCGAT TATTGGTTTAGATAACCAACCTATTTCGCAAGTGTTGCAACTTACAACCATTGATCAAAATTTGAAGGAGATAGGGAAAA CAGCTTTTGAAATGTTTTACCGGCATATAAGTGACAAGAGCTCTAAACAAGAAAAGGTGGAAATTCCATATGAACTTGTG GAGCGATCTACAGTGTAA
Upstream 100 bases:
>100_bases GAAAATTACACTTTTGTAAGAAAAACGAATGAAGATGTAATGATGAGAAGGTTTGACATATGTTCATGGAATTTATTAAT TTAAAAGTAGAGGTGAGTGT
Downstream 100 bases:
>100_bases TTTTAATCCGTTATGTGTATAAGTACATAACGGATTATTTTTTTGAAATATCTTTGACAGGAAACGCGTTTCATACTTTA TAAAGGACTTAAACCGGTTT
Product: LacI family sugar-binding transcriptional regulator
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 325; Mature: 324
Protein sequence:
>325_residues MANIKDIAKMAGVSVTTVSRVLNDHPYVSEEKRKAVIEIVEKLNYSQNANAVHLSKGKTNIVGVILPYINHPSFDAMVGG MMEGALTHNYRVLLCQTNYNKKEEMKSLHMLKTKQLDGLIICSRANDWEIIEPYASYGTIIACEDNDISNISSVYTNHSA AFQLGMNHLIEKGYKKIGYCTGRKLGPSSQKRFDVYKQQLQSIDEEVNEEWIFTECFTLEDGVRVAHKLKGMQNLPEALI VAGDEVAIGVMTEVGKLGIQVPEDLAIIGLDNQPISQVLQLTTIDQNLKEIGKTAFEMFYRHISDKSSKQEKVEIPYELV ERSTV
Sequences:
>Translated_325_residues MANIKDIAKMAGVSVTTVSRVLNDHPYVSEEKRKAVIEIVEKLNYSQNANAVHLSKGKTNIVGVILPYINHPSFDAMVGG MMEGALTHNYRVLLCQTNYNKKEEMKSLHMLKTKQLDGLIICSRANDWEIIEPYASYGTIIACEDNDISNISSVYTNHSA AFQLGMNHLIEKGYKKIGYCTGRKLGPSSQKRFDVYKQQLQSIDEEVNEEWIFTECFTLEDGVRVAHKLKGMQNLPEALI VAGDEVAIGVMTEVGKLGIQVPEDLAIIGLDNQPISQVLQLTTIDQNLKEIGKTAFEMFYRHISDKSSKQEKVEIPYELV ERSTV >Mature_324_residues ANIKDIAKMAGVSVTTVSRVLNDHPYVSEEKRKAVIEIVEKLNYSQNANAVHLSKGKTNIVGVILPYINHPSFDAMVGGM MEGALTHNYRVLLCQTNYNKKEEMKSLHMLKTKQLDGLIICSRANDWEIIEPYASYGTIIACEDNDISNISSVYTNHSAA FQLGMNHLIEKGYKKIGYCTGRKLGPSSQKRFDVYKQQLQSIDEEVNEEWIFTECFTLEDGVRVAHKLKGMQNLPEALIV AGDEVAIGVMTEVGKLGIQVPEDLAIIGLDNQPISQVLQLTTIDQNLKEIGKTAFEMFYRHISDKSSKQEKVEIPYELVE RSTV
Specific function: Repressor That Binds To The Purf Operator And Coregulates Other Genes For De Novo Purine Nucleotide Synthesis. It Is Involved In Regulation Of Purb, Purc, Purek, Purhd, Purl, Purmn And Guaba Expression. Binds Hypoxanthine And Guanine As Inducers. [C]
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 HTH lacI-type DNA-binding domain [H]
Homologues:
Organism=Escherichia coli, GI1787948, Length=333, Percent_Identity=28.2282282282282, Blast_Score=138, Evalue=4e-34, Organism=Escherichia coli, GI1790194, Length=333, Percent_Identity=27.027027027027, Blast_Score=137, Evalue=1e-33, Organism=Escherichia coli, GI1790369, Length=300, Percent_Identity=31.3333333333333, Blast_Score=130, Evalue=1e-31, Organism=Escherichia coli, GI1787580, Length=320, Percent_Identity=28.75, Blast_Score=112, Evalue=4e-26, Organism=Escherichia coli, GI1788474, Length=314, Percent_Identity=27.7070063694268, Blast_Score=106, Evalue=2e-24, Organism=Escherichia coli, GI1789202, Length=294, Percent_Identity=27.891156462585, Blast_Score=103, Evalue=2e-23, Organism=Escherichia coli, GI1786540, Length=334, Percent_Identity=26.0479041916168, Blast_Score=102, Evalue=5e-23, Organism=Escherichia coli, GI1789068, Length=281, Percent_Identity=27.7580071174377, Blast_Score=94, Evalue=2e-20, Organism=Escherichia coli, GI48994940, Length=322, Percent_Identity=22.6708074534161, Blast_Score=87, Evalue=1e-18, Organism=Escherichia coli, GI1790715, Length=339, Percent_Identity=22.1238938053097, Blast_Score=83, Evalue=3e-17, Organism=Escherichia coli, GI1787906, Length=217, Percent_Identity=23.963133640553, Blast_Score=80, Evalue=1e-16, Organism=Escherichia coli, GI1790689, Length=338, Percent_Identity=21.8934911242604, Blast_Score=70, Evalue=2e-13, Organism=Escherichia coli, GI1786268, Length=304, Percent_Identity=22.0394736842105, Blast_Score=67, Evalue=2e-12,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000843 - InterPro: IPR010982 - InterPro: IPR001761 [H]
Pfam domain/function: PF00356 LacI; PF00532 Peripla_BP_1 [H]
EC number: NA
Molecular weight: Translated: 36446; Mature: 36315
Theoretical pI: Translated: 6.06; Mature: 6.06
Prosite motif: PS00356 HTH_LACI_1 ; PS50932 HTH_LACI_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.5 %Cys (Translated Protein) 3.4 %Met (Translated Protein) 4.9 %Cys+Met (Translated Protein) 1.5 %Cys (Mature Protein) 3.1 %Met (Mature Protein) 4.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MANIKDIAKMAGVSVTTVSRVLNDHPYVSEEKRKAVIEIVEKLNYSQNANAVHLSKGKTN CCCHHHHHHHHCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCCCCCEEEECCCCCC IVGVILPYINHPSFDAMVGGMMEGALTHNYRVLLCQTNYNKKEEMKSLHMLKTKQLDGLI EEEEEEECCCCCCHHHHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHHHHHHCCCCEE ICSRANDWEIIEPYASYGTIIACEDNDISNISSVYTNHSAAFQLGMNHLIEKGYKKIGYC EEECCCCCCEECCHHCCCCEEEECCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCC TGRKLGPSSQKRFDVYKQQLQSIDEEVNEEWIFTECFTLEDGVRVAHKLKGMQNLPEALI CCCCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEEECCHHHHHHHHHHHHHCCCHHEE VAGDEVAIGVMTEVGKLGIQVPEDLAIIGLDNQPISQVLQLTTIDQNLKEIGKTAFEMFY EECCCEEEHHHHHHHHCCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH RHISDKSSKQEKVEIPYELVERSTV HHHCCCCCCCHHCCCCHHHHCCCCC >Mature Secondary Structure ANIKDIAKMAGVSVTTVSRVLNDHPYVSEEKRKAVIEIVEKLNYSQNANAVHLSKGKTN CCHHHHHHHHCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCCCCCEEEECCCCCC IVGVILPYINHPSFDAMVGGMMEGALTHNYRVLLCQTNYNKKEEMKSLHMLKTKQLDGLI EEEEEEECCCCCCHHHHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHHHHHHCCCCEE ICSRANDWEIIEPYASYGTIIACEDNDISNISSVYTNHSAAFQLGMNHLIEKGYKKIGYC EEECCCCCCEECCHHCCCCEEEECCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCC TGRKLGPSSQKRFDVYKQQLQSIDEEVNEEWIFTECFTLEDGVRVAHKLKGMQNLPEALI CCCCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEEECCHHHHHHHHHHHHHCCCHHEE VAGDEVAIGVMTEVGKLGIQVPEDLAIIGLDNQPISQVLQLTTIDQNLKEIGKTAFEMFY EECCCEEEHHHHHHHHCCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH RHISDKSSKQEKVEIPYELVERSTV HHHCCCCCCCHHCCCCHHHHCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 9384377 [H]