Definition | Sulfolobus tokodaii str. 7 chromosome, complete genome. |
---|---|
Accession | NC_003106 |
Length | 2,694,756 |
Click here to switch to the map view.
The map label for this gene is 15921529
Identifier: 15921529
GI number: 15921529
Start: 1265411
End: 1266640
Strand: Direct
Name: 15921529
Synonym: ST1265
Alternate gene names: NA
Gene position: 1265411-1266640 (Clockwise)
Preceding gene: 15921526
Following gene: 15921530
Centisome position: 46.96
GC content: 33.98
Gene sequence:
>1230_bases ATGAAAGTATATATCGTGGAACATGCGATAGGGTCGTTTGCATATGATGAGCAAGGTAAGCTTATAGACTTTGTATTGAG TAGTAAGGACTTAGGAAAAGTTGTAGATTCTTTACTAGATAATGAAAAAGGTATTCCATTGCCAACTACCATAGAATTAA TTCAGAAGATAAAACCAGAAGAAGTAGTAGTGGAGAATGAAGCAGAAATACCAAATTTGCAACAGTTAGGTGTTAAAGCT TCTTATGAAATACATAATCTTGGAAGTAAAATATTTAGAGAATCATTACCCAAGATAGCAATAGAAACAAAGTTTGCATC CTCAGAGAACGATTTATACTCATTTTTATATGAAGTTTCTTTTGAATATACTAGAAGAAAATTAAGAACTGCAGCCAGTA AAAGAGACTTACTAGCAATCCAGGCTATTAGGGCAATTGATGATATTGATAAAACTATAAATCTATTTTCGGAAAGATTA AGAGAATGGTATAGTATTCATTTCCCAGAACTTAATAAGCTTGTAGAAGATCATGAACTTTACGCTTCTATTGTTTCAAA ATTTGGACATAGGGATGAAATAACAAATACGGGGTTAGATGAAATAGGAGTGAATAAAGATCTGAGCACTAAAATTTTAG ATGCATCGAAAAAGAGTATTGGAGCTGATATTACTGATGTAGATATAAGATCAATTAAGATGCTAAGCGATACCATATTA GAGCTTTTCAGAATAAGATCAGAACTTACAGATTATGTTGAATCAGTTATGAAAGAAGTAGCTCCTAATGTTACTGCTTT AGTAGGACCAACACTAGGTGCGCGATTATTAAGTTTAGCTGGTAGCCTAGAAGATTTAGCTAAAATGCCTGCTAGTACAA TTCAAGTCTTAGGTGCAGAGAAAGCTCTGTTTAGAGCTTTAAGAAAAGGAGGCAAACCGCCAAAACACGGTGTTATATTT CAATATCCAGCAATTCATACTTCTCCAAGGTGGCAAAGAGGTAAAATTGCAAGAGCATTAGCTGCTAAATTAGCAATAGC TGCCAGAATAGACGCTTTTAGTGGCAGATTTATAGGTGATAAATTGAATGAGGAGTTAAAGAAAAGGATTGAGGAGATTA AGACAAAATACGCTCAACCTCCACCAAGGAAGCCACAAGAACAAAAGAGAAAAGAAGAAGAAAGAAAAGGTAAAAAAGGA GGAAGAGAAAAAAGAAAAGGTAGGAGATGA
Upstream 100 bases:
>100_bases CTTTTCCAGCTTTTGTGAGCGAACCGTGGGATGGCATACTAAATTATATGTGATGAAGAAAAATTTAAACTTAAATGCTA ATTAGTTGTTTTGAGTACTG
Downstream 100 bases:
>100_bases AATTTATGTCAGAATTAGTTAAGATTTCAAAAACACAGTTTGAAAATGTGTTTCAATGTGAATTTAATGATGGCACAGTA AGGCTTTGTACTAAAAACTT
Product: C/D box methylation guide ribonucleoprotein complex aNOP56 subunit
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 409; Mature: 409
Protein sequence:
>409_residues MKVYIVEHAIGSFAYDEQGKLIDFVLSSKDLGKVVDSLLDNEKGIPLPTTIELIQKIKPEEVVVENEAEIPNLQQLGVKA SYEIHNLGSKIFRESLPKIAIETKFASSENDLYSFLYEVSFEYTRRKLRTAASKRDLLAIQAIRAIDDIDKTINLFSERL REWYSIHFPELNKLVEDHELYASIVSKFGHRDEITNTGLDEIGVNKDLSTKILDASKKSIGADITDVDIRSIKMLSDTIL ELFRIRSELTDYVESVMKEVAPNVTALVGPTLGARLLSLAGSLEDLAKMPASTIQVLGAEKALFRALRKGGKPPKHGVIF QYPAIHTSPRWQRGKIARALAAKLAIAARIDAFSGRFIGDKLNEELKKRIEEIKTKYAQPPPRKPQEQKRKEEERKGKKG GREKRKGRR
Sequences:
>Translated_409_residues MKVYIVEHAIGSFAYDEQGKLIDFVLSSKDLGKVVDSLLDNEKGIPLPTTIELIQKIKPEEVVVENEAEIPNLQQLGVKA SYEIHNLGSKIFRESLPKIAIETKFASSENDLYSFLYEVSFEYTRRKLRTAASKRDLLAIQAIRAIDDIDKTINLFSERL REWYSIHFPELNKLVEDHELYASIVSKFGHRDEITNTGLDEIGVNKDLSTKILDASKKSIGADITDVDIRSIKMLSDTIL ELFRIRSELTDYVESVMKEVAPNVTALVGPTLGARLLSLAGSLEDLAKMPASTIQVLGAEKALFRALRKGGKPPKHGVIF QYPAIHTSPRWQRGKIARALAAKLAIAARIDAFSGRFIGDKLNEELKKRIEEIKTKYAQPPPRKPQEQKRKEEERKGKKG GREKRKGRR >Mature_409_residues MKVYIVEHAIGSFAYDEQGKLIDFVLSSKDLGKVVDSLLDNEKGIPLPTTIELIQKIKPEEVVVENEAEIPNLQQLGVKA SYEIHNLGSKIFRESLPKIAIETKFASSENDLYSFLYEVSFEYTRRKLRTAASKRDLLAIQAIRAIDDIDKTINLFSERL REWYSIHFPELNKLVEDHELYASIVSKFGHRDEITNTGLDEIGVNKDLSTKILDASKKSIGADITDVDIRSIKMLSDTIL ELFRIRSELTDYVESVMKEVAPNVTALVGPTLGARLLSLAGSLEDLAKMPASTIQVLGAEKALFRALRKGGKPPKHGVIF QYPAIHTSPRWQRGKIARALAAKLAIAARIDAFSGRFIGDKLNEELKKRIEEIKTKYAQPPPRKPQEQKRKEEERKGKKG GREKRKGRR
Specific function: Unknown
COG id: COG1498
COG function: function code J; Protein implicated in ribosomal biogenesis, Nop56p homolog
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: Contains 1 Nop domain [H]
Homologues:
Organism=Homo sapiens, GI32483374, Length=254, Percent_Identity=42.5196850393701, Blast_Score=211, Evalue=1e-54, Organism=Homo sapiens, GI7706254, Length=244, Percent_Identity=39.344262295082, Blast_Score=188, Evalue=7e-48, Organism=Homo sapiens, GI221136939, Length=239, Percent_Identity=31.7991631799163, Blast_Score=114, Evalue=2e-25, Organism=Caenorhabditis elegans, GI17509449, Length=284, Percent_Identity=40.4929577464789, Blast_Score=218, Evalue=6e-57, Organism=Caenorhabditis elegans, GI17562296, Length=258, Percent_Identity=38.3720930232558, Blast_Score=195, Evalue=5e-50, Organism=Caenorhabditis elegans, GI17510923, Length=263, Percent_Identity=32.6996197718631, Blast_Score=108, Evalue=6e-24, Organism=Saccharomyces cerevisiae, GI6324886, Length=367, Percent_Identity=35.6948228882834, Blast_Score=206, Evalue=5e-54, Organism=Saccharomyces cerevisiae, GI6323226, Length=260, Percent_Identity=38.0769230769231, Blast_Score=190, Evalue=4e-49, Organism=Saccharomyces cerevisiae, GI6321528, Length=276, Percent_Identity=27.536231884058, Blast_Score=82, Evalue=1e-16, Organism=Drosophila melanogaster, GI28572126, Length=283, Percent_Identity=38.86925795053, Blast_Score=197, Evalue=9e-51, Organism=Drosophila melanogaster, GI17137636, Length=230, Percent_Identity=41.304347826087, Blast_Score=194, Evalue=9e-50, Organism=Drosophila melanogaster, GI21357435, Length=275, Percent_Identity=30.5454545454545, Blast_Score=113, Evalue=3e-25,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR012976 - InterPro: IPR002687 [H]
Pfam domain/function: PF01798 Nop; PF08060 NOSIC [H]
EC number: NA
Molecular weight: Translated: 46121; Mature: 46121
Theoretical pI: Translated: 9.90; Mature: 9.90
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.0 %Cys (Translated Protein) 1.0 %Met (Translated Protein) 1.0 %Cys+Met (Translated Protein) 0.0 %Cys (Mature Protein) 1.0 %Met (Mature Protein) 1.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKVYIVEHAIGSFAYDEQGKLIDFVLSSKDLGKVVDSLLDNEKGIPLPTTIELIQKIKPE CEEEEEEHHHCCCCCCCCCCEEHHHHCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHCCCC EVVVENEAEIPNLQQLGVKASYEIHNLGSKIFRESLPKIAIETKFASSENDLYSFLYEVS CEEECCCCCCCCHHHHCCCCCHHHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHH FEYTRRKLRTAASKRDLLAIQAIRAIDDIDKTINLFSERLREWYSIHFPELNKLVEDHEL HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH YASIVSKFGHRDEITNTGLDEIGVNKDLSTKILDASKKSIGADITDVDIRSIKMLSDTIL HHHHHHHCCCCHHHCCCCCHHHCCCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHH ELFRIRSELTDYVESVMKEVAPNVTALVGPTLGARLLSLAGSLEDLAKMPASTIQVLGAE HHHHHHHHHHHHHHHHHHHHCCCCEEHHCCHHHHHHHHHHCCHHHHHHCCHHHHHHHHHH KALFRALRKGGKPPKHGVIFQYPAIHTSPRWQRGKIARALAAKLAIAARIDAFSGRFIGD HHHHHHHHCCCCCCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHH KLNEELKKRIEEIKTKYAQPPPRKPQEQKRKEEERKGKKGGREKRKGRR HHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCHHHHCCCC >Mature Secondary Structure MKVYIVEHAIGSFAYDEQGKLIDFVLSSKDLGKVVDSLLDNEKGIPLPTTIELIQKIKPE CEEEEEEHHHCCCCCCCCCCEEHHHHCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHCCCC EVVVENEAEIPNLQQLGVKASYEIHNLGSKIFRESLPKIAIETKFASSENDLYSFLYEVS CEEECCCCCCCCHHHHCCCCCHHHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHH FEYTRRKLRTAASKRDLLAIQAIRAIDDIDKTINLFSERLREWYSIHFPELNKLVEDHEL HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH YASIVSKFGHRDEITNTGLDEIGVNKDLSTKILDASKKSIGADITDVDIRSIKMLSDTIL HHHHHHHCCCCHHHCCCCCHHHCCCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHH ELFRIRSELTDYVESVMKEVAPNVTALVGPTLGARLLSLAGSLEDLAKMPASTIQVLGAE HHHHHHHHHHHHHHHHHHHHCCCCEEHHCCHHHHHHHHHHCCHHHHHHCCHHHHHHHHHH KALFRALRKGGKPPKHGVIFQYPAIHTSPRWQRGKIARALAAKLAIAARIDAFSGRFIGD HHHHHHHHCCCCCCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHH KLNEELKKRIEEIKTKYAQPPPRKPQEQKRKEEERKGKKGGREKRKGRR HHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCHHHHCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 8688087 [H]