Definition | Agrobacterium tumefaciens str. C58 chromosome circular, complete sequence. |
---|---|
Accession | NC_003062 |
Length | 2,841,580 |
Click here to switch to the map view.
The map label for this gene is sun [H]
Identifier: 15887638
GI number: 15887638
Start: 283635
End: 284924
Strand: Reverse
Name: sun [H]
Synonym: Atu0289
Alternate gene names: 15887638
Gene position: 284924-283635 (Counterclockwise)
Preceding gene: 15887639
Following gene: 159184243
Centisome position: 10.03
GC content: 61.09
Gene sequence:
>1290_bases ATGCGTTTGGGCGGGCGTTTGGCCGGAGCAATCGAAGTATTGGCGGATATTGAGGGACGCAGGCGTCCCGTCGCCGATGC GCTGAAGGATTGGGGTCTCGCCCATCGTTTTGCCGGCTCCGGCGACAGGGCCGCCATCGGCAACATCGTTTATGACGCGC TTCGTATGAAACTGTCGCACGCCTGGCTGATGGATGACGATAGCGCCGCCTCGCTTGCCTATGCTGTTCTGCTGCGGCAA TGGGGCAAAAGCTTTGCGGAACTGACGGCGGAATTCGACGGCGATAAATTCGCCCCCGCCGCGCCCGACGCGGAAAGGCA GCAGGCCTTTCTTTCCCGTTCACTTTCCGACGCGCCGGCCTATATTCAGGGCGATGTTCCCGAATGGGTTCAATCTTCCC TCGAAACTGCCTTCGGGGAGCGTTGGCTCGCCGAGGCGCAGGCGCTCAACGAGCGACCGACCCTAGACCTGCGCGCCAAC ACGCTGAAGGCAACCCGGGACAAGGTTCTGAAGGCGCTTGAGGAAAGCGGCGCGGAAGCCACTCGGATTGCCCGGCAGGG TCTGCGTATACCCGCCGGTGAAGGCCCTTCCCGCCTGCCGAATGTCACCGCCGAACTTTCCTTCCAGAAGGGTTGGTTCG AGGTGCAGGATGAAGGCTCACAGATCGTCGCCGATCTTGCCGGTGCCCGGGAAGGTGAACAGGTCCTCGATTATTGCGCC GGTGGTGGCGGCAAGACGTTGGCCATGGCCGCCAGCATGAACAATAAAGGTCAGGTCCACGCTTTCGATGCCGATCGCAA GCGGCTCGCGCCGATCATCGAGCGGCTGAAGCGGGCCGGTACGCGCAATGTGCAGGTGCATGATCGCGCAGCGGGTCTTG CGCCGTTCCAGGAAAAATTCGACCGCGTTCTGGTCGATGCACCCTGCACCGGCACCGGAACCTGGCGCCGCCGCCCCGAC ACCAAATGGCGGCTGACGGCGCGCAATCTGGAAGAACGCGTGCAGCAGCAGGGCGAGGCGCTTTCGCAAGCCAAGGGTTT CGTGCGCCCGGGCGGCGAGCTACTTTATGTTACCTGTTCGGTTCTGCCCGAGGAAAATGAGCAGCAGGTCAGGCGCTTCT GCGAGGAAAATCCAGAATTTGCCATCGGTTCGGCTCTGGAGCGCTGGCAGTCGATTTTCAGTGGCAATGCGAATAAACCG CATTCTTCAGATGGCAAGACGGTGACGCTCACGCCTGCAACCACAGATACGGACGGTTTCTTCTTCTGCTTGATGAAACG CAAAGCATAA
Upstream 100 bases:
>100_bases ACTGCCAAGGTCTGCTACGAGATCGTCAAGGCCGATTGAGCCTTGCCCTCGCCGCCATTCGCGTCTACCAACCGGGCTTA TCAGCAAGAAGAGGTAGACA
Downstream 100 bases:
>100_bases AGCAGGTTTTGCACGCAGCGATTTTACTTGGGAAATACCGCGCGGGCTTGTTTCAAAGCATTCTAAACTAGAATGTTTGA CACAAGTTTGTGATTGGGTC
Product: Sun protein
Products: NA
Alternate protein names: 16S rRNA m5C967 methyltransferase; rRNA (cytosine-C(5)-)-methyltransferase rsmB [H]
Number of amino acids: Translated: 429; Mature: 429
Protein sequence:
>429_residues MRLGGRLAGAIEVLADIEGRRRPVADALKDWGLAHRFAGSGDRAAIGNIVYDALRMKLSHAWLMDDDSAASLAYAVLLRQ WGKSFAELTAEFDGDKFAPAAPDAERQQAFLSRSLSDAPAYIQGDVPEWVQSSLETAFGERWLAEAQALNERPTLDLRAN TLKATRDKVLKALEESGAEATRIARQGLRIPAGEGPSRLPNVTAELSFQKGWFEVQDEGSQIVADLAGAREGEQVLDYCA GGGGKTLAMAASMNNKGQVHAFDADRKRLAPIIERLKRAGTRNVQVHDRAAGLAPFQEKFDRVLVDAPCTGTGTWRRRPD TKWRLTARNLEERVQQQGEALSQAKGFVRPGGELLYVTCSVLPEENEQQVRRFCEENPEFAIGSALERWQSIFSGNANKP HSSDGKTVTLTPATTDTDGFFFCLMKRKA
Sequences:
>Translated_429_residues MRLGGRLAGAIEVLADIEGRRRPVADALKDWGLAHRFAGSGDRAAIGNIVYDALRMKLSHAWLMDDDSAASLAYAVLLRQ WGKSFAELTAEFDGDKFAPAAPDAERQQAFLSRSLSDAPAYIQGDVPEWVQSSLETAFGERWLAEAQALNERPTLDLRAN TLKATRDKVLKALEESGAEATRIARQGLRIPAGEGPSRLPNVTAELSFQKGWFEVQDEGSQIVADLAGAREGEQVLDYCA GGGGKTLAMAASMNNKGQVHAFDADRKRLAPIIERLKRAGTRNVQVHDRAAGLAPFQEKFDRVLVDAPCTGTGTWRRRPD TKWRLTARNLEERVQQQGEALSQAKGFVRPGGELLYVTCSVLPEENEQQVRRFCEENPEFAIGSALERWQSIFSGNANKP HSSDGKTVTLTPATTDTDGFFFCLMKRKA >Mature_429_residues MRLGGRLAGAIEVLADIEGRRRPVADALKDWGLAHRFAGSGDRAAIGNIVYDALRMKLSHAWLMDDDSAASLAYAVLLRQ WGKSFAELTAEFDGDKFAPAAPDAERQQAFLSRSLSDAPAYIQGDVPEWVQSSLETAFGERWLAEAQALNERPTLDLRAN TLKATRDKVLKALEESGAEATRIARQGLRIPAGEGPSRLPNVTAELSFQKGWFEVQDEGSQIVADLAGAREGEQVLDYCA GGGGKTLAMAASMNNKGQVHAFDADRKRLAPIIERLKRAGTRNVQVHDRAAGLAPFQEKFDRVLVDAPCTGTGTWRRRPD TKWRLTARNLEERVQQQGEALSQAKGFVRPGGELLYVTCSVLPEENEQQVRRFCEENPEFAIGSALERWQSIFSGNANKP HSSDGKTVTLTPATTDTDGFFFCLMKRKA
Specific function: Specifically methylates the cytosine at position 967 (m5C967) of 16S rRNA [H]
COG id: COG0144
COG function: function code J; tRNA and rRNA cytosine-C5-methylases
Gene ontology:
Cell location: Cytoplasm (Potential) [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the methyltransferase superfamily. RsmB/NOP family [H]
Homologues:
Organism=Homo sapiens, GI76150625, Length=329, Percent_Identity=30.0911854103343, Blast_Score=107, Evalue=3e-23, Organism=Homo sapiens, GI76150623, Length=329, Percent_Identity=30.0911854103343, Blast_Score=107, Evalue=3e-23, Organism=Homo sapiens, GI8922322, Length=232, Percent_Identity=31.4655172413793, Blast_Score=92, Evalue=1e-18, Organism=Homo sapiens, GI270288816, Length=232, Percent_Identity=31.4655172413793, Blast_Score=92, Evalue=1e-18, Organism=Homo sapiens, GI270288818, Length=232, Percent_Identity=31.4655172413793, Blast_Score=92, Evalue=1e-18, Organism=Homo sapiens, GI23199998, Length=232, Percent_Identity=31.4655172413793, Blast_Score=91, Evalue=1e-18, Organism=Homo sapiens, GI32698918, Length=173, Percent_Identity=31.2138728323699, Blast_Score=86, Evalue=5e-17, Organism=Homo sapiens, GI301336155, Length=193, Percent_Identity=29.5336787564767, Blast_Score=69, Evalue=8e-12, Organism=Homo sapiens, GI39995082, Length=193, Percent_Identity=29.5336787564767, Blast_Score=69, Evalue=9e-12, Organism=Escherichia coli, GI2367212, Length=361, Percent_Identity=31.3019390581717, Blast_Score=154, Evalue=8e-39, Organism=Escherichia coli, GI87081985, Length=278, Percent_Identity=29.8561151079137, Blast_Score=96, Evalue=7e-21, Organism=Caenorhabditis elegans, GI17536757, Length=296, Percent_Identity=31.4189189189189, Blast_Score=103, Evalue=2e-22, Organism=Saccharomyces cerevisiae, GI6324268, Length=325, Percent_Identity=29.2307692307692, Blast_Score=97, Evalue=4e-21, Organism=Saccharomyces cerevisiae, GI6319447, Length=174, Percent_Identity=30.4597701149425, Blast_Score=69, Evalue=2e-12, Organism=Drosophila melanogaster, GI22024126, Length=292, Percent_Identity=29.1095890410959, Blast_Score=98, Evalue=1e-20, Organism=Drosophila melanogaster, GI21356579, Length=221, Percent_Identity=30.316742081448, Blast_Score=95, Evalue=8e-20, Organism=Drosophila melanogaster, GI24668781, Length=205, Percent_Identity=30.2439024390244, Blast_Score=81, Evalue=1e-15, Organism=Drosophila melanogaster, GI21356513, Length=201, Percent_Identity=29.8507462686567, Blast_Score=80, Evalue=3e-15, Organism=Drosophila melanogaster, GI21355201, Length=191, Percent_Identity=28.7958115183246, Blast_Score=74, Evalue=2e-13,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001678 - InterPro: IPR018314 - InterPro: IPR006027 - InterPro: IPR004573 [H]
Pfam domain/function: PF01189 Nol1_Nop2_Fmu; PF01029 NusB [H]
EC number: =2.1.1.176 [H]
Molecular weight: Translated: 47087; Mature: 47087
Theoretical pI: Translated: 6.69; Mature: 6.69
Prosite motif: PS01153 NOL1_NOP2_SUN
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.2 %Cys (Translated Protein) 1.4 %Met (Translated Protein) 2.6 %Cys+Met (Translated Protein) 1.2 %Cys (Mature Protein) 1.4 %Met (Mature Protein) 2.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MRLGGRLAGAIEVLADIEGRRRPVADALKDWGLAHRFAGSGDRAAIGNIVYDALRMKLSH CCCCCHHHHHHHHHHHCCCCCCCHHHHHHHCCHHHHHCCCCCCHHHHHHHHHHHHHHHHH AWLMDDDSAASLAYAVLLRQWGKSFAELTAEFDGDKFAPAAPDAERQQAFLSRSLSDAPA HEECCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHCCCCCCC YIQGDVPEWVQSSLETAFGERWLAEAQALNERPTLDLRANTLKATRDKVLKALEESGAEA EEECCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEHHHHHHHHHHHHHHHHHCCHHH TRIARQGLRIPAGEGPSRLPNVTAELSFQKGWFEVQDEGSQIVADLAGAREGEQVLDYCA HHHHHHCCCCCCCCCCCCCCCCEEEEEECCCCEEECCCCHHHHHHHHCCCCHHHHHHHHC GGGGKTLAMAASMNNKGQVHAFDADRKRLAPIIERLKRAGTRNVQVHDRAAGLAPFQEKF CCCCCEEEEEECCCCCCCEEEECCCHHHHHHHHHHHHHCCCCCEEEEHHHCCCCHHHHHH DRVLVDAPCTGTGTWRRRPDTKWRLTARNLEERVQQQGEALSQAKGFVRPGGELLYVTCS CCEEECCCCCCCCCCCCCCCCCEEEEHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEE VLPEENEQQVRRFCEENPEFAIGSALERWQSIFSGNANKPHSSDGKTVTLTPATTDTDGF CCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEECCCCCCCCE FFCLMKRKA EEEEEECCC >Mature Secondary Structure MRLGGRLAGAIEVLADIEGRRRPVADALKDWGLAHRFAGSGDRAAIGNIVYDALRMKLSH CCCCCHHHHHHHHHHHCCCCCCCHHHHHHHCCHHHHHCCCCCCHHHHHHHHHHHHHHHHH AWLMDDDSAASLAYAVLLRQWGKSFAELTAEFDGDKFAPAAPDAERQQAFLSRSLSDAPA HEECCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHCCCCCCC YIQGDVPEWVQSSLETAFGERWLAEAQALNERPTLDLRANTLKATRDKVLKALEESGAEA EEECCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEHHHHHHHHHHHHHHHHHCCHHH TRIARQGLRIPAGEGPSRLPNVTAELSFQKGWFEVQDEGSQIVADLAGAREGEQVLDYCA HHHHHHCCCCCCCCCCCCCCCCEEEEEECCCCEEECCCCHHHHHHHHCCCCHHHHHHHHC GGGGKTLAMAASMNNKGQVHAFDADRKRLAPIIERLKRAGTRNVQVHDRAAGLAPFQEKF CCCCCEEEEEECCCCCCCEEEECCCHHHHHHHHHHHHHCCCCCEEEEHHHCCCCHHHHHH DRVLVDAPCTGTGTWRRRPDTKWRLTARNLEERVQQQGEALSQAKGFVRPGGELLYVTCS CCEEECCCCCCCCCCCCCCCCCEEEEHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEE VLPEENEQQVRRFCEENPEFAIGSALERWQSIFSGNANKPHSSDGKTVTLTPATTDTDGF CCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEECCCCCCCCE FFCLMKRKA EEEEEECCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA