| Definition | Methanosarcina mazei Go1 chromosome, complete genome. |
|---|---|
| Accession | NC_003901 |
| Length | 4,096,345 |
Click here to switch to the map view.
The map label for this gene is lhr [C]
Identifier: 21226527
GI number: 21226527
Start: 539592
End: 541784
Strand: Reverse
Name: lhr [C]
Synonym: MM_0425
Alternate gene names: 21226527
Gene position: 541784-539592 (Counterclockwise)
Preceding gene: 21226528
Following gene: 21226526
Centisome position: 13.23
GC content: 48.52
Gene sequence:
>2193_bases ATGAAGATAGAAAGCCTCGACCTGCCCGATGAAATAAAGCGCTTTTACGAAAACTCCGGGATTCTTGAACTTTATCCGCC ACAGGCAGAAGCTGTTGAGAAAGGCCTGCTTGAAGGAAAAAACCTGCTTGCTGCAATTCCTACGGCTTCGGGAAAGACTC TTCTTGCCGAGCTTGCGATGTTGAAGTCTGTGCTTAATGGAGGAAAGGCACTCTATATCGTGCCCCTGAGAGCTCTTGCT TCCGAGAAATTCAGGCGGTTTCAGGAATTTTCCGTACTGGGCATGAGGGTCGGGATTTCAACAGGAGACTACGACAGGCG AGACGAAGGGCTCGGCATAAATGATATTATTGTTGCAACCTCTGAAAAAACAGATTCCCTGCTCAGGAACGAGACTGCCT GGATGCAGGAGATCTCTGTTGTTGTGGCAGATGAAGTTCACCTTATTGATTCTCCAGACAGGGGTCCGACGCTTGAAATC ACCCTTTCAAAGCTCAGGCGAATGAACCCTTCCTGTCAGGTCCTTGCACTTTCGGCAACAGTAGGAAATGCCGATGAGCT TGCAGCCTGGCTTGATGCCGAACTTGTATTAAGCGAGTGGAGGCCCACCGACCTTATGGAAGGGGTGTTTTATAACGGGA TTTTTTACTGCAAGGATAAGGAGAAGCCTGTTGGGCAGCCAACAAAAGATGAGGCAGTAAACCTGGTTCTCGATACTATA AAGGAAGGAGGACAGTGCCTTGTTTTCGAGAGCAGCAGGAAGAACTGCATGGGTTTTGCAAAAAAAGCAGTTTCTGCAGT AAAAAAGACTCTTTCTAATGAAGACCGTGAAACTCTTGCAGGCATTGCTGACGAGATCATTGAAAACAGTGAAACCGATG TTTCATCAGTTCTTGCCACCTGTGTCCGTTCAGGGACAGCATTCCACCACGCAGGCCTTACAACGCCTTTGAGGGAGCTT GTAGAAAACGGCTTTCGTGAAGGGCGTATAAAAATTATTTCAAGCACTCCTACCCTTGCAGCAGGGTTGAACCTTCCTGC CAGGCGTGTGATAATAAGGAGTTACCGGCGTTACTCTTCAGATTCGGGCATGCAGCCAATCCCGGTACTTGAATACAAGC AGATGGCAGGAAGGGCAGGGAGGCCGAGGCTTGACCCTTACGGAGAAGCTGTCCTGCTTGCAAAGTCTTACGAGGAGTTT GTCTTCCTTTTCGAAAAATACATCGAAGCCGGAGCTGAAGATATATGGTCCAAGCTGGGTACGGAAAATGCCCTCAGGAC GCATATACTTTCTACGATCTCGAACGGGTTTGCCCGGACAAGGGAAGAACTCATGGATTTCCTCGAGGCAACATTTTTCG CTTTCCAGTACTCAAACTTCGGGCTCTCCGCGGTTGTGGACGAATGCCTGGACTTTTTGAGGCGGGAAGGTATGCTTGAA AAAGACCCCGATGCCCTTGTTTCCACGGTTTTCGGAAAACTCGTATCAAGGCTCTACATCGACCCTCTCTCTGCAGCTCT TATTGCAAAGGGCTTGAGGGAAGCAGGTACCCTTACCGAGCTTACCCTTCTGCACCTGATATGCAGCACTCCGGATATGC GTCTTATGTATATGCGGAGCCAGGACTACCAGGAAGTCAACGACTACGTAATGGCTCACGCAGGCGAGTTTTCAAAGGTT CCGAATCCCTTCAATATCGCCGAGTACGAGTGGTTCCTTGGTGAGGTAAAGACCTCTCTCCTTCTGATGGACTGGATACA TGAGAAGCCTGAAAACGAGATCTGTCTGAAGTTCGGGATAGGGGAAGGTGATATCCATGCAACTGCAGATATTGCCGAAT GGATAATGCATGTTACAGCCCAGCTTGCAGGACTTCTCGACCTTAAAGGTGCAAAAGAAGCCTCAGAACTGGAAAAGAGG ATCAGGTATGGAGCAGCCCCGGAACTTATGGACCTGCTTGATATCAGAAGTGTGGGGCGCGTGAGAGCAAGAAAACTTTA TGAGGCAGGCTTTAAGTCCACAGCTGAACTCGCAGCAGCTTCTCCTGAACATATAGCCGTACTTGTAGGACCAAAGATTA CTGAAAGGATTTTCAAGCAGATCGGACGTAGAGAAGCGGTATCTGAATTTTCTGATATTGAGCCTCTGGAAAAAGGTTCT TCCGATGGGCAGAGGACAATTAGTGATTATTGA
Upstream 100 bases:
>100_bases TTTACTCAATGAAAACCTCAACCTTTTCAGGGGAACAGCTCCGGCATTTGCACTGCGGTTTCAAAATAGTTTTATCTTAT GTGTGAGTTTCTGATTTTCA
Downstream 100 bases:
>100_bases TTCTGACTAATGATTACCGGTTACTTTGAGCTTTCTTTCATGTTTTTTAATTTATTGCTTTTTGGGATTAATTGTTTTCT CTATCCCTGAGTTCCGAGAG
Product: ski2-like helicase
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 730; Mature: 730
Protein sequence:
>730_residues MKIESLDLPDEIKRFYENSGILELYPPQAEAVEKGLLEGKNLLAAIPTASGKTLLAELAMLKSVLNGGKALYIVPLRALA SEKFRRFQEFSVLGMRVGISTGDYDRRDEGLGINDIIVATSEKTDSLLRNETAWMQEISVVVADEVHLIDSPDRGPTLEI TLSKLRRMNPSCQVLALSATVGNADELAAWLDAELVLSEWRPTDLMEGVFYNGIFYCKDKEKPVGQPTKDEAVNLVLDTI KEGGQCLVFESSRKNCMGFAKKAVSAVKKTLSNEDRETLAGIADEIIENSETDVSSVLATCVRSGTAFHHAGLTTPLREL VENGFREGRIKIISSTPTLAAGLNLPARRVIIRSYRRYSSDSGMQPIPVLEYKQMAGRAGRPRLDPYGEAVLLAKSYEEF VFLFEKYIEAGAEDIWSKLGTENALRTHILSTISNGFARTREELMDFLEATFFAFQYSNFGLSAVVDECLDFLRREGMLE KDPDALVSTVFGKLVSRLYIDPLSAALIAKGLREAGTLTELTLLHLICSTPDMRLMYMRSQDYQEVNDYVMAHAGEFSKV PNPFNIAEYEWFLGEVKTSLLLMDWIHEKPENEICLKFGIGEGDIHATADIAEWIMHVTAQLAGLLDLKGAKEASELEKR IRYGAAPELMDLLDIRSVGRVRARKLYEAGFKSTAELAAASPEHIAVLVGPKITERIFKQIGRREAVSEFSDIEPLEKGS SDGQRTISDY
Sequences:
>Translated_730_residues MKIESLDLPDEIKRFYENSGILELYPPQAEAVEKGLLEGKNLLAAIPTASGKTLLAELAMLKSVLNGGKALYIVPLRALA SEKFRRFQEFSVLGMRVGISTGDYDRRDEGLGINDIIVATSEKTDSLLRNETAWMQEISVVVADEVHLIDSPDRGPTLEI TLSKLRRMNPSCQVLALSATVGNADELAAWLDAELVLSEWRPTDLMEGVFYNGIFYCKDKEKPVGQPTKDEAVNLVLDTI KEGGQCLVFESSRKNCMGFAKKAVSAVKKTLSNEDRETLAGIADEIIENSETDVSSVLATCVRSGTAFHHAGLTTPLREL VENGFREGRIKIISSTPTLAAGLNLPARRVIIRSYRRYSSDSGMQPIPVLEYKQMAGRAGRPRLDPYGEAVLLAKSYEEF VFLFEKYIEAGAEDIWSKLGTENALRTHILSTISNGFARTREELMDFLEATFFAFQYSNFGLSAVVDECLDFLRREGMLE KDPDALVSTVFGKLVSRLYIDPLSAALIAKGLREAGTLTELTLLHLICSTPDMRLMYMRSQDYQEVNDYVMAHAGEFSKV PNPFNIAEYEWFLGEVKTSLLLMDWIHEKPENEICLKFGIGEGDIHATADIAEWIMHVTAQLAGLLDLKGAKEASELEKR IRYGAAPELMDLLDIRSVGRVRARKLYEAGFKSTAELAAASPEHIAVLVGPKITERIFKQIGRREAVSEFSDIEPLEKGS SDGQRTISDY >Mature_730_residues MKIESLDLPDEIKRFYENSGILELYPPQAEAVEKGLLEGKNLLAAIPTASGKTLLAELAMLKSVLNGGKALYIVPLRALA SEKFRRFQEFSVLGMRVGISTGDYDRRDEGLGINDIIVATSEKTDSLLRNETAWMQEISVVVADEVHLIDSPDRGPTLEI TLSKLRRMNPSCQVLALSATVGNADELAAWLDAELVLSEWRPTDLMEGVFYNGIFYCKDKEKPVGQPTKDEAVNLVLDTI KEGGQCLVFESSRKNCMGFAKKAVSAVKKTLSNEDRETLAGIADEIIENSETDVSSVLATCVRSGTAFHHAGLTTPLREL VENGFREGRIKIISSTPTLAAGLNLPARRVIIRSYRRYSSDSGMQPIPVLEYKQMAGRAGRPRLDPYGEAVLLAKSYEEF VFLFEKYIEAGAEDIWSKLGTENALRTHILSTISNGFARTREELMDFLEATFFAFQYSNFGLSAVVDECLDFLRREGMLE KDPDALVSTVFGKLVSRLYIDPLSAALIAKGLREAGTLTELTLLHLICSTPDMRLMYMRSQDYQEVNDYVMAHAGEFSKV PNPFNIAEYEWFLGEVKTSLLLMDWIHEKPENEICLKFGIGEGDIHATADIAEWIMHVTAQLAGLLDLKGAKEASELEKR IRYGAAPELMDLLDIRSVGRVRARKLYEAGFKSTAELAAASPEHIAVLVGPKITERIFKQIGRREAVSEFSDIEPLEKGS SDGQRTISDY
Specific function: Unknown
COG id: COG1204
COG function: function code R; Superfamily II helicase
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 helicase C-terminal domain
Homologues:
Organism=Homo sapiens, GI110556640, Length=782, Percent_Identity=29.2838874680307, Blast_Score=236, Evalue=9e-62, Organism=Homo sapiens, GI139394648, Length=492, Percent_Identity=32.7235772357724, Blast_Score=206, Evalue=6e-53, Organism=Homo sapiens, GI76880486, Length=530, Percent_Identity=27.7358490566038, Blast_Score=153, Evalue=6e-37, Organism=Homo sapiens, GI130484567, Length=399, Percent_Identity=29.5739348370927, Blast_Score=152, Evalue=1e-36, Organism=Homo sapiens, GI169218225, Length=405, Percent_Identity=31.358024691358, Blast_Score=145, Evalue=2e-34, Organism=Homo sapiens, GI40217847, Length=410, Percent_Identity=30.9756097560976, Blast_Score=144, Evalue=3e-34, Organism=Homo sapiens, GI67782311, Length=406, Percent_Identity=29.064039408867, Blast_Score=124, Evalue=4e-28, Organism=Homo sapiens, GI193211480, Length=418, Percent_Identity=26.7942583732057, Blast_Score=122, Evalue=2e-27, Organism=Homo sapiens, GI222831595, Length=276, Percent_Identity=24.2753623188406, Blast_Score=71, Evalue=3e-12, Organism=Escherichia coli, GI1787942, Length=389, Percent_Identity=23.6503856041131, Blast_Score=69, Evalue=7e-13, Organism=Caenorhabditis elegans, GI71995032, Length=745, Percent_Identity=26.8456375838926, Blast_Score=177, Evalue=2e-44, Organism=Caenorhabditis elegans, GI86563272, Length=548, Percent_Identity=29.7445255474453, Blast_Score=162, Evalue=6e-40, Organism=Caenorhabditis elegans, GI17537127, Length=614, Percent_Identity=27.6872964169381, Blast_Score=157, Evalue=1e-38, Organism=Caenorhabditis elegans, GI17542826, Length=441, Percent_Identity=27.6643990929705, Blast_Score=142, Evalue=7e-34, Organism=Caenorhabditis elegans, GI17537519, Length=562, Percent_Identity=26.6903914590747, Blast_Score=139, Evalue=8e-33, Organism=Caenorhabditis elegans, GI133930973, Length=461, Percent_Identity=27.9826464208243, Blast_Score=125, Evalue=1e-28, Organism=Caenorhabditis elegans, GI71995036, Length=432, Percent_Identity=27.7777777777778, Blast_Score=108, Evalue=7e-24, Organism=Saccharomyces cerevisiae, GI6321020, Length=596, Percent_Identity=26.8456375838926, Blast_Score=166, Evalue=2e-41, Organism=Saccharomyces cerevisiae, GI6321710, Length=527, Percent_Identity=27.3244781783681, Blast_Score=164, Evalue=6e-41, Organism=Saccharomyces cerevisiae, GI9755332, Length=584, Percent_Identity=28.5958904109589, Blast_Score=161, Evalue=4e-40, Organism=Saccharomyces cerevisiae, GI6322411, Length=413, Percent_Identity=27.8450363196126, Blast_Score=129, Evalue=2e-30, Organism=Saccharomyces cerevisiae, GI6323430, Length=192, Percent_Identity=31.25, Blast_Score=80, Evalue=8e-16, Organism=Drosophila melanogaster, GI17933644, Length=754, Percent_Identity=28.5145888594164, Blast_Score=230, Evalue=3e-60, Organism=Drosophila melanogaster, GI24660651, Length=781, Percent_Identity=28.169014084507, Blast_Score=196, Evalue=6e-50, Organism=Drosophila melanogaster, GI24647182, Length=436, Percent_Identity=30.7339449541284, Blast_Score=157, Evalue=3e-38, Organism=Drosophila melanogaster, GI28574898, Length=435, Percent_Identity=29.1954022988506, Blast_Score=137, Evalue=3e-32, Organism=Drosophila melanogaster, GI17864608, Length=414, Percent_Identity=29.7101449275362, Blast_Score=130, Evalue=3e-30, Organism=Drosophila melanogaster, GI17933658, Length=99, Percent_Identity=38.3838383838384, Blast_Score=72, Evalue=9e-13,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): HELS_METMA (Q8PZR7)
Other databases:
- EMBL: AE008384 - RefSeq: NP_632449.1 - ProteinModelPortal: Q8PZR7 - SMR: Q8PZR7 - GeneID: 1478767 - GenomeReviews: AE008384_GR - KEGG: mma:MM_0425 - NMPDR: fig|192952.1.peg.425 - HOGENOM: HBG496908 - OMA: FLERTFY - ProtClustDB: PRK02362 - BioCyc: MMAZ192952:MM0425-MONOMER - HAMAP: MF_00442 - InterPro: IPR014001 - InterPro: IPR011545 - InterPro: IPR001650 - InterPro: IPR014021 - InterPro: IPR022965 - InterPro: IPR010994 - SMART: SM00487 - SMART: SM00490
Pfam domain/function: PF00270 DEAD; PF00271 Helicase_C; SSF47781 RuvA_2_like
EC number: 3.6.1.-
Molecular weight: Translated: 81075; Mature: 81075
Theoretical pI: Translated: 4.83; Mature: 4.83
Prosite motif: PS51192 HELICASE_ATP_BIND_1; PS51194 HELICASE_CTER
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.1 %Cys (Translated Protein) 2.5 %Met (Translated Protein) 3.6 %Cys+Met (Translated Protein) 1.1 %Cys (Mature Protein) 2.5 %Met (Mature Protein) 3.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKIESLDLPDEIKRFYENSGILELYPPQAEAVEKGLLEGKNLLAAIPTASGKTLLAELAM CCCCCCCCHHHHHHHHCCCCEEEECCCCHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHH LKSVLNGGKALYIVPLRALASEKFRRFQEFSVLGMRVGISTGDYDRRDEGLGINDIIVAT HHHHHCCCCEEEEEEHHHHHHHHHHHHHHHHHHHHEEECCCCCCCCCCCCCCCCEEEEEE SEKTDSLLRNETAWMQEISVVVADEVHLIDSPDRGPTLEITLSKLRRMNPSCQVLALSAT CCHHHHHHHHHHHHHHHHHHHEECCEEEECCCCCCCEEEEEHHHHHCCCCCCEEEEEEEC VGNADELAAWLDAELVLSEWRPTDLMEGVFYNGIFYCKDKEKPVGQPTKDEAVNLVLDTI CCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCEEEECCCCCCCCCCCHHHHHHHHHHHH KEGGQCLVFESSRKNCMGFAKKAVSAVKKTLSNEDRETLAGIADEIIENSETDVSSVLAT HCCCEEEEEECCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHH CVRSGTAFHHAGLTTPLRELVENGFREGRIKIISSTPTLAAGLNLPARRVIIRSYRRYSS HHHCCCCHHHCCCCHHHHHHHHCCCCCCCEEEEECCCCHHHCCCCCHHHHHHHHHHHHCC DSGMQPIPVLEYKQMAGRAGRPRLDPYGEAVLLAKSYEEFVFLFEKYIEAGAEDIWSKLG CCCCCCCCHHHHHHHCCCCCCCCCCCCCCEEEEHHHHHHHHHHHHHHHHHCHHHHHHHHC TENALRTHILSTISNGFARTREELMDFLEATFFAFQYSNFGLSAVVDECLDFLRREGMLE CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCC KDPDALVSTVFGKLVSRLYIDPLSAALIAKGLREAGTLTELTLLHLICSTPDMRLMYMRS CCHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCEEEEECC QDYQEVNDYVMAHAGEFSKVPNPFNIAEYEWFLGEVKTSLLLMDWIHEKPENEICLKFGI CCHHHHHHHHHHCCCCHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEC GEGDIHATADIAEWIMHVTAQLAGLLDLKGAKEASELEKRIRYGAAPELMDLLDIRSVGR CCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHH VRARKLYEAGFKSTAELAAASPEHIAVLVGPKITERIFKQIGRREAVSEFSDIEPLEKGS HHHHHHHHHCCCHHHHHHCCCCCCEEEEECCHHHHHHHHHHHHHHHHHHHHCCCHHHCCC SDGQRTISDY CCCCCCCCCC >Mature Secondary Structure MKIESLDLPDEIKRFYENSGILELYPPQAEAVEKGLLEGKNLLAAIPTASGKTLLAELAM CCCCCCCCHHHHHHHHCCCCEEEECCCCHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHH LKSVLNGGKALYIVPLRALASEKFRRFQEFSVLGMRVGISTGDYDRRDEGLGINDIIVAT HHHHHCCCCEEEEEEHHHHHHHHHHHHHHHHHHHHEEECCCCCCCCCCCCCCCCEEEEEE SEKTDSLLRNETAWMQEISVVVADEVHLIDSPDRGPTLEITLSKLRRMNPSCQVLALSAT CCHHHHHHHHHHHHHHHHHHHEECCEEEECCCCCCCEEEEEHHHHHCCCCCCEEEEEEEC VGNADELAAWLDAELVLSEWRPTDLMEGVFYNGIFYCKDKEKPVGQPTKDEAVNLVLDTI CCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCEEEECCCCCCCCCCCHHHHHHHHHHHH KEGGQCLVFESSRKNCMGFAKKAVSAVKKTLSNEDRETLAGIADEIIENSETDVSSVLAT HCCCEEEEEECCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHH CVRSGTAFHHAGLTTPLRELVENGFREGRIKIISSTPTLAAGLNLPARRVIIRSYRRYSS HHHCCCCHHHCCCCHHHHHHHHCCCCCCCEEEEECCCCHHHCCCCCHHHHHHHHHHHHCC DSGMQPIPVLEYKQMAGRAGRPRLDPYGEAVLLAKSYEEFVFLFEKYIEAGAEDIWSKLG CCCCCCCCHHHHHHHCCCCCCCCCCCCCCEEEEHHHHHHHHHHHHHHHHHCHHHHHHHHC TENALRTHILSTISNGFARTREELMDFLEATFFAFQYSNFGLSAVVDECLDFLRREGMLE CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCC KDPDALVSTVFGKLVSRLYIDPLSAALIAKGLREAGTLTELTLLHLICSTPDMRLMYMRS CCHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCEEEEECC QDYQEVNDYVMAHAGEFSKVPNPFNIAEYEWFLGEVKTSLLLMDWIHEKPENEICLKFGI CCHHHHHHHHHHCCCCHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEC GEGDIHATADIAEWIMHVTAQLAGLLDLKGAKEASELEKRIRYGAAPELMDLLDIRSVGR CCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHH VRARKLYEAGFKSTAELAAASPEHIAVLVGPKITERIFKQIGRREAVSEFSDIEPLEKGS HHHHHHHHHCCCHHHHHHCCCCCCEEEEECCHHHHHHHHHHHHHHHHHHHHCCCHHHCCC SDGQRTISDY CCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: Hydrolase; Acting on acid anhydrides [C]
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 12125824