| Definition | Methanopyrus kandleri AV19, complete genome. |
|---|---|
| Accession | NC_003551 |
| Length | 1,694,969 |
Click here to switch to the map view.
The map label for this gene is lhr [C]
Identifier: 20094271
GI number: 20094271
Start: 794560
End: 797016
Strand: Direct
Name: lhr [C]
Synonym: MK0835
Alternate gene names: 20094271
Gene position: 794560-797016 (Clockwise)
Preceding gene: 20094270
Following gene: 20094274
Centisome position: 46.88
GC content: 60.77
Gene sequence:
>2457_bases GTGATAACCGTAGTGTTCACGGGGTACAAGGGTGGGAAGCTCAGGTTCGTGCTGGCAGAGGGTGACCCGGAGGACGGTGC CGAGATTTCGATCGAGGGTCACCTGGAGCTCTCCGGAGGCGGTGAACCCGTTCGAGGGGTTCTGGGCGGAGAACCGACGC CACCCGACCGCGTGTTAAGCGAGCTACAGAAAGCGGACAGGAAGATAGCCCTCCCCGACGTTGCGGAGGTGGTCGATCGA GTTCTGGGGGAGAAAATCGAGGTTAAGGAGCTCTGTCGGAGATGCCTCGCGTCCGATCGAGTGACCGTGTTGAAGCACGG GTACCGGTTCGGTGAAGTCGAAGTGTGCGGGCGCTGCGCCCGTGAGATTCTGGAGGAGGAGTTGAGGTTCCGAGTACCCG GCTTCTCCCAGACTCTTCTCGAGAAGCTCGAGAGACTCCTACATGAGCTCCGCGACATAGATCGCGTCGTGGAGATGGTG GATCCGGCGTTCGATCCCGCCGAAGAAGAGGAGAAGACGCGGTGGGAGATTGTAGAGGCCGAGGACGAGGAGGAACACCG TCTCCCACTGACCGAGCTCGACATCCCGGAGGAATTGCGTCGAGTTCTCGAAAGGATGGGGTATCAGGAGCTGACTCCTG TCCAAACGAAGTGCGTCGAGCGAGGACTCCTCGAGGGCCGGAATCTGCTGGTCGTCTCTCCCACGGGTTCCGGTAAAACG CTGGTAGCGGAGCTCGCCGGACTCACCGAAGTCCTCCGAGAAGGTCGTAAGATGGTTTACCTGGTACCGCTCGTGGCCCT GGCGAACCAGAAGCACCGGGAATTCATGGAAAAGTACGGACGACCGTTAGGGATAGGTGTCAGGCTACAGGTCGGCGCGG CGAGATTGAAGGAGTTCTCTGGACCCGAACGTGGTCCGTCGCCGCGTGACGCCGACATCATAGTCGGCACCTACGAGGGG TTCGACCTCCTCCTTCGAACTGGGGCCGTAGATCCCGACGATATCGGTGTCGTAGTAATCGACGAGGTACATACGCTGGC TGACGAGAGAGGTCCACGCTTGGACGGTCTCGTATGCAGGCTTAAGACGCTAACCGGTGCGCAGTTGTTGGGTCTCTCGG CCACCGTGGGGAACCCCGAAGAACTCGCCGAGTACTTGGATGCTGAACCGATAGTTCACAACCGCCGCCCCGTACCCCTA GAGTACCACCTCGTGATCAATCAAGATCGCCGACAGAAGTGGGATAGAATCGCCAGACTCGTGGAGTCGGAGTGGGAGAC CGAGTACTCCACGGGGTATAGGGGTCAGACTATTGTCTTCACGTATTCGCGCAGAAACACACATCGATTGGCCGATCTCC TCAATGAGAGAACGGGACTAGATGTCGCCCCTTATCACGCGGGACTCCCCTACGACCGGCGCCGCTCCATCGAACGAGCC TTCGAACGCGGTGAACTGGCCGCGGTCGTAACCACGGCAGCCCTCGGGGCCGGCGTGGATTTCCCGGCCTCGCAGGTTAT CTTCGAGAGTCTGGCTATGGGTATCGAATGGCTCACACCGCGCGAATTCCAGCAGATGGCCGGTCGTGCGGGCCGACCCG GGTATCATGACCGTGGTAAAGTCGTTCTGATGGTCGAGCCGGGCCGCCGTTACCACCGTTCTCAAAGTGAGACCGAGGAC AAGGTGGCTTTCACGCTCCTCGAGTCCGAACCGGAACCCGTAGAAGTCGAGTACGATGACGAGGACGAGAGAGAGCAAGT GCTTGCACACCTCGTCAGCGGGGCGGCGAAGTCTCCGGGCGAGCTCGAACGCGTATGTGACGAATCACTAGGCTTCGCTG GGGATCCGATGCGTAGGGTGAAAGAGCTCCGGGAGATGGGTTTCGTCAAAGGACTCGAGCCTACGGAGAAAGGCCGTGTC GCGGCTCGCTACTTCACGGGTCCCAGAACCGTCCACGAGCTGTCCGCCAGGGCGGGCTCGGATCCTCTGAGGGCGGTCGC CTCGGTACGTCCGTTCGAGAGGTTCCAGCTGTCACCGTCGATCAAGCGTGCGGTAGAAAGGGTCACCCGCATGTCGGTTC CATCGAGACTGGACGACGCTCTGAGCGTGATACACTCGGAGCGGAAGGAGGTGATCGAGAGGCTTCCGCCCAAGGAGAAG CAGAAGCTGGTCTCGCTGATGAAGGAGTTAGACTGTGGGTGTGATGCCTTCCCTCACTGCGAGCACGTCAGCCAGCGCGC GTCGGAGCTCGCGCTGAAGGTGCGGCTCGAGGGGAAATCCGTGTACGCCATCCCTAGGATCTTGGAGGGCAGGTACGGGA TAACGGCGTACCCTATCGACATCGCGAACTGGTTGGAGGAGGTCGTGCGGCTCTTGGAATGTGCCGGTGAGATCGCGGAG GAACCGGAGATGGCGGCGGCCGCTGAGGCGCTGGCCGATCCGTGGTCAGAACGGTAG
Upstream 100 bases:
>100_bases TCGAGATCGAGCGTGATCGTGTATACGTACTGAAACCCGAGTACTACCCGGATGGTATCTTAATGAAAGTCGAGAAAATC CGGGTGTAGGGGAAACGAGC
Downstream 100 bases:
>100_bases CGGTGGTGGTCCGACTCTACCCTGCTGCAGTACTCCGAGTGCCCGTTTCAACTCTCCTTCCGGATTCGTCAGGGTTGGTA CCTGAACACCTAAACCTCGA
Product: helicase
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 818; Mature: 818
Protein sequence:
>818_residues MITVVFTGYKGGKLRFVLAEGDPEDGAEISIEGHLELSGGGEPVRGVLGGEPTPPDRVLSELQKADRKIALPDVAEVVDR VLGEKIEVKELCRRCLASDRVTVLKHGYRFGEVEVCGRCAREILEEELRFRVPGFSQTLLEKLERLLHELRDIDRVVEMV DPAFDPAEEEEKTRWEIVEAEDEEEHRLPLTELDIPEELRRVLERMGYQELTPVQTKCVERGLLEGRNLLVVSPTGSGKT LVAELAGLTEVLREGRKMVYLVPLVALANQKHREFMEKYGRPLGIGVRLQVGAARLKEFSGPERGPSPRDADIIVGTYEG FDLLLRTGAVDPDDIGVVVIDEVHTLADERGPRLDGLVCRLKTLTGAQLLGLSATVGNPEELAEYLDAEPIVHNRRPVPL EYHLVINQDRRQKWDRIARLVESEWETEYSTGYRGQTIVFTYSRRNTHRLADLLNERTGLDVAPYHAGLPYDRRRSIERA FERGELAAVVTTAALGAGVDFPASQVIFESLAMGIEWLTPREFQQMAGRAGRPGYHDRGKVVLMVEPGRRYHRSQSETED KVAFTLLESEPEPVEVEYDDEDEREQVLAHLVSGAAKSPGELERVCDESLGFAGDPMRRVKELREMGFVKGLEPTEKGRV AARYFTGPRTVHELSARAGSDPLRAVASVRPFERFQLSPSIKRAVERVTRMSVPSRLDDALSVIHSERKEVIERLPPKEK QKLVSLMKELDCGCDAFPHCEHVSQRASELALKVRLEGKSVYAIPRILEGRYGITAYPIDIANWLEEVVRLLECAGEIAE EPEMAAAAEALADPWSER
Sequences:
>Translated_818_residues MITVVFTGYKGGKLRFVLAEGDPEDGAEISIEGHLELSGGGEPVRGVLGGEPTPPDRVLSELQKADRKIALPDVAEVVDR VLGEKIEVKELCRRCLASDRVTVLKHGYRFGEVEVCGRCAREILEEELRFRVPGFSQTLLEKLERLLHELRDIDRVVEMV DPAFDPAEEEEKTRWEIVEAEDEEEHRLPLTELDIPEELRRVLERMGYQELTPVQTKCVERGLLEGRNLLVVSPTGSGKT LVAELAGLTEVLREGRKMVYLVPLVALANQKHREFMEKYGRPLGIGVRLQVGAARLKEFSGPERGPSPRDADIIVGTYEG FDLLLRTGAVDPDDIGVVVIDEVHTLADERGPRLDGLVCRLKTLTGAQLLGLSATVGNPEELAEYLDAEPIVHNRRPVPL EYHLVINQDRRQKWDRIARLVESEWETEYSTGYRGQTIVFTYSRRNTHRLADLLNERTGLDVAPYHAGLPYDRRRSIERA FERGELAAVVTTAALGAGVDFPASQVIFESLAMGIEWLTPREFQQMAGRAGRPGYHDRGKVVLMVEPGRRYHRSQSETED KVAFTLLESEPEPVEVEYDDEDEREQVLAHLVSGAAKSPGELERVCDESLGFAGDPMRRVKELREMGFVKGLEPTEKGRV AARYFTGPRTVHELSARAGSDPLRAVASVRPFERFQLSPSIKRAVERVTRMSVPSRLDDALSVIHSERKEVIERLPPKEK QKLVSLMKELDCGCDAFPHCEHVSQRASELALKVRLEGKSVYAIPRILEGRYGITAYPIDIANWLEEVVRLLECAGEIAE EPEMAAAAEALADPWSER >Mature_818_residues MITVVFTGYKGGKLRFVLAEGDPEDGAEISIEGHLELSGGGEPVRGVLGGEPTPPDRVLSELQKADRKIALPDVAEVVDR VLGEKIEVKELCRRCLASDRVTVLKHGYRFGEVEVCGRCAREILEEELRFRVPGFSQTLLEKLERLLHELRDIDRVVEMV DPAFDPAEEEEKTRWEIVEAEDEEEHRLPLTELDIPEELRRVLERMGYQELTPVQTKCVERGLLEGRNLLVVSPTGSGKT LVAELAGLTEVLREGRKMVYLVPLVALANQKHREFMEKYGRPLGIGVRLQVGAARLKEFSGPERGPSPRDADIIVGTYEG FDLLLRTGAVDPDDIGVVVIDEVHTLADERGPRLDGLVCRLKTLTGAQLLGLSATVGNPEELAEYLDAEPIVHNRRPVPL EYHLVINQDRRQKWDRIARLVESEWETEYSTGYRGQTIVFTYSRRNTHRLADLLNERTGLDVAPYHAGLPYDRRRSIERA FERGELAAVVTTAALGAGVDFPASQVIFESLAMGIEWLTPREFQQMAGRAGRPGYHDRGKVVLMVEPGRRYHRSQSETED KVAFTLLESEPEPVEVEYDDEDEREQVLAHLVSGAAKSPGELERVCDESLGFAGDPMRRVKELREMGFVKGLEPTEKGRV AARYFTGPRTVHELSARAGSDPLRAVASVRPFERFQLSPSIKRAVERVTRMSVPSRLDDALSVIHSERKEVIERLPPKEK QKLVSLMKELDCGCDAFPHCEHVSQRASELALKVRLEGKSVYAIPRILEGRYGITAYPIDIANWLEEVVRLLECAGEIAE EPEMAAAAEALADPWSER
Specific function: Unknown
COG id: COG1202
COG function: function code R; Superfamily II helicase, archaea-specific
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 helicase C-terminal domain [H]
Homologues:
Organism=Homo sapiens, GI130484567, Length=602, Percent_Identity=25.9136212624585, Blast_Score=133, Evalue=8e-31, Organism=Homo sapiens, GI40217847, Length=602, Percent_Identity=25.9136212624585, Blast_Score=130, Evalue=5e-30, Organism=Homo sapiens, GI169218225, Length=602, Percent_Identity=26.0797342192691, Blast_Score=129, Evalue=8e-30, Organism=Homo sapiens, GI110556640, Length=420, Percent_Identity=26.9047619047619, Blast_Score=125, Evalue=2e-28, Organism=Homo sapiens, GI139394648, Length=472, Percent_Identity=26.271186440678, Blast_Score=123, Evalue=8e-28, Organism=Homo sapiens, GI76880486, Length=430, Percent_Identity=27.2093023255814, Blast_Score=117, Evalue=6e-26, Organism=Escherichia coli, GI1787942, Length=387, Percent_Identity=27.1317829457364, Blast_Score=73, Evalue=7e-14, Organism=Caenorhabditis elegans, GI17537127, Length=406, Percent_Identity=27.8325123152709, Blast_Score=127, Evalue=3e-29, Organism=Caenorhabditis elegans, GI17537519, Length=411, Percent_Identity=26.7639902676399, Blast_Score=126, Evalue=6e-29, Organism=Caenorhabditis elegans, GI71995032, Length=390, Percent_Identity=26.1538461538462, Blast_Score=122, Evalue=9e-28, Organism=Caenorhabditis elegans, GI86563272, Length=382, Percent_Identity=27.2251308900524, Blast_Score=112, Evalue=9e-25, Organism=Caenorhabditis elegans, GI133930973, Length=277, Percent_Identity=29.6028880866426, Blast_Score=80, Evalue=4e-15, Organism=Caenorhabditis elegans, GI71995036, Length=94, Percent_Identity=38.2978723404255, Blast_Score=71, Evalue=2e-12, Organism=Saccharomyces cerevisiae, GI6321020, Length=417, Percent_Identity=28.0575539568345, Blast_Score=120, Evalue=8e-28, Organism=Saccharomyces cerevisiae, GI9755332, Length=410, Percent_Identity=25.3658536585366, Blast_Score=116, Evalue=2e-26, Organism=Saccharomyces cerevisiae, GI6321710, Length=400, Percent_Identity=25, Blast_Score=113, Evalue=1e-25, Organism=Saccharomyces cerevisiae, GI6322411, Length=448, Percent_Identity=25.4464285714286, Blast_Score=108, Evalue=4e-24, Organism=Saccharomyces cerevisiae, GI6321267, Length=336, Percent_Identity=27.3809523809524, Blast_Score=75, Evalue=3e-14, Organism=Saccharomyces cerevisiae, GI6323430, Length=215, Percent_Identity=30.2325581395349, Blast_Score=72, Evalue=5e-13, Organism=Drosophila melanogaster, GI24660651, Length=425, Percent_Identity=26.5882352941176, Blast_Score=131, Evalue=2e-30, Organism=Drosophila melanogaster, GI28574898, Length=432, Percent_Identity=27.5462962962963, Blast_Score=122, Evalue=7e-28, Organism=Drosophila melanogaster, GI24647182, Length=458, Percent_Identity=26.6375545851528, Blast_Score=117, Evalue=4e-26, Organism=Drosophila melanogaster, GI17933644, Length=225, Percent_Identity=28, Blast_Score=81, Evalue=3e-15,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR014001 - InterPro: IPR011545 - InterPro: IPR001650 - InterPro: IPR014021 - InterPro: IPR014014 [H]
Pfam domain/function: PF00270 DEAD; PF00271 Helicase_C [H]
EC number: 3.6.1.- [C]
Molecular weight: Translated: 91717; Mature: 91717
Theoretical pI: Translated: 4.92; Mature: 4.92
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.3 %Cys (Translated Protein) 1.6 %Met (Translated Protein) 2.9 %Cys+Met (Translated Protein) 1.3 %Cys (Mature Protein) 1.6 %Met (Mature Protein) 2.9 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MITVVFTGYKGGKLRFVLAEGDPEDGAEISIEGHLELSGGGEPVRGVLGGEPTPPDRVLS CEEEEEECCCCCEEEEEEECCCCCCCCEEEEEEEEEECCCCCCCHHCCCCCCCCHHHHHH ELQKADRKIALPDVAEVVDRVLGEKIEVKELCRRCLASDRVTVLKHGYRFGEVEVCGRCA HHHHHCCCCCCCHHHHHHHHHHCCCCCHHHHHHHHHCCCCCHHHHCCCCCCHHHHHHHHH REILEEELRFRVPGFSQTLLEKLERLLHELRDIDRVVEMVDPAFDPAEEEEKTRWEIVEA HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHCCEEEECC EDEEEHRLPLTELDIPEELRRVLERMGYQELTPVQTKCVERGLLEGRNLLVVSPTGSGKT CCCHHHCCCCCCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCHH LVAELAGLTEVLREGRKMVYLVPLVALANQKHREFMEKYGRPLGIGVRLQVGAARLKEFS HHHHHHHHHHHHHCCCCEEEEHHHHHHHCHHHHHHHHHCCCCCCEEEEEEHHHHHHHHCC GPERGPSPRDADIIVGTYEGFDLLLRTGAVDPDDIGVVVIDEVHTLADERGPRLDGLVCR CCCCCCCCCCCCEEEEECCCHHHHHHCCCCCCCCCCEEEEHHHHHHHHCCCCCCHHHHHH LKTLTGAQLLGLSATVGNPEELAEYLDAEPIVHNRRPVPLEYHLVINQDRRQKWDRIARL HHHHCCHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCEEEEEEECCHHHHHHHHHHHH VESEWETEYSTGYRGQTIVFTYSRRNTHRLADLLNERTGLDVAPYHAGLPYDRRRSIERA HHHHHCCHHCCCCCCCEEEEEECCCCHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHH FERGELAAVVTTAALGAGVDFPASQVIFESLAMGIEWLTPREFQQMAGRAGRPGYHDRGK HHCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCHHCCCHHHHHHHHCCCCCCCCCCCCC VVLMVEPGRRYHRSQSETEDKVAFTLLESEPEPVEVEYDDEDEREQVLAHLVSGAAKSPG EEEEECCCHHHHCCCCCCCHHEEEEEECCCCCCEEEECCCCHHHHHHHHHHHHCCCCCCH ELERVCDESLGFAGDPMRRVKELREMGFVKGLEPTEKGRVAARYFTGPRTVHELSARAGS HHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCHHHHHHHHHCCC DPLRAVASVRPFERFQLSPSIKRAVERVTRMSVPSRLDDALSVIHSERKEVIERLPPKEK CHHHHHHHCCCHHHHCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCHHH QKLVSLMKELDCGCDAFPHCEHVSQRASELALKVRLEGKSVYAIPRILEGRYGITAYPID HHHHHHHHHHCCCCCCCCCHHHHHHHHHHEEEEEEECCCCEEECHHHHCCCCCCEEECHH IANWLEEVVRLLECAGEIAEEPEMAAAAEALADPWSER HHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCC >Mature Secondary Structure MITVVFTGYKGGKLRFVLAEGDPEDGAEISIEGHLELSGGGEPVRGVLGGEPTPPDRVLS CEEEEEECCCCCEEEEEEECCCCCCCCEEEEEEEEEECCCCCCCHHCCCCCCCCHHHHHH ELQKADRKIALPDVAEVVDRVLGEKIEVKELCRRCLASDRVTVLKHGYRFGEVEVCGRCA HHHHHCCCCCCCHHHHHHHHHHCCCCCHHHHHHHHHCCCCCHHHHCCCCCCHHHHHHHHH REILEEELRFRVPGFSQTLLEKLERLLHELRDIDRVVEMVDPAFDPAEEEEKTRWEIVEA HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHCCEEEECC EDEEEHRLPLTELDIPEELRRVLERMGYQELTPVQTKCVERGLLEGRNLLVVSPTGSGKT CCCHHHCCCCCCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCHH LVAELAGLTEVLREGRKMVYLVPLVALANQKHREFMEKYGRPLGIGVRLQVGAARLKEFS HHHHHHHHHHHHHCCCCEEEEHHHHHHHCHHHHHHHHHCCCCCCEEEEEEHHHHHHHHCC GPERGPSPRDADIIVGTYEGFDLLLRTGAVDPDDIGVVVIDEVHTLADERGPRLDGLVCR CCCCCCCCCCCCEEEEECCCHHHHHHCCCCCCCCCCEEEEHHHHHHHHCCCCCCHHHHHH LKTLTGAQLLGLSATVGNPEELAEYLDAEPIVHNRRPVPLEYHLVINQDRRQKWDRIARL HHHHCCHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCEEEEEEECCHHHHHHHHHHHH VESEWETEYSTGYRGQTIVFTYSRRNTHRLADLLNERTGLDVAPYHAGLPYDRRRSIERA HHHHHCCHHCCCCCCCEEEEEECCCCHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHH FERGELAAVVTTAALGAGVDFPASQVIFESLAMGIEWLTPREFQQMAGRAGRPGYHDRGK HHCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCHHCCCHHHHHHHHCCCCCCCCCCCCC VVLMVEPGRRYHRSQSETEDKVAFTLLESEPEPVEVEYDDEDEREQVLAHLVSGAAKSPG EEEEECCCHHHHCCCCCCCHHEEEEEECCCCCCEEEECCCCHHHHHHHHHHHHCCCCCCH ELERVCDESLGFAGDPMRRVKELREMGFVKGLEPTEKGRVAARYFTGPRTVHELSARAGS HHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCHHHHHHHHHCCC DPLRAVASVRPFERFQLSPSIKRAVERVTRMSVPSRLDDALSVIHSERKEVIERLPPKEK CHHHHHHHCCCHHHHCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCHHH QKLVSLMKELDCGCDAFPHCEHVSQRASELALKVRLEGKSVYAIPRILEGRYGITAYPID HHHHHHHHHHCCCCCCCCCHHHHHHHHHHEEEEEEECCCCEEECHHHHCCCCCCEEECHH IANWLEEVVRLLECAGEIAEEPEMAAAAEALADPWSER HHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: Hydrolase; Acting on acid anhydrides [C]
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 8688087 [H]