Definition Methanosarcina mazei Go1 chromosome, complete genome.
Accession NC_003901
Length 4,096,345

Click here to switch to the map view.

The map label for this gene is helD [H]

Identifier: 21226130

GI number: 21226130

Start: 30530

End: 33127

Strand: Direct

Name: helD [H]

Synonym: MM_0028

Alternate gene names: 21226130

Gene position: 30530-33127 (Clockwise)

Preceding gene: 21226127

Following gene: 21226131

Centisome position: 0.75

GC content: 33.6

Gene sequence:

>2598_bases
ATGGTTTTAGGCAATAGAAATGAAAATTTCTCAGATTTTAAAAAAGGTAAGCTAAGGTCATTTATTTTATCTATTATCTC
TCCAATTGCACATTCATTTAAAAGAATTTACGAAATATGCATAACTGAAGTTCTGGAAGTCAGAAATAAAATTTTAAAAA
TTGAAAAAGGAACCATAAGCGAAGAAGTAAAAAAAAATATTGAAAAGGATATATCTGAACTGTATTTAAAGCATGAATTA
TACAATATAAGAAAGGAGTCTTATCTGACTTATCATGAACAGAAGGACCTAATTGATATTTGCGCAAAATGCCTAACTGC
CTGTTCTTCTTTTAAAAATCGAAGTAAACTTTTCGATGAGCCATCCAATGATTTTGTTAATAAATCGATAACAGAATTGT
CTTATTTGTCGAGTGAGTTTAAGAGATATAACAATAAAGAGTTTATTCAACAAAGAATCCAAAAGTATGATTACTTGTTC
CAGAAATCTTCTTTTCCTCTAGATAACTCACAGAAAAGAGCAATAGTGACTGACGATACACATAATTTAGTTGTAGCAGG
GGCTGGTTCTGGAAAAACCGAAGTTTTGATCACAAGAACTGCTTATCTAACGGAAAGAGCACCAGATAAAGTTGACGCAA
AAAAGATACTCATTCTTGCTTATCAGAATAAAGCCGCTGAAGAGGTTAAAAAAAGACTTAAAGATAGGTTTAGTATTGAA
GAGGCGGAAGTCAGAACTTTTCATTCATTGGGAAAGAAAATCTTGGAAGAAGGGAGCAAAATTTCTGGAAAAGAAACTCC
CAAATTAAAGTTTTCAGGTTCTAATTTTGACAAAGAATTTTCAAATTATATTGATCATTTATTCAACCTTAGAAAGACAA
ACGGAGATTTCCAGAAGAAAATTGTAGACTATATGAGATTATACCATGACAATGAGGTAATAAAGAGCCAGAAAGACTTT
CAGAAAAAAGAAGAGTTCTACAAATATATGGCGAATTTAACATATACGGCTCTTGACGGAACTGAGGTCAAAAGTGAGGC
TGAAAGGGCGATATTGAACTTTTTCATTAGTCATAACCTGAATGGAGAAAGAGCTAAAATACGTTATGAAGACCCCGCTA
GATGGATGACGTATACTGATTCTAACGGTAATAAAAAGTCTCCAAAGCCCGATTTTTTCTTTCCAGATTATGATATTTAC
CTTGAGCATTGGGCAATAGACAAAAATGGAAGAGTTCCAGACTGGTTTGAAGGTAAAAACGCGTCAGAAGAATACAAACT
TGGGATGAAGAGAAAAAAAGAGAAGTTTGCACAGCAAGATAAATATTCTCTTGTAGAAATCGCAAGTTATGAATTCAAAG
GGGAGAAATTTGAGGAAATTTTAATTGAAAGAATGACAAAGGCATTAAAAATAAAGTATCCCGATAAAGATTTTGAATTC
ACCCCTGTTCCTTATGATCAACTTATCAGTAGAGTGAATTACGGATGCAAAGAATCTCTCAGAGGCCTTTCATTCAATGT
TAGCAAATTCATAACAATTGCTAAAACATACAATCTGCCTCCAGAGAATATCAATCAAAGGTTACGAAGTGAAAGATGGT
CTGCAAAACAAGAATCTTTTGCCAGGATCGTATTAGATATCTATGAGCTTTATGAAAACGAATTGAGAACTGAAAACAGG
ATTGATTTTGCTGATATGATCAATCTTGCTGTAAAGGAACTGAAAGAGAATCAGGAACTATACAGAAATTCTTTTGCCCA
AATTTTAATTGATGAATATCAGGATATAAGTTCTCAAAGATATGAGCTTATACGCGAACTTATGAAGAAGAATGATGGGT
GTAAACTGTTCTGCGTAGGCGACGATTGGCAGAGTATAATGGGTTTCTCAGGTTCCGACCTTGACTTTTTTGTCAATTTC
CATGAGTACTTCGACCATCCGGCCAGGACTGACCTTTCCACAAATTATAGAAGCTGTAAATCTATAGTGGACACAGGTGC
TGAAATTATAAAATATAATAGAGACTCACAGATAGAAAAAGACACTTTTGCTAAGAATACTACCGAGAATCCTATAAGAA
TATACGTCTCAAAACATAATCGAAGATCAACAAATATGTACTATTCTCAAATTGCCAGTCATTGCGTTTATTCAATAAAG
CATTATCTTGATGACGGTTATGAACCGAAAGATATTCTACTTCTATCAAGAATTGGAAAAAATCTTAAGATGAAGAATAC
GTTAATGGAATACGCTAAAACTCTAGATGTTCCTATTTCTTTTGATGGGAGTAAAAACCCGAATAAAATACCTTTTATGA
CAGTACATAAAAGTAAGGGTTTACAGGCAAAAGCTGTTTTCCTGCTGGACGTGGTGGAAGACTTGTACGGTTTTCCCTGC
GAAATAGAAAACCCAGATATTTTCGAGCCAGCAATTCTTTCACGTAAAAGAGATAGATATGAGGAAGAAAGGAGACTCTT
TTACGTTGCTGTAACAAGAGCTATGAATGACCTGATAATTTACACCCGAAAAGATTCGGTAAGCAAATTTATACATGAAA
TTGAAAACAAGGTTACTTTTTATGAACTCAATAACTGA

Upstream 100 bases:

>100_bases
TTCAGTTTCCGATTTACCTACATTTTTATAGAATGTATGTATAAATATTCAGATCATACATATACAGAAATTTTTCTTCC
AGCTACTAGAAGGCACTAGA

Downstream 100 bases:

>100_bases
ATTAAAGTATCAGGGTTTTAATTCTCAAATATCCAAAACCGAATCCTCGAATTCTAAATAAAAAATTCTCCCTCAAACCA
ATTCAATTGAAATATATTTT

Product: superfamily I DNA/RNA helicase

Products: NA

Alternate protein names: 75 kDa helicase [H]

Number of amino acids: Translated: 865; Mature: 865

Protein sequence:

>865_residues
MVLGNRNENFSDFKKGKLRSFILSIISPIAHSFKRIYEICITEVLEVRNKILKIEKGTISEEVKKNIEKDISELYLKHEL
YNIRKESYLTYHEQKDLIDICAKCLTACSSFKNRSKLFDEPSNDFVNKSITELSYLSSEFKRYNNKEFIQQRIQKYDYLF
QKSSFPLDNSQKRAIVTDDTHNLVVAGAGSGKTEVLITRTAYLTERAPDKVDAKKILILAYQNKAAEEVKKRLKDRFSIE
EAEVRTFHSLGKKILEEGSKISGKETPKLKFSGSNFDKEFSNYIDHLFNLRKTNGDFQKKIVDYMRLYHDNEVIKSQKDF
QKKEEFYKYMANLTYTALDGTEVKSEAERAILNFFISHNLNGERAKIRYEDPARWMTYTDSNGNKKSPKPDFFFPDYDIY
LEHWAIDKNGRVPDWFEGKNASEEYKLGMKRKKEKFAQQDKYSLVEIASYEFKGEKFEEILIERMTKALKIKYPDKDFEF
TPVPYDQLISRVNYGCKESLRGLSFNVSKFITIAKTYNLPPENINQRLRSERWSAKQESFARIVLDIYELYENELRTENR
IDFADMINLAVKELKENQELYRNSFAQILIDEYQDISSQRYELIRELMKKNDGCKLFCVGDDWQSIMGFSGSDLDFFVNF
HEYFDHPARTDLSTNYRSCKSIVDTGAEIIKYNRDSQIEKDTFAKNTTENPIRIYVSKHNRRSTNMYYSQIASHCVYSIK
HYLDDGYEPKDILLLSRIGKNLKMKNTLMEYAKTLDVPISFDGSKNPNKIPFMTVHKSKGLQAKAVFLLDVVEDLYGFPC
EIENPDIFEPAILSRKRDRYEEERRLFYVAVTRAMNDLIIYTRKDSVSKFIHEIENKVTFYELNN

Sequences:

>Translated_865_residues
MVLGNRNENFSDFKKGKLRSFILSIISPIAHSFKRIYEICITEVLEVRNKILKIEKGTISEEVKKNIEKDISELYLKHEL
YNIRKESYLTYHEQKDLIDICAKCLTACSSFKNRSKLFDEPSNDFVNKSITELSYLSSEFKRYNNKEFIQQRIQKYDYLF
QKSSFPLDNSQKRAIVTDDTHNLVVAGAGSGKTEVLITRTAYLTERAPDKVDAKKILILAYQNKAAEEVKKRLKDRFSIE
EAEVRTFHSLGKKILEEGSKISGKETPKLKFSGSNFDKEFSNYIDHLFNLRKTNGDFQKKIVDYMRLYHDNEVIKSQKDF
QKKEEFYKYMANLTYTALDGTEVKSEAERAILNFFISHNLNGERAKIRYEDPARWMTYTDSNGNKKSPKPDFFFPDYDIY
LEHWAIDKNGRVPDWFEGKNASEEYKLGMKRKKEKFAQQDKYSLVEIASYEFKGEKFEEILIERMTKALKIKYPDKDFEF
TPVPYDQLISRVNYGCKESLRGLSFNVSKFITIAKTYNLPPENINQRLRSERWSAKQESFARIVLDIYELYENELRTENR
IDFADMINLAVKELKENQELYRNSFAQILIDEYQDISSQRYELIRELMKKNDGCKLFCVGDDWQSIMGFSGSDLDFFVNF
HEYFDHPARTDLSTNYRSCKSIVDTGAEIIKYNRDSQIEKDTFAKNTTENPIRIYVSKHNRRSTNMYYSQIASHCVYSIK
HYLDDGYEPKDILLLSRIGKNLKMKNTLMEYAKTLDVPISFDGSKNPNKIPFMTVHKSKGLQAKAVFLLDVVEDLYGFPC
EIENPDIFEPAILSRKRDRYEEERRLFYVAVTRAMNDLIIYTRKDSVSKFIHEIENKVTFYELNN
>Mature_865_residues
MVLGNRNENFSDFKKGKLRSFILSIISPIAHSFKRIYEICITEVLEVRNKILKIEKGTISEEVKKNIEKDISELYLKHEL
YNIRKESYLTYHEQKDLIDICAKCLTACSSFKNRSKLFDEPSNDFVNKSITELSYLSSEFKRYNNKEFIQQRIQKYDYLF
QKSSFPLDNSQKRAIVTDDTHNLVVAGAGSGKTEVLITRTAYLTERAPDKVDAKKILILAYQNKAAEEVKKRLKDRFSIE
EAEVRTFHSLGKKILEEGSKISGKETPKLKFSGSNFDKEFSNYIDHLFNLRKTNGDFQKKIVDYMRLYHDNEVIKSQKDF
QKKEEFYKYMANLTYTALDGTEVKSEAERAILNFFISHNLNGERAKIRYEDPARWMTYTDSNGNKKSPKPDFFFPDYDIY
LEHWAIDKNGRVPDWFEGKNASEEYKLGMKRKKEKFAQQDKYSLVEIASYEFKGEKFEEILIERMTKALKIKYPDKDFEF
TPVPYDQLISRVNYGCKESLRGLSFNVSKFITIAKTYNLPPENINQRLRSERWSAKQESFARIVLDIYELYENELRTENR
IDFADMINLAVKELKENQELYRNSFAQILIDEYQDISSQRYELIRELMKKNDGCKLFCVGDDWQSIMGFSGSDLDFFVNF
HEYFDHPARTDLSTNYRSCKSIVDTGAEIIKYNRDSQIEKDTFAKNTTENPIRIYVSKHNRRSTNMYYSQIASHCVYSIK
HYLDDGYEPKDILLLSRIGKNLKMKNTLMEYAKTLDVPISFDGSKNPNKIPFMTVHKSKGLQAKAVFLLDVVEDLYGFPC
EIENPDIFEPAILSRKRDRYEEERRLFYVAVTRAMNDLIIYTRKDSVSKFIHEIENKVTFYELNN

Specific function: Helicase IV catalyzes the unwinding of duplex DNA in the 3' to 5' direction with respect to the bound single strand in a reaction that is dependent upon the hydrolysis of ATP [H]

COG id: COG0210

COG function: function code L; Superfamily I DNA and RNA helicases

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 uvrD-like helicase ATP-binding domain [H]

Homologues:

Organism=Escherichia coli, GI1787196, Length=320, Percent_Identity=31.5625, Blast_Score=131, Evalue=2e-31,
Organism=Escherichia coli, GI2367296, Length=155, Percent_Identity=30.3225806451613, Blast_Score=70, Evalue=5e-13,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000212
- InterPro:   IPR014016
- InterPro:   IPR022161
- InterPro:   IPR003661 [H]

Pfam domain/function: PF12462 Nucleolin_N; PF00580 UvrD-helicase [H]

EC number: =3.6.4.12 [H]

Molecular weight: Translated: 101763; Mature: 101763

Theoretical pI: Translated: 8.51; Mature: 8.51

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.2 %Cys     (Translated Protein)
1.6 %Met     (Translated Protein)
2.8 %Cys+Met (Translated Protein)
1.2 %Cys     (Mature Protein)
1.6 %Met     (Mature Protein)
2.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MVLGNRNENFSDFKKGKLRSFILSIISPIAHSFKRIYEICITEVLEVRNKILKIEKGTIS
CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECCCCCHH
EEVKKNIEKDISELYLKHELYNIRKESYLTYHEQKDLIDICAKCLTACSSFKNRSKLFDE
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
PSNDFVNKSITELSYLSSEFKRYNNKEFIQQRIQKYDYLFQKSSFPLDNSQKRAIVTDDT
CCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEECCC
HNLVVAGAGSGKTEVLITRTAYLTERAPDKVDAKKILILAYQNKAAEEVKKRLKDRFSIE
CCEEEEECCCCCCEEEEEEHHHHHHCCCCCCCCCEEEEEEECCHHHHHHHHHHHHHCCCH
EAEVRTFHSLGKKILEEGSKISGKETPKLKFSGSNFDKEFSNYIDHLFNLRKTNGDFQKK
HHHHHHHHHHHHHHHHCCCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHHCCCHHHHH
IVDYMRLYHDNEVIKSQKDFQKKEEFYKYMANLTYTALDGTEVKSEAERAILNFFISHNL
HHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCEEEECCCCHHHHHHHHHHHHHHHHCCC
NGERAKIRYEDPARWMTYTDSNGNKKSPKPDFFFPDYDIYLEHWAIDKNGRVPDWFEGKN
CCCEEEEEECCCCCEEEEECCCCCCCCCCCCCCCCCHHEEEEEEEECCCCCCCCCCCCCC
ASEEYKLGMKRKKEKFAQQDKYSLVEIASYEFKGEKFEEILIERMTKALKIKYPDKDFEF
CCHHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCCHHHHHHHHHHHHHHHEEECCCCCCCC
TPVPYDQLISRVNYGCKESLRGLSFNVSKFITIAKTYNLPPENINQRLRSERWSAKQESF
CCCCHHHHHHHHCCCHHHHHCCCCCCHHHHEEEEHCCCCCHHHHHHHHHHHHCCHHHHHH
ARIVLDIYELYENELRTENRIDFADMINLAVKELKENQELYRNSFAQILIDEYQDISSQR
HHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
YELIRELMKKNDGCKLFCVGDDWQSIMGFSGSDLDFFVNFHEYFDHPARTDLSTNYRSCK
HHHHHHHHHCCCCCEEEEECCCHHHHHCCCCCCHHEEEEHHHHCCCCCCCCCCCCHHHHH
SIVDTGAEIIKYNRDSQIEKDTFAKNTTENPIRIYVSKHNRRSTNMYYSQIASHCVYSIK
HHHHCCHHHHCCCCCCCCCHHHHCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHH
HYLDDGYEPKDILLLSRIGKNLKMKNTLMEYAKTLDVPISFDGSKNPNKIPFMTVHKSKG
HHHCCCCCCHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEECCCCCCCCCCCEEEEECCCC
LQAKAVFLLDVVEDLYGFPCEIENPDIFEPAILSRKRDRYEEERRLFYVAVTRAMNDLII
CCHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHEEHHHHHHHHCCEEE
YTRKDSVSKFIHEIENKVTFYELNN
EECCHHHHHHHHHHHCCEEEEEECC
>Mature Secondary Structure
MVLGNRNENFSDFKKGKLRSFILSIISPIAHSFKRIYEICITEVLEVRNKILKIEKGTIS
CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECCCCCHH
EEVKKNIEKDISELYLKHELYNIRKESYLTYHEQKDLIDICAKCLTACSSFKNRSKLFDE
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
PSNDFVNKSITELSYLSSEFKRYNNKEFIQQRIQKYDYLFQKSSFPLDNSQKRAIVTDDT
CCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEECCC
HNLVVAGAGSGKTEVLITRTAYLTERAPDKVDAKKILILAYQNKAAEEVKKRLKDRFSIE
CCEEEEECCCCCCEEEEEEHHHHHHCCCCCCCCCEEEEEEECCHHHHHHHHHHHHHCCCH
EAEVRTFHSLGKKILEEGSKISGKETPKLKFSGSNFDKEFSNYIDHLFNLRKTNGDFQKK
HHHHHHHHHHHHHHHHCCCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHHCCCHHHHH
IVDYMRLYHDNEVIKSQKDFQKKEEFYKYMANLTYTALDGTEVKSEAERAILNFFISHNL
HHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCEEEECCCCHHHHHHHHHHHHHHHHCCC
NGERAKIRYEDPARWMTYTDSNGNKKSPKPDFFFPDYDIYLEHWAIDKNGRVPDWFEGKN
CCCEEEEEECCCCCEEEEECCCCCCCCCCCCCCCCCHHEEEEEEEECCCCCCCCCCCCCC
ASEEYKLGMKRKKEKFAQQDKYSLVEIASYEFKGEKFEEILIERMTKALKIKYPDKDFEF
CCHHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCCHHHHHHHHHHHHHHHEEECCCCCCCC
TPVPYDQLISRVNYGCKESLRGLSFNVSKFITIAKTYNLPPENINQRLRSERWSAKQESF
CCCCHHHHHHHHCCCHHHHHCCCCCCHHHHEEEEHCCCCCHHHHHHHHHHHHCCHHHHHH
ARIVLDIYELYENELRTENRIDFADMINLAVKELKENQELYRNSFAQILIDEYQDISSQR
HHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
YELIRELMKKNDGCKLFCVGDDWQSIMGFSGSDLDFFVNFHEYFDHPARTDLSTNYRSCK
HHHHHHHHHCCCCCEEEEECCCHHHHHCCCCCCHHEEEEHHHHCCCCCCCCCCCCHHHHH
SIVDTGAEIIKYNRDSQIEKDTFAKNTTENPIRIYVSKHNRRSTNMYYSQIASHCVYSIK
HHHHCCHHHHCCCCCCCCCHHHHCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHH
HYLDDGYEPKDILLLSRIGKNLKMKNTLMEYAKTLDVPISFDGSKNPNKIPFMTVHKSKG
HHHCCCCCCHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEECCCCCCCCCCCEEEEECCCC
LQAKAVFLLDVVEDLYGFPCEIENPDIFEPAILSRKRDRYEEERRLFYVAVTRAMNDLII
CCHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHEEHHHHHHHHCCEEE
YTRKDSVSKFIHEIENKVTFYELNN
EECCHHHHHHHHHHHCCEEEEEECC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 2542273; 8905232; 9278503 [H]