Definition Methanosarcina mazei Go1 chromosome, complete genome.
Accession NC_003901
Length 4,096,345

Click here to switch to the map view.

The map label for this gene is hflX [C]

Identifier: 21226612

GI number: 21226612

Start: 634643

End: 635710

Strand: Reverse

Name: hflX [C]

Synonym: MM_0510

Alternate gene names: 21226612

Gene position: 635710-634643 (Counterclockwise)

Preceding gene: 21226613

Following gene: 21226611

Centisome position: 15.52

GC content: 45.69

Gene sequence:

>1068_bases
GTGAGGCAGAAGAGAGCCGGAAAGGTAATTTTCTATAACAGACTTTCCACAATTCAGCTTTTCAATATCTCGGAAATCTG
CGGGTGCCAGGTAATGGACAAATTCCAGCTTATCCTGGAGATTTTTGCAAAAAGAGCTACTACACGCCGTTCCAAACTTC
AGGTTGAACTTGCCAGGCTGAAATATGAAATTCCAAGAGCGCGAGCTATTGTTTCCCTTGTGAAAAAAGAGGAAAGAGCA
GGTTTTATGGGGCTTGGGGACTATGAAGACGCCTACGAGCAGGACCTGAAGAAAAGGATAGCCAGGATCGAAGATGAACT
CGAGTCTGCAGGAAAAGATGACGAGTCTCTCCGGGCTTTTCGGCACAGGAAAGGCTTTTCCCTTGTCGCTCTTGCAGGTT
ACACCAATGCCGGAAAAAGCACACTTTTCAATGCGATTGTTAATGAAAGCGTTGAAGCCAGGAACATGCTTTTTACAACC
CTTGTTCCCACAACCAGGGCGCTTGACCTGGGAGGAAGAAAAGCACTTTTGACGGATACCGTGGGGTTTATAGAGGAGCT
TCCTCACTGGCTTGTCGATGCATTCAAGTCGACCCTTGATGAAATCTTTCTTTCCGACCTCATCCTGCTGGTGGTTGATG
CAGGTGAAAAACCCGAAACAATCCTGCAAAAGCTCTCCACGTCCCACGACACTCTCTGGGACCGCATACAGGGAGTTCCG
ATAATCACAGTGCTTAACAAAATCGACCTGCTTGAAGAAGCCCAGCTTGAAGCTCTTATGGAAGAAATTGGTTATATGGC
TCCGAATCCTGTATTTGTTTCCGCAAAAAAGAAGATCGGAATGCAGGAGTTAAAAGATGAAATAATCAAACATCTGCCTG
CCTGGTCTTTCTATTCTTTTTCTCTCCCCAATTCGGAAAAAGGGATGTCAGTCCTCTCCTGGCTCTACGATGAAGGCATT
GTGCACAGGGTCGAGTACGGAGAAAGGATTTCAGTGGATTATGAAGCCAGAGAAGACATAATTAACAGGATAAAATCTCT
GGAACTTAATCCTGATGAGCAGGGTTAA

Upstream 100 bases:

>100_bases
AGCCGCAGGCTATATCCCTGTCGGTGAGCTTACCCAGACAAGGTTTCCTGACTCAAGGTACCAGCTCGGGAAAGGAAAGA
TAGAGGAACTTGCCGAACTT

Downstream 100 bases:

>100_bases
ATTTGTTTTTAATTCCCTTTCAGGCAGCACGGAATTATGATAAACTTTTCCAATTATATAACCTACTCTCGGCAGGTAAA
CATATAGATTAATATTATTA

Product: GTP-binding protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 355; Mature: 355

Protein sequence:

>355_residues
MRQKRAGKVIFYNRLSTIQLFNISEICGCQVMDKFQLILEIFAKRATTRRSKLQVELARLKYEIPRARAIVSLVKKEERA
GFMGLGDYEDAYEQDLKKRIARIEDELESAGKDDESLRAFRHRKGFSLVALAGYTNAGKSTLFNAIVNESVEARNMLFTT
LVPTTRALDLGGRKALLTDTVGFIEELPHWLVDAFKSTLDEIFLSDLILLVVDAGEKPETILQKLSTSHDTLWDRIQGVP
IITVLNKIDLLEEAQLEALMEEIGYMAPNPVFVSAKKKIGMQELKDEIIKHLPAWSFYSFSLPNSEKGMSVLSWLYDEGI
VHRVEYGERISVDYEAREDIINRIKSLELNPDEQG

Sequences:

>Translated_355_residues
MRQKRAGKVIFYNRLSTIQLFNISEICGCQVMDKFQLILEIFAKRATTRRSKLQVELARLKYEIPRARAIVSLVKKEERA
GFMGLGDYEDAYEQDLKKRIARIEDELESAGKDDESLRAFRHRKGFSLVALAGYTNAGKSTLFNAIVNESVEARNMLFTT
LVPTTRALDLGGRKALLTDTVGFIEELPHWLVDAFKSTLDEIFLSDLILLVVDAGEKPETILQKLSTSHDTLWDRIQGVP
IITVLNKIDLLEEAQLEALMEEIGYMAPNPVFVSAKKKIGMQELKDEIIKHLPAWSFYSFSLPNSEKGMSVLSWLYDEGI
VHRVEYGERISVDYEAREDIINRIKSLELNPDEQG
>Mature_355_residues
MRQKRAGKVIFYNRLSTIQLFNISEICGCQVMDKFQLILEIFAKRATTRRSKLQVELARLKYEIPRARAIVSLVKKEERA
GFMGLGDYEDAYEQDLKKRIARIEDELESAGKDDESLRAFRHRKGFSLVALAGYTNAGKSTLFNAIVNESVEARNMLFTT
LVPTTRALDLGGRKALLTDTVGFIEELPHWLVDAFKSTLDEIFLSDLILLVVDAGEKPETILQKLSTSHDTLWDRIQGVP
IITVLNKIDLLEEAQLEALMEEIGYMAPNPVFVSAKKKIGMQELKDEIIKHLPAWSFYSFSLPNSEKGMSVLSWLYDEGI
VHRVEYGERISVDYEAREDIINRIKSLELNPDEQG

Specific function: Unknown

COG id: COG2262

COG function: function code R; GTPases

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 G (guanine nucleotide-binding) domain [H]

Homologues:

Organism=Homo sapiens, GI310124269, Length=253, Percent_Identity=35.5731225296443, Blast_Score=110, Evalue=3e-24,
Organism=Homo sapiens, GI310124267, Length=253, Percent_Identity=35.5731225296443, Blast_Score=110, Evalue=3e-24,
Organism=Homo sapiens, GI310124171, Length=253, Percent_Identity=35.5731225296443, Blast_Score=110, Evalue=3e-24,
Organism=Homo sapiens, GI310124169, Length=253, Percent_Identity=35.5731225296443, Blast_Score=110, Evalue=3e-24,
Organism=Homo sapiens, GI310133118, Length=254, Percent_Identity=35.0393700787402, Blast_Score=107, Evalue=2e-23,
Organism=Homo sapiens, GI310133116, Length=254, Percent_Identity=35.0393700787402, Blast_Score=107, Evalue=2e-23,
Organism=Escherichia coli, GI1790615, Length=309, Percent_Identity=38.1877022653722, Blast_Score=167, Evalue=8e-43,
Organism=Caenorhabditis elegans, GI17561038, Length=286, Percent_Identity=28.6713286713287, Blast_Score=86, Evalue=2e-17,
Organism=Drosophila melanogaster, GI24650079, Length=306, Percent_Identity=30.3921568627451, Blast_Score=101, Evalue=6e-22,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR016496
- InterPro:   IPR006073
- InterPro:   IPR002917
- InterPro:   IPR005225 [H]

Pfam domain/function: PF01926 MMR_HSR1 [H]

EC number: NA

Molecular weight: Translated: 40422; Mature: 40422

Theoretical pI: Translated: 5.38; Mature: 5.38

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
2.3 %Met     (Translated Protein)
2.8 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
2.3 %Met     (Mature Protein)
2.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MRQKRAGKVIFYNRLSTIQLFNISEICGCQVMDKFQLILEIFAKRATTRRSKLQVELARL
CCCCCCCCEEEEECCCEEEEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
KYEIPRARAIVSLVKKEERAGFMGLGDYEDAYEQDLKKRIARIEDELESAGKDDESLRAF
HHHCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHH
RHRKGFSLVALAGYTNAGKSTLFNAIVNESVEARNMLFTTLVPTTRALDLGGRKALLTDT
HHCCCCEEEEEECCCCCCHHHHHHHHHHCCHHHHHHHHHHHCCCHHHHCCCCCCHHHHHH
VGFIEELPHWLVDAFKSTLDEIFLSDLILLVVDAGEKPETILQKLSTSHDTLWDRIQGVP
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECCCCCHHHHHHHHHCCHHHHHHHHCCCH
IITVLNKIDLLEEAQLEALMEEIGYMAPNPVFVSAKKKIGMQELKDEIIKHLPAWSFYSF
HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEECHHHCCHHHHHHHHHHHCCCCCEEEE
SLPNSEKGMSVLSWLYDEGIVHRVEYGERISVDYEAREDIINRIKSLELNPDEQG
CCCCCHHHHHHHHHHHHCCCEEHCCCCCEEECCHHHHHHHHHHHHCCCCCCCCCC
>Mature Secondary Structure
MRQKRAGKVIFYNRLSTIQLFNISEICGCQVMDKFQLILEIFAKRATTRRSKLQVELARL
CCCCCCCCEEEEECCCEEEEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
KYEIPRARAIVSLVKKEERAGFMGLGDYEDAYEQDLKKRIARIEDELESAGKDDESLRAF
HHHCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHH
RHRKGFSLVALAGYTNAGKSTLFNAIVNESVEARNMLFTTLVPTTRALDLGGRKALLTDT
HHCCCCEEEEEECCCCCCHHHHHHHHHHCCHHHHHHHHHHHCCCHHHHCCCCCCHHHHHH
VGFIEELPHWLVDAFKSTLDEIFLSDLILLVVDAGEKPETILQKLSTSHDTLWDRIQGVP
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECCCCCHHHHHHHHHCCHHHHHHHHCCCH
IITVLNKIDLLEEAQLEALMEEIGYMAPNPVFVSAKKKIGMQELKDEIIKHLPAWSFYSF
HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEECHHHCCHHHHHHHHHHHCCCCCEEEE
SLPNSEKGMSVLSWLYDEGIVHRVEYGERISVDYEAREDIINRIKSLELNPDEQG
CCCCCHHHHHHHHHHHHCCCEEHCCCCCEEECCHHHHHHHHHHHHCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8688087 [H]