Definition Rhizobium leguminosarum bv. trifolii WSM2304 chromosome, complete genome.
Accession NC_011369
Length 4,537,948

Click here to switch to the map view.

The map label for this gene is hmuS [H]

Identifier: 209550594

GI number: 209550594

Start: 3078114

End: 3079163

Strand: Direct

Name: hmuS [H]

Synonym: Rleg2_3018

Alternate gene names: 209550594

Gene position: 3078114-3079163 (Clockwise)

Preceding gene: 209550593

Following gene: 209550595

Centisome position: 67.83

GC content: 61.33

Gene sequence:

>1050_bases
ATGACTGAACAGACAAGACCGGCGCCAGCCGAAATCCGGGCGTTTCGCGCCGAAAATCCGAAGATGCGCGAGCGCGATAT
CGCCGCCCAGTTGAAGATTTCCGAGGCAGCCCTCGTCGCCGCCGAAACCGGCATCAGCGTGACCCGCATCGATGGCAGCG
CGCTGAAGCTTCTCGAACGCGTGGCGAGCCTCGGCGAAGTGATGGCGCTGTCGCGCAACGAAAGTGCCGTGCACGAAAAG
ATCGGCGTCTTCGAAAACATCAAAAGCGGCGTACAGGCCGCAATCGTTCTCGGCGAGAATATCGACCTGCGCATCTTCCC
GAGCCGATGGGAACATGGCTTCGCCGTATCCAAGAAGGATGGCGACCAGCTGCGCCTCAGCCTGCAATATTTCGACAAGG
CGGGCAACGCCGTGCACAAGGTGCACCTGCGCCCGAATTCGAATGTCGAGGCCTATCACGCGCTGGTTGCCGAGTTGAAG
CTGGAAGACCAGTCGCAGGACTTCGTCGAGGCCGAGACCGCAGATACCGTCGATGAAACCGCCGACGTCAGCCGCGACGA
GCTGCGCGACAACTGGAGCAGGCTCACCGACACGCATGAGTTCTTCGGCATGCTGAAGCGCCTGAAGATCGGCCGCCAGG
CGGCCGTGCGCAGCGTCGGCGACGACTATGCCTGGAAGCTCGACAGCAGCGCCACGGCGGAGATGATGCATGCCTCGGTG
AAATCCGGCCTGCCGATCATGTGCTTCGTCGCCAGTGACGGTGTCGTTCAGATCCATTCCGGCCCGATCTTCAACGTCCA
GACCATGGGCCCATGGATTAATATCATGGACCCAACCTTCCATCTGCATCTGCGGCAGGATCACATCGCCGAGACCTGGG
CGGTGCGCAAGCCGACCAAAGACGGCCACGTCACCTCGCTGGAGGCTTACAATGCGCAAGGCGAGATGATCATCCAGTTC
TTCGGCAAGCGGAAGGAAGGGTCCGACGAACGCACCGAGTGGCGCGAGATCATGGAAAACCTGCCGCGGGCAGCCAGTGT
CGCCGCATAA

Upstream 100 bases:

>100_bases
AACGAGATCATGATCAGACACGAGGGCGTGACCTATCGCCTGAAGATCACCCGTCAGGGCAAGCTCATTCTCAATAAGTA
GGGCCAACAGCAGGTAAAAC

Downstream 100 bases:

>100_bases
GGATTGCAACGATGACGATGCGTAACAATCCGCGCCGGATTCGCCCCTGGCAACTGGCCGTGACGGCGGCCGTCATGGCA
CTGCCGCTGATCCCGTCGGC

Product: Haemin-degrading family protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 349; Mature: 348

Protein sequence:

>349_residues
MTEQTRPAPAEIRAFRAENPKMRERDIAAQLKISEAALVAAETGISVTRIDGSALKLLERVASLGEVMALSRNESAVHEK
IGVFENIKSGVQAAIVLGENIDLRIFPSRWEHGFAVSKKDGDQLRLSLQYFDKAGNAVHKVHLRPNSNVEAYHALVAELK
LEDQSQDFVEAETADTVDETADVSRDELRDNWSRLTDTHEFFGMLKRLKIGRQAAVRSVGDDYAWKLDSSATAEMMHASV
KSGLPIMCFVASDGVVQIHSGPIFNVQTMGPWINIMDPTFHLHLRQDHIAETWAVRKPTKDGHVTSLEAYNAQGEMIIQF
FGKRKEGSDERTEWREIMENLPRAASVAA

Sequences:

>Translated_349_residues
MTEQTRPAPAEIRAFRAENPKMRERDIAAQLKISEAALVAAETGISVTRIDGSALKLLERVASLGEVMALSRNESAVHEK
IGVFENIKSGVQAAIVLGENIDLRIFPSRWEHGFAVSKKDGDQLRLSLQYFDKAGNAVHKVHLRPNSNVEAYHALVAELK
LEDQSQDFVEAETADTVDETADVSRDELRDNWSRLTDTHEFFGMLKRLKIGRQAAVRSVGDDYAWKLDSSATAEMMHASV
KSGLPIMCFVASDGVVQIHSGPIFNVQTMGPWINIMDPTFHLHLRQDHIAETWAVRKPTKDGHVTSLEAYNAQGEMIIQF
FGKRKEGSDERTEWREIMENLPRAASVAA
>Mature_348_residues
TEQTRPAPAEIRAFRAENPKMRERDIAAQLKISEAALVAAETGISVTRIDGSALKLLERVASLGEVMALSRNESAVHEKI
GVFENIKSGVQAAIVLGENIDLRIFPSRWEHGFAVSKKDGDQLRLSLQYFDKAGNAVHKVHLRPNSNVEAYHALVAELKL
EDQSQDFVEAETADTVDETADVSRDELRDNWSRLTDTHEFFGMLKRLKIGRQAAVRSVGDDYAWKLDSSATAEMMHASVK
SGLPIMCFVASDGVVQIHSGPIFNVQTMGPWINIMDPTFHLHLRQDHIAETWAVRKPTKDGHVTSLEAYNAQGEMIIQFF
GKRKEGSDERTEWREIMENLPRAASVAA

Specific function: Part of the binding-protein-dependent transport system for hemin [H]

COG id: COG3720

COG function: function code P; Putative heme degradation protein

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: To Y.enterocolitica hemS [H]

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR007845 [H]

Pfam domain/function: PF05171 HemS [H]

EC number: NA

Molecular weight: Translated: 38932; Mature: 38801

Theoretical pI: Translated: 5.95; Mature: 5.95

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.3 %Cys     (Translated Protein)
3.2 %Met     (Translated Protein)
3.4 %Cys+Met (Translated Protein)
0.3 %Cys     (Mature Protein)
2.9 %Met     (Mature Protein)
3.2 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTEQTRPAPAEIRAFRAENPKMRERDIAAQLKISEAALVAAETGISVTRIDGSALKLLER
CCCCCCCCHHHHHEECCCCCCCHHHHHHHHEEECHHHEEEECCCCEEEEECCHHHHHHHH
VASLGEVMALSRNESAVHEKIGVFENIKSGVQAAIVLGENIDLRIFPSRWEHGFAVSKKD
HHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCEEEECCCCCCCCEEECCCC
GDQLRLSLQYFDKAGNAVHKVHLRPNSNVEAYHALVAELKLEDQSQDFVEAETADTVDET
CCEEEEEEEHHHCCCCEEEEEEECCCCCHHHHHHHHHHHHCCCCCCHHHHHHCHHHHHHH
ADVSRDELRDNWSRLTDTHEFFGMLKRLKIGRQAAVRSVGDDYAWKLDSSATAEMMHASV
HCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHH
KSGLPIMCFVASDGVVQIHSGPIFNVQTMGPWINIMDPTFHLHLRQDHIAETWAVRKPTK
HCCCCEEEEEECCCEEEECCCCEEEEEECCCEEEEECCEEEEEECHHHHHHHHHCCCCCC
DGHVTSLEAYNAQGEMIIQFFGKRKEGSDERTEWREIMENLPRAASVAA
CCCEEEEEEECCCCCEEEEECCCCCCCCCHHHHHHHHHHHCCCHHCCCC
>Mature Secondary Structure 
TEQTRPAPAEIRAFRAENPKMRERDIAAQLKISEAALVAAETGISVTRIDGSALKLLER
CCCCCCCHHHHHEECCCCCCCHHHHHHHHEEECHHHEEEECCCCEEEEECCHHHHHHHH
VASLGEVMALSRNESAVHEKIGVFENIKSGVQAAIVLGENIDLRIFPSRWEHGFAVSKKD
HHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCEEEECCCCCCCCEEECCCC
GDQLRLSLQYFDKAGNAVHKVHLRPNSNVEAYHALVAELKLEDQSQDFVEAETADTVDET
CCEEEEEEEHHHCCCCEEEEEEECCCCCHHHHHHHHHHHHCCCCCCHHHHHHCHHHHHHH
ADVSRDELRDNWSRLTDTHEFFGMLKRLKIGRQAAVRSVGDDYAWKLDSSATAEMMHASV
HCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHH
KSGLPIMCFVASDGVVQIHSGPIFNVQTMGPWINIMDPTFHLHLRQDHIAETWAVRKPTK
HCCCCEEEEEECCCEEEECCCCEEEEEECCCEEEEECCEEEEEECHHHHHHHHHCCCCCC
DGHVTSLEAYNAQGEMIIQFFGKRKEGSDERTEWREIMENLPRAASVAA
CCCEEEEEEECCCCCEEEEECCCCCCCCCHHHHHHHHHHHCCCHHCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 9026634; 11586360; 12142430 [H]