Definition Mesorhizobium sp. BNC1, complete genome.
Accession NC_008254
Length 4,412,446

Click here to switch to the map view.

The map label for this gene is yghZ [H]

Identifier: 110632809

GI number: 110632809

Start: 494093

End: 495139

Strand: Direct

Name: yghZ [H]

Synonym: Meso_0448

Alternate gene names: 110632809

Gene position: 494093-495139 (Clockwise)

Preceding gene: 110632808

Following gene: 110632810

Centisome position: 11.2

GC content: 60.65

Gene sequence:

>1047_bases
ATGAATTGGGAGCCTTCGCCTGACCGTTACGAGGGAATGCAGTATCGCCGTTGCGGCCGCTCCGGCCTCAAACTGCCCCC
GATTTCTCTCGGCTTGTGGCATAATTTCGGGGAGGATCGGCCGCACGAGATGAAACAGGCGATCTGCCGCAGGGCTTTCG
ACCTCGGCATCACCCATTTCGACCTAGCCAACAATTACGGCCCGCCACCGGGCTCGGCCGAGGAAGCCTTTGGGGAAATC
CTGCGCACAGACTTCGCCGGTTATCGCGATCAGCTCATCATTTCCTCGAAGGCGGGCTATCTGATGTGGCCGGGGCCCTA
TGGCGAGTGGGGCAGCCGCAAATATCTCATCGCGAGCTGCGACCAGAGCCTGAAGCGGATGGGGCTCGATTATGTCGATA
TTTTCTATTCCCACCGCTTCGATCCGCAGACGCCCCTTGAAGAGACGATGATGGCCCTCGATCATATCGCTCGGTCGGGC
CGCGCGCTTTATGTAGGCATCTCGTCCTATAATTCGCGCCGCACACGGGAGGCCGTTGCCATCCTCGAGGATCTCGGGAC
GCCCTGCCTTATTCATCAGCCCAGCTACTCGATGATCAATCGCTGGGTGGAGGATGATGGCCTGCTCGACACGCTTGAAG
AGCTGGGTGTTGGCTGCATCGCCTTTTCGCCGCTAGCTCAGGGCATGCTCACCGACAGATATCTGGGCGGCATTCCTCAA
GACAGCCGCGCTGCGCAGGGCAAGTCGCTCCGTAAGGAGTTCATAAACGAGAAGACGCTCGGAAACATCCGCCACTTGAA
CGAGATCGCGGCGCGCCGTGGCCAGAGCTTGGCGCAGATGGCGGTCGCCTGGGTACTGCGCGGCGGGCGGGTCACCTCCG
CGCTCATCGGAGCTAGCCGTCCCGAGCAGGTCGAAGAGATAGTCGCAGGCCTGGAAAAGGCGGACTTCACCGCCGACGAA
CTGGCGGAAATCGAATCCTATGCCCGCGAGGCCGATATCAACCTCTGGGCAGCTTCCGCAGAGAGAACCGGCCCACAGCG
GAAATAA

Upstream 100 bases:

>100_bases
GCCGGTGCCGGCACTGGCACGTTGAATGGGGAAGAAATCCTGTTCAACTGGCGGCTGCGCTCGTCGCGGCGCGCCGATTG
CTGGCAGAAAGGGAATTTGC

Downstream 100 bases:

>100_bases
TAAGCGGATTGCTCATGCCCGTTCAATTTGACGGCTGCAAGCCGTAGGGTTTCTTTCGGCGATCGATGGACACGACGATG
AGGGGCGGGGGCCTATATTT

Product: aldo/keto reductase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 348; Mature: 348

Protein sequence:

>348_residues
MNWEPSPDRYEGMQYRRCGRSGLKLPPISLGLWHNFGEDRPHEMKQAICRRAFDLGITHFDLANNYGPPPGSAEEAFGEI
LRTDFAGYRDQLIISSKAGYLMWPGPYGEWGSRKYLIASCDQSLKRMGLDYVDIFYSHRFDPQTPLEETMMALDHIARSG
RALYVGISSYNSRRTREAVAILEDLGTPCLIHQPSYSMINRWVEDDGLLDTLEELGVGCIAFSPLAQGMLTDRYLGGIPQ
DSRAAQGKSLRKEFINEKTLGNIRHLNEIAARRGQSLAQMAVAWVLRGGRVTSALIGASRPEQVEEIVAGLEKADFTADE
LAEIESYAREADINLWAASAERTGPQRK

Sequences:

>Translated_348_residues
MNWEPSPDRYEGMQYRRCGRSGLKLPPISLGLWHNFGEDRPHEMKQAICRRAFDLGITHFDLANNYGPPPGSAEEAFGEI
LRTDFAGYRDQLIISSKAGYLMWPGPYGEWGSRKYLIASCDQSLKRMGLDYVDIFYSHRFDPQTPLEETMMALDHIARSG
RALYVGISSYNSRRTREAVAILEDLGTPCLIHQPSYSMINRWVEDDGLLDTLEELGVGCIAFSPLAQGMLTDRYLGGIPQ
DSRAAQGKSLRKEFINEKTLGNIRHLNEIAARRGQSLAQMAVAWVLRGGRVTSALIGASRPEQVEEIVAGLEKADFTADE
LAEIESYAREADINLWAASAERTGPQRK
>Mature_348_residues
MNWEPSPDRYEGMQYRRCGRSGLKLPPISLGLWHNFGEDRPHEMKQAICRRAFDLGITHFDLANNYGPPPGSAEEAFGEI
LRTDFAGYRDQLIISSKAGYLMWPGPYGEWGSRKYLIASCDQSLKRMGLDYVDIFYSHRFDPQTPLEETMMALDHIARSG
RALYVGISSYNSRRTREAVAILEDLGTPCLIHQPSYSMINRWVEDDGLLDTLEELGVGCIAFSPLAQGMLTDRYLGGIPQ
DSRAAQGKSLRKEFINEKTLGNIRHLNEIAARRGQSLAQMAVAWVLRGGRVTSALIGASRPEQVEEIVAGLEKADFTADE
LAEIESYAREADINLWAASAERTGPQRK

Specific function: Unknown

COG id: COG0667

COG function: function code C; Predicted oxidoreductases (related to aryl-alcohol dehydrogenases)

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

Organism=Homo sapiens, GI27436969, Length=324, Percent_Identity=34.5679012345679, Blast_Score=174, Evalue=8e-44,
Organism=Homo sapiens, GI4504825, Length=321, Percent_Identity=34.2679127725857, Blast_Score=171, Evalue=9e-43,
Organism=Homo sapiens, GI27436964, Length=333, Percent_Identity=32.7327327327327, Blast_Score=168, Evalue=6e-42,
Organism=Homo sapiens, GI27436962, Length=327, Percent_Identity=33.0275229357798, Blast_Score=166, Evalue=2e-41,
Organism=Homo sapiens, GI27436966, Length=327, Percent_Identity=32.7217125382263, Blast_Score=164, Evalue=1e-40,
Organism=Homo sapiens, GI27436971, Length=329, Percent_Identity=31.6109422492401, Blast_Score=153, Evalue=2e-37,
Organism=Homo sapiens, GI41327764, Length=212, Percent_Identity=27.8301886792453, Blast_Score=70, Evalue=3e-12,
Organism=Homo sapiens, GI41152114, Length=214, Percent_Identity=28.5046728971963, Blast_Score=69, Evalue=5e-12,
Organism=Homo sapiens, GI223718702, Length=212, Percent_Identity=27.3584905660377, Blast_Score=69, Evalue=5e-12,
Organism=Escherichia coli, GI1789375, Length=346, Percent_Identity=61.5606936416185, Blast_Score=449, Evalue=1e-127,
Organism=Escherichia coli, GI87081735, Length=323, Percent_Identity=33.7461300309598, Blast_Score=149, Evalue=2e-37,
Organism=Escherichia coli, GI1789199, Length=345, Percent_Identity=28.9855072463768, Blast_Score=99, Evalue=4e-22,
Organism=Escherichia coli, GI1788070, Length=315, Percent_Identity=27.6190476190476, Blast_Score=89, Evalue=4e-19,
Organism=Escherichia coli, GI1788081, Length=317, Percent_Identity=27.1293375394322, Blast_Score=78, Evalue=9e-16,
Organism=Escherichia coli, GI48994888, Length=262, Percent_Identity=27.8625954198473, Blast_Score=71, Evalue=1e-13,
Organism=Escherichia coli, GI1787674, Length=304, Percent_Identity=24.0131578947368, Blast_Score=66, Evalue=3e-12,
Organism=Saccharomyces cerevisiae, GI6325169, Length=334, Percent_Identity=26.3473053892216, Blast_Score=100, Evalue=4e-22,
Organism=Saccharomyces cerevisiae, GI6319951, Length=327, Percent_Identity=24.4648318042813, Blast_Score=70, Evalue=5e-13,
Organism=Saccharomyces cerevisiae, GI6323998, Length=331, Percent_Identity=22.9607250755287, Blast_Score=69, Evalue=2e-12,
Organism=Saccharomyces cerevisiae, GI6322615, Length=251, Percent_Identity=24.3027888446215, Blast_Score=68, Evalue=2e-12,
Organism=Saccharomyces cerevisiae, GI6319958, Length=292, Percent_Identity=22.9452054794521, Blast_Score=64, Evalue=3e-11,
Organism=Drosophila melanogaster, GI24640980, Length=334, Percent_Identity=27.2455089820359, Blast_Score=108, Evalue=7e-24,
Organism=Drosophila melanogaster, GI45549126, Length=334, Percent_Identity=27.2455089820359, Blast_Score=107, Evalue=1e-23,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001395
- InterPro:   IPR005399
- InterPro:   IPR023210 [H]

Pfam domain/function: PF00248 Aldo_ket_red [H]

EC number: NA

Molecular weight: Translated: 38979; Mature: 38979

Theoretical pI: Translated: 5.70; Mature: 5.70

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.4 %Cys     (Translated Protein)
2.9 %Met     (Translated Protein)
4.3 %Cys+Met (Translated Protein)
1.4 %Cys     (Mature Protein)
2.9 %Met     (Mature Protein)
4.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNWEPSPDRYEGMQYRRCGRSGLKLPPISLGLWHNFGEDRPHEMKQAICRRAFDLGITHF
CCCCCCCCHHHCHHHHHHCCCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHCCCCEE
DLANNYGPPPGSAEEAFGEILRTDFAGYRDQLIISSKAGYLMWPGPYGEWGSRKYLIASC
EHHCCCCCCCCCHHHHHHHHHHHHHCCCCCCEEEECCCCEEECCCCCCCCCCCEEEHHHH
DQSLKRMGLDYVDIFYSHRFDPQTPLEETMMALDHIARSGRALYVGISSYNSRRTREAVA
HHHHHHHCCHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHH
ILEDLGTPCLIHQPSYSMINRWVEDDGLLDTLEELGVGCIAFSPLAQGMLTDRYLGGIPQ
HHHHCCCCEEEECCCHHHHHHHHCCCCHHHHHHHCCCCCEEHHHHHHHHHHHHHHCCCCC
DSRAAQGKSLRKEFINEKTLGNIRHLNEIAARRGQSLAQMAVAWVLRGGRVTSALIGASR
CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHCCCCHHHHHHCCCC
PEQVEEIVAGLEKADFTADELAEIESYAREADINLWAASAERTGPQRK
HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCEEEECCHHCCCCCCC
>Mature Secondary Structure
MNWEPSPDRYEGMQYRRCGRSGLKLPPISLGLWHNFGEDRPHEMKQAICRRAFDLGITHF
CCCCCCCCHHHCHHHHHHCCCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHCCCCEE
DLANNYGPPPGSAEEAFGEILRTDFAGYRDQLIISSKAGYLMWPGPYGEWGSRKYLIASC
EHHCCCCCCCCCHHHHHHHHHHHHHCCCCCCEEEECCCCEEECCCCCCCCCCCEEEHHHH
DQSLKRMGLDYVDIFYSHRFDPQTPLEETMMALDHIARSGRALYVGISSYNSRRTREAVA
HHHHHHHCCHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHH
ILEDLGTPCLIHQPSYSMINRWVEDDGLLDTLEELGVGCIAFSPLAQGMLTDRYLGGIPQ
HHHHCCCCEEEECCCHHHHHHHHCCCCHHHHHHHCCCCCEEHHHHHHHHHHHHHHCCCCC
DSRAAQGKSLRKEFINEKTLGNIRHLNEIAARRGQSLAQMAVAWVLRGGRVTSALIGASR
CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHCCCCHHHHHHCCCC
PEQVEEIVAGLEKADFTADELAEIESYAREADINLWAASAERTGPQRK
HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCEEEECCHHCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796 [H]