| Definition | Mesorhizobium sp. BNC1, complete genome. |
|---|---|
| Accession | NC_008254 |
| Length | 4,412,446 |
Click here to switch to the map view.
The map label for this gene is edd [H]
Identifier: 110632519
GI number: 110632519
Start: 191456
End: 193270
Strand: Reverse
Name: edd [H]
Synonym: Meso_0157
Alternate gene names: 110632519
Gene position: 193270-191456 (Counterclockwise)
Preceding gene: 110632520
Following gene: 110632516
Centisome position: 4.38
GC content: 62.98
Gene sequence:
>1815_bases ATGACCGCAAGGCGAGAAATCGAAGCCGTCACAGCCCGCATCCGCGAGCGCTCGCGCCAGACACGGGAAACCTACCTTGC CCGCGTGGACGAAGCGGCGGCCAAGGGAGCTCATCGCAGCACGCTCTCCTGCGGAAACCTGGCGCATGGCTTTGCCGCCT GCGGCCCGCAGGAAAAGCAGGCCCTAGCGGGTGACACCGTTCCCAATCTTGGCATCATCACCGCCTATAACGATATGCTG TCGGCGCACCAACCCTATGAGACGTTTCCGGCGCTCATCAAGGAGGCAGCGCGGGAAGCAGGTGGTGTGGCGCAAGTGGC GGCAGGGGTGCCCGCCATGTGCGACGGCGTTACCCAAGGCCAGCCGGGAATGCAGCTTTCGCTTTTCTCGCGCGACGTCA TCGCGCTCGCCACGGCGGTCGGTCTCTCGCACAACATGTTCGATGCTGCCGTCTATCTCGGCATTTGCGACAAGATCGTT CCGGGGCTGACCATCGCGGCGCTTACCTTCGGTCACCTGCCGGCCATATTCATCCCCGCTGGCCCCATGACCAGCGGTCT GCCGAATGACGAGAAGTCGAAGATCCGCCAGCTTTTCGCCGAAGGAAAGATCGGGCGCGAAGCGCTGCTGGATGCCGAAT CCAAGTCCTATCACGGACCGGGCACCTGCACCTTTTACGGAACGGCCAACTCCAACCAGATGCTGATGGAGATCATGGGC CTGCACATTCCGGGCTCCTCCTTCGTCAATCCGAACACGCCGCTGCGCGATGCACTTACCAAAGAGGCGGTGAAGCGTGC GCTCGCCATTACGGCGCTTGGCAACGAGTTCACGCCGGTAGGCCGCATGATCGACGAGCGCTCGGTCGTGAACGGCGTGG TGGGGCTCAACGCAACGGGCGGCTCCACCAATCACACGATGCATCTCGTGGCCATGGCGGCGGCCGCCGGCATCAAGCTC ACCTGGGCGGACATCGCCGAGATTTCGGAGCATGTGCCCCTTCTGGCGCGGGTTTATCCGAACGGCCTTGCGGACGTGAA CCATTTCCATGCCGCGGGCGGGATCGGCTTTCTTGTCCGCGAGCTCCTCGATGCCGGTCTTCTGCATGAGGATGTGCAGA CCGTCTGGGGAGCGGGCCTGCGGCAATATGCGATTGAAGCAAAGCTCGGCCCTGACGGTTCAGTGACGCGCGAAATAGCG CCTAACGAAAGCGGGGATGAGAAGGTGCTCGTGCCTGTCGCGAAAGCGTTTCAGCCGACCGGCGGCATACGCGTGCTCAA GGGCAATCTCGGGCAGGCCATCATCAAGACCTCTGCCGTAAAGCCCCAGCACAGGATTATAGAGGCGCCGGCTATTATCT TCCACGCACAGGGAGAATTGCAAGCGGCTTTCAAGGCGGGCGAGCTCGACCGCGATTTTATCGCCGTCATCCGTTTTCAA GGGCCGAAGGCGAATGGCATGCCGGAGCTGCACAAGCTTACGCCGGTGCTCGGCGTGCTGCAGGACCGGGGTTACAAGGT CGCGTTGCTTACGGACGGGCGGATGTCCGGCGCCTCTGGCAAGGTGCCTGCCGCAATCCATGTCACGCCGGAGGCCGCCG ACGGCGGCGCAATCGCAAAGCTCAGGGATGGAGACCTGGTGCGGCTGGACGCGGAAGCCGGAACGGTCGAGGCTCTGGTC GACGCGGAGGAATTTTCCGCCCGCGCTCCCGCATCGGCCGATCTCTCCTATGAACATTACGGCATGGGGCGCGAGCTCTT CGCAAGCTTCCGTCAGATCGTTTCACCGGCAGATCAGGGCGCGGCGGTGTTTTGA
Upstream 100 bases:
>100_bases GGCACCGGGAGCGGCAAGCGCCTCGGGCAAGCGCCCCTGACATCTTGAATATTATCGAGACCGAAGCCACTCGGACTCGA CCATGACAGGATATGAAGCC
Downstream 100 bases:
>100_bases GGCACACTCCCCTAGGAAGCGCTATTTCGCGGTCGCGAAATAGATGTGCCGCTTACCTCCCTTGAGCGGTTCCTCCGCTT GGATGTCGAAGTGAGCCGAC
Product: phosphogluconate dehydratase
Products: NA
Alternate protein names: 6-phosphogluconate dehydratase [H]
Number of amino acids: Translated: 604; Mature: 603
Protein sequence:
>604_residues MTARREIEAVTARIRERSRQTRETYLARVDEAAAKGAHRSTLSCGNLAHGFAACGPQEKQALAGDTVPNLGIITAYNDML SAHQPYETFPALIKEAAREAGGVAQVAAGVPAMCDGVTQGQPGMQLSLFSRDVIALATAVGLSHNMFDAAVYLGICDKIV PGLTIAALTFGHLPAIFIPAGPMTSGLPNDEKSKIRQLFAEGKIGREALLDAESKSYHGPGTCTFYGTANSNQMLMEIMG LHIPGSSFVNPNTPLRDALTKEAVKRALAITALGNEFTPVGRMIDERSVVNGVVGLNATGGSTNHTMHLVAMAAAAGIKL TWADIAEISEHVPLLARVYPNGLADVNHFHAAGGIGFLVRELLDAGLLHEDVQTVWGAGLRQYAIEAKLGPDGSVTREIA PNESGDEKVLVPVAKAFQPTGGIRVLKGNLGQAIIKTSAVKPQHRIIEAPAIIFHAQGELQAAFKAGELDRDFIAVIRFQ GPKANGMPELHKLTPVLGVLQDRGYKVALLTDGRMSGASGKVPAAIHVTPEAADGGAIAKLRDGDLVRLDAEAGTVEALV DAEEFSARAPASADLSYEHYGMGRELFASFRQIVSPADQGAAVF
Sequences:
>Translated_604_residues MTARREIEAVTARIRERSRQTRETYLARVDEAAAKGAHRSTLSCGNLAHGFAACGPQEKQALAGDTVPNLGIITAYNDML SAHQPYETFPALIKEAAREAGGVAQVAAGVPAMCDGVTQGQPGMQLSLFSRDVIALATAVGLSHNMFDAAVYLGICDKIV PGLTIAALTFGHLPAIFIPAGPMTSGLPNDEKSKIRQLFAEGKIGREALLDAESKSYHGPGTCTFYGTANSNQMLMEIMG LHIPGSSFVNPNTPLRDALTKEAVKRALAITALGNEFTPVGRMIDERSVVNGVVGLNATGGSTNHTMHLVAMAAAAGIKL TWADIAEISEHVPLLARVYPNGLADVNHFHAAGGIGFLVRELLDAGLLHEDVQTVWGAGLRQYAIEAKLGPDGSVTREIA PNESGDEKVLVPVAKAFQPTGGIRVLKGNLGQAIIKTSAVKPQHRIIEAPAIIFHAQGELQAAFKAGELDRDFIAVIRFQ GPKANGMPELHKLTPVLGVLQDRGYKVALLTDGRMSGASGKVPAAIHVTPEAADGGAIAKLRDGDLVRLDAEAGTVEALV DAEEFSARAPASADLSYEHYGMGRELFASFRQIVSPADQGAAVF >Mature_603_residues TARREIEAVTARIRERSRQTRETYLARVDEAAAKGAHRSTLSCGNLAHGFAACGPQEKQALAGDTVPNLGIITAYNDMLS AHQPYETFPALIKEAAREAGGVAQVAAGVPAMCDGVTQGQPGMQLSLFSRDVIALATAVGLSHNMFDAAVYLGICDKIVP GLTIAALTFGHLPAIFIPAGPMTSGLPNDEKSKIRQLFAEGKIGREALLDAESKSYHGPGTCTFYGTANSNQMLMEIMGL HIPGSSFVNPNTPLRDALTKEAVKRALAITALGNEFTPVGRMIDERSVVNGVVGLNATGGSTNHTMHLVAMAAAAGIKLT WADIAEISEHVPLLARVYPNGLADVNHFHAAGGIGFLVRELLDAGLLHEDVQTVWGAGLRQYAIEAKLGPDGSVTREIAP NESGDEKVLVPVAKAFQPTGGIRVLKGNLGQAIIKTSAVKPQHRIIEAPAIIFHAQGELQAAFKAGELDRDFIAVIRFQG PKANGMPELHKLTPVLGVLQDRGYKVALLTDGRMSGASGKVPAAIHVTPEAADGGAIAKLRDGDLVRLDAEAGTVEALVD AEEFSARAPASADLSYEHYGMGRELFASFRQIVSPADQGAAVF
Specific function: KEY ENZYME IN THE ENTNER-DOUDOROFF PATHWAY. [C]
COG id: COG0129
COG function: function code EG; Dihydroxyacid dehydratase/phosphogluconate dehydratase
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the ilvD/edd family [H]
Homologues:
Organism=Escherichia coli, GI1788157, Length=595, Percent_Identity=61.1764705882353, Blast_Score=768, Evalue=0.0, Organism=Escherichia coli, GI48994964, Length=577, Percent_Identity=29.2894280762565, Blast_Score=192, Evalue=4e-50, Organism=Escherichia coli, GI2367371, Length=497, Percent_Identity=30.784708249497, Blast_Score=149, Evalue=7e-37, Organism=Escherichia coli, GI1786464, Length=501, Percent_Identity=29.5409181636727, Blast_Score=135, Evalue=6e-33, Organism=Saccharomyces cerevisiae, GI6322476, Length=491, Percent_Identity=29.5315682281059, Blast_Score=177, Evalue=3e-45,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR004786 - InterPro: IPR015928 - InterPro: IPR000581 - InterPro: IPR020558 [H]
Pfam domain/function: PF00920 ILVD_EDD [H]
EC number: =4.2.1.12 [H]
Molecular weight: Translated: 63670; Mature: 63538
Theoretical pI: Translated: 6.51; Mature: 6.51
Prosite motif: PS00886 ILVD_EDD_1 ; PS00887 ILVD_EDD_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.8 %Cys (Translated Protein) 2.5 %Met (Translated Protein) 3.3 %Cys+Met (Translated Protein) 0.8 %Cys (Mature Protein) 2.3 %Met (Mature Protein) 3.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTARREIEAVTARIRERSRQTRETYLARVDEAAAKGAHRSTLSCGNLAHGFAACGPQEKQ CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHCCCCHHH ALAGDTVPNLGIITAYNDMLSAHQPYETFPALIKEAAREAGGVAQVAAGVPAMCDGVTQG HHCCCCCCCCEEEEEEHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCCCHHHCCCCCC QPGMQLSLFSRDVIALATAVGLSHNMFDAAVYLGICDKIVPGLTIAALTFGHLPAIFIPA CCCCEEEHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCHHEEHHHHCCCCEEEECC GPMTSGLPNDEKSKIRQLFAEGKIGREALLDAESKSYHGPGTCTFYGTANSNQMLMEIMG CCCCCCCCCCHHHHHHHHHHCCCCCHHHHHCCCCCCCCCCCEEEEEECCCCCHHHHHHHH LHIPGSSFVNPNTPLRDALTKEAVKRALAITALGNEFTPVGRMIDERSVVNGVVGLNATG CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHEECCCCCCHHHHHHHHHHHHHHEEEECCCC GSTNHTMHLVAMAAAAGIKLTWADIAEISEHVPLLARVYPNGLADVNHFHAAGGIGFLVR CCCCCHHHHHHHHHHCCCEEEHHHHHHHHHCCCCHHHHCCCCCCCCCHHHHCCCHHHHHH ELLDAGLLHEDVQTVWGAGLRQYAIEAKLGPDGSVTREIAPNESGDEKVLVPVAKAFQPT HHHHHHHHHHHHHHHHHCCHHHHEEEEECCCCCCCEEEECCCCCCCCEEEEEEHHHCCCC GGIRVLKGNLGQAIIKTSAVKPQHRIIEAPAIIFHAQGELQAAFKAGELDRDFIAVIRFQ CCEEEEECCCCHHHHHHCCCCCHHHHEECCEEEEECCCCHHHHHHCCCCCCCEEEEEEEC GPKANGMPELHKLTPVLGVLQDRGYKVALLTDGRMSGASGKVPAAIHVTPEAADGGAIAK CCCCCCCCHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCEEEEECCCCCCCCCEEE LRDGDLVRLDAEAGTVEALVDAEEFSARAPASADLSYEHYGMGRELFASFRQIVSPADQG ECCCCEEEEECCCCCCHHHCCHHHHCCCCCCCCCCCHHHHCCCHHHHHHHHHHHCCCCCC AAVF CCCC >Mature Secondary Structure TARREIEAVTARIRERSRQTRETYLARVDEAAAKGAHRSTLSCGNLAHGFAACGPQEKQ CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHCCCCHHH ALAGDTVPNLGIITAYNDMLSAHQPYETFPALIKEAAREAGGVAQVAAGVPAMCDGVTQG HHCCCCCCCCEEEEEEHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCCCHHHCCCCCC QPGMQLSLFSRDVIALATAVGLSHNMFDAAVYLGICDKIVPGLTIAALTFGHLPAIFIPA CCCCEEEHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCHHEEHHHHCCCCEEEECC GPMTSGLPNDEKSKIRQLFAEGKIGREALLDAESKSYHGPGTCTFYGTANSNQMLMEIMG CCCCCCCCCCHHHHHHHHHHCCCCCHHHHHCCCCCCCCCCCEEEEEECCCCCHHHHHHHH LHIPGSSFVNPNTPLRDALTKEAVKRALAITALGNEFTPVGRMIDERSVVNGVVGLNATG CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHEECCCCCCHHHHHHHHHHHHHHEEEECCCC GSTNHTMHLVAMAAAAGIKLTWADIAEISEHVPLLARVYPNGLADVNHFHAAGGIGFLVR CCCCCHHHHHHHHHHCCCEEEHHHHHHHHHCCCCHHHHCCCCCCCCCHHHHCCCHHHHHH ELLDAGLLHEDVQTVWGAGLRQYAIEAKLGPDGSVTREIAPNESGDEKVLVPVAKAFQPT HHHHHHHHHHHHHHHHHCCHHHHEEEEECCCCCCCEEEECCCCCCCCEEEEEEHHHCCCC GGIRVLKGNLGQAIIKTSAVKPQHRIIEAPAIIFHAQGELQAAFKAGELDRDFIAVIRFQ CCEEEEECCCCHHHHHHCCCCCHHHHEECCEEEEECCCCHHHHHHCCCCCCCEEEEEEEC GPKANGMPELHKLTPVLGVLQDRGYKVALLTDGRMSGASGKVPAAIHVTPEAADGGAIAK CCCCCCCCHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCEEEEECCCCCCCCCEEE LRDGDLVRLDAEAGTVEALVDAEEFSARAPASADLSYEHYGMGRELFASFRQIVSPADQG ECCCCEEEEECCCCCCHHHCCHHHHCCCCCCCCCCCHHHHCCCHHHHHHHHHHHCCCCCC AAVF CCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 10400573; 11481430 [H]