| Definition | Rhodopseudomonas palustris HaA2, complete genome. |
|---|---|
| Accession | NC_007778 |
| Length | 5,331,656 |
Click here to switch to the map view.
The map label for this gene is ilvD [H]
Identifier: 86749216
GI number: 86749216
Start: 2380537
End: 2382345
Strand: Reverse
Name: ilvD [H]
Synonym: RPB_2095
Alternate gene names: 86749216
Gene position: 2382345-2380537 (Counterclockwise)
Preceding gene: 86749224
Following gene: 86749215
Centisome position: 44.68
GC content: 66.72
Gene sequence:
>1809_bases ATGACCAAGATCACCCCGGGGACCGCCCGACGCAAACTGCGTTCCAGCGAATGGTTCAACGATCCGCACAATCCGGCGAT GACCGCGCTGTATCTCGAGCGCTATCTGAACTACGGACTGACGCGCGGCGAACTGCAGTCCGGCAAGCCGATCATCGGCA TCGCCCAGACCGGCAACGACCTGTCGCCGTGCAACCGCCATCATCTGGAACTGGCGCAGCGCGTGCGCGAAGGCATCCGC GCCGCCGGCGGCATCGCGATGGAATTCCCGGTGCACCCGATCCAGGAAACCGGCAAGCGGCCGACCGCGGCGCTGGATCG CAATCTGGCTTATCTCGGTCTCGTCGAGGTGCTGTTCAGCTATCCGCTCGACGGCGTGGTGCTCACCACCGGCTGCGACA AGACCACGCCGGCCTGCCTGATGGCGGCGGCGACCGTGAACATCCCGGCGATCGTGCTGTCCGGCGGGCCGATGCTGAAC GGCTGGCACAATGGCGAGCGCTCCGGCTCCGGCACGGTGGTGTGGAAATCACGCGAGCGTCTCGCCGCCGGCGAGATCGA CTACGAGGAATTCATGGAGATCGTGGCGTCGTCGGCGCCGTCGGTCGGCCATTGCAACACCATGGGCACCGCGTCGACGA TGAACTCGCTGGCCGAAGCGCTCGGCATGTCGCTGCCCGGCTGCGCCGCGATCCCGGCGCCGTATCGCGAGCGCGGCCAG ATCGCCTACGCCACCGGCGTGCGCGCGGTCGAGATGGTGTGGGAGGATCTGAAGCCGTCCGACATCCTGACGCGCGAGGC GTTCGAAAACGCCATCGTGGTCAATTCGGCGATCGGCGGCTCGACCAACGCGCCGATCCATCTCAACGCGCTGGCGCGCC ACATCGGCGTCGAGCTTTCGATCGACGACTGGCAGAGCATCGGCCACAAAATTCCGCTGCTGGTCAACATGCAGCCGGCG GGATTCTATCTCGGCGAGGAATATCACCGCGCCGGCGGCGTGCCGGCGGTGGTGCGCGAGTTGATGCAGCACGGCAAGAT TCACACGGACGCGCTGACGGTGAACGGCCTCACCATGGGCCAGAACTGTGCCGGCGCGCCCGCGCCCGACGGCGACGTGA TCAAGTCCTACGACGGACCGCTGGTGCAGGACGCAGGCTTCCTGGTGCTGCGCGGCAATCTGTTCGAGTCCGCGATCATG AAGACATCCGTGATCAGCCTCGAATTCCGCGAGCGCTATCTGTCCAGCCCGAACGACCCCAACGCGTTCGAGGGCCGCGC CATCGTGTTCGAGGGGCCGGAGGATTATCACGACCGGATCGACGATCCGGCGCTGAAGATCGACGAGCACTGCATCCTGT TCGTGCGCGGCACCGGGCCGATCGGCTATCCGGGTGGCGCCGAAGTCGTGAACATGCAGCCGCCGGCGGCGCTGATCAAG CGCGGCATTCACTCCCTGCCCTGCATCGGCGACGGACGCCAGAGCGGCACCTCGGGCTCGCCGTCGATCCTCAACGCGAC GCCGGAGGCCGCCGCCAATGGCGGGCTCGCGATCCTGAAGACCGGCGACCGCGTCCGCATCGATCTGAACACGGGCAGCG CCAATATTCTGATCAGCGACGACGAGTTGAAGCAGCGCCGCGCCGAACTCGAGGCGCATGGCGGCTTCGCCTACCCGAAG CACCAGACGCCGTGGCAGGAACTGTATCGCCAGACCGTGGGCCAACAGGCCACCGGCGCCTGCCTCGAACTCGCGACGCG CTACCACGACATCGCCGGCACGGTCGGCGTGGCGCGGCATAATCATTGA
Upstream 100 bases:
>100_bases GAGCCGCACACGGCATGTCAGCATGCGCCGCCGCCGAATGAATCACCATCAATAAGCTCGAGACGCCCGAACAACGGGCG TGTGCGCTGCGGGAGAAACG
Downstream 100 bases:
>100_bases TGCGCCCGTGCTGACGAAGGACCTCTCCTCGCGTAGCAGCCGAACCTCTCCCCGCGTGCGGGGAGAGGTCGGAAGTTGCG CGCAGCGCGACTTCCGGGTG
Product: dihydroxy-acid dehydratase
Products: NA
Alternate protein names: DAD [H]
Number of amino acids: Translated: 602; Mature: 601
Protein sequence:
>602_residues MTKITPGTARRKLRSSEWFNDPHNPAMTALYLERYLNYGLTRGELQSGKPIIGIAQTGNDLSPCNRHHLELAQRVREGIR AAGGIAMEFPVHPIQETGKRPTAALDRNLAYLGLVEVLFSYPLDGVVLTTGCDKTTPACLMAAATVNIPAIVLSGGPMLN GWHNGERSGSGTVVWKSRERLAAGEIDYEEFMEIVASSAPSVGHCNTMGTASTMNSLAEALGMSLPGCAAIPAPYRERGQ IAYATGVRAVEMVWEDLKPSDILTREAFENAIVVNSAIGGSTNAPIHLNALARHIGVELSIDDWQSIGHKIPLLVNMQPA GFYLGEEYHRAGGVPAVVRELMQHGKIHTDALTVNGLTMGQNCAGAPAPDGDVIKSYDGPLVQDAGFLVLRGNLFESAIM KTSVISLEFRERYLSSPNDPNAFEGRAIVFEGPEDYHDRIDDPALKIDEHCILFVRGTGPIGYPGGAEVVNMQPPAALIK RGIHSLPCIGDGRQSGTSGSPSILNATPEAAANGGLAILKTGDRVRIDLNTGSANILISDDELKQRRAELEAHGGFAYPK HQTPWQELYRQTVGQQATGACLELATRYHDIAGTVGVARHNH
Sequences:
>Translated_602_residues MTKITPGTARRKLRSSEWFNDPHNPAMTALYLERYLNYGLTRGELQSGKPIIGIAQTGNDLSPCNRHHLELAQRVREGIR AAGGIAMEFPVHPIQETGKRPTAALDRNLAYLGLVEVLFSYPLDGVVLTTGCDKTTPACLMAAATVNIPAIVLSGGPMLN GWHNGERSGSGTVVWKSRERLAAGEIDYEEFMEIVASSAPSVGHCNTMGTASTMNSLAEALGMSLPGCAAIPAPYRERGQ IAYATGVRAVEMVWEDLKPSDILTREAFENAIVVNSAIGGSTNAPIHLNALARHIGVELSIDDWQSIGHKIPLLVNMQPA GFYLGEEYHRAGGVPAVVRELMQHGKIHTDALTVNGLTMGQNCAGAPAPDGDVIKSYDGPLVQDAGFLVLRGNLFESAIM KTSVISLEFRERYLSSPNDPNAFEGRAIVFEGPEDYHDRIDDPALKIDEHCILFVRGTGPIGYPGGAEVVNMQPPAALIK RGIHSLPCIGDGRQSGTSGSPSILNATPEAAANGGLAILKTGDRVRIDLNTGSANILISDDELKQRRAELEAHGGFAYPK HQTPWQELYRQTVGQQATGACLELATRYHDIAGTVGVARHNH >Mature_601_residues TKITPGTARRKLRSSEWFNDPHNPAMTALYLERYLNYGLTRGELQSGKPIIGIAQTGNDLSPCNRHHLELAQRVREGIRA AGGIAMEFPVHPIQETGKRPTAALDRNLAYLGLVEVLFSYPLDGVVLTTGCDKTTPACLMAAATVNIPAIVLSGGPMLNG WHNGERSGSGTVVWKSRERLAAGEIDYEEFMEIVASSAPSVGHCNTMGTASTMNSLAEALGMSLPGCAAIPAPYRERGQI AYATGVRAVEMVWEDLKPSDILTREAFENAIVVNSAIGGSTNAPIHLNALARHIGVELSIDDWQSIGHKIPLLVNMQPAG FYLGEEYHRAGGVPAVVRELMQHGKIHTDALTVNGLTMGQNCAGAPAPDGDVIKSYDGPLVQDAGFLVLRGNLFESAIMK TSVISLEFRERYLSSPNDPNAFEGRAIVFEGPEDYHDRIDDPALKIDEHCILFVRGTGPIGYPGGAEVVNMQPPAALIKR GIHSLPCIGDGRQSGTSGSPSILNATPEAAANGGLAILKTGDRVRIDLNTGSANILISDDELKQRRAELEAHGGFAYPKH QTPWQELYRQTVGQQATGACLELATRYHDIAGTVGVARHNH
Specific function: Valine and isoleucine biosynthesis; fourth step. [C]
COG id: COG0129
COG function: function code EG; Dihydroxyacid dehydratase/phosphogluconate dehydratase
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the ilvD/edd family [H]
Homologues:
Organism=Escherichia coli, GI48994964, Length=577, Percent_Identity=29.1161178509532, Blast_Score=202, Evalue=7e-53, Organism=Escherichia coli, GI1788157, Length=456, Percent_Identity=30.9210526315789, Blast_Score=186, Evalue=5e-48, Organism=Escherichia coli, GI2367371, Length=517, Percent_Identity=28.0464216634429, Blast_Score=130, Evalue=2e-31, Organism=Escherichia coli, GI1786464, Length=526, Percent_Identity=28.1368821292776, Blast_Score=124, Evalue=1e-29, Organism=Saccharomyces cerevisiae, GI6322476, Length=512, Percent_Identity=30.2734375, Blast_Score=218, Evalue=2e-57,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR015928 - InterPro: IPR004404 - InterPro: IPR000581 - InterPro: IPR020558 [H]
Pfam domain/function: PF00920 ILVD_EDD [H]
EC number: =4.2.1.9 [H]
Molecular weight: Translated: 64635; Mature: 64504
Theoretical pI: Translated: 6.26; Mature: 6.26
Prosite motif: PS00886 ILVD_EDD_1
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.5 %Cys (Translated Protein) 2.5 %Met (Translated Protein) 4.0 %Cys+Met (Translated Protein) 1.5 %Cys (Mature Protein) 2.3 %Met (Mature Protein) 3.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTKITPGTARRKLRSSEWFNDPHNPAMTALYLERYLNYGLTRGELQSGKPIIGIAQTGND CCCCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEECCCCC LSPCNRHHLELAQRVREGIRAAGGIAMEFPVHPIQETGKRPTAALDRNLAYLGLVEVLFS CCCCHHHHHHHHHHHHHHHHHHCCEEEECCCCCHHHCCCCCCHHHHCCHHHHHHHHHHHH YPLDGVVLTTGCDKTTPACLMAAATVNIPAIVLSGGPMLNGWHNGERSGSGTVVWKSRER CCCCCEEEECCCCCCCHHHHHHHHHCCCCEEEEECCCEECCCCCCCCCCCCEEEECCCCC LAAGEIDYEEFMEIVASSAPSVGHCNTMGTASTMNSLAEALGMSLPGCAAIPAPYRERGQ CCCCCCCHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCHHHCCC IAYATGVRAVEMVWEDLKPSDILTREAFENAIVVNSAIGGSTNAPIHLNALARHIGVELS EEEECCCHHHHHHHHHCCCHHHHHHHHHCCEEEEEECCCCCCCCCEEHHHHHHHHCCEEE IDDWQSIGHKIPLLVNMQPAGFYLGEEYHRAGGVPAVVRELMQHGKIHTDALTVNGLTMG EHHHHHCCCCCCEEEEECCCCCCCCHHHHHCCCCHHHHHHHHHCCCEEEEEEEECCEEEC QNCAGAPAPDGDVIKSYDGPLVQDAGFLVLRGNLFESAIMKTSVISLEFRERYLSSPNDP CCCCCCCCCCCCCEECCCCCEEECCCEEEEECCHHHHHHHHHHHEEHHHHHHHCCCCCCC NAFEGRAIVFEGPEDYHDRIDDPALKIDEHCILFVRGTGPIGYPGGAEVVNMQPPAALIK CCCCCCEEEEECCHHHHHCCCCCCEEECCEEEEEEECCCCCCCCCCCCEEECCCCHHHHH RGIHSLPCIGDGRQSGTSGSPSILNATPEAAANGGLAILKTGDRVRIDLNTGSANILISD CCHHCCCCCCCCCCCCCCCCCCEEECCCCHHCCCCEEEEECCCEEEEEEECCCEEEEECC DELKQRRAELEAHGGFAYPKHQTPWQELYRQTVGQQATGACLELATRYHDIAGTVGVARH HHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCHHCC NH CC >Mature Secondary Structure TKITPGTARRKLRSSEWFNDPHNPAMTALYLERYLNYGLTRGELQSGKPIIGIAQTGND CCCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEECCCCC LSPCNRHHLELAQRVREGIRAAGGIAMEFPVHPIQETGKRPTAALDRNLAYLGLVEVLFS CCCCHHHHHHHHHHHHHHHHHHCCEEEECCCCCHHHCCCCCCHHHHCCHHHHHHHHHHHH YPLDGVVLTTGCDKTTPACLMAAATVNIPAIVLSGGPMLNGWHNGERSGSGTVVWKSRER CCCCCEEEECCCCCCCHHHHHHHHHCCCCEEEEECCCEECCCCCCCCCCCCEEEECCCCC LAAGEIDYEEFMEIVASSAPSVGHCNTMGTASTMNSLAEALGMSLPGCAAIPAPYRERGQ CCCCCCCHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCHHHCCC IAYATGVRAVEMVWEDLKPSDILTREAFENAIVVNSAIGGSTNAPIHLNALARHIGVELS EEEECCCHHHHHHHHHCCCHHHHHHHHHCCEEEEEECCCCCCCCCEEHHHHHHHHCCEEE IDDWQSIGHKIPLLVNMQPAGFYLGEEYHRAGGVPAVVRELMQHGKIHTDALTVNGLTMG EHHHHHCCCCCCEEEEECCCCCCCCHHHHHCCCCHHHHHHHHHCCCEEEEEEEECCEEEC QNCAGAPAPDGDVIKSYDGPLVQDAGFLVLRGNLFESAIMKTSVISLEFRERYLSSPNDP CCCCCCCCCCCCCEECCCCCEEECCCEEEEECCHHHHHHHHHHHEEHHHHHHHCCCCCCC NAFEGRAIVFEGPEDYHDRIDDPALKIDEHCILFVRGTGPIGYPGGAEVVNMQPPAALIK CCCCCCEEEEECCHHHHHCCCCCCEEECCEEEEEEECCCCCCCCCCCCEEECCCCHHHHH RGIHSLPCIGDGRQSGTSGSPSILNATPEAAANGGLAILKTGDRVRIDLNTGSANILISD CCHHCCCCCCCCCCCCCCCCCCEEECCCCHHCCCCEEEEECCCEEEEEEECCCEEEEECC DELKQRRAELEAHGGFAYPKHQTPWQELYRQTVGQQATGACLELATRYHDIAGTVGVARH HHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCHHCC NH CC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA