Definition | Ruegeria sp. TM1040, complete genome. |
---|---|
Accession | NC_008044 |
Length | 3,200,938 |
Click here to switch to the map view.
The map label for this gene is ilvI [H]
Identifier: 99080716
GI number: 99080716
Start: 931943
End: 933694
Strand: Reverse
Name: ilvI [H]
Synonym: TM1040_0875
Alternate gene names: 99080716
Gene position: 933694-931943 (Counterclockwise)
Preceding gene: 99080717
Following gene: 99080715
Centisome position: 29.17
GC content: 60.56
Gene sequence:
>1752_bases ATGACACGTGAAATGACCGGCGCAAAAATGGTTGTCCAAGCCCTCAAGGAGCAGGGCGTGGACACGGTATTCGGATATCC CGGAGGCGCTGTCCTACCCATTTACGATGAAATCTTTCAGCAAAATGACATTCGCCATATTCTGGTCCGTCACGAGCAGG GCGCAGTGCATGCCGCCGAAGGGTATGCGCGCTCGACCGGCAAACCCGGCGTGGTGCTTGTGACCTCTGGCCCCGGTGCC ACCAACGCGGTGACCGGCCTCACGGATGCGCTGCTGGACTCGATCCCGCTGGTGGTTCTCACCGGGCAGGTCCCGACCTT CATGATCGGCTCTGACGCCTTTCAGGAGGCCGACACCGTCGGCATCACCCGCCCCTGCACCAAGCACAACTGGCTGGTGA AGGACACCGATAAACTCGCCTCCACCATCCACGAAGGGTTCCATGTCGCAACCTCTGGCCGCCCTGGCCCGGTGCTGATC GACATTCCCAAGGACGTGCAGTTTGCCACCGGCACCTATGAGCCCAAGAAACCCTCGGCTTCGCATTACCAGCCGGTCGT CAAGGGCGACATGGAAGAAATCACCGAGCTGGTCGCCGCGATGGAGACCGCCAAACGCCCGGTGTTTTATACCGGCGGCG GCGTGATCAACTCGGGCCCGGCCGCAAGCCAGCTTCTGCGCGAACTGGTGGAGGCCACCGGCTTTCCGATCACCTCGACC CTGATGGGTCTCGGTGCCTATCCCGCGTCGGGCAAGCAGTGGCTTGGGATGCTTGGCATGCATGGTCTCTACGAGGCCAA TATGGCGATGCATGACTGCGACCTGATGATCAACATCGGCGCGCGCTTTGATGACCGGATCACCGGTCGCATCGACGCCT TCAGCCCGAAATCCATCAAGGCCCATATCGACATCGACCCCTCCTCGATCAACAAGGTGATCAAGGCGGACATCCCGATT GTCGGGGACGTTGGCCATGTGCTTGAGGACATTCTCAAGGTCTGGAAGAGCCGCGGGCGCAAGACCAACGCCGAAGCACT GGCAAAATGGCAGGGCCAGATCGACGAATGGCGCGCGGTGAAATGCCTGACCTATGAGATGTCCGAAACCACCATCAAGC CGCAATATGCGCTTGAGCGTCTCGAGGCGCTGACCAAGGGTCGGGATCGCTATATCACCACCGAAGTGGGCCAGCACCAG ATGTGGGCAGCACAGTTCCTGGGCTTTGAAGACCCCAACCGCTGGATGACCTCCGGGGGGCTTGGCACCATGGGCTATGG TACGCCTGCCTCTATCGGCGCGCAGATCGCGCATCCCGATGCGCTGGTGATCAACGTCGCGGGCGAGGCCTCTTGGCTGA TGAACATGCAGGAAATGGGCACTGCGACCCAGTACCGCCTGCCAGTGAAACAGTTCATCCTCAACAACGAACGCCTTGGC ATGGTGCGCCAGTGGCAGGAGCTCTTGCATGGTGAGCGCTACTCGCACAGCTGGTCCGAAGCGCTGCCCGATTTTGTCAA ACTCGCCGAAGCCTTTGGCGCCAAGGGCATCATCTGCTCGGACCCCAAGGATCTGGATGACGCGATCATGGAGATGATCG AATATGACGGGCCGGTGATCTTTGACTGTCTGGTGGAAAAGCACGAGAACTGCTTCCCGATGATCCCCTCGGGCAAGGCT CACAACGAGATGCTGTTGGGCGCGGCTGAAACGCAGGGCGTGATCCAGTCCGGCGGCGCGGTTCTGGTCTGA
Upstream 100 bases:
>100_bases ATGCGCCTCCGATCCTCAAAATCGGGGGCTTTTTTATGCGATACACACTGACGTTAGACTGACGCAAAGAAAGACGATGT CGTGAACTGGAGCAAAGCAG
Downstream 100 bases:
>100_bases TCATCTGAGGTTTGATTGCAAGGCGTATCGCGCCTTGCCCCGATATTTCGAGGAAAGGGACTGACATGTCTGCCCTACAC ATCAAAAAAGGTGCTACCAA
Product: acetolactate synthase 3 catalytic subunit
Products: NA
Alternate protein names: AHAS-III; ALS-III; Acetohydroxy-acid synthase III large subunit [H]
Number of amino acids: Translated: 583; Mature: 582
Protein sequence:
>583_residues MTREMTGAKMVVQALKEQGVDTVFGYPGGAVLPIYDEIFQQNDIRHILVRHEQGAVHAAEGYARSTGKPGVVLVTSGPGA TNAVTGLTDALLDSIPLVVLTGQVPTFMIGSDAFQEADTVGITRPCTKHNWLVKDTDKLASTIHEGFHVATSGRPGPVLI DIPKDVQFATGTYEPKKPSASHYQPVVKGDMEEITELVAAMETAKRPVFYTGGGVINSGPAASQLLRELVEATGFPITST LMGLGAYPASGKQWLGMLGMHGLYEANMAMHDCDLMINIGARFDDRITGRIDAFSPKSIKAHIDIDPSSINKVIKADIPI VGDVGHVLEDILKVWKSRGRKTNAEALAKWQGQIDEWRAVKCLTYEMSETTIKPQYALERLEALTKGRDRYITTEVGQHQ MWAAQFLGFEDPNRWMTSGGLGTMGYGTPASIGAQIAHPDALVINVAGEASWLMNMQEMGTATQYRLPVKQFILNNERLG MVRQWQELLHGERYSHSWSEALPDFVKLAEAFGAKGIICSDPKDLDDAIMEMIEYDGPVIFDCLVEKHENCFPMIPSGKA HNEMLLGAAETQGVIQSGGAVLV
Sequences:
>Translated_583_residues MTREMTGAKMVVQALKEQGVDTVFGYPGGAVLPIYDEIFQQNDIRHILVRHEQGAVHAAEGYARSTGKPGVVLVTSGPGA TNAVTGLTDALLDSIPLVVLTGQVPTFMIGSDAFQEADTVGITRPCTKHNWLVKDTDKLASTIHEGFHVATSGRPGPVLI DIPKDVQFATGTYEPKKPSASHYQPVVKGDMEEITELVAAMETAKRPVFYTGGGVINSGPAASQLLRELVEATGFPITST LMGLGAYPASGKQWLGMLGMHGLYEANMAMHDCDLMINIGARFDDRITGRIDAFSPKSIKAHIDIDPSSINKVIKADIPI VGDVGHVLEDILKVWKSRGRKTNAEALAKWQGQIDEWRAVKCLTYEMSETTIKPQYALERLEALTKGRDRYITTEVGQHQ MWAAQFLGFEDPNRWMTSGGLGTMGYGTPASIGAQIAHPDALVINVAGEASWLMNMQEMGTATQYRLPVKQFILNNERLG MVRQWQELLHGERYSHSWSEALPDFVKLAEAFGAKGIICSDPKDLDDAIMEMIEYDGPVIFDCLVEKHENCFPMIPSGKA HNEMLLGAAETQGVIQSGGAVLV >Mature_582_residues TREMTGAKMVVQALKEQGVDTVFGYPGGAVLPIYDEIFQQNDIRHILVRHEQGAVHAAEGYARSTGKPGVVLVTSGPGAT NAVTGLTDALLDSIPLVVLTGQVPTFMIGSDAFQEADTVGITRPCTKHNWLVKDTDKLASTIHEGFHVATSGRPGPVLID IPKDVQFATGTYEPKKPSASHYQPVVKGDMEEITELVAAMETAKRPVFYTGGGVINSGPAASQLLRELVEATGFPITSTL MGLGAYPASGKQWLGMLGMHGLYEANMAMHDCDLMINIGARFDDRITGRIDAFSPKSIKAHIDIDPSSINKVIKADIPIV GDVGHVLEDILKVWKSRGRKTNAEALAKWQGQIDEWRAVKCLTYEMSETTIKPQYALERLEALTKGRDRYITTEVGQHQM WAAQFLGFEDPNRWMTSGGLGTMGYGTPASIGAQIAHPDALVINVAGEASWLMNMQEMGTATQYRLPVKQFILNNERLGM VRQWQELLHGERYSHSWSEALPDFVKLAEAFGAKGIICSDPKDLDDAIMEMIEYDGPVIFDCLVEKHENCFPMIPSGKAH NEMLLGAAETQGVIQSGGAVLV
Specific function: Valine and isoleucine biosynthesis; first step. [C]
COG id: COG0028
COG function: function code EH; Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase]
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the TPP enzyme family [H]
Homologues:
Organism=Homo sapiens, GI93004078, Length=566, Percent_Identity=24.3816254416961, Blast_Score=145, Evalue=1e-34, Organism=Homo sapiens, GI21361361, Length=595, Percent_Identity=24.7058823529412, Blast_Score=144, Evalue=2e-34, Organism=Escherichia coli, GI87081685, Length=572, Percent_Identity=48.951048951049, Blast_Score=562, Evalue=1e-161, Organism=Escherichia coli, GI1790104, Length=564, Percent_Identity=42.1985815602837, Blast_Score=449, Evalue=1e-127, Organism=Escherichia coli, GI1786717, Length=570, Percent_Identity=31.5789473684211, Blast_Score=296, Evalue=2e-81, Organism=Escherichia coli, GI1787096, Length=558, Percent_Identity=27.0609318996416, Blast_Score=180, Evalue=3e-46, Organism=Escherichia coli, GI1788716, Length=570, Percent_Identity=24.5614035087719, Blast_Score=137, Evalue=1e-33, Organism=Caenorhabditis elegans, GI17531299, Length=562, Percent_Identity=24.7330960854093, Blast_Score=150, Evalue=1e-36, Organism=Caenorhabditis elegans, GI17531301, Length=562, Percent_Identity=24.7330960854093, Blast_Score=150, Evalue=3e-36, Organism=Caenorhabditis elegans, GI17542570, Length=518, Percent_Identity=26.6409266409266, Blast_Score=132, Evalue=6e-31, Organism=Saccharomyces cerevisiae, GI6323755, Length=585, Percent_Identity=45.2991452991453, Blast_Score=490, Evalue=1e-139, Organism=Saccharomyces cerevisiae, GI6320816, Length=500, Percent_Identity=23.6, Blast_Score=103, Evalue=7e-23, Organism=Saccharomyces cerevisiae, GI6321524, Length=588, Percent_Identity=22.108843537415, Blast_Score=79, Evalue=2e-15, Organism=Drosophila melanogaster, GI19922626, Length=570, Percent_Identity=25.7894736842105, Blast_Score=153, Evalue=4e-37,
Paralogues:
None
Copy number: 340 Molecules/Cell In: Growth-Phase, Minimal-Media (Based on E. coli). [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR012846 - InterPro: IPR012000 - InterPro: IPR012001 - InterPro: IPR000399 - InterPro: IPR011766 [H]
Pfam domain/function: PF02775 TPP_enzyme_C; PF00205 TPP_enzyme_M; PF02776 TPP_enzyme_N [H]
EC number: =2.2.1.6 [H]
Molecular weight: Translated: 63495; Mature: 63364
Theoretical pI: Translated: 5.35; Mature: 5.35
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.0 %Cys (Translated Protein) 4.1 %Met (Translated Protein) 5.1 %Cys+Met (Translated Protein) 1.0 %Cys (Mature Protein) 4.0 %Met (Mature Protein) 5.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTREMTGAKMVVQALKEQGVDTVFGYPGGAVLPIYDEIFQQNDIRHILVRHEQGAVHAAE CCCCCHHHHHHHHHHHHCCCCEEECCCCCCEEHHHHHHHCCCCCEEEEEECCCCCEEHHC GYARSTGKPGVVLVTSGPGATNAVTGLTDALLDSIPLVVLTGQVPTFMIGSDAFQEADTV CCCCCCCCCCEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCEEEECCHHHHHHCCC GITRPCTKHNWLVKDTDKLASTIHEGFHVATSGRPGPVLIDIPKDVQFATGTYEPKKPSA CCCCCCCCCCCEEECHHHHHHHHHHCEEEEECCCCCCEEEECCCCCEECCCCCCCCCCCC SHYQPVVKGDMEEITELVAAMETAKRPVFYTGGGVINSGPAASQLLRELVEATGFPITST CCCCCHHCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCHHHHHHHHHHHHHCCCCHHHH LMGLGAYPASGKQWLGMLGMHGLYEANMAMHDCDLMINIGARFDDRITGRIDAFSPKSIK HHHCCCCCCCCHHHHHHHCCCCHHHCCCEEECCEEEEEECCCCCCCCCCEEECCCCCCEE AHIDIDPSSINKVIKADIPIVGDVGHVLEDILKVWKSRGRKTNAEALAKWQGQIDEWRAV EEEECCHHHHHHHHHCCCCEECCHHHHHHHHHHHHHHCCCCCCHHHHHHHHCCCCHHHEE KCLTYEMSETTIKPQYALERLEALTKGRDRYITTEVGQHQMWAAQFLGFEDPNRWMTSGG EEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHCCCCCCCCEEECCC LGTMGYGTPASIGAQIAHPDALVINVAGEASWLMNMQEMGTATQYRLPVKQFILNNERLG CCCCCCCCCHHCCCEECCCCEEEEEECCCHHHHHHHHHHCCCCEECCCHHHHHHCCCCCH MVRQWQELLHGERYSHSWSEALPDFVKLAEAFGAKGIICSDPKDLDDAIMEMIEYDGPVI HHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCEEECCCCCHHHHHHHHHHCCCCEE FDCLVEKHENCFPMIPSGKAHNEMLLGAAETQGVIQSGGAVLV EHHHHHHCCCCCCCCCCCCCCCCEEEECHHHHHHHHCCCEEEC >Mature Secondary Structure TREMTGAKMVVQALKEQGVDTVFGYPGGAVLPIYDEIFQQNDIRHILVRHEQGAVHAAE CCCCHHHHHHHHHHHHCCCCEEECCCCCCEEHHHHHHHCCCCCEEEEEECCCCCEEHHC GYARSTGKPGVVLVTSGPGATNAVTGLTDALLDSIPLVVLTGQVPTFMIGSDAFQEADTV CCCCCCCCCCEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCEEEECCHHHHHHCCC GITRPCTKHNWLVKDTDKLASTIHEGFHVATSGRPGPVLIDIPKDVQFATGTYEPKKPSA CCCCCCCCCCCEEECHHHHHHHHHHCEEEEECCCCCCEEEECCCCCEECCCCCCCCCCCC SHYQPVVKGDMEEITELVAAMETAKRPVFYTGGGVINSGPAASQLLRELVEATGFPITST CCCCCHHCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCHHHHHHHHHHHHHCCCCHHHH LMGLGAYPASGKQWLGMLGMHGLYEANMAMHDCDLMINIGARFDDRITGRIDAFSPKSIK HHHCCCCCCCCHHHHHHHCCCCHHHCCCEEECCEEEEEECCCCCCCCCCEEECCCCCCEE AHIDIDPSSINKVIKADIPIVGDVGHVLEDILKVWKSRGRKTNAEALAKWQGQIDEWRAV EEEECCHHHHHHHHHCCCCEECCHHHHHHHHHHHHHHCCCCCCHHHHHHHHCCCCHHHEE KCLTYEMSETTIKPQYALERLEALTKGRDRYITTEVGQHQMWAAQFLGFEDPNRWMTSGG EEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHCCCCCCCCEEECCC LGTMGYGTPASIGAQIAHPDALVINVAGEASWLMNMQEMGTATQYRLPVKQFILNNERLG CCCCCCCCCHHCCCEECCCCEEEEEECCCHHHHHHHHHHCCCCEECCCHHHHHHCCCCCH MVRQWQELLHGERYSHSWSEALPDFVKLAEAFGAKGIICSDPKDLDDAIMEMIEYDGPVI HHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCEEECCCCCHHHHHHHHHHCCCCEE FDCLVEKHENCFPMIPSGKAHNEMLLGAAETQGVIQSGGAVLV EHHHHHHCCCCCCCCCCCCCCCCEEEECHHHHHHHHCCCEEEC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 6308579; 1630901; 9278503; 3891724; 9298646 [H]