Definition | Escherichia coli O157:H7 str. EC4115, complete genome. |
---|---|
Accession | NC_011353 |
Length | 5,572,075 |
Click here to switch to the map view.
The map label for this gene is leuA [H]
Identifier: 209398496
GI number: 209398496
Start: 86546
End: 88117
Strand: Reverse
Name: leuA [H]
Synonym: ECH74115_0081
Alternate gene names: 209398496
Gene position: 88117-86546 (Counterclockwise)
Preceding gene: 209396438
Following gene: 209399607
Centisome position: 1.58
GC content: 53.12
Gene sequence:
>1572_bases ATGAGCCAGCAAGTCATTATTTTCGATACCACATTGCGCGACGGTGAACAGGCGTTACAGGCAAGCTTGAGTGTGAAAGA AAAACTGCAAATTGCGCTGGCCCTTGAGCGTATGGGTGTTGACGTGATGGAAGTCGGGTTCCCCGTCTCTTCGCCGGGTG ATTTTGAATCAGTGCAAACCATCGCTCGCCAGGTTAAAAACAGCCGTGTATGCGCGTTAGCTCGCTGCGTGGAGAAAGAT ATCGACGTGGCGGCCGAATCCCTGAAAGTCGCCGAAGCCTTCCGTATTCATACCTTTATTGCCACTTCGCCAATGCACAT CGCCACCAAGCTGCGCAGCACGCTGGACGAGGTGATCGAACGCGCTATCTATATGGTGAAACGCGCCCGTAATTACACCG ATGATGTTGAATTTTCTTGCGAAGATGCCGGGCGTACACCCATTGCCGATCTGGCGCGAGTGGTCGAAGCGGCGATTAAT GCCGGTGCCACCACCATCAACATTCCGGACACCGTGGGCTACACCATGCCGTTTGAGTTCGCCGGAATCATTAGCGGCCT GTATGAACGCGTGCCTAACATCGACAAAGCCATTATTTCCGTACATACCCACGACGATTTGGGCCTGGCGGTCGGCAACT CACTGGCGGCGGTACATGCCGGTGCACGCCAGGTGGAAGGTGCAATGAACGGGATCGGCGAGCGTGCCGGTAACTGTTCC CTGGAAGAAGTCATCATGGCGATTAAAGTTCGTAAGGATATTCTCAACGTCCACACCGCCATTAATCACCAGGAGATATG GCGCACCAGCCAGTTAGTTAGCCAGATTTGTAATATGCCGATCCCGGCAAACAAAGCCATTGTTGGCAGCGGCGCATTCG CACACTCCTCCGGTATCCACCAGGATGGCGTGCTGAAAAACCGCGAAAACTACGAAATCATGACACCAGAATCTATTGGT CTGAACCAAATCCAGCTGAATCTGACCTCTCGTTCGGGGCGTGCGGCGGTGAAACATCGCATGGATGAGATGGGGTATAA AGAAAGTGAATATAATTTAGACAATTTGTACGACGCCTTCCTCAAGCTGGCGGACAAAAAAGGTCAGGTGTTTGATTACG ATCTGGAGGCGCTGGCCTTCATCGGTAAGCAGCAAGAAGAGCCGGAGCATTTCCGTCTGGATTACTTCAGCGTGCAGTCT GGCTCTAACGATATCGCCACCGCCGCCGTCAAACTGGCCTGCGGCGAAGAAGTCAAAGCAGAAGCCGCCAACGGTAACGG TCCGGTCGATGCCGTCTATCAGGCGATTAACCGCATCACTGACTATAACGTCGAACTGGTGAAATACAGCCTGACCGCCA AAGGTCACGGTAAAGATGCGCTGGGTCAGGTGGATATTGTCGCCAACTACAACGGTCGCCGCTTCCACGGCGTCGGCCTG GCCACCGATATTGTCGAGTCCTCCGCCAAAGCCATGGTGCACGTACTTAACAATATCTGGCGTGCCACAGAAGTCGAAAA AGAGTTGCAACGCAAAGCTCAACACAACGAAAACAACAAGGAAACCGTGTGA
Upstream 100 bases:
>100_bases AGCATTAAGCCAGCACGCAGTCAAACAAAAAACCCGCGCCATTGCGCGGGTTTTTTTATGCCCGAAGCGAGGCGCTCTAA AAGAGACAAGGACCCAAACC
Downstream 100 bases:
>100_bases TGTCGAAGAATTACCATATTGCCGTATTGCCGGGGGATGGCATTGGTCCGGAAGTGATGACCCAGGCGCTGAAAGTGCTG GATGCCGTGCGCAACCGCTT
Product: 2-isopropylmalate synthase
Products: NA
Alternate protein names: Alpha-IPM synthase; Alpha-isopropylmalate synthase [H]
Number of amino acids: Translated: 523; Mature: 522
Protein sequence:
>523_residues MSQQVIIFDTTLRDGEQALQASLSVKEKLQIALALERMGVDVMEVGFPVSSPGDFESVQTIARQVKNSRVCALARCVEKD IDVAAESLKVAEAFRIHTFIATSPMHIATKLRSTLDEVIERAIYMVKRARNYTDDVEFSCEDAGRTPIADLARVVEAAIN AGATTINIPDTVGYTMPFEFAGIISGLYERVPNIDKAIISVHTHDDLGLAVGNSLAAVHAGARQVEGAMNGIGERAGNCS LEEVIMAIKVRKDILNVHTAINHQEIWRTSQLVSQICNMPIPANKAIVGSGAFAHSSGIHQDGVLKNRENYEIMTPESIG LNQIQLNLTSRSGRAAVKHRMDEMGYKESEYNLDNLYDAFLKLADKKGQVFDYDLEALAFIGKQQEEPEHFRLDYFSVQS GSNDIATAAVKLACGEEVKAEAANGNGPVDAVYQAINRITDYNVELVKYSLTAKGHGKDALGQVDIVANYNGRRFHGVGL ATDIVESSAKAMVHVLNNIWRATEVEKELQRKAQHNENNKETV
Sequences:
>Translated_523_residues MSQQVIIFDTTLRDGEQALQASLSVKEKLQIALALERMGVDVMEVGFPVSSPGDFESVQTIARQVKNSRVCALARCVEKD IDVAAESLKVAEAFRIHTFIATSPMHIATKLRSTLDEVIERAIYMVKRARNYTDDVEFSCEDAGRTPIADLARVVEAAIN AGATTINIPDTVGYTMPFEFAGIISGLYERVPNIDKAIISVHTHDDLGLAVGNSLAAVHAGARQVEGAMNGIGERAGNCS LEEVIMAIKVRKDILNVHTAINHQEIWRTSQLVSQICNMPIPANKAIVGSGAFAHSSGIHQDGVLKNRENYEIMTPESIG LNQIQLNLTSRSGRAAVKHRMDEMGYKESEYNLDNLYDAFLKLADKKGQVFDYDLEALAFIGKQQEEPEHFRLDYFSVQS GSNDIATAAVKLACGEEVKAEAANGNGPVDAVYQAINRITDYNVELVKYSLTAKGHGKDALGQVDIVANYNGRRFHGVGL ATDIVESSAKAMVHVLNNIWRATEVEKELQRKAQHNENNKETV >Mature_522_residues SQQVIIFDTTLRDGEQALQASLSVKEKLQIALALERMGVDVMEVGFPVSSPGDFESVQTIARQVKNSRVCALARCVEKDI DVAAESLKVAEAFRIHTFIATSPMHIATKLRSTLDEVIERAIYMVKRARNYTDDVEFSCEDAGRTPIADLARVVEAAINA GATTINIPDTVGYTMPFEFAGIISGLYERVPNIDKAIISVHTHDDLGLAVGNSLAAVHAGARQVEGAMNGIGERAGNCSL EEVIMAIKVRKDILNVHTAINHQEIWRTSQLVSQICNMPIPANKAIVGSGAFAHSSGIHQDGVLKNRENYEIMTPESIGL NQIQLNLTSRSGRAAVKHRMDEMGYKESEYNLDNLYDAFLKLADKKGQVFDYDLEALAFIGKQQEEPEHFRLDYFSVQSG SNDIATAAVKLACGEEVKAEAANGNGPVDAVYQAINRITDYNVELVKYSLTAKGHGKDALGQVDIVANYNGRRFHGVGLA TDIVESSAKAMVHVLNNIWRATEVEKELQRKAQHNENNKETV
Specific function: Catalyzes the condensation of the acetyl group of acetyl-CoA with 3-methyl-2-oxobutanoate (2-oxoisovalerate) to form 3-carboxy-3-hydroxy-4-methylpentanoate (2-isopropylmalate) [H]
COG id: COG0119
COG function: function code E; Isopropylmalate/homocitrate/citramalate synthases
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the alpha-IPM synthase/homocitrate synthase family. LeuA type 1 subfamily [H]
Homologues:
Organism=Escherichia coli, GI1786261, Length=523, Percent_Identity=99.6175908221797, Blast_Score=1077, Evalue=0.0, Organism=Saccharomyces cerevisiae, GI6320019, Length=359, Percent_Identity=27.5766016713092, Blast_Score=135, Evalue=2e-32, Organism=Saccharomyces cerevisiae, GI6320071, Length=359, Percent_Identity=27.2980501392758, Blast_Score=135, Evalue=2e-32, Organism=Saccharomyces cerevisiae, GI6324682, Length=544, Percent_Identity=24.8161764705882, Blast_Score=123, Evalue=7e-29, Organism=Saccharomyces cerevisiae, GI6324225, Length=541, Percent_Identity=24.2144177449168, Blast_Score=120, Evalue=7e-28,
Paralogues:
None
Copy number: 160 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR013709 - InterPro: IPR002034 - InterPro: IPR013785 - InterPro: IPR005671 - InterPro: IPR000891 [H]
Pfam domain/function: PF00682 HMGL-like; PF08502 LeuA_dimer [H]
EC number: =2.3.3.13 [H]
Molecular weight: Translated: 57315; Mature: 57183
Theoretical pI: Translated: 5.49; Mature: 5.49
Prosite motif: PS00815 AIPM_HOMOCIT_SYNTH_1 ; PS00816 AIPM_HOMOCIT_SYNTH_2 ; PS50991 PYR_CT
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.1 %Cys (Translated Protein) 2.5 %Met (Translated Protein) 3.6 %Cys+Met (Translated Protein) 1.1 %Cys (Mature Protein) 2.3 %Met (Mature Protein) 3.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSQQVIIFDTTLRDGEQALQASLSVKEKLQIALALERMGVDVMEVGFPVSSPGDFESVQT CCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHCCCCCCCCCHHHHHH IARQVKNSRVCALARCVEKDIDVAAESLKVAEAFRIHTFIATSPMHIATKLRSTLDEVIE HHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHH RAIYMVKRARNYTDDVEFSCEDAGRTPIADLARVVEAAINAGATTINIPDTVGYTMPFEF HHHHHHHHHCCCCCCCCEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCCCHHH AGIISGLYERVPNIDKAIISVHTHDDLGLAVGNSLAAVHAGARQVEGAMNGIGERAGNCS HHHHHHHHHHCCCCCCEEEEEECCCCCCHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCCC LEEVIMAIKVRKDILNVHTAINHQEIWRTSQLVSQICNMPIPANKAIVGSGAFAHSSGIH HHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCC QDGVLKNRENYEIMTPESIGLNQIQLNLTSRSGRAAVKHRMDEMGYKESEYNLDNLYDAF CCCCCCCCCCCEEECCCCCCCCEEEEEEECCCCHHHHHHHHHHCCCCCCCCCHHHHHHHH LKLADKKGQVFDYDLEALAFIGKQQEEPEHFRLDYFSVQSGSNDIATAAVKLACGEEVKA HHHHCCCCCEEECCHHHHHHHCCCCCCCCCEEEEEEEECCCCCHHHHHHHHHHCCCHHHC EAANGNGPVDAVYQAINRITDYNVELVKYSLTAKGHGKDALGQVDIVANYNGRRFHGVGL CCCCCCCCHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCCEEEEECCCCCEEECCCH ATDIVESSAKAMVHVLNNIWRATEVEKELQRKAQHNENNKETV HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC >Mature Secondary Structure SQQVIIFDTTLRDGEQALQASLSVKEKLQIALALERMGVDVMEVGFPVSSPGDFESVQT CCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHCCCCCCCCCHHHHHH IARQVKNSRVCALARCVEKDIDVAAESLKVAEAFRIHTFIATSPMHIATKLRSTLDEVIE HHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHH RAIYMVKRARNYTDDVEFSCEDAGRTPIADLARVVEAAINAGATTINIPDTVGYTMPFEF HHHHHHHHHCCCCCCCCEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCCCHHH AGIISGLYERVPNIDKAIISVHTHDDLGLAVGNSLAAVHAGARQVEGAMNGIGERAGNCS HHHHHHHHHHCCCCCCEEEEEECCCCCCHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCCC LEEVIMAIKVRKDILNVHTAINHQEIWRTSQLVSQICNMPIPANKAIVGSGAFAHSSGIH HHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCC QDGVLKNRENYEIMTPESIGLNQIQLNLTSRSGRAAVKHRMDEMGYKESEYNLDNLYDAF CCCCCCCCCCCEEECCCCCCCCEEEEEEECCCCHHHHHHHHHHCCCCCCCCCHHHHHHHH LKLADKKGQVFDYDLEALAFIGKQQEEPEHFRLDYFSVQSGSNDIATAAVKLACGEEVKA HHHHCCCCCEEECCHHHHHHHCCCCCCCCCEEEEEEEECCCCCHHHHHHHHHHCCCHHHC EAANGNGPVDAVYQAINRITDYNVELVKYSLTAKGHGKDALGQVDIVANYNGRRFHGVGL CCCCCCCCHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCCEEEEECCCCCEEECCCH ATDIVESSAKAMVHVLNNIWRATEVEKELQRKAQHNENNKETV HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA