Definition | Escherichia coli HS, complete genome. |
---|---|
Accession | NC_009800 |
Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is aidB [H]
Identifier: 157163652
GI number: 157163652
Start: 4439011
End: 4440636
Strand: Direct
Name: aidB [H]
Synonym: EcHS_A4431
Alternate gene names: 157163652
Gene position: 4439011-4440636 (Clockwise)
Preceding gene: 157163651
Following gene: 157163655
Centisome position: 95.6
GC content: 55.04
Gene sequence:
>1626_bases GTGCACTGGCAAACTCACACCGTTTTTAATCAACCTATACCATTAAATAACAGCAACTTATACCTGTCTGATGGCGCGCT CTGCGAAGCGGTAACGCGTGAAGGTGCTGGCTGGGATAGCGATTTTCTAGCCAGTATTGGTCAGCAGTTAGGAACGGCTG AATCCCTTGAACTGGGGCGGCTGGCGAATGTGAATCCGCCTGAATTATTGCGCTACGATGCGCAAGGACGCCGTCTGGAC GATGTGCGTTTTCACCCCGCCTGGCACCTGCTGATGCAGGCGCTATGTACCAATCGGGTGCACAATCTTGCCTGGGAAGA AGACGCTCGCTCCGGCGCATTTGTGGCGCGCGCGGCGCGTTTTATGTTGCACGCACAGGTTGAGGCAGGGTCGTTATGTC CGATAACCATGACCTTTGCCGCCACGCCATTGCTGTTACAGATGTTACCCGCGCCGTTTCAGGACTGGACCACGCCGCTG CTGAGCGATCGCTACGATTCTCACTTATTGCCAGGTGGGCAAAAACGCGGTTTGTTGATTGGCATGGGAATGACGGAAAA GCAGGGCGGTTCCGATGTCATGAGCAACACCACCCGCGCAGAGCGCCTGGAAGATGGCTCTTATCGGCTGGTGGGGCATA AATGGTTTTTCTCGGTGCCGCAAAGCGATGCGCATCTGGTGCTGGCGCAGACTACGGGCGGTTTGTCCTGCTTTTTTGTG CCGCGCTTTTTGCCTGACGGGCAACGCAACGCGATTCGCCTCGAGCGGCTGAAAGATAAGCTGGGTAATCGCTCTAACGC CAGCTGCGAAGTGGAGTTTCAGGATGCCATTGGCTGGTTGTTGGGGCAGGAAGGGGAAGGAATTCGTCTGATCCTGAAAA TGGGTGGGATGACGCGTTTTGATTGCGCCCTGGGTAGCCATGCCATGATGCGCCGTGCATTTTCGCTGGCGATTTATCAT GCACATCAACGCCATGTTTTTGGTAATCCATTGATCCAACAGCCCCTTATGCGTCATGTCTTAAGTCGCATGGCACTTCA GCTTGAAGGGCAAACGGCGTTGCTGTTTCGTCTTGCGCGAGCGTGGGACCGGCGTGCCGATGCCAAAGAAGCCCTGTGGG CGCGTTTATTTACGCCTGCGGCAAAATTTGTGATCTGCAAAAGAGGTATGACGTTTGTGGCCGAAGCGATGGAGGTGCTG GGCGGCATTGGTTATTGCGAGGAGAGCGAGCTGCCGCGGCTTTACCGGGAGATGCCGGTAAACAGTATTTGGGAAGGTTC CGGCAATATTATGTGCCTGGATGTGCTGCGCGTTCTCAATAAGCAAGCGGGCGTATACGACTTATTGTCGGAAGCGTTTG TGGAAGTGAAAGGGCAGGATCGCTATTTTGATCGCGCGGTTCGTCGTTTACAGCAGCAGCTGCGCAAGCCAGCTGAAGAA CTGGGGCGAGAGATTACTCATCAGCTATTCCTGCTGGGCTGCGGTGCGCAAATGTTGAAATATGCCTCTCCGCCAATGGC GCAGGCGTGGTGTCAGGTGATGTTAGATACGCGCGGCGGCGTACGGTTGTCAGAGCAGATCCAGAATGATTTATTGCTGC GGGCGACGGGGGGAGTGTGTGTGTAA
Upstream 100 bases:
>100_bases ATTACATTGCTGGATAAGAATGTTTTAGCAATCTCTTTCTGTCATGAATCCATGGCAGTGACCATACTAATGGTGACTGC CATTGATGGAGGGAGACACA
Downstream 100 bases:
>100_bases GCGTATACGACTGATGCGACGCTGGTTTCGATTAACTCAATGAAATATGTGAAAATTGTAGGCCGGACAAGGCGCTCGCG CCGCATCCGGCATTGTTCAT
Product: isovaleryl CoA dehydrogenase
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 541; Mature: 541
Protein sequence:
>541_residues MHWQTHTVFNQPIPLNNSNLYLSDGALCEAVTREGAGWDSDFLASIGQQLGTAESLELGRLANVNPPELLRYDAQGRRLD DVRFHPAWHLLMQALCTNRVHNLAWEEDARSGAFVARAARFMLHAQVEAGSLCPITMTFAATPLLLQMLPAPFQDWTTPL LSDRYDSHLLPGGQKRGLLIGMGMTEKQGGSDVMSNTTRAERLEDGSYRLVGHKWFFSVPQSDAHLVLAQTTGGLSCFFV PRFLPDGQRNAIRLERLKDKLGNRSNASCEVEFQDAIGWLLGQEGEGIRLILKMGGMTRFDCALGSHAMMRRAFSLAIYH AHQRHVFGNPLIQQPLMRHVLSRMALQLEGQTALLFRLARAWDRRADAKEALWARLFTPAAKFVICKRGMTFVAEAMEVL GGIGYCEESELPRLYREMPVNSIWEGSGNIMCLDVLRVLNKQAGVYDLLSEAFVEVKGQDRYFDRAVRRLQQQLRKPAEE LGREITHQLFLLGCGAQMLKYASPPMAQAWCQVMLDTRGGVRLSEQIQNDLLLRATGGVCV
Sequences:
>Translated_541_residues MHWQTHTVFNQPIPLNNSNLYLSDGALCEAVTREGAGWDSDFLASIGQQLGTAESLELGRLANVNPPELLRYDAQGRRLD DVRFHPAWHLLMQALCTNRVHNLAWEEDARSGAFVARAARFMLHAQVEAGSLCPITMTFAATPLLLQMLPAPFQDWTTPL LSDRYDSHLLPGGQKRGLLIGMGMTEKQGGSDVMSNTTRAERLEDGSYRLVGHKWFFSVPQSDAHLVLAQTTGGLSCFFV PRFLPDGQRNAIRLERLKDKLGNRSNASCEVEFQDAIGWLLGQEGEGIRLILKMGGMTRFDCALGSHAMMRRAFSLAIYH AHQRHVFGNPLIQQPLMRHVLSRMALQLEGQTALLFRLARAWDRRADAKEALWARLFTPAAKFVICKRGMTFVAEAMEVL GGIGYCEESELPRLYREMPVNSIWEGSGNIMCLDVLRVLNKQAGVYDLLSEAFVEVKGQDRYFDRAVRRLQQQLRKPAEE LGREITHQLFLLGCGAQMLKYASPPMAQAWCQVMLDTRGGVRLSEQIQNDLLLRATGGVCV >Mature_541_residues MHWQTHTVFNQPIPLNNSNLYLSDGALCEAVTREGAGWDSDFLASIGQQLGTAESLELGRLANVNPPELLRYDAQGRRLD DVRFHPAWHLLMQALCTNRVHNLAWEEDARSGAFVARAARFMLHAQVEAGSLCPITMTFAATPLLLQMLPAPFQDWTTPL LSDRYDSHLLPGGQKRGLLIGMGMTEKQGGSDVMSNTTRAERLEDGSYRLVGHKWFFSVPQSDAHLVLAQTTGGLSCFFV PRFLPDGQRNAIRLERLKDKLGNRSNASCEVEFQDAIGWLLGQEGEGIRLILKMGGMTRFDCALGSHAMMRRAFSLAIYH AHQRHVFGNPLIQQPLMRHVLSRMALQLEGQTALLFRLARAWDRRADAKEALWARLFTPAAKFVICKRGMTFVAEAMEVL GGIGYCEESELPRLYREMPVNSIWEGSGNIMCLDVLRVLNKQAGVYDLLSEAFVEVKGQDRYFDRAVRRLQQQLRKPAEE LGREITHQLFLLGCGAQMLKYASPPMAQAWCQVMLDTRGGVRLSEQIQNDLLLRATGGVCV
Specific function: May help to prevent alkylation damage by protecting DNA and destroying alkylating agents [H]
COG id: COG1960
COG function: function code I; Acyl-CoA dehydrogenases
Gene ontology:
Cell location: Cytoplasm (Potential) [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the acyl-CoA dehydrogenase family [H]
Homologues:
Organism=Homo sapiens, GI4557233, Length=287, Percent_Identity=29.616724738676, Blast_Score=100, Evalue=7e-21, Organism=Homo sapiens, GI226958412, Length=271, Percent_Identity=24.7232472324723, Blast_Score=80, Evalue=4e-15, Organism=Homo sapiens, GI226958414, Length=271, Percent_Identity=24.7232472324723, Blast_Score=80, Evalue=4e-15, Organism=Homo sapiens, GI4501859, Length=252, Percent_Identity=25.3968253968254, Blast_Score=79, Evalue=8e-15, Organism=Homo sapiens, GI7656849, Length=266, Percent_Identity=27.8195488721804, Blast_Score=76, Evalue=7e-14, Organism=Homo sapiens, GI4557231, Length=274, Percent_Identity=25.1824817518248, Blast_Score=71, Evalue=2e-12, Organism=Homo sapiens, GI187960098, Length=274, Percent_Identity=25.1824817518248, Blast_Score=71, Evalue=3e-12, Organism=Homo sapiens, GI21361497, Length=260, Percent_Identity=24.2307692307692, Blast_Score=69, Evalue=1e-11, Organism=Escherichia coli, GI87082384, Length=541, Percent_Identity=99.4454713493531, Blast_Score=1107, Evalue=0.0, Organism=Escherichia coli, GI1786223, Length=329, Percent_Identity=25.531914893617, Blast_Score=82, Evalue=8e-17, Organism=Escherichia coli, GI87081958, Length=258, Percent_Identity=26.3565891472868, Blast_Score=76, Evalue=6e-15, Organism=Caenorhabditis elegans, GI86563383, Length=538, Percent_Identity=29.9256505576208, Blast_Score=189, Evalue=4e-48, Organism=Caenorhabditis elegans, GI86563381, Length=538, Percent_Identity=29.9256505576208, Blast_Score=188, Evalue=6e-48, Organism=Caenorhabditis elegans, GI17538396, Length=271, Percent_Identity=28.7822878228782, Blast_Score=100, Evalue=4e-21, Organism=Caenorhabditis elegans, GI17508101, Length=267, Percent_Identity=23.9700374531835, Blast_Score=86, Evalue=5e-17, Organism=Caenorhabditis elegans, GI17506239, Length=270, Percent_Identity=24.0740740740741, Blast_Score=84, Evalue=2e-16, Organism=Caenorhabditis elegans, GI71985184, Length=266, Percent_Identity=24.4360902255639, Blast_Score=75, Evalue=7e-14, Organism=Caenorhabditis elegans, GI17534899, Length=269, Percent_Identity=24.5353159851301, Blast_Score=73, Evalue=3e-13, Organism=Caenorhabditis elegans, GI17569725, Length=268, Percent_Identity=24.2537313432836, Blast_Score=72, Evalue=7e-13, Organism=Caenorhabditis elegans, GI17505929, Length=298, Percent_Identity=26.510067114094, Blast_Score=67, Evalue=2e-11, Organism=Drosophila melanogaster, GI21356377, Length=312, Percent_Identity=25.6410256410256, Blast_Score=92, Evalue=1e-18, Organism=Drosophila melanogaster, GI21355753, Length=297, Percent_Identity=28.2828282828283, Blast_Score=89, Evalue=9e-18, Organism=Drosophila melanogaster, GI281363737, Length=341, Percent_Identity=23.7536656891496, Blast_Score=84, Evalue=2e-16, Organism=Drosophila melanogaster, GI24646207, Length=274, Percent_Identity=25.5474452554745, Blast_Score=80, Evalue=5e-15, Organism=Drosophila melanogaster, GI24660351, Length=279, Percent_Identity=25.4480286738351, Blast_Score=73, Evalue=4e-13, Organism=Drosophila melanogaster, GI24666513, Length=269, Percent_Identity=22.3048327137546, Blast_Score=68, Evalue=1e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR006089 - InterPro: IPR006090 - InterPro: IPR006091 - InterPro: IPR009075 - InterPro: IPR009100 [H]
Pfam domain/function: PF00441 Acyl-CoA_dh_1; PF02770 Acyl-CoA_dh_M [H]
EC number: NA
Molecular weight: Translated: 60640; Mature: 60640
Theoretical pI: Translated: 7.89; Mature: 7.89
Prosite motif: PS00072 ACYL_COA_DH_1 ; PS00073 ACYL_COA_DH_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.2 %Cys (Translated Protein) 3.9 %Met (Translated Protein) 6.1 %Cys+Met (Translated Protein) 2.2 %Cys (Mature Protein) 3.9 %Met (Mature Protein) 6.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MHWQTHTVFNQPIPLNNSNLYLSDGALCEAVTREGAGWDSDFLASIGQQLGTAESLELGR CCCCCCCCCCCCCCCCCCCEEECCCHHHHHHHHCCCCCCHHHHHHHHHHHCCHHHCCCCC LANVNPPELLRYDAQGRRLDDVRFHPAWHLLMQALCTNRVHNLAWEEDARSGAFVARAAR CCCCCCHHHHEECCCCCCCCCCEECHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHH FMLHAQVEAGSLCPITMTFAATPLLLQMLPAPFQDWTTPLLSDRYDSHLLPGGQKRGLLI HHHHEECCCCCCCCEEHHHHHHHHHHHHCCCCHHHHCCHHHHCCCCCCCCCCCCCCCEEE GMGMTEKQGGSDVMSNTTRAERLEDGSYRLVGHKWFFSVPQSDAHLVLAQTTGGLSCFFV ECCCCCCCCCCHHHHCCHHHHHCCCCCEEEEEEEEEEECCCCCCEEEEEECCCCEEEEEE PRFLPDGQRNAIRLERLKDKLGNRSNASCEVEFQDAIGWLLGQEGEGIRLILKMGGMTRF HHHCCCCCCCHHHHHHHHHHHCCCCCCEEEEEHHHHHHHHHCCCCCCEEEEEEECCCCEE DCALGSHAMMRRAFSLAIYHAHQRHVFGNPLIQQPLMRHVLSRMALQLEGQTALLFRLAR HHHHCHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHH AWDRRADAKEALWARLFTPAAKFVICKRGMTFVAEAMEVLGGIGYCEESELPRLYREMPV HHHHCCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHCCC NSIWEGSGNIMCLDVLRVLNKQAGVYDLLSEAFVEVKGQDRYFDRAVRRLQQQLRKPAEE CCEECCCCCEEHHHHHHHHCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHH LGREITHQLFLLGCGAQMLKYASPPMAQAWCQVMLDTRGGVRLSEQIQNDLLLRATGGVC HHHHHHHHHHHHHCCHHHHHHCCCHHHHHHHHHHHCCCCCCEEHHHHHCCEEEEECCCCC V C >Mature Secondary Structure MHWQTHTVFNQPIPLNNSNLYLSDGALCEAVTREGAGWDSDFLASIGQQLGTAESLELGR CCCCCCCCCCCCCCCCCCCEEECCCHHHHHHHHCCCCCCHHHHHHHHHHHCCHHHCCCCC LANVNPPELLRYDAQGRRLDDVRFHPAWHLLMQALCTNRVHNLAWEEDARSGAFVARAAR CCCCCCHHHHEECCCCCCCCCCEECHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHH FMLHAQVEAGSLCPITMTFAATPLLLQMLPAPFQDWTTPLLSDRYDSHLLPGGQKRGLLI HHHHEECCCCCCCCEEHHHHHHHHHHHHCCCCHHHHCCHHHHCCCCCCCCCCCCCCCEEE GMGMTEKQGGSDVMSNTTRAERLEDGSYRLVGHKWFFSVPQSDAHLVLAQTTGGLSCFFV ECCCCCCCCCCHHHHCCHHHHHCCCCCEEEEEEEEEEECCCCCCEEEEEECCCCEEEEEE PRFLPDGQRNAIRLERLKDKLGNRSNASCEVEFQDAIGWLLGQEGEGIRLILKMGGMTRF HHHCCCCCCCHHHHHHHHHHHCCCCCCEEEEEHHHHHHHHHCCCCCCEEEEEEECCCCEE DCALGSHAMMRRAFSLAIYHAHQRHVFGNPLIQQPLMRHVLSRMALQLEGQTALLFRLAR HHHHCHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHH AWDRRADAKEALWARLFTPAAKFVICKRGMTFVAEAMEVLGGIGYCEESELPRLYREMPV HHHHCCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHCCC NSIWEGSGNIMCLDVLRVLNKQAGVYDLLSEAFVEVKGQDRYFDRAVRRLQQQLRKPAEE CCEECCCCCEEHHHHHHHHCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHH LGREITHQLFLLGCGAQMLKYASPPMAQAWCQVMLDTRGGVRLSEQIQNDLLLRATGGVC HHHHHHHHHHHHHCCHHHHHHCCCHHHHHHHHHHHCCCCCCEEHHHHHCCEEEEECCCCC V C
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 7961409; 7610040; 9278503 [H]