Definition | Escherichia coli 55989, complete genome. |
---|---|
Accession | NC_011748 |
Length | 5,154,862 |
Click here to switch to the map view.
The map label for this gene is araH
Identifier: 218695462
GI number: 218695462
Start: 2132835
End: 2133824
Strand: Reverse
Name: araH
Synonym: EC55989_2075
Alternate gene names: 218695462
Gene position: 2133824-2132835 (Counterclockwise)
Preceding gene: 218695463
Following gene: 218695461
Centisome position: 41.39
GC content: 51.11
Gene sequence:
>990_bases ATGATGTCTTCTGTTTCTACATCGGGGTCTGGCGCACCTAAGTCGTCATTCAGCTTCGGGCGTATCTGGGATCAGTACGG CATGCTGGTGGTGTTTGCGGTGCTCTTTATCGCCTGTGCCATTTTTGTCCCAAATTTTGCCACCTTCATTAATATGAAAG GGTTGGGCCTGGCAATTTCCATGTCGGGGATGGTGGCTTGTGGCATGTTGTTCTGCCTCGCTTCCGGTGACTTTGACCTT TCTGTCGCCTCCGTAATTGCCTGTGCGGGTGTCACCACGGCGGTGGTTATTAACCTGACTGAAAGCCTGTGGATTGGCGT GGCAGCGGGGTTGTTGCTGGGCGTTCTCTGTGGCCTGGTCAATGGCTTTGTTATCGCCAAACTGAAAATAAATGCTCTGA TCACGACATTGGCAACGATGCAGATTGTTCGAGGTCTGGCGTACATCATTTCAGACGGTAAAGCGGTCGGTATCGAAGAT GAAAGCTTCTTTGCCCTTGGTTACGCCAACTGGTTCGGTCTGCCTGCGCCAATCTGGCTCACCGTCGCGTGTCTGATTAT CTTTGGTTTGCTGCTGAATAAAACCACCTTTGGTCGTAACACCCTGGCGATTGGCGGGAACGAAGAGGCCGCGCGTCTGG CGGGTGTACCGGTTGTTCGCACCAAAATTATTATCTTTGTTCTCTCAGGCCTGGTATCAGCGATAGCCGGAATTATTCTG GCTTCACGTATGACCAGTGGGCAGCCAATGACGTCGATTGGTTATGAGCTGATTGTTATCTCCGCCTGCGTTTTAGGTGG CGTTTCTCTGAAAGGTGGCATCGGAAAAATCTCATATGTGGTGGCGGGTATCTTAATTTTAGGCACCGTGGAAAACGCCA TGAACCTGCTAAATATTTCTCCTTTCGCGCAGTACGTGGTTCGCGGCTTAATCCTGCTGGCAGCGGTGATCTTCGACCGT TACAAGCAAAAAGCGAAACGCACTGTCTGA
Upstream 100 bases:
>100_bases AAATCGCCGGTGAATTGTTACACGAGCAGGCAGATGAGCGTCAGGCACTGAGCCTTGCGATGCCTAAAGTCAGCCAGGCT GTTGCCTGAGTAAGGAGAGT
Downstream 100 bases:
>100_bases TGCTTTTTTCTGCAACAATTTAGCGTTTTTTCCCACCATAGCCAACCGCCATAACGGTTGGCTGTTCTTCGTTGCAAATG GCGACCCCCGTCACACTGTC
Product: L-arabinose transporter permease protein
Products: ADP; phosphate; arabinose [Cytoplasm] [C]
Alternate protein names: NA
Number of amino acids: Translated: 329; Mature: 329
Protein sequence:
>329_residues MMSSVSTSGSGAPKSSFSFGRIWDQYGMLVVFAVLFIACAIFVPNFATFINMKGLGLAISMSGMVACGMLFCLASGDFDL SVASVIACAGVTTAVVINLTESLWIGVAAGLLLGVLCGLVNGFVIAKLKINALITTLATMQIVRGLAYIISDGKAVGIED ESFFALGYANWFGLPAPIWLTVACLIIFGLLLNKTTFGRNTLAIGGNEEAARLAGVPVVRTKIIIFVLSGLVSAIAGIIL ASRMTSGQPMTSIGYELIVISACVLGGVSLKGGIGKISYVVAGILILGTVENAMNLLNISPFAQYVVRGLILLAAVIFDR YKQKAKRTV
Sequences:
>Translated_329_residues MMSSVSTSGSGAPKSSFSFGRIWDQYGMLVVFAVLFIACAIFVPNFATFINMKGLGLAISMSGMVACGMLFCLASGDFDL SVASVIACAGVTTAVVINLTESLWIGVAAGLLLGVLCGLVNGFVIAKLKINALITTLATMQIVRGLAYIISDGKAVGIED ESFFALGYANWFGLPAPIWLTVACLIIFGLLLNKTTFGRNTLAIGGNEEAARLAGVPVVRTKIIIFVLSGLVSAIAGIIL ASRMTSGQPMTSIGYELIVISACVLGGVSLKGGIGKISYVVAGILILGTVENAMNLLNISPFAQYVVRGLILLAAVIFDR YKQKAKRTV >Mature_329_residues MMSSVSTSGSGAPKSSFSFGRIWDQYGMLVVFAVLFIACAIFVPNFATFINMKGLGLAISMSGMVACGMLFCLASGDFDL SVASVIACAGVTTAVVINLTESLWIGVAAGLLLGVLCGLVNGFVIAKLKINALITTLATMQIVRGLAYIISDGKAVGIED ESFFALGYANWFGLPAPIWLTVACLIIFGLLLNKTTFGRNTLAIGGNEEAARLAGVPVVRTKIIIFVLSGLVSAIAGIIL ASRMTSGQPMTSIGYELIVISACVLGGVSLKGGIGKISYVVAGILILGTVENAMNLLNISPFAQYVVRGLILLAAVIFDR YKQKAKRTV
Specific function: Part of the binding-protein-dependent transport system for L-arabinose. Probably responsible for the translocation of the substrate across the membrane [H]
COG id: COG1172
COG function: function code G; Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein [H]
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: Belongs to the binding-protein-dependent transport system permease family. AraH/rbsC subfamily [H]
Homologues:
Organism=Escherichia coli, GI145693152, Length=328, Percent_Identity=100, Blast_Score=640, Evalue=0.0, Organism=Escherichia coli, GI1790191, Length=310, Percent_Identity=35.8064516129032, Blast_Score=168, Evalue=5e-43, Organism=Escherichia coli, GI1790524, Length=318, Percent_Identity=30.188679245283, Blast_Score=144, Evalue=6e-36, Organism=Escherichia coli, GI1788896, Length=311, Percent_Identity=31.8327974276527, Blast_Score=139, Evalue=2e-34, Organism=Escherichia coli, GI1789992, Length=132, Percent_Identity=46.2121212121212, Blast_Score=122, Evalue=3e-29, Organism=Escherichia coli, GI87082395, Length=272, Percent_Identity=31.9852941176471, Blast_Score=107, Evalue=9e-25, Organism=Escherichia coli, GI1787793, Length=256, Percent_Identity=35.15625, Blast_Score=100, Evalue=2e-22, Organism=Escherichia coli, GI145693214, Length=242, Percent_Identity=38.0165289256198, Blast_Score=92, Evalue=3e-20, Organism=Escherichia coli, GI1788471, Length=321, Percent_Identity=30.8411214953271, Blast_Score=92, Evalue=3e-20, Organism=Escherichia coli, GI1787794, Length=274, Percent_Identity=28.8321167883212, Blast_Score=92, Evalue=6e-20,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001851 [H]
Pfam domain/function: PF02653 BPD_transp_2 [H]
EC number: NA
Molecular weight: Translated: 34343; Mature: 34343
Theoretical pI: Translated: 9.26; Mature: 9.26
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.1 %Cys (Translated Protein) 3.3 %Met (Translated Protein) 5.5 %Cys+Met (Translated Protein) 2.1 %Cys (Mature Protein) 3.3 %Met (Mature Protein) 5.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MMSSVSTSGSGAPKSSFSFGRIWDQYGMLVVFAVLFIACAIFVPNFATFINMKGLGLAIS CCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHCCCCCCEEEE MSGMVACGMLFCLASGDFDLSVASVIACAGVTTAVVINLTESLWIGVAAGLLLGVLCGLV HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH NGFVIAKLKINALITTLATMQIVRGLAYIISDGKAVGIEDESFFALGYANWFGLPAPIWL CCCEEEEHHHHHHHHHHHHHHHHHHHHHHHCCCCEECCCCCCEEEEEEHHHCCCCHHHHH TVACLIIFGLLLNKTTFGRNTLAIGGNEEAARLAGVPVVRTKIIIFVLSGLVSAIAGIIL HHHHHHHHHHHHCCCCCCCCEEEECCCCHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHH ASRMTSGQPMTSIGYELIVISACVLGGVSLKGGIGKISYVVAGILILGTVENAMNLLNIS HHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCC PFAQYVVRGLILLAAVIFDRYKQKAKRTV HHHHHHHHHHHHHHHHHHHHHHHHHHCCC >Mature Secondary Structure MMSSVSTSGSGAPKSSFSFGRIWDQYGMLVVFAVLFIACAIFVPNFATFINMKGLGLAIS CCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHCCCCCCEEEE MSGMVACGMLFCLASGDFDLSVASVIACAGVTTAVVINLTESLWIGVAAGLLLGVLCGLV HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH NGFVIAKLKINALITTLATMQIVRGLAYIISDGKAVGIEDESFFALGYANWFGLPAPIWL CCCEEEEHHHHHHHHHHHHHHHHHHHHHHHCCCCEECCCCCCEEEEEEHHHCCCCHHHHH TVACLIIFGLLLNKTTFGRNTLAIGGNEEAARLAGVPVVRTKIIIFVLSGLVSAIAGIIL HHHHHHHHHHHHCCCCCCCCEEEECCCCHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHH ASRMTSGQPMTSIGYELIVISACVLGGVSLKGGIGKISYVVAGILILGTVENAMNLLNIS HHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCC PFAQYVVRGLILLAAVIFDRYKQKAKRTV HHHHHHHHHHHHHHHHHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: ATP; arabinose [Periplasm]; H2O [C]
Specific reaction: ATP + arabinose [Periplasm] + H2O = ADP + phosphate + arabinose [Cytoplasm] [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 7.0
TargetDB status: NA
Availability: NA
References: 2445996; 9097040; 9278503; 8045430 [H]