| Definition | Escherichia coli E24377A, complete genome. |
|---|---|
| Accession | NC_009801 |
| Length | 4,979,619 |
Click here to switch to the map view.
The map label for this gene is araH [H]
Identifier: 157156388
GI number: 157156388
Start: 2109375
End: 2110361
Strand: Reverse
Name: araH [H]
Synonym: EcE24377A_2130
Alternate gene names: 157156388
Gene position: 2110361-2109375 (Counterclockwise)
Preceding gene: 157156751
Following gene: 157158032
Centisome position: 42.38
GC content: 50.66
Gene sequence:
>987_bases ATGTCTTCTGTTTCTACATCGGGGTCTGGCGCACCTAAGTCGTCATTCAGCTTCGGGCGTATCTGGGATCAGTACGGCAT GCTGGTGGTGTTTGCGGTGCTCTTTATCGCCTGTGCCATTTTTGTCCCAAATTTTGCCACCTTCATTAATATGAAAGGGT TGGGCCTGGCAATTTCCATGTCGGGGATGGTGGCGTGTGGCATGTTGTTCTGCCTTGCTTCCGGTGACTTTGACCTTTCT GTCGCCTCCGTAATTGCCTGTGCGGGTGTCACCACGGCGGTGGTTATCAACCTGACTGAAAGCCTGTGGATTGGCGTGGC AGCGGGGTTGCTGCTTGGCATTCTCTGTGGCCTGGTCAATGGCTTTGTTATCGCCAAACTGAAAATAAATGCTCTGATCA CAACACTGGCAACGATGCAGATTGTTCGAGGTCTGGCGTACATCATTTCAGACGGTAAAGCGGTCGGTATCGAAGATGAA AGCTTCTTTGCCCTTGGTTACGCTAACTGGTTCGGTCTGCCTGCGCCAATCTGGCTCACCGTCGCGTGTCTGATTATCTT TGGTTTGTTGCTGAATAAAACCACCTTTGGTCGTAACACCCTGGCGATTGGCGGGAACGAAGAGGCTGCGCGTCTGGCGG GTGTACCGGTTGTTCGCACCAAAATTATTATCTTTGTTCTCTCTGGCCTGGTATCTGCGATAGCCGGAATTATTCTGGCT TCACGTATGACTAGTGGGCAGCCAATGACGTCGATTGGTTATGAGCTTATTGTTATCTCCGCCTGCGTTTTAGGTGGCGT TTCTCTGAAAGGTGGCATCGGAAAAATCTCATATGTGGTGGCGGGTATCTTAATTTTAGGCACCGTGGAAAACGCCATGA ACCTGCTTAATATTTCTCCTTTCGCGCAGTACGTGGTTCGCGGCTTAATCCTGCTGGCAGCGGTGATCTTCGACCGTTAC AAGCAAAAAGCGAAACGCACTGTCTGA
Upstream 100 bases:
>100_bases TCGCCGGTGAATTGTTACACGAGCAGGCAGATGAGCGTCAGGCACTGAGCCTTGCGATGCCTAAAGTCAGCCAGGCAGTT GCCTGAGTAAGGAGAGAATG
Downstream 100 bases:
>100_bases TGCTTTTTTCCACAACAATTTAACGTTTTTTCCCACCACAGCCAGCCGCCACAACGGTTGGCTGTTCTTCATTGCAAATG GCGACCCCCGTCACACTGTC
Product: L-arabinose transporter permease protein
Products: ADP; phosphate; arabinose [Cytoplasm] [C]
Alternate protein names: NA
Number of amino acids: Translated: 328; Mature: 327
Protein sequence:
>328_residues MSSVSTSGSGAPKSSFSFGRIWDQYGMLVVFAVLFIACAIFVPNFATFINMKGLGLAISMSGMVACGMLFCLASGDFDLS VASVIACAGVTTAVVINLTESLWIGVAAGLLLGILCGLVNGFVIAKLKINALITTLATMQIVRGLAYIISDGKAVGIEDE SFFALGYANWFGLPAPIWLTVACLIIFGLLLNKTTFGRNTLAIGGNEEAARLAGVPVVRTKIIIFVLSGLVSAIAGIILA SRMTSGQPMTSIGYELIVISACVLGGVSLKGGIGKISYVVAGILILGTVENAMNLLNISPFAQYVVRGLILLAAVIFDRY KQKAKRTV
Sequences:
>Translated_328_residues MSSVSTSGSGAPKSSFSFGRIWDQYGMLVVFAVLFIACAIFVPNFATFINMKGLGLAISMSGMVACGMLFCLASGDFDLS VASVIACAGVTTAVVINLTESLWIGVAAGLLLGILCGLVNGFVIAKLKINALITTLATMQIVRGLAYIISDGKAVGIEDE SFFALGYANWFGLPAPIWLTVACLIIFGLLLNKTTFGRNTLAIGGNEEAARLAGVPVVRTKIIIFVLSGLVSAIAGIILA SRMTSGQPMTSIGYELIVISACVLGGVSLKGGIGKISYVVAGILILGTVENAMNLLNISPFAQYVVRGLILLAAVIFDRY KQKAKRTV >Mature_327_residues SSVSTSGSGAPKSSFSFGRIWDQYGMLVVFAVLFIACAIFVPNFATFINMKGLGLAISMSGMVACGMLFCLASGDFDLSV ASVIACAGVTTAVVINLTESLWIGVAAGLLLGILCGLVNGFVIAKLKINALITTLATMQIVRGLAYIISDGKAVGIEDES FFALGYANWFGLPAPIWLTVACLIIFGLLLNKTTFGRNTLAIGGNEEAARLAGVPVVRTKIIIFVLSGLVSAIAGIILAS RMTSGQPMTSIGYELIVISACVLGGVSLKGGIGKISYVVAGILILGTVENAMNLLNISPFAQYVVRGLILLAAVIFDRYK QKAKRTV
Specific function: Part of the binding-protein-dependent transport system for L-arabinose. Probably responsible for the translocation of the substrate across the membrane [H]
COG id: COG1172
COG function: function code G; Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein [H]
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: Belongs to the binding-protein-dependent transport system permease family. AraH/rbsC subfamily [H]
Homologues:
Organism=Escherichia coli, GI145693152, Length=328, Percent_Identity=99.6951219512195, Blast_Score=639, Evalue=0.0, Organism=Escherichia coli, GI1790191, Length=310, Percent_Identity=35.8064516129032, Blast_Score=167, Evalue=6e-43, Organism=Escherichia coli, GI1790524, Length=318, Percent_Identity=30.188679245283, Blast_Score=144, Evalue=5e-36, Organism=Escherichia coli, GI1788896, Length=311, Percent_Identity=31.8327974276527, Blast_Score=139, Evalue=2e-34, Organism=Escherichia coli, GI1789992, Length=132, Percent_Identity=46.2121212121212, Blast_Score=122, Evalue=3e-29, Organism=Escherichia coli, GI87082395, Length=272, Percent_Identity=31.9852941176471, Blast_Score=107, Evalue=1e-24, Organism=Escherichia coli, GI1787793, Length=256, Percent_Identity=35.15625, Blast_Score=100, Evalue=2e-22, Organism=Escherichia coli, GI145693214, Length=242, Percent_Identity=38.4297520661157, Blast_Score=93, Evalue=3e-20, Organism=Escherichia coli, GI1788471, Length=321, Percent_Identity=30.8411214953271, Blast_Score=92, Evalue=5e-20, Organism=Escherichia coli, GI1787794, Length=274, Percent_Identity=28.8321167883212, Blast_Score=92, Evalue=6e-20,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001851 [H]
Pfam domain/function: PF02653 BPD_transp_2 [H]
EC number: NA
Molecular weight: Translated: 34225; Mature: 34094
Theoretical pI: Translated: 9.26; Mature: 9.26
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.1 %Cys (Translated Protein) 3.0 %Met (Translated Protein) 5.2 %Cys+Met (Translated Protein) 2.1 %Cys (Mature Protein) 2.8 %Met (Mature Protein) 4.9 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSSVSTSGSGAPKSSFSFGRIWDQYGMLVVFAVLFIACAIFVPNFATFINMKGLGLAISM CCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHCCCCCCEEEEH SGMVACGMLFCLASGDFDLSVASVIACAGVTTAVVINLTESLWIGVAAGLLLGILCGLVN HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC GFVIAKLKINALITTLATMQIVRGLAYIISDGKAVGIEDESFFALGYANWFGLPAPIWLT CCEEEEHHHHHHHHHHHHHHHHHHHHHHHCCCCEECCCCCCEEEEEEHHHCCCCHHHHHH VACLIIFGLLLNKTTFGRNTLAIGGNEEAARLAGVPVVRTKIIIFVLSGLVSAIAGIILA HHHHHHHHHHHCCCCCCCCEEEECCCCHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHH SRMTSGQPMTSIGYELIVISACVLGGVSLKGGIGKISYVVAGILILGTVENAMNLLNISP HHCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCH FAQYVVRGLILLAAVIFDRYKQKAKRTV HHHHHHHHHHHHHHHHHHHHHHHHHCCC >Mature Secondary Structure SSVSTSGSGAPKSSFSFGRIWDQYGMLVVFAVLFIACAIFVPNFATFINMKGLGLAISM CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHCCCCCCEEEEH SGMVACGMLFCLASGDFDLSVASVIACAGVTTAVVINLTESLWIGVAAGLLLGILCGLVN HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC GFVIAKLKINALITTLATMQIVRGLAYIISDGKAVGIEDESFFALGYANWFGLPAPIWLT CCEEEEHHHHHHHHHHHHHHHHHHHHHHHCCCCEECCCCCCEEEEEEHHHCCCCHHHHHH VACLIIFGLLLNKTTFGRNTLAIGGNEEAARLAGVPVVRTKIIIFVLSGLVSAIAGIILA HHHHHHHHHHHCCCCCCCCEEEECCCCHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHH SRMTSGQPMTSIGYELIVISACVLGGVSLKGGIGKISYVVAGILILGTVENAMNLLNISP HHCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCH FAQYVVRGLILLAAVIFDRYKQKAKRTV HHHHHHHHHHHHHHHHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: ATP; arabinose [Periplasm]; H2O [C]
Specific reaction: ATP + arabinose [Periplasm] + H2O = ADP + phosphate + arabinose [Cytoplasm] [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 7.0
TargetDB status: NA
Availability: NA
References: 2445996; 9097040; 9278503; 8045430 [H]