Definition Yersinia pseudotuberculosis YPIII chromosome, complete genome.
Accession NC_010465
Length 4,689,441

Click here to switch to the map view.

The map label for this gene is araH [H]

Identifier: 170024232

GI number: 170024232

Start: 2219463

End: 2220512

Strand: Reverse

Name: araH [H]

Synonym: YPK_1997

Alternate gene names: 170024232

Gene position: 2220512-2219463 (Counterclockwise)

Preceding gene: 170024233

Following gene: 170024231

Centisome position: 47.35

GC content: 49.05

Gene sequence:

>1050_bases
ATGTCTAGCGTTACTTTGAGTTCGGATAAAAAGAACCCGGTATCAACTGAATCAAAAGGTGGGATCCCACAACCTCAGCA
ACCACAGAATGCGCCGACTAAAAGTGGCTTAGGATTATCCCGTATCTGGGACAGCTACGGCATGCTGGTGGTCTTCGCCG
TGGTCTTTATTGGTTGCGTGATTTTTGTGCCTAATTTTGGCTCCTTTATCAATATGAAAGGGTTGGGGCTGGCTATCTCA
ATGTCTGGGATGGTAGCGTGCGGTATGCTTTTTTGTTTGGCATCCGGCGATTTTGACCTCTCTGTGGCCTCGGTCATTGC
CTGCGCGGGTGTTACCACCGCGGTGGTGATCAATATGACAGAGAGCCTATGGATAGGGGTGGGCGCAGGTTTGTTATTAG
GGGCGGCATGTGGGCTAATTAATGGTTTTGTGATTGCCCGGCTAAAAATCAATGCGCTGATCACCACATTGGCGACGATG
CAAATTGTTCGTGGCCTGGCGTATATCATTTCTGATGGTAAAGCCGTGGGTATCGAAGATGAGCGCTTCTTCGCCTTGGG
TTATACCAACTGGTTTGGTCTGCCAGCACCCATCTGGATCACCGTTGCCTGTTTGGTGCTGTTTGGTTTCTTACTGAATA
AAACCACCTTTGGCCGTAACACATTGGCGATTGGGGGGAATGAAGATGCTGCGCGTCTGGCCGGTGTGCCGGTCGTGCGG
ACCAAAATCATTATTTTTGTGCTGTCAGGTTTGGTGTCTGCCGCCGCAGGGATTATTTTGGCTTCGCGCATGACGAGCGG
CCAGCCAATGACCTCGATTGGTTATGAGCTCATTGTTATCTCTGCTTGCGTATTAGGCGGGGTATCACTAAAAGGCGGCA
TTGGTAAAATTTCTTACGTCATTGCGGGGATCTTGATTCTGGGAACAGTAGAAAATGCCATGAACCTATTAAATATCTCC
CCGTTCTCACAATATGTGGTTCGTGGTTTGATCCTGCTGGCGGCGGTTATCTTCGACCGTTACAAACAGTTAGCTAAACG
GACGATATAA

Upstream 100 bases:

>100_bases
ATGCCACTGAAGAGCAAGCCTTAAGTCTGGCAATGTTACGCACCCCGAATATTGCCACCAATACCGCGTCTGCGGTTGCC
TGACTGTGAAGGAGTTAATC

Downstream 100 bases:

>100_bases
GGTTACGGTGACTGAAACGTCCATAAAAATCTCTCTAGTTCACAGTTTTAGCATCACGCTCGGGCAGCATAACACCTGCC
CGATAAACTGAAAGAAGTTA

Product: L-arabinose transporter permease

Products: ADP; phosphate; arabinose [Cytoplasm] [C]

Alternate protein names: NA

Number of amino acids: Translated: 349; Mature: 348

Protein sequence:

>349_residues
MSSVTLSSDKKNPVSTESKGGIPQPQQPQNAPTKSGLGLSRIWDSYGMLVVFAVVFIGCVIFVPNFGSFINMKGLGLAIS
MSGMVACGMLFCLASGDFDLSVASVIACAGVTTAVVINMTESLWIGVGAGLLLGAACGLINGFVIARLKINALITTLATM
QIVRGLAYIISDGKAVGIEDERFFALGYTNWFGLPAPIWITVACLVLFGFLLNKTTFGRNTLAIGGNEDAARLAGVPVVR
TKIIIFVLSGLVSAAAGIILASRMTSGQPMTSIGYELIVISACVLGGVSLKGGIGKISYVIAGILILGTVENAMNLLNIS
PFSQYVVRGLILLAAVIFDRYKQLAKRTI

Sequences:

>Translated_349_residues
MSSVTLSSDKKNPVSTESKGGIPQPQQPQNAPTKSGLGLSRIWDSYGMLVVFAVVFIGCVIFVPNFGSFINMKGLGLAIS
MSGMVACGMLFCLASGDFDLSVASVIACAGVTTAVVINMTESLWIGVGAGLLLGAACGLINGFVIARLKINALITTLATM
QIVRGLAYIISDGKAVGIEDERFFALGYTNWFGLPAPIWITVACLVLFGFLLNKTTFGRNTLAIGGNEDAARLAGVPVVR
TKIIIFVLSGLVSAAAGIILASRMTSGQPMTSIGYELIVISACVLGGVSLKGGIGKISYVIAGILILGTVENAMNLLNIS
PFSQYVVRGLILLAAVIFDRYKQLAKRTI
>Mature_348_residues
SSVTLSSDKKNPVSTESKGGIPQPQQPQNAPTKSGLGLSRIWDSYGMLVVFAVVFIGCVIFVPNFGSFINMKGLGLAISM
SGMVACGMLFCLASGDFDLSVASVIACAGVTTAVVINMTESLWIGVGAGLLLGAACGLINGFVIARLKINALITTLATMQ
IVRGLAYIISDGKAVGIEDERFFALGYTNWFGLPAPIWITVACLVLFGFLLNKTTFGRNTLAIGGNEDAARLAGVPVVRT
KIIIFVLSGLVSAAAGIILASRMTSGQPMTSIGYELIVISACVLGGVSLKGGIGKISYVIAGILILGTVENAMNLLNISP
FSQYVVRGLILLAAVIFDRYKQLAKRTI

Specific function: Part of the binding-protein-dependent transport system for L-arabinose. Probably responsible for the translocation of the substrate across the membrane [H]

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cell inner membrane; Multi-pass membrane protein [H]

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the binding-protein-dependent transport system permease family. AraH/rbsC subfamily [H]

Homologues:

Organism=Escherichia coli, GI145693152, Length=319, Percent_Identity=90.5956112852665, Blast_Score=552, Evalue=1e-158,
Organism=Escherichia coli, GI1790191, Length=289, Percent_Identity=36.6782006920415, Blast_Score=162, Evalue=2e-41,
Organism=Escherichia coli, GI1788896, Length=334, Percent_Identity=31.7365269461078, Blast_Score=137, Evalue=7e-34,
Organism=Escherichia coli, GI1790524, Length=329, Percent_Identity=29.1793313069909, Blast_Score=135, Evalue=5e-33,
Organism=Escherichia coli, GI1789992, Length=130, Percent_Identity=45.3846153846154, Blast_Score=119, Evalue=3e-28,
Organism=Escherichia coli, GI87082395, Length=272, Percent_Identity=33.0882352941176, Blast_Score=108, Evalue=4e-25,
Organism=Escherichia coli, GI1787793, Length=256, Percent_Identity=33.984375, Blast_Score=100, Evalue=3e-22,
Organism=Escherichia coli, GI1787794, Length=273, Percent_Identity=30.03663003663, Blast_Score=89, Evalue=5e-19,
Organism=Escherichia coli, GI1788471, Length=322, Percent_Identity=31.9875776397516, Blast_Score=89, Evalue=5e-19,
Organism=Escherichia coli, GI145693214, Length=260, Percent_Identity=35.7692307692308, Blast_Score=88, Evalue=1e-18,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001851 [H]

Pfam domain/function: PF02653 BPD_transp_2 [H]

EC number: NA

Molecular weight: Translated: 36485; Mature: 36354

Theoretical pI: Translated: 9.45; Mature: 9.45

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.0 %Cys     (Translated Protein)
3.2 %Met     (Translated Protein)
5.2 %Cys+Met (Translated Protein)
2.0 %Cys     (Mature Protein)
2.9 %Met     (Mature Protein)
4.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSSVTLSSDKKNPVSTESKGGIPQPQQPQNAPTKSGLGLSRIWDSYGMLVVFAVVFIGCV
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH
IFVPNFGSFINMKGLGLAISMSGMVACGMLFCLASGDFDLSVASVIACAGVTTAVVINMT
HHHCCCCCEEECCCCCEEEEHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHEEECH
ESLWIGVGAGLLLGAACGLINGFVIARLKINALITTLATMQIVRGLAYIISDGKAVGIED
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEECCCC
ERFFALGYTNWFGLPAPIWITVACLVLFGFLLNKTTFGRNTLAIGGNEDAARLAGVPVVR
CEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCEEEECCCCCHHHHCCCCHHH
TKIIIFVLSGLVSAAAGIILASRMTSGQPMTSIGYELIVISACVLGGVSLKGGIGKISYV
HHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHH
IAGILILGTVENAMNLLNISPFSQYVVRGLILLAAVIFDRYKQLAKRTI
HHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
>Mature Secondary Structure 
SSVTLSSDKKNPVSTESKGGIPQPQQPQNAPTKSGLGLSRIWDSYGMLVVFAVVFIGCV
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH
IFVPNFGSFINMKGLGLAISMSGMVACGMLFCLASGDFDLSVASVIACAGVTTAVVINMT
HHHCCCCCEEECCCCCEEEEHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHEEECH
ESLWIGVGAGLLLGAACGLINGFVIARLKINALITTLATMQIVRGLAYIISDGKAVGIED
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEECCCC
ERFFALGYTNWFGLPAPIWITVACLVLFGFLLNKTTFGRNTLAIGGNEDAARLAGVPVVR
CEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCEEEECCCCCHHHHCCCCHHH
TKIIIFVLSGLVSAAAGIILASRMTSGQPMTSIGYELIVISACVLGGVSLKGGIGKISYV
HHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHH
IAGILILGTVENAMNLLNISPFSQYVVRGLILLAAVIFDRYKQLAKRTI
HHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: ATP; arabinose [Periplasm]; H2O [C]

Specific reaction: ATP + arabinose [Periplasm] + H2O = ADP + phosphate + arabinose [Cytoplasm] [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 7.0

TargetDB status: NA

Availability: NA

References: 2445996; 9097040; 9278503; 8045430 [H]