Definition Rhizobium leguminosarum bv. trifolii WSM2304 chromosome, complete genome.
Accession NC_011369
Length 4,537,948

Click here to switch to the map view.

The map label for this gene is yhdX [H]

Identifier: 209549222

GI number: 209549222

Start: 1649637

End: 1651976

Strand: Reverse

Name: yhdX [H]

Synonym: Rleg2_1623

Alternate gene names: 209549222

Gene position: 1651976-1649637 (Counterclockwise)

Preceding gene: 209549223

Following gene: 209549221

Centisome position: 36.4

GC content: 61.2

Gene sequence:

>2340_bases
ATGACAATCCACGAAACGAGTGCGATCCACGGCATGGTAGAGAGCGATAGCCTGAAGTCGCGGCGCCTGCGCAATCGCAT
CCGGCACGAGGCGATCCAGTTGGGCATCGTCGGTCTGGTCGTGGTGCTGCTCGGCATCCTGGCCTACGCGGTCAAGGCGG
GCCTTTCCGAGAAAGGCATACGGTTTAGCTTCTCGTTCCTGGCCAATACCGCCGGCTTCGACATCAGCGAAGGATGGACG
CTGACATCCGGTGAAGGCGTCGTCCCCGGCCTCTCCCAGTTCTCTTCGGACATGTCGGTCGCGTCGGCTTTCGTCACAGG
TATCTTCAACACCGCGAAGGTCGCGCTGCTCGCAATCGCCCTCAGCACGATCCTCGGAACGCTGCTCGGCGTCGGCCGTC
TCTCGACCAACTGGGTGGTGAGAAACCTCTGTTTCTGGATTGTGGAATTCGTACGCAACACGCCGCTGCTCATACAGCTC
GTCTTCTGGTACTTCGCGGTCGTACTGCGCTTTCCTCCAATGGCGTCCGCAGCGAAGCTTTATGGCGGTCTTATCGTCAG
CCAACAAGGCATATACGTTCCCACCTTGGCCTGGAGCGGCGGCGTATCCACCCTTTCGCTCATCCTCGTCACCGCCACAT
GGGTCTCGGTTCTAGGAGCAGTGTTCGACCCCATCCGTGGAATGCGGCGGATATGCATCGCTGTGGCGGTGGTCTGCTTC
GGGCTATTGATCGCAACCGGCGGAATGACCGTCGATTTTCCCGTCGCCAACAAGTTCAAGGCCAGCGGCGGAACCAGCAT
AAGCCCTGAGATGGCGGCTCTCCTGCTTGCCGTCGTCGTGAACAGCGCTTCCTTCATCGCAGAGATCGTCCGCGGCGCGA
TCGACGCTCTTCCCAAGTCCCAGTGGGAAGCAGCGGGGTCGCTGGGCTTCAGCCGGCGCGACACCGTCAACGACATCGTC
CTTCCTCAGGTATTCCGCGTCGTTCTCCCCTCTTTCGGGAACCAGTATATCAGCCTGACCAAGAATACCGCGCTCGGGAT
CGCGGTCGGATACCCGGACCTGTTCAACATCTATGGAACGATCGCCAATCAGACCGGCCACAGCCTCGAAGGCATCATCA
TCGTCATGGCGTCCTACCTGATCCTCAGCTGGATCATCAGTTCGGCCGTGTACTGGGCGAACCGCCGTCTCAACCCCAAT
GGGAGTGTCGCGATGATCACCACCTCGATGAAGTGGACCCGTCACAACCTCTTTGGAAAGCCTTTCGACGCCGTCCTGTC
GCTGCTGGTCATCCCAAGCGTTCTCTGGCTCGCCTATTCCATCTTTGGCTGGGCGCTCACCACGGCAAGGTGGGACATCA
TCGCAGCGAGCTTGCGGGTATTGATGATCGGCGTGTTCCCAGCCGATCAGGCCTGGCGCGCATGGACGGCTTCGGCGATC
ATAGGAGCGATACTTGGCGCCGGCCTCGGCTGCGTGTTTTCGTTCAAGCCGCACCATGGTCTCGTACTCGCTCTCCCAGC
GTTGCTGCTTTCGATCGTCGATCATGGCGATCTGGGCAATGCACTGCCTGCCTTGGGCGTCGTCGCTGCGACCATGCTGG
GATGGGTGCTGACGTCCTACGTGCCAGTGTCCAGGAGCGTGATGCCTGCCGCGGCATTCACGGGCTTCATCGCCGTTGTG
GCGGTCATGGCCCCTCCGGGCGTGGGGCTCTGGGGCGGACTTCTTCTCAGCGTCCTGCTGACGCTCGTGACTTCGGTCCT
GTCCCTGCCGATCGGCATCCTGCTCGCCTTCGGACGGCAGAGCCGGCTCTCAAGCCTGCGGATCATTTGCACCTGGTACA
TCGAGGTCATGCGGTCGGTCCCGCTCATCCTCGTCGTCTACTGGATCTGGATCCTCATGCCGGTCCTTGCGCCGCAGTGG
GGACTGGCCGACGTCGCGCGCGGCATGATCGGGTTCACGATCTTCTATGCCGCCTACGTCGCCGAATACGTTCGGAGCGG
CCTTCAGGCGGTTCCCCGCGCTCAGACCGAGGCGGCCAGATCTCTCGGCATGAGCGAATTCGACATCAACCGCTCGACCG
TTCTGCCGCAGGCGCTCCGGGTCGTGGTCGCTCCGCTGGTCGGCAATGTGCTCGACATCTTCAACTCCGCACCACTCGTC
TTCATCATCGGGCTGACGGATTTTCTCCGCGCCGGCCAGATGATCCTGGCGAACCCACAATATGGCGACCGGACCTTCGA
AGTCCTCTCGTTCCTGCTCATCACCTATTTCCTGGTCGGCTCGCTGATCACCTATGCCGCCAGAAAGCTCGAAGACCATA
TGGCAACGAGCACCCGGTGA

Upstream 100 bases:

>100_bases
TGGACGCAGGGTGGCGCCATCTACTCGCCGCTCTGGAACTGATGCTGGCCGGGGTCGAGACCGTGCAAAGGTTTCGACCC
CATGCACTTGGAGCCTGAAG

Downstream 100 bases:

>100_bases
GCCGATCCAACACGGAGACATCACGATGAACGCCATCGTCGAAATGAAAAGCATGAACAAGTTCTACGGCGCTCACCATG
TGCTGAAGGACATCGATCTC

Product: polar amino acid ABC transporter inner membrane subunit

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 779; Mature: 778

Protein sequence:

>779_residues
MTIHETSAIHGMVESDSLKSRRLRNRIRHEAIQLGIVGLVVVLLGILAYAVKAGLSEKGIRFSFSFLANTAGFDISEGWT
LTSGEGVVPGLSQFSSDMSVASAFVTGIFNTAKVALLAIALSTILGTLLGVGRLSTNWVVRNLCFWIVEFVRNTPLLIQL
VFWYFAVVLRFPPMASAAKLYGGLIVSQQGIYVPTLAWSGGVSTLSLILVTATWVSVLGAVFDPIRGMRRICIAVAVVCF
GLLIATGGMTVDFPVANKFKASGGTSISPEMAALLLAVVVNSASFIAEIVRGAIDALPKSQWEAAGSLGFSRRDTVNDIV
LPQVFRVVLPSFGNQYISLTKNTALGIAVGYPDLFNIYGTIANQTGHSLEGIIIVMASYLILSWIISSAVYWANRRLNPN
GSVAMITTSMKWTRHNLFGKPFDAVLSLLVIPSVLWLAYSIFGWALTTARWDIIAASLRVLMIGVFPADQAWRAWTASAI
IGAILGAGLGCVFSFKPHHGLVLALPALLLSIVDHGDLGNALPALGVVAATMLGWVLTSYVPVSRSVMPAAAFTGFIAVV
AVMAPPGVGLWGGLLLSVLLTLVTSVLSLPIGILLAFGRQSRLSSLRIICTWYIEVMRSVPLILVVYWIWILMPVLAPQW
GLADVARGMIGFTIFYAAYVAEYVRSGLQAVPRAQTEAARSLGMSEFDINRSTVLPQALRVVVAPLVGNVLDIFNSAPLV
FIIGLTDFLRAGQMILANPQYGDRTFEVLSFLLITYFLVGSLITYAARKLEDHMATSTR

Sequences:

>Translated_779_residues
MTIHETSAIHGMVESDSLKSRRLRNRIRHEAIQLGIVGLVVVLLGILAYAVKAGLSEKGIRFSFSFLANTAGFDISEGWT
LTSGEGVVPGLSQFSSDMSVASAFVTGIFNTAKVALLAIALSTILGTLLGVGRLSTNWVVRNLCFWIVEFVRNTPLLIQL
VFWYFAVVLRFPPMASAAKLYGGLIVSQQGIYVPTLAWSGGVSTLSLILVTATWVSVLGAVFDPIRGMRRICIAVAVVCF
GLLIATGGMTVDFPVANKFKASGGTSISPEMAALLLAVVVNSASFIAEIVRGAIDALPKSQWEAAGSLGFSRRDTVNDIV
LPQVFRVVLPSFGNQYISLTKNTALGIAVGYPDLFNIYGTIANQTGHSLEGIIIVMASYLILSWIISSAVYWANRRLNPN
GSVAMITTSMKWTRHNLFGKPFDAVLSLLVIPSVLWLAYSIFGWALTTARWDIIAASLRVLMIGVFPADQAWRAWTASAI
IGAILGAGLGCVFSFKPHHGLVLALPALLLSIVDHGDLGNALPALGVVAATMLGWVLTSYVPVSRSVMPAAAFTGFIAVV
AVMAPPGVGLWGGLLLSVLLTLVTSVLSLPIGILLAFGRQSRLSSLRIICTWYIEVMRSVPLILVVYWIWILMPVLAPQW
GLADVARGMIGFTIFYAAYVAEYVRSGLQAVPRAQTEAARSLGMSEFDINRSTVLPQALRVVVAPLVGNVLDIFNSAPLV
FIIGLTDFLRAGQMILANPQYGDRTFEVLSFLLITYFLVGSLITYAARKLEDHMATSTR
>Mature_778_residues
TIHETSAIHGMVESDSLKSRRLRNRIRHEAIQLGIVGLVVVLLGILAYAVKAGLSEKGIRFSFSFLANTAGFDISEGWTL
TSGEGVVPGLSQFSSDMSVASAFVTGIFNTAKVALLAIALSTILGTLLGVGRLSTNWVVRNLCFWIVEFVRNTPLLIQLV
FWYFAVVLRFPPMASAAKLYGGLIVSQQGIYVPTLAWSGGVSTLSLILVTATWVSVLGAVFDPIRGMRRICIAVAVVCFG
LLIATGGMTVDFPVANKFKASGGTSISPEMAALLLAVVVNSASFIAEIVRGAIDALPKSQWEAAGSLGFSRRDTVNDIVL
PQVFRVVLPSFGNQYISLTKNTALGIAVGYPDLFNIYGTIANQTGHSLEGIIIVMASYLILSWIISSAVYWANRRLNPNG
SVAMITTSMKWTRHNLFGKPFDAVLSLLVIPSVLWLAYSIFGWALTTARWDIIAASLRVLMIGVFPADQAWRAWTASAII
GAILGAGLGCVFSFKPHHGLVLALPALLLSIVDHGDLGNALPALGVVAATMLGWVLTSYVPVSRSVMPAAAFTGFIAVVA
VMAPPGVGLWGGLLLSVLLTLVTSVLSLPIGILLAFGRQSRLSSLRIICTWYIEVMRSVPLILVVYWIWILMPVLAPQWG
LADVARGMIGFTIFYAAYVAEYVRSGLQAVPRAQTEAARSLGMSEFDINRSTVLPQALRVVVAPLVGNVLDIFNSAPLVF
IIGLTDFLRAGQMILANPQYGDRTFEVLSFLLITYFLVGSLITYAARKLEDHMATSTR

Specific function: Probably part of the binding-protein-dependent transport system ydhWXYZ for an amino acid; probably responsible for the translocation of the substrate across the membrane [H]

COG id: COG4597

COG function: function code E; ABC-type amino acid transport system, permease component

Gene ontology:

Cell location: Cell inner membrane; Multi-pass membrane protein (Potential) [H]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 ABC transmembrane type-1 domain [H]

Homologues:

Organism=Escherichia coli, GI48994930, Length=355, Percent_Identity=35.2112676056338, Blast_Score=200, Evalue=2e-52,
Organism=Escherichia coli, GI87082239, Length=398, Percent_Identity=28.643216080402, Blast_Score=131, Evalue=2e-31,
Organism=Escherichia coli, GI1786873, Length=214, Percent_Identity=28.9719626168224, Blast_Score=94, Evalue=2e-20,
Organism=Escherichia coli, GI1788226, Length=194, Percent_Identity=28.3505154639175, Blast_Score=84, Evalue=3e-17,
Organism=Escherichia coli, GI1786874, Length=218, Percent_Identity=25.6880733944954, Blast_Score=80, Evalue=4e-16,
Organism=Escherichia coli, GI1787087, Length=231, Percent_Identity=28.1385281385281, Blast_Score=72, Evalue=2e-13,
Organism=Escherichia coli, GI1788645, Length=122, Percent_Identity=32.7868852459016, Blast_Score=72, Evalue=2e-13,
Organism=Escherichia coli, GI1787030, Length=132, Percent_Identity=34.8484848484849, Blast_Score=71, Evalue=2e-13,
Organism=Escherichia coli, GI1788646, Length=173, Percent_Identity=28.9017341040462, Blast_Score=66, Evalue=1e-11,
Organism=Escherichia coli, GI1787086, Length=128, Percent_Identity=31.25, Blast_Score=64, Evalue=3e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR010065
- InterPro:   IPR000515 [H]

Pfam domain/function: PF00528 BPD_transp_1 [H]

EC number: NA

Molecular weight: Translated: 83766; Mature: 83635

Theoretical pI: Translated: 9.85; Mature: 9.85

Prosite motif: PS50928 ABC_TM1

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
2.6 %Met     (Translated Protein)
3.2 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
2.4 %Met     (Mature Protein)
3.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTIHETSAIHGMVESDSLKSRRLRNRIRHEAIQLGIVGLVVVLLGILAYAVKAGLSEKGI
CCCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC
RFSFSFLANTAGFDISEGWTLTSGEGVVPGLSQFSSDMSVASAFVTGIFNTAKVALLAIA
EEEHHHHHCCCCCCCCCCEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
LSTILGTLLGVGRLSTNWVVRNLCFWIVEFVRNTPLLIQLVFWYFAVVLRFPPMASAAKL
HHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCHHHHHH
YGGLIVSQQGIYVPTLAWSGGVSTLSLILVTATWVSVLGAVFDPIRGMRRICIAVAVVCF
HCCCEEECCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
GLLIATGGMTVDFPVANKFKASGGTSISPEMAALLLAVVVNSASFIAEIVRGAIDALPKS
HHHHHCCCCEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCH
QWEAAGSLGFSRRDTVNDIVLPQVFRVVLPSFGNQYISLTKNTALGIAVGYPDLFNIYGT
HHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEECCCEEEEEECCCHHHHHHHH
IANQTGHSLEGIIIVMASYLILSWIISSAVYWANRRLNPNGSVAMITTSMKWTRHNLFGK
HHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCEECCCCCEEEEEECCHHHHHHCCCC
PFDAVLSLLVIPSVLWLAYSIFGWALTTARWDIIAASLRVLMIGVFPADQAWRAWTASAI
CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH
IGAILGAGLGCVFSFKPHHGLVLALPALLLSIVDHGDLGNALPALGVVAATMLGWVLTSY
HHHHHHCCHHHEEEECCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHC
VPVSRSVMPAAAFTGFIAVVAVMAPPGVGLWGGLLLSVLLTLVTSVLSLPIGILLAFGRQ
CCCCCHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCH
SRLSSLRIICTWYIEVMRSVPLILVVYWIWILMPVLAPQWGLADVARGMIGFTIFYAAYV
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHH
AEYVRSGLQAVPRAQTEAARSLGMSEFDINRSTVLPQALRVVVAPLVGNVLDIFNSAPLV
HHHHHHHHHHCCCHHHHHHHHCCCCHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCEE
FIIGLTDFLRAGQMILANPQYGDRTFEVLSFLLITYFLVGSLITYAARKLEDHMATSTR
EEEEHHHHHHCCCEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC
>Mature Secondary Structure 
TIHETSAIHGMVESDSLKSRRLRNRIRHEAIQLGIVGLVVVLLGILAYAVKAGLSEKGI
CCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC
RFSFSFLANTAGFDISEGWTLTSGEGVVPGLSQFSSDMSVASAFVTGIFNTAKVALLAIA
EEEHHHHHCCCCCCCCCCEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
LSTILGTLLGVGRLSTNWVVRNLCFWIVEFVRNTPLLIQLVFWYFAVVLRFPPMASAAKL
HHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCHHHHHH
YGGLIVSQQGIYVPTLAWSGGVSTLSLILVTATWVSVLGAVFDPIRGMRRICIAVAVVCF
HCCCEEECCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
GLLIATGGMTVDFPVANKFKASGGTSISPEMAALLLAVVVNSASFIAEIVRGAIDALPKS
HHHHHCCCCEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCH
QWEAAGSLGFSRRDTVNDIVLPQVFRVVLPSFGNQYISLTKNTALGIAVGYPDLFNIYGT
HHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEECCCEEEEEECCCHHHHHHHH
IANQTGHSLEGIIIVMASYLILSWIISSAVYWANRRLNPNGSVAMITTSMKWTRHNLFGK
HHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCEECCCCCEEEEEECCHHHHHHCCCC
PFDAVLSLLVIPSVLWLAYSIFGWALTTARWDIIAASLRVLMIGVFPADQAWRAWTASAI
CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH
IGAILGAGLGCVFSFKPHHGLVLALPALLLSIVDHGDLGNALPALGVVAATMLGWVLTSY
HHHHHHCCHHHEEEECCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHC
VPVSRSVMPAAAFTGFIAVVAVMAPPGVGLWGGLLLSVLLTLVTSVLSLPIGILLAFGRQ
CCCCCHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCH
SRLSSLRIICTWYIEVMRSVPLILVVYWIWILMPVLAPQWGLADVARGMIGFTIFYAAYV
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHH
AEYVRSGLQAVPRAQTEAARSLGMSEFDINRSTVLPQALRVVVAPLVGNVLDIFNSAPLV
HHHHHHHHHHCCCHHHHHHHHCCCCHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCEE
FIIGLTDFLRAGQMILANPQYGDRTFEVLSFLLITYFLVGSLITYAARKLEDHMATSTR
EEEEHHHHHHCCCEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 9278503 [H]