Definition Escherichia coli IAI39 chromosome, complete genome.
Accession NC_011750
Length 5,132,068

Click here to switch to the map view.

The map label for this gene is yheS

Identifier: 218702100

GI number: 218702100

Start: 3980560

End: 3982473

Strand: Direct

Name: yheS

Synonym: ECIAI39_3835

Alternate gene names: 218702100

Gene position: 3980560-3982473 (Clockwise)

Preceding gene: 218702096

Following gene: 218702101

Centisome position: 77.56

GC content: 55.69

Gene sequence:

>1914_bases
ATGATTGTTTTCTCCTCGTTACAAATTCGTCGCGGCGTGCGCGTCCTGCTGGATAATGCCACCGCCACCATCAACCCCGG
GCAGAAAGTCGGCCTGGTGGGTAAAAACGGCTGTGGTAAATCTACCCTGCTGGCATTGCTGAAAAATGAAATCAGCGCCG
ACGGCGGCAGCTACACCTTTCCGGGAAGCTGGCAACTGGCGTGGGTGAATCAGGAAACGCCGGCGTTACCGCAAGCGGCG
CTGGAATATGTCATTGACGGCGACCGTGAATATCGTCAACTGGAAGCGCAGCTACACGACGCCAACGAACGTAACGACGG
GCACGCCATCGCCACTATTCATGGCAAGCTGGATGCTATTGACGCATGGAGTATTCGCTCCCGTGCCGCCAGCCTGCTGC
ACGGCCTCGGTTTCAGCAATGAACAACTGGAGCGCCCGGTAAGTGATTTCTCCGGTGGCTGGCGTATGCGTCTTAACCTT
GCCCAGGCGCTGATTTGCCGTTCAGACTTGCTGCTGCTCGACGAACCGACTAACCACCTCGATCTCGATGCCGTTATCTG
GCTGGAAAAATGGCTGAAGAGCTATCAGGGCACGCTGATCCTGATCTCTCACGACCGCGACTTCCTCGATCCGATCGTTG
AAAAAATTATTCATATCGAACAACAAAGCATGTTCGAGTACACCGGCAACTACAGTTCGTTTGAAGTACAGCGCGCCACC
CGTCTGGCGCAGCAACAAGCGATGTACGAAAGCCAGCAGGAACGCGTAGCGCATCTGCAAAGTTATATCGACCGTTTCCG
TGCCAAAGCCACCAAAGCGAAGCAGGCCCAGAGCCGTATTAAGATGCTGGAGCGTATGGAGCTGATTGCCCCGGCGCACG
TCGACAACCCGTTCCGCTTTAGCTTCCGCGCGCCGGAAAGCCTGCCTAATCCGTTATTAAAGATGGAAAAAGTCAGCGCA
GGCTATGGTGATCGCATTATTCTCGACTCGATTAAACTGAATCTGGTCCCCGGCTCGCGCATTGGTCTGCTAGGCCGCAA
CGGCGCGGGTAAATCGACATTAATCAAACTGTTAGCCGGTGAACTTGCGCCAGTCAGCGGTGAAATTGGTCTGGCGAAAG
GGATCAAGCTCGGCTACTTCGCCCAGCATCAACTGGAATACCTGCGCGCCGACGAATCGCCGATTCAACATCTGGCACGT
TTAGCGCCGCAGGAGCTGGAGCAAAAACTGCGTGACTACCTCGGCGGCTTTGGTTTCCAGGGCGATAAAGTAACCGAAGA
AACGCGCCGCTTCTCCGGTGGGGAAAAAGCCCGCCTGGTGCTGGCATTAATTGTCTGGCAGCGTCCGAATCTGCTGCTGC
TCGACGAACCGACCAACCACCTTGACCTCGACATGCGTCAGGCACTCACCGAAGCATTAATCGAGTTCGAAGGCGCGCTG
GTTGTCGTCTCGCACGACCGTCATTTGCTGCGTTCCACCACTGACGATCTCTACCTGGTTCACGATCGTAAAGTCGAACC
GTTCGACGGCGATCTGGAAGATTATCAACAGTGGTTGAGCGACGTACAAAAGCAGGAAAACCAGACCGACGAAGCGCCAA
AAGAGAACGCGAACAGCGCCCAGGCACGTAAAGATCAGAAGCGTCGGGAAGCGGAGCTGCGTGCGCAAACCCAGCCACTG
CGTAAAGAGATTGCCCGTCTGGAAAAAGAGATGGAGAAGCTGAACGCGCAACTGGCGCAGGCGGAAGAGAAACTCGGCGA
CAGCGAACTGTACGATCAAAGCCGTAAAGCGGAGTTGACCGCCTGCCTGCAACAGCAAGCCAGCGCCAAATCCGGCCTGG
AAGAGTGCGAAATGGCATGGCTGGAAGCCCAGGAGCAGCTTGAGCAGATGCTGCTGGAAGGCCAAAACAACTGA

Upstream 100 bases:

>100_bases
AGTCGCTGTTTTGGGCTACCATTGCGCCCGGTGCGGCAGCTCGCCCATACATTACATTATCATAATGATAAGTTAACATA
GTCTGAACATACGGCGCCTT

Downstream 100 bases:

>100_bases
TGGCGCAGATAACAACGACCGATGCCAATGAATTTAGCAGCAGTGCTGAATTCACCCCTATGCGCGGCTTTAGCAATTGT
CATCTACAAACCATGCTGCC

Product: putative ABC transporter ATP-binding protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 637; Mature: 637

Protein sequence:

>637_residues
MIVFSSLQIRRGVRVLLDNATATINPGQKVGLVGKNGCGKSTLLALLKNEISADGGSYTFPGSWQLAWVNQETPALPQAA
LEYVIDGDREYRQLEAQLHDANERNDGHAIATIHGKLDAIDAWSIRSRAASLLHGLGFSNEQLERPVSDFSGGWRMRLNL
AQALICRSDLLLLDEPTNHLDLDAVIWLEKWLKSYQGTLILISHDRDFLDPIVEKIIHIEQQSMFEYTGNYSSFEVQRAT
RLAQQQAMYESQQERVAHLQSYIDRFRAKATKAKQAQSRIKMLERMELIAPAHVDNPFRFSFRAPESLPNPLLKMEKVSA
GYGDRIILDSIKLNLVPGSRIGLLGRNGAGKSTLIKLLAGELAPVSGEIGLAKGIKLGYFAQHQLEYLRADESPIQHLAR
LAPQELEQKLRDYLGGFGFQGDKVTEETRRFSGGEKARLVLALIVWQRPNLLLLDEPTNHLDLDMRQALTEALIEFEGAL
VVVSHDRHLLRSTTDDLYLVHDRKVEPFDGDLEDYQQWLSDVQKQENQTDEAPKENANSAQARKDQKRREAELRAQTQPL
RKEIARLEKEMEKLNAQLAQAEEKLGDSELYDQSRKAELTACLQQQASAKSGLEECEMAWLEAQEQLEQMLLEGQNN

Sequences:

>Translated_637_residues
MIVFSSLQIRRGVRVLLDNATATINPGQKVGLVGKNGCGKSTLLALLKNEISADGGSYTFPGSWQLAWVNQETPALPQAA
LEYVIDGDREYRQLEAQLHDANERNDGHAIATIHGKLDAIDAWSIRSRAASLLHGLGFSNEQLERPVSDFSGGWRMRLNL
AQALICRSDLLLLDEPTNHLDLDAVIWLEKWLKSYQGTLILISHDRDFLDPIVEKIIHIEQQSMFEYTGNYSSFEVQRAT
RLAQQQAMYESQQERVAHLQSYIDRFRAKATKAKQAQSRIKMLERMELIAPAHVDNPFRFSFRAPESLPNPLLKMEKVSA
GYGDRIILDSIKLNLVPGSRIGLLGRNGAGKSTLIKLLAGELAPVSGEIGLAKGIKLGYFAQHQLEYLRADESPIQHLAR
LAPQELEQKLRDYLGGFGFQGDKVTEETRRFSGGEKARLVLALIVWQRPNLLLLDEPTNHLDLDMRQALTEALIEFEGAL
VVVSHDRHLLRSTTDDLYLVHDRKVEPFDGDLEDYQQWLSDVQKQENQTDEAPKENANSAQARKDQKRREAELRAQTQPL
RKEIARLEKEMEKLNAQLAQAEEKLGDSELYDQSRKAELTACLQQQASAKSGLEECEMAWLEAQEQLEQMLLEGQNN
>Mature_637_residues
MIVFSSLQIRRGVRVLLDNATATINPGQKVGLVGKNGCGKSTLLALLKNEISADGGSYTFPGSWQLAWVNQETPALPQAA
LEYVIDGDREYRQLEAQLHDANERNDGHAIATIHGKLDAIDAWSIRSRAASLLHGLGFSNEQLERPVSDFSGGWRMRLNL
AQALICRSDLLLLDEPTNHLDLDAVIWLEKWLKSYQGTLILISHDRDFLDPIVEKIIHIEQQSMFEYTGNYSSFEVQRAT
RLAQQQAMYESQQERVAHLQSYIDRFRAKATKAKQAQSRIKMLERMELIAPAHVDNPFRFSFRAPESLPNPLLKMEKVSA
GYGDRIILDSIKLNLVPGSRIGLLGRNGAGKSTLIKLLAGELAPVSGEIGLAKGIKLGYFAQHQLEYLRADESPIQHLAR
LAPQELEQKLRDYLGGFGFQGDKVTEETRRFSGGEKARLVLALIVWQRPNLLLLDEPTNHLDLDMRQALTEALIEFEGAL
VVVSHDRHLLRSTTDDLYLVHDRKVEPFDGDLEDYQQWLSDVQKQENQTDEAPKENANSAQARKDQKRREAELRAQTQPL
RKEIARLEKEMEKLNAQLAQAEEKLGDSELYDQSRKAELTACLQQQASAKSGLEECEMAWLEAQEQLEQMLLEGQNN

Specific function: Unknown

COG id: COG0488

COG function: function code R; ATPase components of ABC transporters with duplicated ATPase domains

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 2 ABC transporter domains [H]

Homologues:

Organism=Homo sapiens, GI148612853, Length=537, Percent_Identity=37.6163873370577, Blast_Score=357, Evalue=3e-98,
Organism=Homo sapiens, GI10947137, Length=513, Percent_Identity=36.2573099415205, Blast_Score=318, Evalue=1e-86,
Organism=Homo sapiens, GI27881506, Length=513, Percent_Identity=36.2573099415205, Blast_Score=318, Evalue=1e-86,
Organism=Homo sapiens, GI10947135, Length=525, Percent_Identity=35.4285714285714, Blast_Score=273, Evalue=4e-73,
Organism=Homo sapiens, GI69354671, Length=525, Percent_Identity=35.4285714285714, Blast_Score=273, Evalue=4e-73,
Organism=Homo sapiens, GI21536376, Length=185, Percent_Identity=28.6486486486486, Blast_Score=68, Evalue=2e-11,
Organism=Escherichia coli, GI1789751, Length=637, Percent_Identity=99.6860282574568, Blast_Score=1298, Evalue=0.0,
Organism=Escherichia coli, GI1787041, Length=529, Percent_Identity=33.0812854442344, Blast_Score=298, Evalue=1e-81,
Organism=Escherichia coli, GI2367384, Length=532, Percent_Identity=31.390977443609, Blast_Score=231, Evalue=1e-61,
Organism=Escherichia coli, GI1787182, Length=623, Percent_Identity=28.4109149277689, Blast_Score=231, Evalue=1e-61,
Organism=Escherichia coli, GI1788165, Length=190, Percent_Identity=33.1578947368421, Blast_Score=96, Evalue=8e-21,
Organism=Escherichia coli, GI87081782, Length=527, Percent_Identity=25.6166982922201, Blast_Score=88, Evalue=2e-18,
Organism=Escherichia coli, GI87081791, Length=204, Percent_Identity=28.4313725490196, Blast_Score=74, Evalue=4e-14,
Organism=Escherichia coli, GI1787164, Length=216, Percent_Identity=28.7037037037037, Blast_Score=73, Evalue=5e-14,
Organism=Escherichia coli, GI48994997, Length=221, Percent_Identity=25.7918552036199, Blast_Score=67, Evalue=4e-12,
Organism=Escherichia coli, GI48994883, Length=219, Percent_Identity=26.9406392694064, Blast_Score=67, Evalue=4e-12,
Organism=Escherichia coli, GI87081709, Length=209, Percent_Identity=23.9234449760766, Blast_Score=67, Evalue=5e-12,
Organism=Escherichia coli, GI1789891, Length=190, Percent_Identity=27.3684210526316, Blast_Score=65, Evalue=1e-11,
Organism=Escherichia coli, GI1789586, Length=218, Percent_Identity=27.9816513761468, Blast_Score=64, Evalue=4e-11,
Organism=Escherichia coli, GI1787500, Length=225, Percent_Identity=27.5555555555556, Blast_Score=63, Evalue=5e-11,
Organism=Escherichia coli, GI1787758, Length=205, Percent_Identity=23.9024390243902, Blast_Score=62, Evalue=9e-11,
Organism=Caenorhabditis elegans, GI17553372, Length=530, Percent_Identity=38.3018867924528, Blast_Score=353, Evalue=1e-97,
Organism=Caenorhabditis elegans, GI17555318, Length=524, Percent_Identity=35.1145038167939, Blast_Score=323, Evalue=2e-88,
Organism=Caenorhabditis elegans, GI17559834, Length=547, Percent_Identity=34.1864716636197, Blast_Score=310, Evalue=1e-84,
Organism=Caenorhabditis elegans, GI71996809, Length=204, Percent_Identity=32.3529411764706, Blast_Score=70, Evalue=3e-12,
Organism=Caenorhabditis elegans, GI193211017, Length=184, Percent_Identity=33.1521739130435, Blast_Score=69, Evalue=1e-11,
Organism=Caenorhabditis elegans, GI193211015, Length=184, Percent_Identity=33.1521739130435, Blast_Score=69, Evalue=1e-11,
Organism=Saccharomyces cerevisiae, GI6321121, Length=533, Percent_Identity=34.8968105065666, Blast_Score=335, Evalue=1e-92,
Organism=Saccharomyces cerevisiae, GI6320874, Length=534, Percent_Identity=35.3932584269663, Blast_Score=322, Evalue=1e-88,
Organism=Saccharomyces cerevisiae, GI6325030, Length=438, Percent_Identity=29.4520547945205, Blast_Score=160, Evalue=5e-40,
Organism=Saccharomyces cerevisiae, GI6324314, Length=391, Percent_Identity=28.3887468030691, Blast_Score=152, Evalue=2e-37,
Organism=Saccharomyces cerevisiae, GI6323278, Length=391, Percent_Identity=28.9002557544757, Blast_Score=150, Evalue=7e-37,
Organism=Drosophila melanogaster, GI24666836, Length=520, Percent_Identity=37.5, Blast_Score=367, Evalue=1e-101,
Organism=Drosophila melanogaster, GI24642252, Length=523, Percent_Identity=37.4760994263862, Blast_Score=352, Evalue=6e-97,
Organism=Drosophila melanogaster, GI18859989, Length=523, Percent_Identity=37.4760994263862, Blast_Score=352, Evalue=6e-97,
Organism=Drosophila melanogaster, GI24641342, Length=532, Percent_Identity=35.3383458646617, Blast_Score=311, Evalue=1e-84,
Organism=Drosophila melanogaster, GI116007184, Length=164, Percent_Identity=30.4878048780488, Blast_Score=68, Evalue=2e-11,
Organism=Drosophila melanogaster, GI221500365, Length=165, Percent_Identity=30.3030303030303, Blast_Score=68, Evalue=2e-11,
Organism=Drosophila melanogaster, GI24661270, Length=191, Percent_Identity=28.7958115183246, Blast_Score=67, Evalue=3e-11,
Organism=Drosophila melanogaster, GI21355589, Length=191, Percent_Identity=28.7958115183246, Blast_Score=67, Evalue=3e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003439
- InterPro:   IPR017871
- InterPro:   IPR003593 [H]

Pfam domain/function: PF00005 ABC_tran [H]

EC number: NA

Molecular weight: Translated: 71885; Mature: 71885

Theoretical pI: Translated: 5.47; Mature: 5.47

Prosite motif: PS00211 ABC_TRANSPORTER_1 ; PS50893 ABC_TRANSPORTER_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
1.7 %Met     (Translated Protein)
2.4 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
1.7 %Met     (Mature Protein)
2.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MIVFSSLQIRRGVRVLLDNATATINPGQKVGLVGKNGCGKSTLLALLKNEISADGGSYTF
CEEECCHHHHCCCEEEEECCEEEECCCCEEEEECCCCCCHHHHHHHHHHHHCCCCCCEEC
PGSWQLAWVNQETPALPQAALEYVIDGDREYRQLEAQLHDANERNDGHAIATIHGKLDAI
CCCEEEEEECCCCCCCHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCC
DAWSIRSRAASLLHGLGFSNEQLERPVSDFSGGWRMRLNLAQALICRSDLLLLDEPTNHL
CHHHHHHHHHHHHHHCCCCHHHHHCCHHHCCCCEEEEHHHHHHHHHHCCEEEEECCCCCC
DLDAVIWLEKWLKSYQGTLILISHDRDFLDPIVEKIIHIEQQSMFEYTGNYSSFEVQRAT
CHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHH
RLAQQQAMYESQQERVAHLQSYIDRFRAKATKAKQAQSRIKMLERMELIAPAHVDNPFRF
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEE
SFRAPESLPNPLLKMEKVSAGYGDRIILDSIKLNLVPGSRIGLLGRNGAGKSTLIKLLAG
EEECCCCCCCHHHHHHHHCCCCCCEEEEEEEEEEECCCCCEEEEECCCCCHHHHHHHHHH
ELAPVSGEIGLAKGIKLGYFAQHQLEYLRADESPIQHLARLAPQELEQKLRDYLGGFGFQ
HCCCCCCCCCHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHCHHHHHHHHHHHHCCCCCC
GDKVTEETRRFSGGEKARLVLALIVWQRPNLLLLDEPTNHLDLDMRQALTEALIEFEGAL
CCCHHHHHHHCCCCCHHHHHHHHHEECCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCEE
VVVSHDRHLLRSTTDDLYLVHDRKVEPFDGDLEDYQQWLSDVQKQENQTDEAPKENANSA
EEEECCCHHHHCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCHH
QARKDQKRREAELRAQTQPLRKEIARLEKEMEKLNAQLAQAEEKLGDSELYDQSRKAELT
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH
ACLQQQASAKSGLEECEMAWLEAQEQLEQMLLEGQNN
HHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCC
>Mature Secondary Structure
MIVFSSLQIRRGVRVLLDNATATINPGQKVGLVGKNGCGKSTLLALLKNEISADGGSYTF
CEEECCHHHHCCCEEEEECCEEEECCCCEEEEECCCCCCHHHHHHHHHHHHCCCCCCEEC
PGSWQLAWVNQETPALPQAALEYVIDGDREYRQLEAQLHDANERNDGHAIATIHGKLDAI
CCCEEEEEECCCCCCCHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCC
DAWSIRSRAASLLHGLGFSNEQLERPVSDFSGGWRMRLNLAQALICRSDLLLLDEPTNHL
CHHHHHHHHHHHHHHCCCCHHHHHCCHHHCCCCEEEEHHHHHHHHHHCCEEEEECCCCCC
DLDAVIWLEKWLKSYQGTLILISHDRDFLDPIVEKIIHIEQQSMFEYTGNYSSFEVQRAT
CHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHH
RLAQQQAMYESQQERVAHLQSYIDRFRAKATKAKQAQSRIKMLERMELIAPAHVDNPFRF
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEE
SFRAPESLPNPLLKMEKVSAGYGDRIILDSIKLNLVPGSRIGLLGRNGAGKSTLIKLLAG
EEECCCCCCCHHHHHHHHCCCCCCEEEEEEEEEEECCCCCEEEEECCCCCHHHHHHHHHH
ELAPVSGEIGLAKGIKLGYFAQHQLEYLRADESPIQHLARLAPQELEQKLRDYLGGFGFQ
HCCCCCCCCCHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHCHHHHHHHHHHHHCCCCCC
GDKVTEETRRFSGGEKARLVLALIVWQRPNLLLLDEPTNHLDLDMRQALTEALIEFEGAL
CCCHHHHHHHCCCCCHHHHHHHHHEECCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCEE
VVVSHDRHLLRSTTDDLYLVHDRKVEPFDGDLEDYQQWLSDVQKQENQTDEAPKENANSA
EEEECCCHHHHCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCHH
QARKDQKRREAELRAQTQPLRKEIARLEKEMEKLNAQLAQAEEKLGDSELYDQSRKAELT
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH
ACLQQQASAKSGLEECEMAWLEAQEQLEQMLLEGQNN
HHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796 [H]