Definition | Escherichia coli IAI39 chromosome, complete genome. |
---|---|
Accession | NC_011750 |
Length | 5,132,068 |
Click here to switch to the map view.
The map label for this gene is yheS
Identifier: 218702100
GI number: 218702100
Start: 3980560
End: 3982473
Strand: Direct
Name: yheS
Synonym: ECIAI39_3835
Alternate gene names: 218702100
Gene position: 3980560-3982473 (Clockwise)
Preceding gene: 218702096
Following gene: 218702101
Centisome position: 77.56
GC content: 55.69
Gene sequence:
>1914_bases ATGATTGTTTTCTCCTCGTTACAAATTCGTCGCGGCGTGCGCGTCCTGCTGGATAATGCCACCGCCACCATCAACCCCGG GCAGAAAGTCGGCCTGGTGGGTAAAAACGGCTGTGGTAAATCTACCCTGCTGGCATTGCTGAAAAATGAAATCAGCGCCG ACGGCGGCAGCTACACCTTTCCGGGAAGCTGGCAACTGGCGTGGGTGAATCAGGAAACGCCGGCGTTACCGCAAGCGGCG CTGGAATATGTCATTGACGGCGACCGTGAATATCGTCAACTGGAAGCGCAGCTACACGACGCCAACGAACGTAACGACGG GCACGCCATCGCCACTATTCATGGCAAGCTGGATGCTATTGACGCATGGAGTATTCGCTCCCGTGCCGCCAGCCTGCTGC ACGGCCTCGGTTTCAGCAATGAACAACTGGAGCGCCCGGTAAGTGATTTCTCCGGTGGCTGGCGTATGCGTCTTAACCTT GCCCAGGCGCTGATTTGCCGTTCAGACTTGCTGCTGCTCGACGAACCGACTAACCACCTCGATCTCGATGCCGTTATCTG GCTGGAAAAATGGCTGAAGAGCTATCAGGGCACGCTGATCCTGATCTCTCACGACCGCGACTTCCTCGATCCGATCGTTG AAAAAATTATTCATATCGAACAACAAAGCATGTTCGAGTACACCGGCAACTACAGTTCGTTTGAAGTACAGCGCGCCACC CGTCTGGCGCAGCAACAAGCGATGTACGAAAGCCAGCAGGAACGCGTAGCGCATCTGCAAAGTTATATCGACCGTTTCCG TGCCAAAGCCACCAAAGCGAAGCAGGCCCAGAGCCGTATTAAGATGCTGGAGCGTATGGAGCTGATTGCCCCGGCGCACG TCGACAACCCGTTCCGCTTTAGCTTCCGCGCGCCGGAAAGCCTGCCTAATCCGTTATTAAAGATGGAAAAAGTCAGCGCA GGCTATGGTGATCGCATTATTCTCGACTCGATTAAACTGAATCTGGTCCCCGGCTCGCGCATTGGTCTGCTAGGCCGCAA CGGCGCGGGTAAATCGACATTAATCAAACTGTTAGCCGGTGAACTTGCGCCAGTCAGCGGTGAAATTGGTCTGGCGAAAG GGATCAAGCTCGGCTACTTCGCCCAGCATCAACTGGAATACCTGCGCGCCGACGAATCGCCGATTCAACATCTGGCACGT TTAGCGCCGCAGGAGCTGGAGCAAAAACTGCGTGACTACCTCGGCGGCTTTGGTTTCCAGGGCGATAAAGTAACCGAAGA AACGCGCCGCTTCTCCGGTGGGGAAAAAGCCCGCCTGGTGCTGGCATTAATTGTCTGGCAGCGTCCGAATCTGCTGCTGC TCGACGAACCGACCAACCACCTTGACCTCGACATGCGTCAGGCACTCACCGAAGCATTAATCGAGTTCGAAGGCGCGCTG GTTGTCGTCTCGCACGACCGTCATTTGCTGCGTTCCACCACTGACGATCTCTACCTGGTTCACGATCGTAAAGTCGAACC GTTCGACGGCGATCTGGAAGATTATCAACAGTGGTTGAGCGACGTACAAAAGCAGGAAAACCAGACCGACGAAGCGCCAA AAGAGAACGCGAACAGCGCCCAGGCACGTAAAGATCAGAAGCGTCGGGAAGCGGAGCTGCGTGCGCAAACCCAGCCACTG CGTAAAGAGATTGCCCGTCTGGAAAAAGAGATGGAGAAGCTGAACGCGCAACTGGCGCAGGCGGAAGAGAAACTCGGCGA CAGCGAACTGTACGATCAAAGCCGTAAAGCGGAGTTGACCGCCTGCCTGCAACAGCAAGCCAGCGCCAAATCCGGCCTGG AAGAGTGCGAAATGGCATGGCTGGAAGCCCAGGAGCAGCTTGAGCAGATGCTGCTGGAAGGCCAAAACAACTGA
Upstream 100 bases:
>100_bases AGTCGCTGTTTTGGGCTACCATTGCGCCCGGTGCGGCAGCTCGCCCATACATTACATTATCATAATGATAAGTTAACATA GTCTGAACATACGGCGCCTT
Downstream 100 bases:
>100_bases TGGCGCAGATAACAACGACCGATGCCAATGAATTTAGCAGCAGTGCTGAATTCACCCCTATGCGCGGCTTTAGCAATTGT CATCTACAAACCATGCTGCC
Product: putative ABC transporter ATP-binding protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 637; Mature: 637
Protein sequence:
>637_residues MIVFSSLQIRRGVRVLLDNATATINPGQKVGLVGKNGCGKSTLLALLKNEISADGGSYTFPGSWQLAWVNQETPALPQAA LEYVIDGDREYRQLEAQLHDANERNDGHAIATIHGKLDAIDAWSIRSRAASLLHGLGFSNEQLERPVSDFSGGWRMRLNL AQALICRSDLLLLDEPTNHLDLDAVIWLEKWLKSYQGTLILISHDRDFLDPIVEKIIHIEQQSMFEYTGNYSSFEVQRAT RLAQQQAMYESQQERVAHLQSYIDRFRAKATKAKQAQSRIKMLERMELIAPAHVDNPFRFSFRAPESLPNPLLKMEKVSA GYGDRIILDSIKLNLVPGSRIGLLGRNGAGKSTLIKLLAGELAPVSGEIGLAKGIKLGYFAQHQLEYLRADESPIQHLAR LAPQELEQKLRDYLGGFGFQGDKVTEETRRFSGGEKARLVLALIVWQRPNLLLLDEPTNHLDLDMRQALTEALIEFEGAL VVVSHDRHLLRSTTDDLYLVHDRKVEPFDGDLEDYQQWLSDVQKQENQTDEAPKENANSAQARKDQKRREAELRAQTQPL RKEIARLEKEMEKLNAQLAQAEEKLGDSELYDQSRKAELTACLQQQASAKSGLEECEMAWLEAQEQLEQMLLEGQNN
Sequences:
>Translated_637_residues MIVFSSLQIRRGVRVLLDNATATINPGQKVGLVGKNGCGKSTLLALLKNEISADGGSYTFPGSWQLAWVNQETPALPQAA LEYVIDGDREYRQLEAQLHDANERNDGHAIATIHGKLDAIDAWSIRSRAASLLHGLGFSNEQLERPVSDFSGGWRMRLNL AQALICRSDLLLLDEPTNHLDLDAVIWLEKWLKSYQGTLILISHDRDFLDPIVEKIIHIEQQSMFEYTGNYSSFEVQRAT RLAQQQAMYESQQERVAHLQSYIDRFRAKATKAKQAQSRIKMLERMELIAPAHVDNPFRFSFRAPESLPNPLLKMEKVSA GYGDRIILDSIKLNLVPGSRIGLLGRNGAGKSTLIKLLAGELAPVSGEIGLAKGIKLGYFAQHQLEYLRADESPIQHLAR LAPQELEQKLRDYLGGFGFQGDKVTEETRRFSGGEKARLVLALIVWQRPNLLLLDEPTNHLDLDMRQALTEALIEFEGAL VVVSHDRHLLRSTTDDLYLVHDRKVEPFDGDLEDYQQWLSDVQKQENQTDEAPKENANSAQARKDQKRREAELRAQTQPL RKEIARLEKEMEKLNAQLAQAEEKLGDSELYDQSRKAELTACLQQQASAKSGLEECEMAWLEAQEQLEQMLLEGQNN >Mature_637_residues MIVFSSLQIRRGVRVLLDNATATINPGQKVGLVGKNGCGKSTLLALLKNEISADGGSYTFPGSWQLAWVNQETPALPQAA LEYVIDGDREYRQLEAQLHDANERNDGHAIATIHGKLDAIDAWSIRSRAASLLHGLGFSNEQLERPVSDFSGGWRMRLNL AQALICRSDLLLLDEPTNHLDLDAVIWLEKWLKSYQGTLILISHDRDFLDPIVEKIIHIEQQSMFEYTGNYSSFEVQRAT RLAQQQAMYESQQERVAHLQSYIDRFRAKATKAKQAQSRIKMLERMELIAPAHVDNPFRFSFRAPESLPNPLLKMEKVSA GYGDRIILDSIKLNLVPGSRIGLLGRNGAGKSTLIKLLAGELAPVSGEIGLAKGIKLGYFAQHQLEYLRADESPIQHLAR LAPQELEQKLRDYLGGFGFQGDKVTEETRRFSGGEKARLVLALIVWQRPNLLLLDEPTNHLDLDMRQALTEALIEFEGAL VVVSHDRHLLRSTTDDLYLVHDRKVEPFDGDLEDYQQWLSDVQKQENQTDEAPKENANSAQARKDQKRREAELRAQTQPL RKEIARLEKEMEKLNAQLAQAEEKLGDSELYDQSRKAELTACLQQQASAKSGLEECEMAWLEAQEQLEQMLLEGQNN
Specific function: Unknown
COG id: COG0488
COG function: function code R; ATPase components of ABC transporters with duplicated ATPase domains
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 2 ABC transporter domains [H]
Homologues:
Organism=Homo sapiens, GI148612853, Length=537, Percent_Identity=37.6163873370577, Blast_Score=357, Evalue=3e-98, Organism=Homo sapiens, GI10947137, Length=513, Percent_Identity=36.2573099415205, Blast_Score=318, Evalue=1e-86, Organism=Homo sapiens, GI27881506, Length=513, Percent_Identity=36.2573099415205, Blast_Score=318, Evalue=1e-86, Organism=Homo sapiens, GI10947135, Length=525, Percent_Identity=35.4285714285714, Blast_Score=273, Evalue=4e-73, Organism=Homo sapiens, GI69354671, Length=525, Percent_Identity=35.4285714285714, Blast_Score=273, Evalue=4e-73, Organism=Homo sapiens, GI21536376, Length=185, Percent_Identity=28.6486486486486, Blast_Score=68, Evalue=2e-11, Organism=Escherichia coli, GI1789751, Length=637, Percent_Identity=99.6860282574568, Blast_Score=1298, Evalue=0.0, Organism=Escherichia coli, GI1787041, Length=529, Percent_Identity=33.0812854442344, Blast_Score=298, Evalue=1e-81, Organism=Escherichia coli, GI2367384, Length=532, Percent_Identity=31.390977443609, Blast_Score=231, Evalue=1e-61, Organism=Escherichia coli, GI1787182, Length=623, Percent_Identity=28.4109149277689, Blast_Score=231, Evalue=1e-61, Organism=Escherichia coli, GI1788165, Length=190, Percent_Identity=33.1578947368421, Blast_Score=96, Evalue=8e-21, Organism=Escherichia coli, GI87081782, Length=527, Percent_Identity=25.6166982922201, Blast_Score=88, Evalue=2e-18, Organism=Escherichia coli, GI87081791, Length=204, Percent_Identity=28.4313725490196, Blast_Score=74, Evalue=4e-14, Organism=Escherichia coli, GI1787164, Length=216, Percent_Identity=28.7037037037037, Blast_Score=73, Evalue=5e-14, Organism=Escherichia coli, GI48994997, Length=221, Percent_Identity=25.7918552036199, Blast_Score=67, Evalue=4e-12, Organism=Escherichia coli, GI48994883, Length=219, Percent_Identity=26.9406392694064, Blast_Score=67, Evalue=4e-12, Organism=Escherichia coli, GI87081709, Length=209, Percent_Identity=23.9234449760766, Blast_Score=67, Evalue=5e-12, Organism=Escherichia coli, GI1789891, Length=190, Percent_Identity=27.3684210526316, Blast_Score=65, Evalue=1e-11, Organism=Escherichia coli, GI1789586, Length=218, Percent_Identity=27.9816513761468, Blast_Score=64, Evalue=4e-11, Organism=Escherichia coli, GI1787500, Length=225, Percent_Identity=27.5555555555556, Blast_Score=63, Evalue=5e-11, Organism=Escherichia coli, GI1787758, Length=205, Percent_Identity=23.9024390243902, Blast_Score=62, Evalue=9e-11, Organism=Caenorhabditis elegans, GI17553372, Length=530, Percent_Identity=38.3018867924528, Blast_Score=353, Evalue=1e-97, Organism=Caenorhabditis elegans, GI17555318, Length=524, Percent_Identity=35.1145038167939, Blast_Score=323, Evalue=2e-88, Organism=Caenorhabditis elegans, GI17559834, Length=547, Percent_Identity=34.1864716636197, Blast_Score=310, Evalue=1e-84, Organism=Caenorhabditis elegans, GI71996809, Length=204, Percent_Identity=32.3529411764706, Blast_Score=70, Evalue=3e-12, Organism=Caenorhabditis elegans, GI193211017, Length=184, Percent_Identity=33.1521739130435, Blast_Score=69, Evalue=1e-11, Organism=Caenorhabditis elegans, GI193211015, Length=184, Percent_Identity=33.1521739130435, Blast_Score=69, Evalue=1e-11, Organism=Saccharomyces cerevisiae, GI6321121, Length=533, Percent_Identity=34.8968105065666, Blast_Score=335, Evalue=1e-92, Organism=Saccharomyces cerevisiae, GI6320874, Length=534, Percent_Identity=35.3932584269663, Blast_Score=322, Evalue=1e-88, Organism=Saccharomyces cerevisiae, GI6325030, Length=438, Percent_Identity=29.4520547945205, Blast_Score=160, Evalue=5e-40, Organism=Saccharomyces cerevisiae, GI6324314, Length=391, Percent_Identity=28.3887468030691, Blast_Score=152, Evalue=2e-37, Organism=Saccharomyces cerevisiae, GI6323278, Length=391, Percent_Identity=28.9002557544757, Blast_Score=150, Evalue=7e-37, Organism=Drosophila melanogaster, GI24666836, Length=520, Percent_Identity=37.5, Blast_Score=367, Evalue=1e-101, Organism=Drosophila melanogaster, GI24642252, Length=523, Percent_Identity=37.4760994263862, Blast_Score=352, Evalue=6e-97, Organism=Drosophila melanogaster, GI18859989, Length=523, Percent_Identity=37.4760994263862, Blast_Score=352, Evalue=6e-97, Organism=Drosophila melanogaster, GI24641342, Length=532, Percent_Identity=35.3383458646617, Blast_Score=311, Evalue=1e-84, Organism=Drosophila melanogaster, GI116007184, Length=164, Percent_Identity=30.4878048780488, Blast_Score=68, Evalue=2e-11, Organism=Drosophila melanogaster, GI221500365, Length=165, Percent_Identity=30.3030303030303, Blast_Score=68, Evalue=2e-11, Organism=Drosophila melanogaster, GI24661270, Length=191, Percent_Identity=28.7958115183246, Blast_Score=67, Evalue=3e-11, Organism=Drosophila melanogaster, GI21355589, Length=191, Percent_Identity=28.7958115183246, Blast_Score=67, Evalue=3e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR003439 - InterPro: IPR017871 - InterPro: IPR003593 [H]
Pfam domain/function: PF00005 ABC_tran [H]
EC number: NA
Molecular weight: Translated: 71885; Mature: 71885
Theoretical pI: Translated: 5.47; Mature: 5.47
Prosite motif: PS00211 ABC_TRANSPORTER_1 ; PS50893 ABC_TRANSPORTER_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.6 %Cys (Translated Protein) 1.7 %Met (Translated Protein) 2.4 %Cys+Met (Translated Protein) 0.6 %Cys (Mature Protein) 1.7 %Met (Mature Protein) 2.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MIVFSSLQIRRGVRVLLDNATATINPGQKVGLVGKNGCGKSTLLALLKNEISADGGSYTF CEEECCHHHHCCCEEEEECCEEEECCCCEEEEECCCCCCHHHHHHHHHHHHCCCCCCEEC PGSWQLAWVNQETPALPQAALEYVIDGDREYRQLEAQLHDANERNDGHAIATIHGKLDAI CCCEEEEEECCCCCCCHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCC DAWSIRSRAASLLHGLGFSNEQLERPVSDFSGGWRMRLNLAQALICRSDLLLLDEPTNHL CHHHHHHHHHHHHHHCCCCHHHHHCCHHHCCCCEEEEHHHHHHHHHHCCEEEEECCCCCC DLDAVIWLEKWLKSYQGTLILISHDRDFLDPIVEKIIHIEQQSMFEYTGNYSSFEVQRAT CHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHH RLAQQQAMYESQQERVAHLQSYIDRFRAKATKAKQAQSRIKMLERMELIAPAHVDNPFRF HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEE SFRAPESLPNPLLKMEKVSAGYGDRIILDSIKLNLVPGSRIGLLGRNGAGKSTLIKLLAG EEECCCCCCCHHHHHHHHCCCCCCEEEEEEEEEEECCCCCEEEEECCCCCHHHHHHHHHH ELAPVSGEIGLAKGIKLGYFAQHQLEYLRADESPIQHLARLAPQELEQKLRDYLGGFGFQ HCCCCCCCCCHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHCHHHHHHHHHHHHCCCCCC GDKVTEETRRFSGGEKARLVLALIVWQRPNLLLLDEPTNHLDLDMRQALTEALIEFEGAL CCCHHHHHHHCCCCCHHHHHHHHHEECCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCEE VVVSHDRHLLRSTTDDLYLVHDRKVEPFDGDLEDYQQWLSDVQKQENQTDEAPKENANSA EEEECCCHHHHCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCHH QARKDQKRREAELRAQTQPLRKEIARLEKEMEKLNAQLAQAEEKLGDSELYDQSRKAELT HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH ACLQQQASAKSGLEECEMAWLEAQEQLEQMLLEGQNN HHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCC >Mature Secondary Structure MIVFSSLQIRRGVRVLLDNATATINPGQKVGLVGKNGCGKSTLLALLKNEISADGGSYTF CEEECCHHHHCCCEEEEECCEEEECCCCEEEEECCCCCCHHHHHHHHHHHHCCCCCCEEC PGSWQLAWVNQETPALPQAALEYVIDGDREYRQLEAQLHDANERNDGHAIATIHGKLDAI CCCEEEEEECCCCCCCHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCC DAWSIRSRAASLLHGLGFSNEQLERPVSDFSGGWRMRLNLAQALICRSDLLLLDEPTNHL CHHHHHHHHHHHHHHCCCCHHHHHCCHHHCCCCEEEEHHHHHHHHHHCCEEEEECCCCCC DLDAVIWLEKWLKSYQGTLILISHDRDFLDPIVEKIIHIEQQSMFEYTGNYSSFEVQRAT CHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHH RLAQQQAMYESQQERVAHLQSYIDRFRAKATKAKQAQSRIKMLERMELIAPAHVDNPFRF HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEE SFRAPESLPNPLLKMEKVSAGYGDRIILDSIKLNLVPGSRIGLLGRNGAGKSTLIKLLAG EEECCCCCCCHHHHHHHHCCCCCCEEEEEEEEEEECCCCCEEEEECCCCCHHHHHHHHHH ELAPVSGEIGLAKGIKLGYFAQHQLEYLRADESPIQHLARLAPQELEQKLRDYLGGFGFQ HCCCCCCCCCHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHCHHHHHHHHHHHHCCCCCC GDKVTEETRRFSGGEKARLVLALIVWQRPNLLLLDEPTNHLDLDMRQALTEALIEFEGAL CCCHHHHHHHCCCCCHHHHHHHHHEECCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCEE VVVSHDRHLLRSTTDDLYLVHDRKVEPFDGDLEDYQQWLSDVQKQENQTDEAPKENANSA EEEECCCHHHHCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCHH QARKDQKRREAELRAQTQPLRKEIARLEKEMEKLNAQLAQAEEKLGDSELYDQSRKAELT HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH ACLQQQASAKSGLEECEMAWLEAQEQLEQMLLEGQNN HHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]