Definition Escherichia coli O157:H7 str. EC4115, complete genome.
Accession NC_011353
Length 5,572,075

Click here to switch to the map view.

The map label for this gene is ychM

Identifier: 209397423

GI number: 209397423

Start: 1626502

End: 1628181

Strand: Reverse

Name: ychM

Synonym: ECH74115_1687

Alternate gene names: 209397423

Gene position: 1628181-1626502 (Counterclockwise)

Preceding gene: 209398206

Following gene: 209398573

Centisome position: 29.22

GC content: 54.52

Gene sequence:

>1680_bases
GTGAACAAAATATTTTCCTCACATGTGATGCCTTTCCGCGCTCTGATCGACGCTTGCTGGAAAGAAAAATATACTGCCGC
ACGGTTTACCCGTGACCTGATTGCCGGGATAACCGTCGGGATTATTGCTATCCCGCTGGCGATGGCGTTGGCTATTGGTA
GTGGTGTGGCACCCCAGTACGGTTTATATACCGCAGCTGTTGCGGGGATTGTCATTGCTCTGACGGGTGGGTCACGCTTT
AGCGTTTCCGGTCCGACTGCGGCATTTGTGGTAATTCTCTATCCCGTTTCGCAACAGTTTGGGCTGGCAGGACTGCTGGT
TGCGACCTTGCTGTCGGGGATCTTTTTGATTCTGATGGGTCTGGCACGCTTTGGTCGCCTGATTGAGTATATTCCGGTTT
CCGTCACCTTAGGTTTCACCTCGGGTATCGGGATCACCATCGGTACCATGCAGATTAAAGATTTTCTCGGTCTGCAAATG
GCCCATGTCCCGGAACATTATCTACAAAAAGTCGGCGCATTATTTATGGCGCTGCCGACCATTAATGTGGGTGATGCTGC
CATTGGCATTGTGACGCTAGGTATTCTTGTTTTCTGGCCGCGTCTGGGCATTCGTTTACCCGGTCACCTTCCGGCCTTGC
TGGCTGGTTGCGCGGTGATGGGGATTGTTAACCTGCTCGGCGGACATGTTGCTACCATCGGTTCGCAATTCCACTACGTC
CTGGCCGATGGTTCTCAGGGTAACGGTATTCCGCAACTGCTACCGCAACTGGTGCTGCCGTGGGATCTGCCTAATTCAGA
ATTCACGCTAACCTGGGATTCTATTCGCACACTGCTGCCTGCGGCATTCTCAATGGCAATGCTCGGCGCAATCGAATCTC
TGCTCTGCGCCGTGGTGCTGGATGGTATGACCGGGACGAAGCACAAGGCGAATAGCGAACTGGTTGGACAGGGGCTGGGG
AATATCATCGCTCCGTTCTTTGGTGGTATTACCGCTACCGCTGCCATCGCGCGTTCTGCCGCTAACGTCCGTGCCGGGGC
AACTTCCCCTATCTCGGCGGTGATCCACTCTATTCTGGTTATTCTTGCCCTGCTGGTACTGGCACCGCTGCTCTCCTGGC
TGCCGCTTTCCGCTATGGCAGCCCTGCTGTTGATGGTGGCGTGGAACATGAGTGAAGCGCATAAAGTGGTCGACTTGCTG
CGTCATGCACCGAAAGATGACATCATTGTCATGCTGCTGTGCATGTCGCTGACCGTGCTGTTTGATATGGTTATTGCCAT
CAGCGTGGGGATCGTGCTGGCATCGCTGCTGTTTATGCGTCGTATCGCACGTATGACTCGCCTGGCACCGGTAGTCGTAG
ATGTTCCAGACGATGTCCTGGTTCTGCGCGTTATTGGCCCGCTGTTTTTTGCTGCTGCTGAAGGCTTATTCACGGACCTG
GAGTCACGTCTTGAAGGCAAACGGATTGTGATTCTGAAGTGGGATGCCGTTCCGGTACTTGATGCTGGTGGTCTTGATGC
GTTCCAGCGTTTTGTGAAGCGTCTGCCCGAAGGATGTGAACTGCGCGTGTGCAACGTGGAATTCCAGCCACTGCGCACTA
TGGCTCGCGCAGGCATTCAACCGATCCCGGGACGCCTCGCGTTCTTCCCGAATCGTCGCGCGGCGATGGCGGATTTATAA

Upstream 100 bases:

>100_bases
GCGGCGGGTTTTTTTGTCTGTAATATCCATTTGTATGACCTATGCCTCCTTCACCTGCCATTTAGTTGACAGATGATGCG
CTCACGGATGAAACATTATT

Downstream 100 bases:

>100_bases
GTACGAGATTGACCAGTCAGCGCAAACTGACTGGTCAGCAAACTGCATTATTTGTTAGCTATGACGGCGATTATCGCTAC
GGCGGCAACGTTTGTCATAG

Product: putative sulfate transporter YchM

Products: Proton [Cytoplasm]; SO42- [Cytoplasm] [C]

Alternate protein names: NA

Number of amino acids: Translated: 559; Mature: 559

Protein sequence:

>559_residues
MNKIFSSHVMPFRALIDACWKEKYTAARFTRDLIAGITVGIIAIPLAMALAIGSGVAPQYGLYTAAVAGIVIALTGGSRF
SVSGPTAAFVVILYPVSQQFGLAGLLVATLLSGIFLILMGLARFGRLIEYIPVSVTLGFTSGIGITIGTMQIKDFLGLQM
AHVPEHYLQKVGALFMALPTINVGDAAIGIVTLGILVFWPRLGIRLPGHLPALLAGCAVMGIVNLLGGHVATIGSQFHYV
LADGSQGNGIPQLLPQLVLPWDLPNSEFTLTWDSIRTLLPAAFSMAMLGAIESLLCAVVLDGMTGTKHKANSELVGQGLG
NIIAPFFGGITATAAIARSAANVRAGATSPISAVIHSILVILALLVLAPLLSWLPLSAMAALLLMVAWNMSEAHKVVDLL
RHAPKDDIIVMLLCMSLTVLFDMVIAISVGIVLASLLFMRRIARMTRLAPVVVDVPDDVLVLRVIGPLFFAAAEGLFTDL
ESRLEGKRIVILKWDAVPVLDAGGLDAFQRFVKRLPEGCELRVCNVEFQPLRTMARAGIQPIPGRLAFFPNRRAAMADL

Sequences:

>Translated_559_residues
MNKIFSSHVMPFRALIDACWKEKYTAARFTRDLIAGITVGIIAIPLAMALAIGSGVAPQYGLYTAAVAGIVIALTGGSRF
SVSGPTAAFVVILYPVSQQFGLAGLLVATLLSGIFLILMGLARFGRLIEYIPVSVTLGFTSGIGITIGTMQIKDFLGLQM
AHVPEHYLQKVGALFMALPTINVGDAAIGIVTLGILVFWPRLGIRLPGHLPALLAGCAVMGIVNLLGGHVATIGSQFHYV
LADGSQGNGIPQLLPQLVLPWDLPNSEFTLTWDSIRTLLPAAFSMAMLGAIESLLCAVVLDGMTGTKHKANSELVGQGLG
NIIAPFFGGITATAAIARSAANVRAGATSPISAVIHSILVILALLVLAPLLSWLPLSAMAALLLMVAWNMSEAHKVVDLL
RHAPKDDIIVMLLCMSLTVLFDMVIAISVGIVLASLLFMRRIARMTRLAPVVVDVPDDVLVLRVIGPLFFAAAEGLFTDL
ESRLEGKRIVILKWDAVPVLDAGGLDAFQRFVKRLPEGCELRVCNVEFQPLRTMARAGIQPIPGRLAFFPNRRAAMADL
>Mature_559_residues
MNKIFSSHVMPFRALIDACWKEKYTAARFTRDLIAGITVGIIAIPLAMALAIGSGVAPQYGLYTAAVAGIVIALTGGSRF
SVSGPTAAFVVILYPVSQQFGLAGLLVATLLSGIFLILMGLARFGRLIEYIPVSVTLGFTSGIGITIGTMQIKDFLGLQM
AHVPEHYLQKVGALFMALPTINVGDAAIGIVTLGILVFWPRLGIRLPGHLPALLAGCAVMGIVNLLGGHVATIGSQFHYV
LADGSQGNGIPQLLPQLVLPWDLPNSEFTLTWDSIRTLLPAAFSMAMLGAIESLLCAVVLDGMTGTKHKANSELVGQGLG
NIIAPFFGGITATAAIARSAANVRAGATSPISAVIHSILVILALLVLAPLLSWLPLSAMAALLLMVAWNMSEAHKVVDLL
RHAPKDDIIVMLLCMSLTVLFDMVIAISVGIVLASLLFMRRIARMTRLAPVVVDVPDDVLVLRVIGPLFFAAAEGLFTDL
ESRLEGKRIVILKWDAVPVLDAGGLDAFQRFVKRLPEGCELRVCNVEFQPLRTMARAGIQPIPGRLAFFPNRRAAMADL

Specific function: Possible sulfate transporter

COG id: COG0659

COG function: function code P; Sulfate permease and related transporters (MFS superfamily)

Gene ontology:

Cell location: Cell inner membrane; Multi-pass membrane protein

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 STAS domain

Homologues:

Organism=Homo sapiens, GI262206105, Length=503, Percent_Identity=27.037773359841, Blast_Score=141, Evalue=2e-33,
Organism=Homo sapiens, GI262206075, Length=503, Percent_Identity=27.037773359841, Blast_Score=141, Evalue=2e-33,
Organism=Homo sapiens, GI262206069, Length=503, Percent_Identity=27.037773359841, Blast_Score=141, Evalue=2e-33,
Organism=Homo sapiens, GI262206063, Length=503, Percent_Identity=27.037773359841, Blast_Score=141, Evalue=2e-33,
Organism=Homo sapiens, GI4557535, Length=489, Percent_Identity=23.1083844580777, Blast_Score=89, Evalue=1e-17,
Organism=Homo sapiens, GI94721253, Length=482, Percent_Identity=23.2365145228216, Blast_Score=84, Evalue=3e-16,
Organism=Homo sapiens, GI94721257, Length=482, Percent_Identity=23.2365145228216, Blast_Score=84, Evalue=3e-16,
Organism=Homo sapiens, GI94721259, Length=482, Percent_Identity=23.2365145228216, Blast_Score=84, Evalue=3e-16,
Organism=Homo sapiens, GI94721255, Length=482, Percent_Identity=23.2365145228216, Blast_Score=84, Evalue=3e-16,
Organism=Homo sapiens, GI45827800, Length=389, Percent_Identity=22.879177377892, Blast_Score=80, Evalue=5e-15,
Organism=Homo sapiens, GI39752683, Length=389, Percent_Identity=22.879177377892, Blast_Score=80, Evalue=6e-15,
Organism=Homo sapiens, GI45827802, Length=402, Percent_Identity=21.8905472636816, Blast_Score=80, Evalue=7e-15,
Organism=Homo sapiens, GI4505697, Length=433, Percent_Identity=23.094688221709, Blast_Score=79, Evalue=9e-15,
Organism=Homo sapiens, GI20336282, Length=527, Percent_Identity=21.0626185958254, Blast_Score=76, Evalue=7e-14,
Organism=Homo sapiens, GI269784651, Length=379, Percent_Identity=23.2189973614776, Blast_Score=76, Evalue=7e-14,
Organism=Homo sapiens, GI16306483, Length=510, Percent_Identity=21.5686274509804, Blast_Score=76, Evalue=8e-14,
Organism=Homo sapiens, GI47131207, Length=545, Percent_Identity=23.8532110091743, Blast_Score=75, Evalue=1e-13,
Organism=Homo sapiens, GI20336272, Length=545, Percent_Identity=23.8532110091743, Blast_Score=75, Evalue=1e-13,
Organism=Homo sapiens, GI100913030, Length=478, Percent_Identity=23.0125523012552, Blast_Score=72, Evalue=1e-12,
Organism=Escherichia coli, GI87081859, Length=559, Percent_Identity=100, Blast_Score=1105, Evalue=0.0,
Organism=Caenorhabditis elegans, GI17566848, Length=569, Percent_Identity=24.780316344464, Blast_Score=102, Evalue=5e-22,
Organism=Caenorhabditis elegans, GI86564196, Length=445, Percent_Identity=24.2696629213483, Blast_Score=73, Evalue=5e-13,
Organism=Caenorhabditis elegans, GI193203292, Length=384, Percent_Identity=23.1770833333333, Blast_Score=69, Evalue=5e-12,
Organism=Saccharomyces cerevisiae, GI6323121, Length=456, Percent_Identity=23.9035087719298, Blast_Score=91, Evalue=4e-19,
Organism=Saccharomyces cerevisiae, GI6325260, Length=520, Percent_Identity=22.8846153846154, Blast_Score=87, Evalue=1e-17,
Organism=Saccharomyces cerevisiae, GI6319771, Length=181, Percent_Identity=30.939226519337, Blast_Score=72, Evalue=2e-13,
Organism=Drosophila melanogaster, GI85815873, Length=479, Percent_Identity=25.8872651356994, Blast_Score=118, Evalue=1e-26,
Organism=Drosophila melanogaster, GI19922482, Length=423, Percent_Identity=25.531914893617, Blast_Score=117, Evalue=2e-26,
Organism=Drosophila melanogaster, GI24663084, Length=443, Percent_Identity=27.765237020316, Blast_Score=117, Evalue=3e-26,
Organism=Drosophila melanogaster, GI21357695, Length=443, Percent_Identity=27.765237020316, Blast_Score=117, Evalue=3e-26,
Organism=Drosophila melanogaster, GI21358229, Length=463, Percent_Identity=26.5658747300216, Blast_Score=112, Evalue=7e-25,
Organism=Drosophila melanogaster, GI24649801, Length=453, Percent_Identity=26.4900662251656, Blast_Score=111, Evalue=1e-24,
Organism=Drosophila melanogaster, GI24651449, Length=448, Percent_Identity=24.1071428571429, Blast_Score=103, Evalue=2e-22,
Organism=Drosophila melanogaster, GI21358633, Length=434, Percent_Identity=25.8064516129032, Blast_Score=101, Evalue=1e-21,
Organism=Drosophila melanogaster, GI24647160, Length=178, Percent_Identity=30.3370786516854, Blast_Score=79, Evalue=9e-15,
Organism=Drosophila melanogaster, GI21355087, Length=171, Percent_Identity=30.9941520467836, Blast_Score=79, Evalue=9e-15,
Organism=Drosophila melanogaster, GI24666186, Length=442, Percent_Identity=25.3393665158371, Blast_Score=71, Evalue=2e-12,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): YCHM_ECO57 (P0AFR3)

Other databases:

- EMBL:   AE005174
- EMBL:   BA000007
- PIR:   D85700
- PIR:   G90842
- RefSeq:   NP_287452.1
- RefSeq:   NP_309738.1
- ProteinModelPortal:   P0AFR3
- SMR:   P0AFR3
- EnsemblBacteria:   EBESCT00000025139
- EnsemblBacteria:   EBESCT00000055341
- GeneID:   913152
- GeneID:   960456
- GenomeReviews:   AE005174_GR
- GenomeReviews:   BA000007_GR
- KEGG:   ece:Z1977
- KEGG:   ecs:ECs1711
- GeneTree:   EBGT00050000010289
- HOGENOM:   HBG564176
- OMA:   VFLTCFS
- ProtClustDB:   PRK11660
- BioCyc:   ECOL83334:ECS1711-MONOMER
- InterPro:   IPR018045
- InterPro:   IPR002645
- InterPro:   IPR001902
- InterPro:   IPR011547
- Gene3D:   G3DSA:3.30.750.24
- TIGRFAMs:   TIGR00815

Pfam domain/function: PF01740 STAS; PF00916 Sulfate_transp; SSF52091 STAS

EC number: NA

Molecular weight: Translated: 59430; Mature: 59430

Theoretical pI: Translated: 9.01; Mature: 9.01

Prosite motif: PS01130 SLC26A; PS50801 STAS

Important sites: NA

Signals:

None

Transmembrane regions:

HASH(0x20139bf0)-; HASH(0x2019bde8)-; HASH(0x20094cec)-; HASH(0x201b8e00)-; HASH(0x1fa50290)-; HASH(0x20071c50)-; HASH(0x2019bbd8)-; HASH(0x1f5f2028)-; HASH(0x201dd6b0)-; HASH(0x1f14bd10)-;

Cys/Met content:

1.1 %Cys     (Translated Protein)
3.8 %Met     (Translated Protein)
4.8 %Cys+Met (Translated Protein)
1.1 %Cys     (Mature Protein)
3.8 %Met     (Mature Protein)
4.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNKIFSSHVMPFRALIDACWKEKYTAARFTRDLIAGITVGIIAIPLAMALAIGSGVAPQY
CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCH
GLYTAAVAGIVIALTGGSRFSVSGPTAAFVVILYPVSQQFGLAGLLVATLLSGIFLILMG
HHHHHHHHHHEEEEECCCEEECCCCCCEEEEEEEECCHHHHHHHHHHHHHHHHHHHHHHH
LARFGRLIEYIPVSVTLGFTSGIGITIGTMQIKDFLGLQMAHVPEHYLQKVGALFMALPT
HHHHHHHHHHHCCEEEEECCCCCCEEEEHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCC
INVGDAAIGIVTLGILVFWPRLGIRLPGHLPALLAGCAVMGIVNLLGGHVATIGSQFHYV
CCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHCCCEEEE
LADGSQGNGIPQLLPQLVLPWDLPNSEFTLTWDSIRTLLPAAFSMAMLGAIESLLCAVVL
EECCCCCCCHHHHHHHHHCCCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
DGMTGTKHKANSELVGQGLGNIIAPFFGGITATAAIARSAANVRAGATSPISAVIHSILV
CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHH
ILALLVLAPLLSWLPLSAMAALLLMVAWNMSEAHKVVDLLRHAPKDDIIVMLLCMSLTVL
HHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH
FDMVIAISVGIVLASLLFMRRIARMTRLAPVVVDVPDDVLVLRVIGPLFFAAAEGLFTDL
HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEECCCHHHHHHHHHHHHHHHHHHHHHHH
ESRLEGKRIVILKWDAVPVLDAGGLDAFQRFVKRLPEGCELRVCNVEFQPLRTMARAGIQ
HHHCCCCEEEEEEECCCCEECCCCHHHHHHHHHHCCCCCCEEEECCCHHHHHHHHHCCCC
PIPGRLAFFPNRRAAMADL
CCCCCEEECCCCCCCCCCH
>Mature Secondary Structure
MNKIFSSHVMPFRALIDACWKEKYTAARFTRDLIAGITVGIIAIPLAMALAIGSGVAPQY
CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCH
GLYTAAVAGIVIALTGGSRFSVSGPTAAFVVILYPVSQQFGLAGLLVATLLSGIFLILMG
HHHHHHHHHHEEEEECCCEEECCCCCCEEEEEEEECCHHHHHHHHHHHHHHHHHHHHHHH
LARFGRLIEYIPVSVTLGFTSGIGITIGTMQIKDFLGLQMAHVPEHYLQKVGALFMALPT
HHHHHHHHHHHCCEEEEECCCCCCEEEEHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCC
INVGDAAIGIVTLGILVFWPRLGIRLPGHLPALLAGCAVMGIVNLLGGHVATIGSQFHYV
CCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHCCCEEEE
LADGSQGNGIPQLLPQLVLPWDLPNSEFTLTWDSIRTLLPAAFSMAMLGAIESLLCAVVL
EECCCCCCCHHHHHHHHHCCCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
DGMTGTKHKANSELVGQGLGNIIAPFFGGITATAAIARSAANVRAGATSPISAVIHSILV
CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHH
ILALLVLAPLLSWLPLSAMAALLLMVAWNMSEAHKVVDLLRHAPKDDIIVMLLCMSLTVL
HHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH
FDMVIAISVGIVLASLLFMRRIARMTRLAPVVVDVPDDVLVLRVIGPLFFAAAEGLFTDL
HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEECCCHHHHHHHHHHHHHHHHHHHHHHH
ESRLEGKRIVILKWDAVPVLDAGGLDAFQRFVKRLPEGCELRVCNVEFQPLRTMARAGIQ
HHHCCCCEEEEEEECCCCEECCCCHHHHHHHHHHCCCCCCEEEECCCHHHHHHHHHCCCC
PIPGRLAFFPNRRAAMADL
CCCCCEEECCCCCCCCCCH

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: Proton [Periplasm]; SO42- [Periplasm] [C]

Specific reaction: Proton [Periplasm] + SO42- [Periplasm] = Proton [Cytoplasm] + SO42- [Cytoplasm] [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796