Definition | Escherichia coli HS, complete genome. |
---|---|
Accession | NC_009800 |
Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is ychM
Identifier: 157160711
GI number: 157160711
Start: 1307563
End: 1309215
Strand: Reverse
Name: ychM
Synonym: EcHS_A1311
Alternate gene names: 157160711
Gene position: 1309215-1307563 (Counterclockwise)
Preceding gene: 157160712
Following gene: 157160709
Centisome position: 28.19
GC content: 54.63
Gene sequence:
>1653_bases ATGCCTTTCCGCGCTCTGATCGACGCTTGCTGGAAAGAAAAATATACTGCCGCACGGTTTACCCGTGACCTGATTGCCGG GATAACCGTCGGGATTATTGCTATCCCGCTGGCGATGGCGTTGGCTATTGGTAGTGGTGTGGCACCCCAGTACGGTTTAT ATACCGCAGCTGTTGCGGGGATTGTCATTGCTCTGACGGGTGGGTCACGCTTTAGCGTTTCCGGTCCGACTGCGGCATTT GTGGTAATTCTCTATCCCGTGTCGCAACAGTTTGGACTGGCAGGACTGCTGGTTGCGACCTTGCTGTCGGGGATCTTTTT GATTCTGATGGGTCTGGCACGCTTTGGTCGCCTGATTGAGTATATTCCGGTTTCCGTCACCTTAGGTTTCACCTCGGGTA TCGGGATCACCATCGGTACCATGCAGATTAAAGATTTTCTCGGTCTGCAAATGGCCCATGTCCCGGAACATTATCTACAA AAAGTCGGCGCATTATTTATGGCGCTGCCGACCATTAATGTGGGTGATGCTGCCATTGGCATTGTGACGCTAGGTATTCT TGTTTTTTGGCCGCGTCTGGGCATTCGTTTACCCGGTCACCTTCCGGCCTTGCTGGCTGGTTGCGCGGTGATGGGGATTG TTAACCTGCTCGGCGGACATGTTGCTACCATCGGTTCGCAATTCCACTACGTCCTGGCCGATGGTTCTCAGGGTAACGGT ATTCCGCAACTGCTGCCGCAACTGGTGCTGCCGTGGGATCTGCCTAATTCAGAATTCACGCTAACCTGGGATTCTATTCG CACACTGCTGCCTGCGGCATTCTCAATGGCAATGCTCGGCGCAATCGAATCTCTGCTCTGCGCCGTGGTACTGGATGGTA TGACCGGGACGAAACACAAGGCGAACAGCGAACTGGTTGGACAGGGACTGGGGAATATTATCGCTCCGTTCTTTGGTGGT ATTACCGCTACAGCTGCCATCGCGCGTTCTGCCGCTAACGTCCGTGCCGGGGCAACTTCCCCTATCTCGGCGGTGATCCA CTCTATTCTGGTTATTCTTGCCCTGCTGGTACTGGCACCGCTGCTCTCCTGGCTGCCGCTTTCCGCTATGGCAGCCCTGC TGTTGATGGTGGCGTGGAACATGAGTGAAGCGCATAAAGTGGTCGACTTGCTGCGTCATGCACCGAAAGATGACATCATT GTCATGCTGCTGTGCATGTCGCTGACCGTGCTGTTTGATATGGTTATTGCCATCAGCGTGGGGATCGTGCTGGCATCGCT GCTGTTTATGCGTCGTATCGCACGTATGACTCGCCTGGCACCGGTAGTCGTAGATGTTCCAGACGATGTTCTGGTACTGC GCGTTATTGGCCCGCTGTTTTTTGCTGCTGCTGAAGGCTTGTTCACGGACCTGGAGTCACGTCTTGAAGGCAAACGGATT GTGATTCTGAAGTGGGATGCCGTTCCGGTACTTGATGCTGGTGGTCTTGATGCGTTCCAGCGTTTTGTGAAGCGTCTGCC CGAAGGATGTGAACTGCGCGTGTGCAACGTGGAATTCCAGCCACTGCGCACTATGGCTCGCGCAGGCATTCAACCGATCC CGGGACGCCTCGCGTTCTTCCCGAATCGTCGCGCGGCGATGGCGGATTTATAA
Upstream 100 bases:
>100_bases CATTTGTATGACCTATGCCTCCTTCACCTGCCATTTAGTTGACAGATGATGCGCTCACGGATGAAACATTATTGTGAACA AAATATTTTCCTCACATGTG
Downstream 100 bases:
>100_bases GTACGAGATTGACCAGTCAGCGCAAACTGACTGGTCAGCAAACTGCATTATTTGTTAGCTATGACGGCGATTATCGCTAC GGCGGCAACGTTTGTCATAG
Product: putative sulfate transporter YchM
Products: Proton [Cytoplasm]; SO42- [Cytoplasm] [C]
Alternate protein names: NA
Number of amino acids: Translated: 550; Mature: 549
Protein sequence:
>550_residues MPFRALIDACWKEKYTAARFTRDLIAGITVGIIAIPLAMALAIGSGVAPQYGLYTAAVAGIVIALTGGSRFSVSGPTAAF VVILYPVSQQFGLAGLLVATLLSGIFLILMGLARFGRLIEYIPVSVTLGFTSGIGITIGTMQIKDFLGLQMAHVPEHYLQ KVGALFMALPTINVGDAAIGIVTLGILVFWPRLGIRLPGHLPALLAGCAVMGIVNLLGGHVATIGSQFHYVLADGSQGNG IPQLLPQLVLPWDLPNSEFTLTWDSIRTLLPAAFSMAMLGAIESLLCAVVLDGMTGTKHKANSELVGQGLGNIIAPFFGG ITATAAIARSAANVRAGATSPISAVIHSILVILALLVLAPLLSWLPLSAMAALLLMVAWNMSEAHKVVDLLRHAPKDDII VMLLCMSLTVLFDMVIAISVGIVLASLLFMRRIARMTRLAPVVVDVPDDVLVLRVIGPLFFAAAEGLFTDLESRLEGKRI VILKWDAVPVLDAGGLDAFQRFVKRLPEGCELRVCNVEFQPLRTMARAGIQPIPGRLAFFPNRRAAMADL
Sequences:
>Translated_550_residues MPFRALIDACWKEKYTAARFTRDLIAGITVGIIAIPLAMALAIGSGVAPQYGLYTAAVAGIVIALTGGSRFSVSGPTAAF VVILYPVSQQFGLAGLLVATLLSGIFLILMGLARFGRLIEYIPVSVTLGFTSGIGITIGTMQIKDFLGLQMAHVPEHYLQ KVGALFMALPTINVGDAAIGIVTLGILVFWPRLGIRLPGHLPALLAGCAVMGIVNLLGGHVATIGSQFHYVLADGSQGNG IPQLLPQLVLPWDLPNSEFTLTWDSIRTLLPAAFSMAMLGAIESLLCAVVLDGMTGTKHKANSELVGQGLGNIIAPFFGG ITATAAIARSAANVRAGATSPISAVIHSILVILALLVLAPLLSWLPLSAMAALLLMVAWNMSEAHKVVDLLRHAPKDDII VMLLCMSLTVLFDMVIAISVGIVLASLLFMRRIARMTRLAPVVVDVPDDVLVLRVIGPLFFAAAEGLFTDLESRLEGKRI VILKWDAVPVLDAGGLDAFQRFVKRLPEGCELRVCNVEFQPLRTMARAGIQPIPGRLAFFPNRRAAMADL >Mature_549_residues PFRALIDACWKEKYTAARFTRDLIAGITVGIIAIPLAMALAIGSGVAPQYGLYTAAVAGIVIALTGGSRFSVSGPTAAFV VILYPVSQQFGLAGLLVATLLSGIFLILMGLARFGRLIEYIPVSVTLGFTSGIGITIGTMQIKDFLGLQMAHVPEHYLQK VGALFMALPTINVGDAAIGIVTLGILVFWPRLGIRLPGHLPALLAGCAVMGIVNLLGGHVATIGSQFHYVLADGSQGNGI PQLLPQLVLPWDLPNSEFTLTWDSIRTLLPAAFSMAMLGAIESLLCAVVLDGMTGTKHKANSELVGQGLGNIIAPFFGGI TATAAIARSAANVRAGATSPISAVIHSILVILALLVLAPLLSWLPLSAMAALLLMVAWNMSEAHKVVDLLRHAPKDDIIV MLLCMSLTVLFDMVIAISVGIVLASLLFMRRIARMTRLAPVVVDVPDDVLVLRVIGPLFFAAAEGLFTDLESRLEGKRIV ILKWDAVPVLDAGGLDAFQRFVKRLPEGCELRVCNVEFQPLRTMARAGIQPIPGRLAFFPNRRAAMADL
Specific function: Possible sulfate transporter
COG id: COG0659
COG function: function code P; Sulfate permease and related transporters (MFS superfamily)
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 STAS domain
Homologues:
Organism=Homo sapiens, GI262206105, Length=503, Percent_Identity=27.037773359841, Blast_Score=141, Evalue=2e-33, Organism=Homo sapiens, GI262206075, Length=503, Percent_Identity=27.037773359841, Blast_Score=141, Evalue=2e-33, Organism=Homo sapiens, GI262206069, Length=503, Percent_Identity=27.037773359841, Blast_Score=141, Evalue=2e-33, Organism=Homo sapiens, GI262206063, Length=503, Percent_Identity=27.037773359841, Blast_Score=141, Evalue=2e-33, Organism=Homo sapiens, GI4557535, Length=489, Percent_Identity=23.1083844580777, Blast_Score=89, Evalue=1e-17, Organism=Homo sapiens, GI94721257, Length=482, Percent_Identity=23.2365145228216, Blast_Score=85, Evalue=2e-16, Organism=Homo sapiens, GI94721259, Length=482, Percent_Identity=23.2365145228216, Blast_Score=84, Evalue=3e-16, Organism=Homo sapiens, GI94721253, Length=482, Percent_Identity=23.2365145228216, Blast_Score=84, Evalue=3e-16, Organism=Homo sapiens, GI94721255, Length=482, Percent_Identity=23.2365145228216, Blast_Score=84, Evalue=3e-16, Organism=Homo sapiens, GI39752683, Length=389, Percent_Identity=22.879177377892, Blast_Score=80, Evalue=6e-15, Organism=Homo sapiens, GI45827800, Length=389, Percent_Identity=22.879177377892, Blast_Score=79, Evalue=8e-15, Organism=Homo sapiens, GI4505697, Length=433, Percent_Identity=23.094688221709, Blast_Score=79, Evalue=9e-15, Organism=Homo sapiens, GI45827802, Length=402, Percent_Identity=21.8905472636816, Blast_Score=79, Evalue=9e-15, Organism=Homo sapiens, GI269784651, Length=379, Percent_Identity=23.2189973614776, Blast_Score=77, Evalue=6e-14, Organism=Homo sapiens, GI20336282, Length=510, Percent_Identity=21.5686274509804, Blast_Score=76, Evalue=8e-14, Organism=Homo sapiens, GI16306483, Length=510, Percent_Identity=21.5686274509804, Blast_Score=76, Evalue=8e-14, Organism=Homo sapiens, GI47131207, Length=526, Percent_Identity=23.7642585551331, Blast_Score=75, Evalue=2e-13, Organism=Homo sapiens, GI20336272, Length=526, Percent_Identity=23.7642585551331, Blast_Score=75, Evalue=2e-13, Organism=Homo sapiens, GI100913030, Length=478, Percent_Identity=23.0125523012552, Blast_Score=72, Evalue=1e-12, Organism=Escherichia coli, GI87081859, Length=550, Percent_Identity=100, Blast_Score=1085, Evalue=0.0, Organism=Caenorhabditis elegans, GI17566848, Length=569, Percent_Identity=24.780316344464, Blast_Score=102, Evalue=5e-22, Organism=Caenorhabditis elegans, GI86564196, Length=445, Percent_Identity=24.2696629213483, Blast_Score=72, Evalue=5e-13, Organism=Caenorhabditis elegans, GI193203292, Length=384, Percent_Identity=23.1770833333333, Blast_Score=69, Evalue=6e-12, Organism=Saccharomyces cerevisiae, GI6323121, Length=456, Percent_Identity=23.9035087719298, Blast_Score=91, Evalue=6e-19, Organism=Saccharomyces cerevisiae, GI6325260, Length=551, Percent_Identity=22.3230490018149, Blast_Score=86, Evalue=1e-17, Organism=Saccharomyces cerevisiae, GI6319771, Length=193, Percent_Identity=30.0518134715026, Blast_Score=72, Evalue=3e-13, Organism=Drosophila melanogaster, GI85815873, Length=479, Percent_Identity=25.8872651356994, Blast_Score=118, Evalue=1e-26, Organism=Drosophila melanogaster, GI19922482, Length=423, Percent_Identity=25.531914893617, Blast_Score=117, Evalue=2e-26, Organism=Drosophila melanogaster, GI24663084, Length=443, Percent_Identity=27.765237020316, Blast_Score=116, Evalue=4e-26, Organism=Drosophila melanogaster, GI21357695, Length=443, Percent_Identity=27.765237020316, Blast_Score=116, Evalue=4e-26, Organism=Drosophila melanogaster, GI21358229, Length=463, Percent_Identity=26.5658747300216, Blast_Score=112, Evalue=7e-25, Organism=Drosophila melanogaster, GI24649801, Length=453, Percent_Identity=26.4900662251656, Blast_Score=111, Evalue=1e-24, Organism=Drosophila melanogaster, GI24651449, Length=448, Percent_Identity=24.1071428571429, Blast_Score=103, Evalue=2e-22, Organism=Drosophila melanogaster, GI21358633, Length=434, Percent_Identity=25.8064516129032, Blast_Score=101, Evalue=1e-21, Organism=Drosophila melanogaster, GI24647160, Length=178, Percent_Identity=30.3370786516854, Blast_Score=79, Evalue=9e-15, Organism=Drosophila melanogaster, GI21355087, Length=178, Percent_Identity=30.3370786516854, Blast_Score=79, Evalue=9e-15, Organism=Drosophila melanogaster, GI24666186, Length=442, Percent_Identity=25.3393665158371, Blast_Score=70, Evalue=3e-12,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): YCHM_ECO57 (P0AFR3)
Other databases:
- EMBL: AE005174 - EMBL: BA000007 - PIR: D85700 - PIR: G90842 - RefSeq: NP_287452.1 - RefSeq: NP_309738.1 - ProteinModelPortal: P0AFR3 - SMR: P0AFR3 - EnsemblBacteria: EBESCT00000025139 - EnsemblBacteria: EBESCT00000055341 - GeneID: 913152 - GeneID: 960456 - GenomeReviews: AE005174_GR - GenomeReviews: BA000007_GR - KEGG: ece:Z1977 - KEGG: ecs:ECs1711 - GeneTree: EBGT00050000010289 - HOGENOM: HBG564176 - OMA: VFLTCFS - ProtClustDB: PRK11660 - BioCyc: ECOL83334:ECS1711-MONOMER - InterPro: IPR018045 - InterPro: IPR002645 - InterPro: IPR001902 - InterPro: IPR011547 - Gene3D: G3DSA:3.30.750.24 - TIGRFAMs: TIGR00815
Pfam domain/function: PF01740 STAS; PF00916 Sulfate_transp; SSF52091 STAS
EC number: NA
Molecular weight: Translated: 58386; Mature: 58255
Theoretical pI: Translated: 8.80; Mature: 8.80
Prosite motif: PS01130 SLC26A; PS50801 STAS
Important sites: NA
Signals:
None
Transmembrane regions:
HASH(0x14045438)-; HASH(0x14568928)-; HASH(0x12a3d9ac)-; HASH(0x14553078)-; HASH(0x1456ee1c)-; HASH(0x145be9e8)-; HASH(0x11952d44)-; HASH(0x13e07208)-; HASH(0x14568c04)-; HASH(0x14553698)-;
Cys/Met content:
1.1 %Cys (Translated Protein) 3.6 %Met (Translated Protein) 4.7 %Cys+Met (Translated Protein) 1.1 %Cys (Mature Protein) 3.5 %Met (Mature Protein) 4.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MPFRALIDACWKEKYTAARFTRDLIAGITVGIIAIPLAMALAIGSGVAPQYGLYTAAVAG CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHH IVIALTGGSRFSVSGPTAAFVVILYPVSQQFGLAGLLVATLLSGIFLILMGLARFGRLIE HEEEEECCCEEECCCCCCEEEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH YIPVSVTLGFTSGIGITIGTMQIKDFLGLQMAHVPEHYLQKVGALFMALPTINVGDAAIG HHCCEEEEECCCCCCEEEEHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCHHHHH IVTLGILVFWPRLGIRLPGHLPALLAGCAVMGIVNLLGGHVATIGSQFHYVLADGSQGNG HHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHCCCEEEEEECCCCCCC IPQLLPQLVLPWDLPNSEFTLTWDSIRTLLPAAFSMAMLGAIESLLCAVVLDGMTGTKHK HHHHHHHHHCCCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC ANSELVGQGLGNIIAPFFGGITATAAIARSAANVRAGATSPISAVIHSILVILALLVLAP CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHH LLSWLPLSAMAALLLMVAWNMSEAHKVVDLLRHAPKDDIIVMLLCMSLTVLFDMVIAISV HHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHH GIVLASLLFMRRIARMTRLAPVVVDVPDDVLVLRVIGPLFFAAAEGLFTDLESRLEGKRI HHHHHHHHHHHHHHHHHHHCCEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEE VILKWDAVPVLDAGGLDAFQRFVKRLPEGCELRVCNVEFQPLRTMARAGIQPIPGRLAFF EEEEECCCCEECCCCHHHHHHHHHHCCCCCCEEEECCCHHHHHHHHHCCCCCCCCCEEEC PNRRAAMADL CCCCCCCCCH >Mature Secondary Structure PFRALIDACWKEKYTAARFTRDLIAGITVGIIAIPLAMALAIGSGVAPQYGLYTAAVAG CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHH IVIALTGGSRFSVSGPTAAFVVILYPVSQQFGLAGLLVATLLSGIFLILMGLARFGRLIE HEEEEECCCEEECCCCCCEEEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH YIPVSVTLGFTSGIGITIGTMQIKDFLGLQMAHVPEHYLQKVGALFMALPTINVGDAAIG HHCCEEEEECCCCCCEEEEHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCHHHHH IVTLGILVFWPRLGIRLPGHLPALLAGCAVMGIVNLLGGHVATIGSQFHYVLADGSQGNG HHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHCCCEEEEEECCCCCCC IPQLLPQLVLPWDLPNSEFTLTWDSIRTLLPAAFSMAMLGAIESLLCAVVLDGMTGTKHK HHHHHHHHHCCCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC ANSELVGQGLGNIIAPFFGGITATAAIARSAANVRAGATSPISAVIHSILVILALLVLAP CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHH LLSWLPLSAMAALLLMVAWNMSEAHKVVDLLRHAPKDDIIVMLLCMSLTVLFDMVIAISV HHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHH GIVLASLLFMRRIARMTRLAPVVVDVPDDVLVLRVIGPLFFAAAEGLFTDLESRLEGKRI HHHHHHHHHHHHHHHHHHHCCEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEE VILKWDAVPVLDAGGLDAFQRFVKRLPEGCELRVCNVEFQPLRTMARAGIQPIPGRLAFF EEEEECCCCEECCCCHHHHHHHHHHCCCCCCEEEECCCHHHHHHHHHCCCCCCCCCEEEC PNRRAAMADL CCCCCCCCCH
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: Proton [Periplasm]; SO42- [Periplasm] [C]
Specific reaction: Proton [Periplasm] + SO42- [Periplasm] = Proton [Cytoplasm] + SO42- [Cytoplasm] [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796