Definition Geobacter sulfurreducens PCA chromosome, complete genome.
Accession NC_002939
Length 3,814,139

Click here to switch to the map view.

The map label for this gene is ychM [C]

Identifier: 39997410

GI number: 39997410

Start: 2529292

End: 2531064

Strand: Direct

Name: ychM [C]

Synonym: GSU2312

Alternate gene names: 39997410

Gene position: 2529292-2531064 (Clockwise)

Preceding gene: 39997409

Following gene: 39997418

Centisome position: 66.31

GC content: 61.87

Gene sequence:

>1773_bases
ATGAGGTTGACGCCCTCCTCTTTGCTGCCCGAATGGCTCCGCTCCTATCGTCCGGCCGATCTACTCCCGGATCTGGCGGC
AGGCGCAGTGGTTGCGGTGATACTGGCCCCCCAGGGAATGGCCTATGCGCTGCTTGCAGGGCTTCCCCCCATCATGGGGC
TTTATGCTGCTACGGTGCCGCTGCTGGCCTATGCCCTGGCCGGGTCGTCGCGCCACCTGTCCGTGGGACCCGTTGCCATC
GTATCGCTGCTTGTGCACGTAGCCTGCAGCAAGGTTGCCCACGCGGGTTCAGCGAGCTATGTGTCCGCAGCCCTGCAACT
TGCCCTACTGACAGGTGTGCTGCAACTGCTTTTGGGAACCGTCCGGGCCGGTTTCATGGTCAACTTCCTCTCCCGGGCCG
CCATCGGAGGGTTCACCTCGGCGGCGGCGCTTCTCATCAGCCTGAGCCAGTTTAAGAACCTGCTTGGAATATCCGGCGAC
GGCGGCGAGTCCGCTCTGGAGCTGGCCGCCGGCGTGGTCCGGAACATTGGGACGCTCCACCTCCTGACCAGCGTAATGGG
GCTGGCGGCCATCTGCATGCTGCTTCTCCTGCAACGGTTCGCGCCCCGCTTTCCCGCTCCGCTGGCGGCAATCGTCCTCG
GCATTCCGCTGACGGCCCTTTTGCACCTGGATCAGGCAGGGGTCAGGACTGTCGGTGATCTTCCCCATGGGCTTCCCCCC
CTTTCCCTGCCGCCATTCGCCGCGGATCAAATACTTACGCTCCTGCCGGCCGCCGTGACCATCGCCCTGATCGGCTATCT
GGAATCATTTGCCGTTGCCGGTCTCATTGCCGACCGGGAAAAATACCCGATCTACCCGAACCGTGAACTGGTCGGACTCG
GCATTGCCAATGTGGCTGCGGCATTTTTTTCAGGCTATCCGGTCACCGGCGGCTTTTCCCGCACCGCGGTCAACCATCGG
GCCGGTGCCAGAACAGGCCTGGCCGGCATGATTACGGCAACTCTCATCGGCATCATACTGCTTCACTTCACTCACCTCTT
CCACTACCTTCCAAAAACGATCCTGGCTGCAATCGTCATTGTGGCCGTTGCCGGCCTGGTGGAGGCAGCCGAAGCCCGCT
ACCTTTTTCGGGTGAAGCCCAGCGACGGCTACACGTTTGTTCTGACGTTCCTGGTTACGCTCGGTTTCGGCGTGGAGGCA
GGCATCGTAGCGGGCGTCATCTTCTCGCTGCTGGTTTTCATATGGCGGAGTGCCCATCCCCACATCGCCGAACTGGGGTG
GCTTGAAGAGGAAGGGGTCTTCCGTAACATCCGCCGCTACCCTCATGCCGTTGTGCCTCGCGGCATGCTGCTCGTGCGGG
TCGACGCTTCCCTCTACTTCGCCAACATGGCGTTTGTAGGGGACTGGCTGCGGGCTACCCTAGCAGAGCGGGCGGATGTG
CGCCAAATCATATTCGATCTCTCGGGGGTCAACGATATGGATGCGGTAGCGTTGGCGGCACTGGAGGTGATCATCGAAGG
CCACGGGGAAAGGGGAATTGTCGTGGCATTCGCCGGCATGAAGGGGCCGGTCCGGGATCTGGCCCAACGGGCCGGCTGGC
AGGAACGATATGGGAACCTGATCAGCTTTCTTTCACTGAACCAAGCGGTCCGACAGATGTCGACGGAAGATATGATCCTG
GCTGGACTCCACAGCAAGGAGAGAGAGTCGGAGACATGCAGCGTGCCCGCTACGCGTCCCACCGGCTCGACCAATCATGG
TGATCCCGCCTGA

Upstream 100 bases:

>100_bases
CCGGAACTGTTGCGGGTGACCCCTTGGGTGGTTATCCTGCCCCTGTCGGCGTTGCTGATCGGTTTACTGGTCTGGCTGGA
GCGGGCCGGTCTGTGACTCC

Downstream 100 bases:

>100_bases
ACCACGCGGTCCCGTCCTGACCTTTTAGCCCGATACAAGGCATCATCGGCGGCAGCCAGCAGTTCGTCCAGACCGGCGGT
GCCGTCGCACGCGGCAACAC

Product: sulfate transporter family protein

Products: Proton [Cytoplasm]; SO42- [Cytoplasm] [C]

Alternate protein names: NA

Number of amino acids: Translated: 590; Mature: 590

Protein sequence:

>590_residues
MRLTPSSLLPEWLRSYRPADLLPDLAAGAVVAVILAPQGMAYALLAGLPPIMGLYAATVPLLAYALAGSSRHLSVGPVAI
VSLLVHVACSKVAHAGSASYVSAALQLALLTGVLQLLLGTVRAGFMVNFLSRAAIGGFTSAAALLISLSQFKNLLGISGD
GGESALELAAGVVRNIGTLHLLTSVMGLAAICMLLLLQRFAPRFPAPLAAIVLGIPLTALLHLDQAGVRTVGDLPHGLPP
LSLPPFAADQILTLLPAAVTIALIGYLESFAVAGLIADREKYPIYPNRELVGLGIANVAAAFFSGYPVTGGFSRTAVNHR
AGARTGLAGMITATLIGIILLHFTHLFHYLPKTILAAIVIVAVAGLVEAAEARYLFRVKPSDGYTFVLTFLVTLGFGVEA
GIVAGVIFSLLVFIWRSAHPHIAELGWLEEEGVFRNIRRYPHAVVPRGMLLVRVDASLYFANMAFVGDWLRATLAERADV
RQIIFDLSGVNDMDAVALAALEVIIEGHGERGIVVAFAGMKGPVRDLAQRAGWQERYGNLISFLSLNQAVRQMSTEDMIL
AGLHSKERESETCSVPATRPTGSTNHGDPA

Sequences:

>Translated_590_residues
MRLTPSSLLPEWLRSYRPADLLPDLAAGAVVAVILAPQGMAYALLAGLPPIMGLYAATVPLLAYALAGSSRHLSVGPVAI
VSLLVHVACSKVAHAGSASYVSAALQLALLTGVLQLLLGTVRAGFMVNFLSRAAIGGFTSAAALLISLSQFKNLLGISGD
GGESALELAAGVVRNIGTLHLLTSVMGLAAICMLLLLQRFAPRFPAPLAAIVLGIPLTALLHLDQAGVRTVGDLPHGLPP
LSLPPFAADQILTLLPAAVTIALIGYLESFAVAGLIADREKYPIYPNRELVGLGIANVAAAFFSGYPVTGGFSRTAVNHR
AGARTGLAGMITATLIGIILLHFTHLFHYLPKTILAAIVIVAVAGLVEAAEARYLFRVKPSDGYTFVLTFLVTLGFGVEA
GIVAGVIFSLLVFIWRSAHPHIAELGWLEEEGVFRNIRRYPHAVVPRGMLLVRVDASLYFANMAFVGDWLRATLAERADV
RQIIFDLSGVNDMDAVALAALEVIIEGHGERGIVVAFAGMKGPVRDLAQRAGWQERYGNLISFLSLNQAVRQMSTEDMIL
AGLHSKERESETCSVPATRPTGSTNHGDPA
>Mature_590_residues
MRLTPSSLLPEWLRSYRPADLLPDLAAGAVVAVILAPQGMAYALLAGLPPIMGLYAATVPLLAYALAGSSRHLSVGPVAI
VSLLVHVACSKVAHAGSASYVSAALQLALLTGVLQLLLGTVRAGFMVNFLSRAAIGGFTSAAALLISLSQFKNLLGISGD
GGESALELAAGVVRNIGTLHLLTSVMGLAAICMLLLLQRFAPRFPAPLAAIVLGIPLTALLHLDQAGVRTVGDLPHGLPP
LSLPPFAADQILTLLPAAVTIALIGYLESFAVAGLIADREKYPIYPNRELVGLGIANVAAAFFSGYPVTGGFSRTAVNHR
AGARTGLAGMITATLIGIILLHFTHLFHYLPKTILAAIVIVAVAGLVEAAEARYLFRVKPSDGYTFVLTFLVTLGFGVEA
GIVAGVIFSLLVFIWRSAHPHIAELGWLEEEGVFRNIRRYPHAVVPRGMLLVRVDASLYFANMAFVGDWLRATLAERADV
RQIIFDLSGVNDMDAVALAALEVIIEGHGERGIVVAFAGMKGPVRDLAQRAGWQERYGNLISFLSLNQAVRQMSTEDMIL
AGLHSKERESETCSVPATRPTGSTNHGDPA

Specific function: Expression in E.coli induces sulfate uptake during early-to mid-log phase growth. Uptake is maximal at pH 6.0, is sulfate-specific, requires E.coli CysA and the transmembrane segment but not the STAS domain of the protein [H]

COG id: COG0659

COG function: function code P; Sulfate permease and related transporters (MFS superfamily)

Gene ontology:

Cell location: Cell membrane; Multi-pass membrane protein (Potential) [H]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 STAS domain [H]

Homologues:

Organism=Homo sapiens, GI45827800, Length=495, Percent_Identity=30.5050505050505, Blast_Score=232, Evalue=9e-61,
Organism=Homo sapiens, GI39752683, Length=493, Percent_Identity=30.6288032454361, Blast_Score=231, Evalue=1e-60,
Organism=Homo sapiens, GI94721259, Length=480, Percent_Identity=32.0833333333333, Blast_Score=229, Evalue=6e-60,
Organism=Homo sapiens, GI94721255, Length=480, Percent_Identity=32.0833333333333, Blast_Score=229, Evalue=7e-60,
Organism=Homo sapiens, GI94721253, Length=480, Percent_Identity=32.0833333333333, Blast_Score=229, Evalue=8e-60,
Organism=Homo sapiens, GI94721257, Length=480, Percent_Identity=32.0833333333333, Blast_Score=228, Evalue=1e-59,
Organism=Homo sapiens, GI4505697, Length=509, Percent_Identity=31.6306483300589, Blast_Score=214, Evalue=3e-55,
Organism=Homo sapiens, GI4557535, Length=505, Percent_Identity=29.5049504950495, Blast_Score=211, Evalue=2e-54,
Organism=Homo sapiens, GI269784651, Length=492, Percent_Identity=29.4715447154472, Blast_Score=210, Evalue=2e-54,
Organism=Homo sapiens, GI45827802, Length=445, Percent_Identity=31.0112359550562, Blast_Score=209, Evalue=5e-54,
Organism=Homo sapiens, GI47131207, Length=637, Percent_Identity=27.1585557299843, Blast_Score=173, Evalue=4e-43,
Organism=Homo sapiens, GI20336272, Length=637, Percent_Identity=27.1585557299843, Blast_Score=173, Evalue=4e-43,
Organism=Homo sapiens, GI100913030, Length=555, Percent_Identity=25.7657657657658, Blast_Score=166, Evalue=4e-41,
Organism=Homo sapiens, GI16418413, Length=495, Percent_Identity=27.6767676767677, Blast_Score=166, Evalue=8e-41,
Organism=Homo sapiens, GI217272867, Length=495, Percent_Identity=27.6767676767677, Blast_Score=165, Evalue=1e-40,
Organism=Homo sapiens, GI16418457, Length=529, Percent_Identity=25.8979206049149, Blast_Score=144, Evalue=2e-34,
Organism=Homo sapiens, GI301601599, Length=529, Percent_Identity=25.8979206049149, Blast_Score=144, Evalue=2e-34,
Organism=Homo sapiens, GI20336282, Length=517, Percent_Identity=25.1450676982592, Blast_Score=144, Evalue=3e-34,
Organism=Homo sapiens, GI16306483, Length=444, Percent_Identity=26.8018018018018, Blast_Score=144, Evalue=3e-34,
Organism=Homo sapiens, GI262206105, Length=339, Percent_Identity=30.0884955752212, Blast_Score=139, Evalue=6e-33,
Organism=Homo sapiens, GI262206075, Length=339, Percent_Identity=30.0884955752212, Blast_Score=139, Evalue=6e-33,
Organism=Homo sapiens, GI262206069, Length=339, Percent_Identity=30.0884955752212, Blast_Score=139, Evalue=6e-33,
Organism=Homo sapiens, GI262206063, Length=339, Percent_Identity=30.0884955752212, Blast_Score=139, Evalue=6e-33,
Organism=Homo sapiens, GI45827804, Length=257, Percent_Identity=32.295719844358, Blast_Score=128, Evalue=1e-29,
Organism=Homo sapiens, GI65506789, Length=350, Percent_Identity=31.7142857142857, Blast_Score=116, Evalue=6e-26,
Organism=Homo sapiens, GI301601602, Length=226, Percent_Identity=23.8938053097345, Blast_Score=90, Evalue=6e-18,
Organism=Homo sapiens, GI20336274, Length=83, Percent_Identity=48.1927710843374, Blast_Score=73, Evalue=8e-13,
Organism=Escherichia coli, GI87081859, Length=432, Percent_Identity=28.7037037037037, Blast_Score=130, Evalue=2e-31,
Organism=Caenorhabditis elegans, GI17551690, Length=561, Percent_Identity=27.9857397504456, Blast_Score=205, Evalue=5e-53,
Organism=Caenorhabditis elegans, GI17566848, Length=555, Percent_Identity=29.5495495495495, Blast_Score=199, Evalue=3e-51,
Organism=Caenorhabditis elegans, GI86564196, Length=598, Percent_Identity=23.9130434782609, Blast_Score=199, Evalue=4e-51,
Organism=Caenorhabditis elegans, GI17562578, Length=615, Percent_Identity=26.6666666666667, Blast_Score=182, Evalue=5e-46,
Organism=Caenorhabditis elegans, GI86565215, Length=542, Percent_Identity=26.3837638376384, Blast_Score=177, Evalue=1e-44,
Organism=Caenorhabditis elegans, GI86564876, Length=565, Percent_Identity=25.4867256637168, Blast_Score=155, Evalue=4e-38,
Organism=Caenorhabditis elegans, GI193203292, Length=501, Percent_Identity=26.9461077844311, Blast_Score=155, Evalue=5e-38,
Organism=Caenorhabditis elegans, GI86565213, Length=258, Percent_Identity=29.0697674418605, Blast_Score=104, Evalue=1e-22,
Organism=Caenorhabditis elegans, GI86565209, Length=240, Percent_Identity=25.8333333333333, Blast_Score=84, Evalue=2e-16,
Organism=Caenorhabditis elegans, GI86565211, Length=240, Percent_Identity=25.8333333333333, Blast_Score=84, Evalue=2e-16,
Organism=Saccharomyces cerevisiae, GI6325260, Length=574, Percent_Identity=24.5644599303136, Blast_Score=164, Evalue=4e-41,
Organism=Saccharomyces cerevisiae, GI6319771, Length=468, Percent_Identity=25.8547008547009, Blast_Score=161, Evalue=3e-40,
Organism=Saccharomyces cerevisiae, GI6323121, Length=477, Percent_Identity=27.4633123689727, Blast_Score=161, Evalue=3e-40,
Organism=Drosophila melanogaster, GI24649801, Length=554, Percent_Identity=29.0613718411552, Blast_Score=216, Evalue=2e-56,
Organism=Drosophila melanogaster, GI24651449, Length=586, Percent_Identity=26.2798634812287, Blast_Score=194, Evalue=1e-49,
Organism=Drosophila melanogaster, GI19922482, Length=435, Percent_Identity=30.8045977011494, Blast_Score=184, Evalue=2e-46,
Organism=Drosophila melanogaster, GI24663084, Length=535, Percent_Identity=29.5327102803738, Blast_Score=179, Evalue=6e-45,
Organism=Drosophila melanogaster, GI21357695, Length=535, Percent_Identity=29.5327102803738, Blast_Score=179, Evalue=6e-45,
Organism=Drosophila melanogaster, GI85815873, Length=442, Percent_Identity=30.9954751131222, Blast_Score=177, Evalue=2e-44,
Organism=Drosophila melanogaster, GI24666186, Length=526, Percent_Identity=27.1863117870722, Blast_Score=172, Evalue=8e-43,
Organism=Drosophila melanogaster, GI21358229, Length=455, Percent_Identity=30.5494505494506, Blast_Score=165, Evalue=9e-41,
Organism=Drosophila melanogaster, GI24647160, Length=475, Percent_Identity=26.7368421052632, Blast_Score=162, Evalue=5e-40,
Organism=Drosophila melanogaster, GI21355087, Length=475, Percent_Identity=26.7368421052632, Blast_Score=162, Evalue=6e-40,
Organism=Drosophila melanogaster, GI21358633, Length=538, Percent_Identity=25.6505576208178, Blast_Score=134, Evalue=1e-31,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR002645
- InterPro:   IPR001902
- InterPro:   IPR011547 [H]

Pfam domain/function: PF01740 STAS; PF00916 Sulfate_transp [H]

EC number: NA

Molecular weight: Translated: 62293; Mature: 62293

Theoretical pI: Translated: 8.14; Mature: 8.14

Prosite motif: PS50801 STAS

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.5 %Cys     (Translated Protein)
2.2 %Met     (Translated Protein)
2.7 %Cys+Met (Translated Protein)
0.5 %Cys     (Mature Protein)
2.2 %Met     (Mature Protein)
2.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MRLTPSSLLPEWLRSYRPADLLPDLAAGAVVAVILAPQGMAYALLAGLPPIMGLYAATVP
CCCCHHHHHHHHHHCCCCHHHCHHHHHHHHHHHHHCCCCHHHHHHHCCHHHHHHHHHHHH
LLAYALAGSSRHLSVGPVAIVSLLVHVACSKVAHAGSASYVSAALQLALLTGVLQLLLGT
HHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHH
VRAGFMVNFLSRAAIGGFTSAAALLISLSQFKNLLGISGDGGESALELAAGVVRNIGTLH
HHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHH
LLTSVMGLAAICMLLLLQRFAPRFPAPLAAIVLGIPLTALLHLDQAGVRTVGDLPHGLPP
HHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCHHHCCCCCCCCC
LSLPPFAADQILTLLPAAVTIALIGYLESFAVAGLIADREKYPIYPNRELVGLGIANVAA
CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCEEECCHHHHHH
AFFSGYPVTGGFSRTAVNHRAGARTGLAGMITATLIGIILLHFTHLFHYLPKTILAAIVI
HHHCCCCCCCCCHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
VAVAGLVEAAEARYLFRVKPSDGYTFVLTFLVTLGFGVEAGIVAGVIFSLLVFIWRSAHP
HHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCC
HIAELGWLEEEGVFRNIRRYPHAVVPRGMLLVRVDASLYFANMAFVGDWLRATLAERADV
CHHHHCCCCHHHHHHHHHHCCHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHH
RQIIFDLSGVNDMDAVALAALEVIIEGHGERGIVVAFAGMKGPVRDLAQRAGWQERYGNL
HHHHHHHCCCCCHHHHHHHHHHHHHHCCCCCCEEEEECCCCCHHHHHHHHCCHHHHHHHH
ISFLSLNQAVRQMSTEDMILAGLHSKERESETCSVPATRPTGSTNHGDPA
HHHHHHHHHHHHHCCCHHHHHCCCCHHCCCCCCCCCCCCCCCCCCCCCCC
>Mature Secondary Structure
MRLTPSSLLPEWLRSYRPADLLPDLAAGAVVAVILAPQGMAYALLAGLPPIMGLYAATVP
CCCCHHHHHHHHHHCCCCHHHCHHHHHHHHHHHHHCCCCHHHHHHHCCHHHHHHHHHHHH
LLAYALAGSSRHLSVGPVAIVSLLVHVACSKVAHAGSASYVSAALQLALLTGVLQLLLGT
HHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHH
VRAGFMVNFLSRAAIGGFTSAAALLISLSQFKNLLGISGDGGESALELAAGVVRNIGTLH
HHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHH
LLTSVMGLAAICMLLLLQRFAPRFPAPLAAIVLGIPLTALLHLDQAGVRTVGDLPHGLPP
HHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCHHHCCCCCCCCC
LSLPPFAADQILTLLPAAVTIALIGYLESFAVAGLIADREKYPIYPNRELVGLGIANVAA
CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCEEECCHHHHHH
AFFSGYPVTGGFSRTAVNHRAGARTGLAGMITATLIGIILLHFTHLFHYLPKTILAAIVI
HHHCCCCCCCCCHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
VAVAGLVEAAEARYLFRVKPSDGYTFVLTFLVTLGFGVEAGIVAGVIFSLLVFIWRSAHP
HHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCC
HIAELGWLEEEGVFRNIRRYPHAVVPRGMLLVRVDASLYFANMAFVGDWLRATLAERADV
CHHHHCCCCHHHHHHHHHHCCHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHH
RQIIFDLSGVNDMDAVALAALEVIIEGHGERGIVVAFAGMKGPVRDLAQRAGWQERYGNL
HHHHHHHCCCCCHHHHHHHHHHHHHHCCCCCCEEEEECCCCCHHHHHHHHCCHHHHHHHH
ISFLSLNQAVRQMSTEDMILAGLHSKERESETCSVPATRPTGSTNHGDPA
HHHHHHHHHHHHHCCCHHHHHCCCCHHCCCCCCCCCCCCCCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: Proton [Periplasm]; SO42- [Periplasm] [C]

Specific reaction: Proton [Periplasm] + SO42- [Periplasm] = Proton [Cytoplasm] + SO42- [Cytoplasm] [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 9634230; 12218036 [H]