Definition | Agrobacterium tumefaciens str. C58 chromosome circular, complete sequence. |
---|---|
Accession | NC_003062 |
Length | 2,841,580 |
Click here to switch to the map view.
The map label for this gene is hvsT [C]
Identifier: 15888861
GI number: 15888861
Start: 1528866
End: 1530479
Strand: Direct
Name: hvsT [C]
Synonym: Atu1539
Alternate gene names: 15888861
Gene position: 1528866-1530479 (Clockwise)
Preceding gene: 159184804
Following gene: 15888862
Centisome position: 53.8
GC content: 61.59
Gene sequence:
>1614_bases ATGGGTAACAACGGGCGGGATATTCTCGCAGGGCTTTCCGTCGCCGGGCTGATGTTACCGGAGGCGATCGCTTATTCCGG CATCGCCGGCGTTCCGCCCCAGCATGCATTATATGCGGCAATGGCAGGTTGTCTTGTTTATGCCCTTCTTGGCCAGAGCC GTTTCGCCATCATCTCGCCAACCTCATCATCCGCCGCGATCCTTGCAGCCATGCTGGCCGCGCTCGTGCCGCAGCCCGGA CAGAAGATGCTGCTGGTCGCCGTCGCGGTGTTTCTTGTCGGGCTGTTTTTCCTTGCCGCCGGCACATTGCGGCTGGGCGC CCTGTCGAGCATCATTTCGAGGCCCGTGCTGCGCGGCTTTGCATTCGGACTGGCGATCCTCATTTCGCTCAAACAGTTTC CCGCGATCTTCGGCATGCCGCAAGCGGGAGCCGGCACATTCGAGGCGATCTTGCAGATATTGACCAATCCCGGCCAGTGG AACGGGTTCAGCCTGTCGATCGGCATCGCCGCCCTTATCCTTCTGCTTTTCGCAAGACGTTATCCCCAGATTCCCGGAAG CCTCATCATCATAGCGCTGGCGATCCCCATCTCCGTCATGTTCGATTTGCAGCAACGCGGTGTCGATGTCGTCGGCCCAA TCGATCTGTCCGGCATATGGGGAAGTGTCACCACGCTCTCGCTCGATGAACTGGCGCATGTGGCGCGTTTCGCCCCGCCT CTGGTGCTGATCCTGTTTGCCGAATCCTGGGGAACGATACGGGGCCTGTCGCTACGGCACGGGGAAGACGTGGATGCCAA TCGTGAACTGAAAACGCTTGGTATTGCCAATGTCGCAAGCGCCGCATTACAGGGAATGCCCGTCGGGGCCGGGTTTTCCG CGGGTGCTGCAAGCGAAGCGGCCAATCCGCGCACGAGAATGGCCTCGGCCATCGCCGCGATCGGGCTTGCCGGCTTCACC TTTGCCGCGGCCGACTGGTTTGCCTATATTCCGCATGCCGCGCTTTCCGCCATCATCATCGTGGCGCTGCTTCACGCGCT GGATCCCTCTCCGTTTTTGAGGCTGTGGCGGCTGCGACAGGACCTCGTGCTTGCCCTGGCGGCAACCGCCGGCGTGCTTT TCCTTGGCGTCCTCAACGGAATGCTGGCAGCAATCGTGCTGTCTTTCGCCGTATTCCTGCAAAGACTTTCCTCCCCGCGC ATCGTGATGCTCGGCCGCCTCGGCGCGAGCCACGACTTCGTGGATGTGAAGCGGCATCCGGATGCCGCGGAGCCGGCCGG CATGCTCGTCCTGCGCCCGGCGCAACCGCTTTTCTTCGGAAATGCCGAGCCGACATTCGCGGAAATCACCCGCCGCATAC TCGCGAGCCCTGACATCAACGCCGTCATCATCAGCCTCGAGGAGACATTTGAACTCGATACGACAGCCCTTGAGGCGCTG CTGGAATTCGACGCCAGCCTGCGCGGGCGCAACATCGGAATTCGCTACGCCCCAATGCACGATGCGGTGCGCGACGTTGT CGCCGCCGGTGGCGGAGACGATCTTTTGCGTCGGGCAAACTACAGCGTTGACGATGCCGTTGCCGCGATGGACGGCCTCA AGGAGGAAAAATGA
Upstream 100 bases:
>100_bases GATCCGCGCTGCCAAGCCACTGGATTTCCATCGAGACTTCACGCTAATCCTGATATTGTCGTGTTTGGCGCGAGAACTCC GGTGAGGATCACGGTGCGGC
Downstream 100 bases:
>100_bases CGCATATTTATGAGCCTCGTCTGTCTCGTATTGCCATCGACAAGTTACGACCGACGCAGATCGCGGTGGGCTTCCGCGAG GTCGAACTGAAAAGAAAGGA
Product: sulfate permease
Products: Proton [Cytoplasm]; SO42- [Cytoplasm] [C]
Alternate protein names: NA
Number of amino acids: Translated: 537; Mature: 536
Protein sequence:
>537_residues MGNNGRDILAGLSVAGLMLPEAIAYSGIAGVPPQHALYAAMAGCLVYALLGQSRFAIISPTSSSAAILAAMLAALVPQPG QKMLLVAVAVFLVGLFFLAAGTLRLGALSSIISRPVLRGFAFGLAILISLKQFPAIFGMPQAGAGTFEAILQILTNPGQW NGFSLSIGIAALILLLFARRYPQIPGSLIIIALAIPISVMFDLQQRGVDVVGPIDLSGIWGSVTTLSLDELAHVARFAPP LVLILFAESWGTIRGLSLRHGEDVDANRELKTLGIANVASAALQGMPVGAGFSAGAASEAANPRTRMASAIAAIGLAGFT FAAADWFAYIPHAALSAIIIVALLHALDPSPFLRLWRLRQDLVLALAATAGVLFLGVLNGMLAAIVLSFAVFLQRLSSPR IVMLGRLGASHDFVDVKRHPDAAEPAGMLVLRPAQPLFFGNAEPTFAEITRRILASPDINAVIISLEETFELDTTALEAL LEFDASLRGRNIGIRYAPMHDAVRDVVAAGGGDDLLRRANYSVDDAVAAMDGLKEEK
Sequences:
>Translated_537_residues MGNNGRDILAGLSVAGLMLPEAIAYSGIAGVPPQHALYAAMAGCLVYALLGQSRFAIISPTSSSAAILAAMLAALVPQPG QKMLLVAVAVFLVGLFFLAAGTLRLGALSSIISRPVLRGFAFGLAILISLKQFPAIFGMPQAGAGTFEAILQILTNPGQW NGFSLSIGIAALILLLFARRYPQIPGSLIIIALAIPISVMFDLQQRGVDVVGPIDLSGIWGSVTTLSLDELAHVARFAPP LVLILFAESWGTIRGLSLRHGEDVDANRELKTLGIANVASAALQGMPVGAGFSAGAASEAANPRTRMASAIAAIGLAGFT FAAADWFAYIPHAALSAIIIVALLHALDPSPFLRLWRLRQDLVLALAATAGVLFLGVLNGMLAAIVLSFAVFLQRLSSPR IVMLGRLGASHDFVDVKRHPDAAEPAGMLVLRPAQPLFFGNAEPTFAEITRRILASPDINAVIISLEETFELDTTALEAL LEFDASLRGRNIGIRYAPMHDAVRDVVAAGGGDDLLRRANYSVDDAVAAMDGLKEEK >Mature_536_residues GNNGRDILAGLSVAGLMLPEAIAYSGIAGVPPQHALYAAMAGCLVYALLGQSRFAIISPTSSSAAILAAMLAALVPQPGQ KMLLVAVAVFLVGLFFLAAGTLRLGALSSIISRPVLRGFAFGLAILISLKQFPAIFGMPQAGAGTFEAILQILTNPGQWN GFSLSIGIAALILLLFARRYPQIPGSLIIIALAIPISVMFDLQQRGVDVVGPIDLSGIWGSVTTLSLDELAHVARFAPPL VLILFAESWGTIRGLSLRHGEDVDANRELKTLGIANVASAALQGMPVGAGFSAGAASEAANPRTRMASAIAAIGLAGFTF AAADWFAYIPHAALSAIIIVALLHALDPSPFLRLWRLRQDLVLALAATAGVLFLGVLNGMLAAIVLSFAVFLQRLSSPRI VMLGRLGASHDFVDVKRHPDAAEPAGMLVLRPAQPLFFGNAEPTFAEITRRILASPDINAVIISLEETFELDTTALEALL EFDASLRGRNIGIRYAPMHDAVRDVVAAGGGDDLLRRANYSVDDAVAAMDGLKEEK
Specific function: Expression in E.coli induces sulfate uptake during early-to mid-log phase growth. Uptake is maximal at pH 6.0, is sulfate-specific, requires E.coli CysA and the transmembrane segment but not the STAS domain of the protein [H]
COG id: COG0659
COG function: function code P; Sulfate permease and related transporters (MFS superfamily)
Gene ontology:
Cell location: Cell membrane; Multi-pass membrane protein (Potential) [H]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 STAS domain [H]
Homologues:
Organism=Homo sapiens, GI45827800, Length=486, Percent_Identity=25.7201646090535, Blast_Score=154, Evalue=3e-37, Organism=Homo sapiens, GI39752683, Length=486, Percent_Identity=25.7201646090535, Blast_Score=154, Evalue=3e-37, Organism=Homo sapiens, GI269784651, Length=482, Percent_Identity=23.4439834024896, Blast_Score=129, Evalue=6e-30, Organism=Homo sapiens, GI94721259, Length=460, Percent_Identity=27.1739130434783, Blast_Score=128, Evalue=1e-29, Organism=Homo sapiens, GI94721255, Length=460, Percent_Identity=27.3913043478261, Blast_Score=128, Evalue=1e-29, Organism=Homo sapiens, GI94721253, Length=460, Percent_Identity=27.3913043478261, Blast_Score=128, Evalue=1e-29, Organism=Homo sapiens, GI45827802, Length=435, Percent_Identity=26.6666666666667, Blast_Score=127, Evalue=2e-29, Organism=Homo sapiens, GI94721257, Length=460, Percent_Identity=27.1739130434783, Blast_Score=127, Evalue=2e-29, Organism=Homo sapiens, GI47131207, Length=483, Percent_Identity=27.9503105590062, Blast_Score=118, Evalue=2e-26, Organism=Homo sapiens, GI20336272, Length=483, Percent_Identity=27.9503105590062, Blast_Score=118, Evalue=2e-26, Organism=Homo sapiens, GI4557535, Length=475, Percent_Identity=22.3157894736842, Blast_Score=94, Evalue=2e-19, Organism=Homo sapiens, GI262206105, Length=134, Percent_Identity=40.2985074626866, Blast_Score=86, Evalue=8e-17, Organism=Homo sapiens, GI262206075, Length=134, Percent_Identity=40.2985074626866, Blast_Score=86, Evalue=8e-17, Organism=Homo sapiens, GI262206069, Length=134, Percent_Identity=40.2985074626866, Blast_Score=86, Evalue=8e-17, Organism=Homo sapiens, GI262206063, Length=134, Percent_Identity=40.2985074626866, Blast_Score=86, Evalue=8e-17, Organism=Homo sapiens, GI100913030, Length=491, Percent_Identity=21.5885947046843, Blast_Score=81, Evalue=2e-15, Organism=Homo sapiens, GI4505697, Length=478, Percent_Identity=21.7573221757322, Blast_Score=80, Evalue=5e-15, Organism=Homo sapiens, GI16418413, Length=483, Percent_Identity=22.1532091097308, Blast_Score=77, Evalue=3e-14, Organism=Homo sapiens, GI16418457, Length=537, Percent_Identity=20.8566108007449, Blast_Score=77, Evalue=3e-14, Organism=Homo sapiens, GI301601599, Length=537, Percent_Identity=20.8566108007449, Blast_Score=77, Evalue=3e-14, Organism=Homo sapiens, GI217272867, Length=483, Percent_Identity=22.1532091097308, Blast_Score=77, Evalue=5e-14, Organism=Homo sapiens, GI45827804, Length=241, Percent_Identity=27.3858921161826, Blast_Score=74, Evalue=3e-13, Organism=Escherichia coli, GI87081859, Length=546, Percent_Identity=24.3589743589744, Blast_Score=64, Evalue=2e-11, Organism=Caenorhabditis elegans, GI17551690, Length=589, Percent_Identity=23.2597623089983, Blast_Score=121, Evalue=9e-28, Organism=Caenorhabditis elegans, GI17566848, Length=474, Percent_Identity=24.2616033755274, Blast_Score=102, Evalue=4e-22, Organism=Caenorhabditis elegans, GI86564196, Length=485, Percent_Identity=22.0618556701031, Blast_Score=100, Evalue=2e-21, Organism=Caenorhabditis elegans, GI17562578, Length=478, Percent_Identity=24.0585774058577, Blast_Score=99, Evalue=7e-21, Organism=Caenorhabditis elegans, GI86565215, Length=526, Percent_Identity=22.8136882129278, Blast_Score=95, Evalue=1e-19, Organism=Caenorhabditis elegans, GI86564876, Length=511, Percent_Identity=20.5479452054795, Blast_Score=79, Evalue=8e-15, Organism=Saccharomyces cerevisiae, GI6325260, Length=564, Percent_Identity=23.936170212766, Blast_Score=121, Evalue=3e-28, Organism=Saccharomyces cerevisiae, GI6319771, Length=452, Percent_Identity=22.1238938053097, Blast_Score=95, Evalue=3e-20, Organism=Saccharomyces cerevisiae, GI6323121, Length=159, Percent_Identity=27.0440251572327, Blast_Score=65, Evalue=4e-11, Organism=Drosophila melanogaster, GI21358633, Length=432, Percent_Identity=26.1574074074074, Blast_Score=110, Evalue=3e-24, Organism=Drosophila melanogaster, GI24649801, Length=439, Percent_Identity=25.9681093394077, Blast_Score=106, Evalue=5e-23, Organism=Drosophila melanogaster, GI24651449, Length=173, Percent_Identity=28.3236994219653, Blast_Score=82, Evalue=1e-15, Organism=Drosophila melanogaster, GI21358229, Length=163, Percent_Identity=33.7423312883436, Blast_Score=81, Evalue=2e-15, Organism=Drosophila melanogaster, GI85815873, Length=493, Percent_Identity=26.369168356998, Blast_Score=80, Evalue=3e-15, Organism=Drosophila melanogaster, GI21355087, Length=160, Percent_Identity=30.625, Blast_Score=79, Evalue=1e-14, Organism=Drosophila melanogaster, GI24647160, Length=160, Percent_Identity=30.625, Blast_Score=79, Evalue=1e-14, Organism=Drosophila melanogaster, GI19922482, Length=179, Percent_Identity=30.7262569832402, Blast_Score=78, Evalue=2e-14, Organism=Drosophila melanogaster, GI24663084, Length=184, Percent_Identity=30.4347826086957, Blast_Score=75, Evalue=2e-13, Organism=Drosophila melanogaster, GI21357695, Length=184, Percent_Identity=30.4347826086957, Blast_Score=75, Evalue=2e-13,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR002645 - InterPro: IPR001902 - InterPro: IPR011547 [H]
Pfam domain/function: PF01740 STAS; PF00916 Sulfate_transp [H]
EC number: NA
Molecular weight: Translated: 56346; Mature: 56215
Theoretical pI: Translated: 6.52; Mature: 6.52
Prosite motif: PS50801 STAS
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.2 %Cys (Translated Protein) 2.6 %Met (Translated Protein) 2.8 %Cys+Met (Translated Protein) 0.2 %Cys (Mature Protein) 2.4 %Met (Mature Protein) 2.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MGNNGRDILAGLSVAGLMLPEAIAYSGIAGVPPQHALYAAMAGCLVYALLGQSRFAIISP CCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECC TSSSAAILAAMLAALVPQPGQKMLLVAVAVFLVGLFFLAAGTLRLGALSSIISRPVLRGF CCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH AFGLAILISLKQFPAIFGMPQAGAGTFEAILQILTNPGQWNGFSLSIGIAALILLLFARR HHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCCCCCCCEEEHHHHHHHHHHHHHHH YPQIPGSLIIIALAIPISVMFDLQQRGVDVVGPIDLSGIWGSVTTLSLDELAHVARFAPP CCCCCCHHHHHHHHHHHHHHHHHHHCCCCEECCCCCCCCCCCHHEEEHHHHHHHHHHCCH LVLILFAESWGTIRGLSLRHGEDVDANRELKTLGIANVASAALQGMPVGAGFSAGAASEA HEEEEECCCCCCEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC ANPRTRMASAIAAIGLAGFTFAAADWFAYIPHAALSAIIIVALLHALDPSPFLRLWRLRQ CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH DLVLALAATAGVLFLGVLNGMLAAIVLSFAVFLQRLSSPRIVMLGRLGASHDFVDVKRHP HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCEECCCCC DAAEPAGMLVLRPAQPLFFGNAEPTFAEITRRILASPDINAVIISLEETFELDTTALEAL CCCCCCCEEEECCCCCEEECCCCCHHHHHHHHHHCCCCCCEEEEEEHHHHCHHHHHHHHH LEFDASLRGRNIGIRYAPMHDAVRDVVAAGGGDDLLRRANYSVDDAVAAMDGLKEEK HHHHHHCCCCCCCEEECCHHHHHHHHHHCCCCHHHHHHCCCCHHHHHHHHHCCCCCC >Mature Secondary Structure GNNGRDILAGLSVAGLMLPEAIAYSGIAGVPPQHALYAAMAGCLVYALLGQSRFAIISP CCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECC TSSSAAILAAMLAALVPQPGQKMLLVAVAVFLVGLFFLAAGTLRLGALSSIISRPVLRGF CCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH AFGLAILISLKQFPAIFGMPQAGAGTFEAILQILTNPGQWNGFSLSIGIAALILLLFARR HHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCCCCCCCEEEHHHHHHHHHHHHHHH YPQIPGSLIIIALAIPISVMFDLQQRGVDVVGPIDLSGIWGSVTTLSLDELAHVARFAPP CCCCCCHHHHHHHHHHHHHHHHHHHCCCCEECCCCCCCCCCCHHEEEHHHHHHHHHHCCH LVLILFAESWGTIRGLSLRHGEDVDANRELKTLGIANVASAALQGMPVGAGFSAGAASEA HEEEEECCCCCCEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC ANPRTRMASAIAAIGLAGFTFAAADWFAYIPHAALSAIIIVALLHALDPSPFLRLWRLRQ CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH DLVLALAATAGVLFLGVLNGMLAAIVLSFAVFLQRLSSPRIVMLGRLGASHDFVDVKRHP HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCEECCCCC DAAEPAGMLVLRPAQPLFFGNAEPTFAEITRRILASPDINAVIISLEETFELDTTALEAL CCCCCCCEEEECCCCCEEECCCCCHHHHHHHHHHCCCCCCEEEEEEHHHHCHHHHHHHHH LEFDASLRGRNIGIRYAPMHDAVRDVVAAGGGDDLLRRANYSVDDAVAAMDGLKEEK HHHHHHCCCCCCCEEECCHHHHHHHHHHCCCCHHHHHHCCCCHHHHHHHHHCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: Proton [Periplasm]; SO42- [Periplasm] [C]
Specific reaction: Proton [Periplasm] + SO42- [Periplasm] = Proton [Cytoplasm] + SO42- [Cytoplasm] [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 9634230; 12218036 [H]