| Definition | Trichodesmium erythraeum IMS101 chromosome, complete genome. |
|---|---|
| Accession | NC_008312 |
| Length | 7,750,108 |
Click here to switch to the map view.
The map label for this gene is sat
Identifier: 113476232
GI number: 113476232
Start: 4062549
End: 4063715
Strand: Reverse
Name: sat
Synonym: Tery_2625
Alternate gene names: 113476232
Gene position: 4063715-4062549 (Counterclockwise)
Preceding gene: 113476234
Following gene: 113476231
Centisome position: 52.43
GC content: 40.27
Gene sequence:
>1167_bases TTGAATCAGCATCTCAATGGTATACCAGCACACGGTGGCCATCTAATTAATCGAATTGCCACACCAGCAGAACGACAAGA ATTTATCGAAAAAGCTGAAAGCTTGCCAAAGATACAACTAGACAAACGGGCTCTTTCAGACTTAGAAATGATAGCTATAG GAGGTTTTAGTCCCCTCAATGGCTTTATGGACAAAGATGACTATGAAAGTGTAGTTGTTGATATGCGTCTTAAAAATGGT CTGCCTTGGAGTATACCTGTAACATTATCAGTATCAGAAGAAGTAGCAGATTCAATAAAAGAAGGAAGCTGGGTAGGTCT GTCTTCTCCAGAAGGAGAATTTGCTGGAGTTTTAGAATTAACTCAAAAGTTCCACTACAACAAAGCTCATGAGGCAATCA ATGTTTATAGTACACAAGAAATCAAACACCCAGGAGTGAAAGTCCTGTATGATGCTGGCCCAGTCAACTTAGCAGGCCCA GTTTGGTTATTAGAACGTCATCCTCACCCATTATTTCCCAAATATCAAATAGATCCAGCTGAGTCCAGAAAACTATTTCA GGAAAAAAATTGGAAGACAATAGTTGGTTTCCAAACCCGTAACCCTATCCATCGTGCCCACGAATATATTCAAAAATGTG CTTTAGAAGTAGTAGACGGTTTATTTTTACATCCTTTGGTTGGTGCTACTAAATCAGATGATATTCCTGCTGATGTCAGG ATGCGTTGCTACGAGATTATGTTAGAAAAATACTTTCCTGAAAATCGTGTCATGATGGCCATTAATCCCTCAGCAATGCG TTATGCTGGCCCGCGGGAGGCAATTTTTCATGCTTTGGTGCGGAAAAACTATGGTTGTACTCACTTCATTGTTGGACGAG ATCATGCTGGAGTTGGAGATTATTATGGAACCTATGATGCTCAATATATTTTTGATGAGTTTGAACCTAGAGAGTTAGAT ATTGTCCCAATGAAGTTTGAACACGCTTTTTACTGTACTCGTACCCAAGGTATGGCCACAAGTAAAACAAGTCCAAGTAC AGGAGAAGAAAGAATTCATCTATCAGGGACAAAAGTGCGAGAAATGTTACGTCGGGGTGAGTTGCCACCACCAGAATTTT CACGACCAGAAGTAGCAGCCGAGTTAGCTAAGGCTATGAAAATCTAA
Upstream 100 bases:
>100_bases ATCAAACTTCAAGATAGATGTACCTCTAAAGATGTAGGATCAACATTGGAAAACTTAATCTTTCTCAACTATAGTCGTCA ATATATAGAGAGGAATCACC
Downstream 100 bases:
>100_bases AGGAAGTTAAGCTAACCCTACTGAGAAATTTTTCAGGTGCCAAAGTGTTTATCATTGCTCTGTTCGGGCAAGGGAAATAC GCAAATATTGGTAATAATGG
Product: sulfate adenylyltransferase
Products: NA
Alternate protein names: ATP-sulfurylase; Sulfate adenylate transferase; SAT
Number of amino acids: Translated: 388; Mature: 388
Protein sequence:
>388_residues MNQHLNGIPAHGGHLINRIATPAERQEFIEKAESLPKIQLDKRALSDLEMIAIGGFSPLNGFMDKDDYESVVVDMRLKNG LPWSIPVTLSVSEEVADSIKEGSWVGLSSPEGEFAGVLELTQKFHYNKAHEAINVYSTQEIKHPGVKVLYDAGPVNLAGP VWLLERHPHPLFPKYQIDPAESRKLFQEKNWKTIVGFQTRNPIHRAHEYIQKCALEVVDGLFLHPLVGATKSDDIPADVR MRCYEIMLEKYFPENRVMMAINPSAMRYAGPREAIFHALVRKNYGCTHFIVGRDHAGVGDYYGTYDAQYIFDEFEPRELD IVPMKFEHAFYCTRTQGMATSKTSPSTGEERIHLSGTKVREMLRRGELPPPEFSRPEVAAELAKAMKI
Sequences:
>Translated_388_residues MNQHLNGIPAHGGHLINRIATPAERQEFIEKAESLPKIQLDKRALSDLEMIAIGGFSPLNGFMDKDDYESVVVDMRLKNG LPWSIPVTLSVSEEVADSIKEGSWVGLSSPEGEFAGVLELTQKFHYNKAHEAINVYSTQEIKHPGVKVLYDAGPVNLAGP VWLLERHPHPLFPKYQIDPAESRKLFQEKNWKTIVGFQTRNPIHRAHEYIQKCALEVVDGLFLHPLVGATKSDDIPADVR MRCYEIMLEKYFPENRVMMAINPSAMRYAGPREAIFHALVRKNYGCTHFIVGRDHAGVGDYYGTYDAQYIFDEFEPRELD IVPMKFEHAFYCTRTQGMATSKTSPSTGEERIHLSGTKVREMLRRGELPPPEFSRPEVAAELAKAMKI >Mature_388_residues MNQHLNGIPAHGGHLINRIATPAERQEFIEKAESLPKIQLDKRALSDLEMIAIGGFSPLNGFMDKDDYESVVVDMRLKNG LPWSIPVTLSVSEEVADSIKEGSWVGLSSPEGEFAGVLELTQKFHYNKAHEAINVYSTQEIKHPGVKVLYDAGPVNLAGP VWLLERHPHPLFPKYQIDPAESRKLFQEKNWKTIVGFQTRNPIHRAHEYIQKCALEVVDGLFLHPLVGATKSDDIPADVR MRCYEIMLEKYFPENRVMMAINPSAMRYAGPREAIFHALVRKNYGCTHFIVGRDHAGVGDYYGTYDAQYIFDEFEPRELD IVPMKFEHAFYCTRTQGMATSKTSPSTGEERIHLSGTKVREMLRRGELPPPEFSRPEVAAELAKAMKI
Specific function: Unknown
COG id: COG2046
COG function: function code P; ATP sulfurylase (sulfate adenylyltransferase)
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: Belongs to the sulfate adenylyltransferase family
Homologues:
Organism=Homo sapiens, GI34447231, Length=381, Percent_Identity=29.3963254593176, Blast_Score=140, Evalue=2e-33, Organism=Homo sapiens, GI46094058, Length=405, Percent_Identity=27.9012345679012, Blast_Score=138, Evalue=9e-33, Organism=Homo sapiens, GI62912492, Length=386, Percent_Identity=28.7564766839378, Blast_Score=138, Evalue=1e-32, Organism=Caenorhabditis elegans, GI17542422, Length=395, Percent_Identity=29.873417721519, Blast_Score=155, Evalue=5e-38, Organism=Saccharomyces cerevisiae, GI6322469, Length=390, Percent_Identity=37.9487179487179, Blast_Score=242, Evalue=6e-65, Organism=Drosophila melanogaster, GI24667032, Length=391, Percent_Identity=31.4578005115089, Blast_Score=152, Evalue=4e-37, Organism=Drosophila melanogaster, GI24667028, Length=391, Percent_Identity=31.4578005115089, Blast_Score=152, Evalue=4e-37, Organism=Drosophila melanogaster, GI24667036, Length=391, Percent_Identity=31.4578005115089, Blast_Score=152, Evalue=4e-37, Organism=Drosophila melanogaster, GI24667040, Length=391, Percent_Identity=31.4578005115089, Blast_Score=152, Evalue=4e-37, Organism=Drosophila melanogaster, GI116007838, Length=391, Percent_Identity=31.4578005115089, Blast_Score=152, Evalue=5e-37, Organism=Drosophila melanogaster, GI24667044, Length=391, Percent_Identity=31.4578005115089, Blast_Score=152, Evalue=5e-37,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): SAT_TRIEI (Q111K4)
Other databases:
- EMBL: CP000393 - RefSeq: YP_722293.1 - ProteinModelPortal: Q111K4 - SMR: Q111K4 - STRING: Q111K4 - GeneID: 4245350 - GenomeReviews: CP000393_GR - KEGG: ter:Tery_2625 - NMPDR: fig|203124.1.peg.1094 - eggNOG: COG2046 - HOGENOM: HBG480761 - OMA: RMESYEV - PhylomeDB: Q111K4 - ProtClustDB: PRK04149 - BioCyc: TERY203124:TERY_2625-MONOMER - HAMAP: MF_00066 - InterPro: IPR015947 - InterPro: IPR014729 - InterPro: IPR020792 - InterPro: IPR002650 - Gene3D: G3DSA:3.40.50.620 - TIGRFAMs: TIGR00339
Pfam domain/function: PF01747 ATP-sulfurylase; SSF88697 PUA-like
EC number: =2.7.7.4
Molecular weight: Translated: 43906; Mature: 43906
Theoretical pI: Translated: 6.62; Mature: 6.62
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.0 %Cys (Translated Protein) 3.4 %Met (Translated Protein) 4.4 %Cys+Met (Translated Protein) 1.0 %Cys (Mature Protein) 3.4 %Met (Mature Protein) 4.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MNQHLNGIPAHGGHLINRIATPAERQEFIEKAESLPKIQLDKRALSDLEMIAIGGFSPLN CCCCCCCCCCCCHHHHHHHCCCHHHHHHHHHHHCCCCEEEHHHHHCCCEEEEECCCCCCC GFMDKDDYESVVVDMRLKNGLPWSIPVTLSVSEEVADSIKEGSWVGLSSPEGEFAGVLEL CCCCCCCHHHEEEEEECCCCCCEEEEEEEEECHHHHHHHHCCCEECCCCCCCHHHHHHHH TQKFHYNKAHEAINVYSTQEIKHPGVKVLYDAGPVNLAGPVWLLERHPHPLFPKYQIDPA HHHHHCCHHHHHEEECCHHHHCCCCCEEEEECCCCCCCCCEEEEECCCCCCCCCEECCHH ESRKLFQEKNWKTIVGFQTRNPIHRAHEYIQKCALEVVDGLFLHPLVGATKSDDIPADVR HHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHH MRCYEIMLEKYFPENRVMMAINPSAMRYAGPREAIFHALVRKNYGCTHFIVGRDHAGVGD HHHHHHHHHHHCCCCCEEEEECCHHHHCCCHHHHHHHHHHHCCCCCEEEEEECCCCCCCC YYGTYDAQYIFDEFEPRELDIVPMKFEHAFYCTRTQGMATSKTSPSTGEERIHLSGTKVR CCCCCCHHHHHCCCCCCCEEEEEEEECCEEEEECCCCCCCCCCCCCCCCCEEEECHHHHH EMLRRGELPPPEFSRPEVAAELAKAMKI HHHHCCCCCCCCCCCHHHHHHHHHHHCC >Mature Secondary Structure MNQHLNGIPAHGGHLINRIATPAERQEFIEKAESLPKIQLDKRALSDLEMIAIGGFSPLN CCCCCCCCCCCCHHHHHHHCCCHHHHHHHHHHHCCCCEEEHHHHHCCCEEEEECCCCCCC GFMDKDDYESVVVDMRLKNGLPWSIPVTLSVSEEVADSIKEGSWVGLSSPEGEFAGVLEL CCCCCCCHHHEEEEEECCCCCCEEEEEEEEECHHHHHHHHCCCEECCCCCCCHHHHHHHH TQKFHYNKAHEAINVYSTQEIKHPGVKVLYDAGPVNLAGPVWLLERHPHPLFPKYQIDPA HHHHHCCHHHHHEEECCHHHHCCCCCEEEEECCCCCCCCCEEEEECCCCCCCCCEECCHH ESRKLFQEKNWKTIVGFQTRNPIHRAHEYIQKCALEVVDGLFLHPLVGATKSDDIPADVR HHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHH MRCYEIMLEKYFPENRVMMAINPSAMRYAGPREAIFHALVRKNYGCTHFIVGRDHAGVGD HHHHHHHHHHHCCCCCEEEEECCHHHHCCCHHHHHHHHHHHCCCCCEEEEEECCCCCCCC YYGTYDAQYIFDEFEPRELDIVPMKFEHAFYCTRTQGMATSKTSPSTGEERIHLSGTKVR CCCCCCHHHHHCCCCCCCEEEEEEEECCEEEEECCCCCCCCCCCCCCCCCEEEECHHHHH EMLRRGELPPPEFSRPEVAAELAKAMKI HHHHCCCCCCCCCCCHHHHHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA