| Definition | Streptococcus pneumoniae D39, complete genome. |
|---|---|
| Accession | NC_008533 |
| Length | 2,046,115 |
Click here to switch to the map view.
The map label for this gene is thiI
Identifier: 116516659
GI number: 116516659
Start: 789665
End: 790879
Strand: Direct
Name: thiI
Synonym: SPD_0777
Alternate gene names: 116516659
Gene position: 789665-790879 (Clockwise)
Preceding gene: 116515406
Following gene: 116515554
Centisome position: 38.59
GC content: 42.96
Gene sequence:
>1215_bases ATGCAGTATTCAGAAATTATGATTCGCTACGGAGAGTTGTCAACCAAGGGTAAAAACCGTATGCGTTTCATCAATAAACT TCGTAATAATATTTCGGACGTTTTGTCTATCTATAGCCAAGTTAAGGTAACAGCAGATCGCGACCGTGCCCACGCTTACC TCAATGGAGCTGATTACACAGCAGTTGCAGAATCTCTCAAACAAGTTTTTGGAATTCAAAACTTTTCTCCTGTTTATAAG GTTGAAAAATCTGTAGAAGTTTTGAAGTCTTCTGTCCAAGAGATTATGCGGGACATCTACAAGGAAGGTATGACCTTTAA GATTTCTAGCAAGCGTAGCGACCACAACTTTGAACTTGATAGTCGTGAACTCAACCAAACACTTGGAGGGGCTGTATTCG AAGCCATTCCAAATGTGCAAGTTCAAATGAAAAGTCCTGACATCAATCTTCAGGTGGAGATTCGTGAAGAAGCAGCCTAT CTTTCTTATGAAACCATTCGTGGGGCTGGTGGTTTGCCAGTTGGAACTTCAGGTAAAGGGATGCTCATGTTGTCAGGAGG GATTGACTCACCTGTAGCAGGTTATCTTGCTCTTAAGCGTGGGGTGGATATCGAGGCAGTTCACTTTGCTAGTCCACCAT ATACTAGTCCTGGTGCCCTCAAGAAAGCGCAGGACTTGACTCGTAAATTGACCAAGTTTGGCGGAAATATCCAGTTTATC GAGGTGCCTTTCACAGAGATTCAAGAGGAAATCAAAGCCAAAGCGCCAGAAGCTTATTTGATGACTCTAACTCGTCGCTT TATGATGCGGATTACTGACCGTATTCGTGAGGTACGAAATGGTTTGGTTATCATCAATGGGGAAAGTCTAGGTCAAGTAG CCAGCCAAACCCTTGAAAGTATGAAGGCTATCAATGCTGTTACCAACACTCCCATCATTCGTCCTGTGGTTACCATGGAC AAGTTGGAAATCATTGACATCGCCCAGGAAATCGATACCTTTGACATTTCAATCCAACCGTTTGAAGACTGTTGTACCAT TTTTGCACCAGATCGTCCAAAAACAAATCCTAAAATTAAGAATGCGGAGCAGTACGAAGCGCGTATGGATGTTGAAGGCT TGGTTGAGCGAGCAGTGGCTGGAATCATGATTACTGAAATCACACCTCAAGCCGAAAAAGATGAAGTTGATGACTTGATT GACAATCTGCTCTAA
Upstream 100 bases:
>100_bases TTAGCCTAGACTTGGAAAATGATATGAGTCAGGTCGAGCAGTTTTTGACCAAGTTAAAATTGATTTACAATCAAACTAGA AAAGTAAGATAGGAGCATTC
Downstream 100 bases:
>100_bases TTCAGAAAATCCAAAAGAATAGCGAAAATCAGTAAAAAAAGTTAGTTTTTTCTCTAAAAACAGGTAAAAAACTAACTTTT TTTATTTTTATGATATAATG
Product: thiamine biosynthesis protein ThiI
Products: NA
Alternate protein names: Sulfur carrier protein ThiS sulfurtransferase; Thiamine biosynthesis protein thiI; tRNA 4-thiouridine synthase
Number of amino acids: Translated: 404; Mature: 404
Protein sequence:
>404_residues MQYSEIMIRYGELSTKGKNRMRFINKLRNNISDVLSIYSQVKVTADRDRAHAYLNGADYTAVAESLKQVFGIQNFSPVYK VEKSVEVLKSSVQEIMRDIYKEGMTFKISSKRSDHNFELDSRELNQTLGGAVFEAIPNVQVQMKSPDINLQVEIREEAAY LSYETIRGAGGLPVGTSGKGMLMLSGGIDSPVAGYLALKRGVDIEAVHFASPPYTSPGALKKAQDLTRKLTKFGGNIQFI EVPFTEIQEEIKAKAPEAYLMTLTRRFMMRITDRIREVRNGLVIINGESLGQVASQTLESMKAINAVTNTPIIRPVVTMD KLEIIDIAQEIDTFDISIQPFEDCCTIFAPDRPKTNPKIKNAEQYEARMDVEGLVERAVAGIMITEITPQAEKDEVDDLI DNLL
Sequences:
>Translated_404_residues MQYSEIMIRYGELSTKGKNRMRFINKLRNNISDVLSIYSQVKVTADRDRAHAYLNGADYTAVAESLKQVFGIQNFSPVYK VEKSVEVLKSSVQEIMRDIYKEGMTFKISSKRSDHNFELDSRELNQTLGGAVFEAIPNVQVQMKSPDINLQVEIREEAAY LSYETIRGAGGLPVGTSGKGMLMLSGGIDSPVAGYLALKRGVDIEAVHFASPPYTSPGALKKAQDLTRKLTKFGGNIQFI EVPFTEIQEEIKAKAPEAYLMTLTRRFMMRITDRIREVRNGLVIINGESLGQVASQTLESMKAINAVTNTPIIRPVVTMD KLEIIDIAQEIDTFDISIQPFEDCCTIFAPDRPKTNPKIKNAEQYEARMDVEGLVERAVAGIMITEITPQAEKDEVDDLI DNLL >Mature_404_residues MQYSEIMIRYGELSTKGKNRMRFINKLRNNISDVLSIYSQVKVTADRDRAHAYLNGADYTAVAESLKQVFGIQNFSPVYK VEKSVEVLKSSVQEIMRDIYKEGMTFKISSKRSDHNFELDSRELNQTLGGAVFEAIPNVQVQMKSPDINLQVEIREEAAY LSYETIRGAGGLPVGTSGKGMLMLSGGIDSPVAGYLALKRGVDIEAVHFASPPYTSPGALKKAQDLTRKLTKFGGNIQFI EVPFTEIQEEIKAKAPEAYLMTLTRRFMMRITDRIREVRNGLVIINGESLGQVASQTLESMKAINAVTNTPIIRPVVTMD KLEIIDIAQEIDTFDISIQPFEDCCTIFAPDRPKTNPKIKNAEQYEARMDVEGLVERAVAGIMITEITPQAEKDEVDDLI DNLL
Specific function: Catalyzes the ATP-dependent transfer of a sulfur to tRNA to produce 4-thiouridine in position 8 of tRNAs, which functions as a near-UV photosensor. Also catalyzes the transfer of sulfur to the sulfur carrier protein ThiS, forming ThiS-thiocarboxylate. Thi
COG id: COG0301
COG function: function code H; Thiamine biosynthesis ATP pyrophosphatase
Gene ontology:
Cell location: Cytoplasm
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 THUMP domain
Homologues:
Organism=Escherichia coli, GI1786625, Length=374, Percent_Identity=27.0053475935829, Blast_Score=130, Evalue=1e-31,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): THII_STRP2 (Q04L36)
Other databases:
- EMBL: CP000410 - RefSeq: YP_816262.1 - ProteinModelPortal: Q04L36 - SMR: Q04L36 - STRING: Q04L36 - EnsemblBacteria: EBSTRT00000018823 - GeneID: 4443063 - GenomeReviews: CP000410_GR - KEGG: spd:SPD_0777 - eggNOG: COG0301 - GeneTree: EBGT00050000027439 - HOGENOM: HBG646072 - OMA: ESIGQVA - ProtClustDB: PRK01565 - GO: GO:0005737 - HAMAP: MF_00021 - InterPro: IPR014729 - InterPro: IPR003720 - InterPro: IPR020536 - InterPro: IPR004114 - Gene3D: G3DSA:3.40.50.620 - SMART: SM00981 - TIGRFAMs: TIGR00342
Pfam domain/function: PF02568 ThiI; PF02926 THUMP
EC number: =2.8.1.4
Molecular weight: Translated: 45147; Mature: 45147
Theoretical pI: Translated: 5.13; Mature: 5.13
Prosite motif: PS51165 THUMP
Important sites: BINDING 265-265 BINDING 287-287 BINDING 296-296
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.5 %Cys (Translated Protein) 3.7 %Met (Translated Protein) 4.2 %Cys+Met (Translated Protein) 0.5 %Cys (Mature Protein) 3.7 %Met (Mature Protein) 4.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MQYSEIMIRYGELSTKGKNRMRFINKLRNNISDVLSIYSQVKVTADRDRAHAYLNGADYT CCHHHHEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHEEEECCCCCEEEECCCHHH AVAESLKQVFGIQNFSPVYKVEKSVEVLKSSVQEIMRDIYKEGMTFKISSKRSDHNFELD HHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCC SRELNQTLGGAVFEAIPNVQVQMKSPDINLQVEIREEAAYLSYETIRGAGGLPVGTSGKG HHHHHHHHHHHHHHHCCCCEEEEECCCCEEEEEEECCHHHHHHHHHCCCCCCCCCCCCCE MLMLSGGIDSPVAGYLALKRGVDIEAVHFASPPYTSPGALKKAQDLTRKLTKFGGNIQFI EEEEECCCCCCHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEE EVPFTEIQEEIKAKAPEAYLMTLTRRFMMRITDRIREVRNGLVIINGESLGQVASQTLES ECCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCEEEECCCHHHHHHHHHHHH MKAINAVTNTPIIRPVVTMDKLEIIDIAQEIDTFDISIQPFEDCCTIFAPDRPKTNPKIK HHHHHHHCCCCCEEEEHHHHHHHHHHHHHHCCEEEEEECCHHHHHEEECCCCCCCCCCCC NAEQYEARMDVEGLVERAVAGIMITEITPQAEKDEVDDLIDNLL CCHHHHHHCCHHHHHHHHHHCEEEEECCCCCCHHHHHHHHHHCC >Mature Secondary Structure MQYSEIMIRYGELSTKGKNRMRFINKLRNNISDVLSIYSQVKVTADRDRAHAYLNGADYT CCHHHHEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHEEEECCCCCEEEECCCHHH AVAESLKQVFGIQNFSPVYKVEKSVEVLKSSVQEIMRDIYKEGMTFKISSKRSDHNFELD HHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCC SRELNQTLGGAVFEAIPNVQVQMKSPDINLQVEIREEAAYLSYETIRGAGGLPVGTSGKG HHHHHHHHHHHHHHHCCCCEEEEECCCCEEEEEEECCHHHHHHHHHCCCCCCCCCCCCCE MLMLSGGIDSPVAGYLALKRGVDIEAVHFASPPYTSPGALKKAQDLTRKLTKFGGNIQFI EEEEECCCCCCHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEE EVPFTEIQEEIKAKAPEAYLMTLTRRFMMRITDRIREVRNGLVIINGESLGQVASQTLES ECCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCEEEECCCHHHHHHHHHHHH MKAINAVTNTPIIRPVVTMDKLEIIDIAQEIDTFDISIQPFEDCCTIFAPDRPKTNPKIK HHHHHHHCCCCCEEEEHHHHHHHHHHHHHHCCEEEEEECCHHHHHEEECCCCCCCCCCCC NAEQYEARMDVEGLVERAVAGIMITEITPQAEKDEVDDLIDNLL CCHHHHHHCCHHHHHHHHHHCEEEEECCCCCCHHHHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA