| Definition | Trichodesmium erythraeum IMS101 chromosome, complete genome. |
|---|---|
| Accession | NC_008312 |
| Length | 7,750,108 |
Click here to switch to the map view.
The map label for this gene is crtX [H]
Identifier: 113475759
GI number: 113475759
Start: 3296981
End: 3298258
Strand: Reverse
Name: crtX [H]
Synonym: Tery_2111
Alternate gene names: 113475759
Gene position: 3298258-3296981 (Counterclockwise)
Preceding gene: 113475760
Following gene: 113475758
Centisome position: 42.56
GC content: 41.16
Gene sequence:
>1278_bases ATGACTCATTTTGGTCTGATTTGTCCGGCATTGACAGGACACCTAAATCCAATGCTTCCCATAGGACAAGAATTAAAAAG GCGTGGTCATCGTGTCACGACGATAGGGATACTTGACGCTGAAGCTAAGACACTAGCAGCAGGATTAGAATTTGTTGCCT ATGGCACGGAAGAATATTCTAAGGGCAGCACAGCAGAAGCTTTAAATCACCTGAGTAAACTCAGCGGGTTAGCTGCATTT CGCTATACAATTACACTATTAAAAGACTGGACAAATGTTTTGCTTCGGGATGCTCCGCAAGTCATCAAAAATGCTGGTGT AGATGCTTTGTTAATCGACCAGGCTTCATTAGGAGAATCTATAGGGGATTTTCTAGACATTCCCTTTATTACTATTTGTA GTGCACTGGTACTCAATCAAGATGAGAATGTTCCCCACCCTGTAAGCAACTGGAAATATAACCCCGCCTGGTGGGCAAAA CTGCGTAATAGAGCTACTTGGAGTTTCTATCAAATCTTAGGCAAACCTATTAACAAGGTAGTAGCTGAGTATCGTCGTCA ATGGAATTTACCTTTGTACTCTGACCCCAATGATGCTTATTCTCTACTGGCTCAAATTAGTCAGCAACCTGCTGAGTTAG AATTTCCCAGAGAAAATTTACCTAAGTGTTTTCATTTCACAGGACCTTATCATTATTCAGGTACTAGAGAACCTGTTTCC TTTCCTTGGGAACAGTTGACAGGTAAACCTTTAATTTATGCCTCTATGGGAACTATACAAAATCGTTTGGTTGAGGTATT TTATCAAATTACAGCAGCTTGTGAGGGGTTGGATGCTCAGTTAGTTATTTCTCTGGGAGGTTCTGCCACTCCAGAATCTC TACCCAACTTAGCAGGAAATCCTCTAGTTGTTGAATATGCACCCCAATTAGAAATACTGCAAAAAGCTACTCTCACTATT ACTCATGCAGGTATGAATACAACTCTAGAATGTTTAAGTAATGCAGTACCAATGGTTGCTATTCCTATTGCTAACGATCA ACCAGGAGTAGCGGCACGAATAGCTTGGGCTGGAGCTGGAGTAGCGATAACACTGAAACGTTTAACAGTACCTCGGTTAC GAACAGCTATTTCTCAGGTGCTCACACAACCGTCATATAAGCAAAATGCTTTGAGATTACAGAAAGCAATTAAACGAGCA GGTGGAGTCACTCGTGCTGCTGATATTATTGAACAGGCAGTATCAACAGGTAAACCAGTTTTAACAGGAACTATATAA
Upstream 100 bases:
>100_bases ACGGGGATGCCAGATGTGTATTGGCAGACAATGATTGATTTTCGTAATGCTTGATAGTATGATTGTAGGGGAAAATCAGA AAACCAGGAAAAAACTTACT
Downstream 100 bases:
>100_bases CTAGCTAGGGTCTGCTAAAAAAGTCTGTTTAAGAAGTAAGGAAGTAGGTAGAGGAAGCGGTGGAATATTATTTGTCTCTT GATTTTTCTGGCATAGGAGT
Product: glycosyl transferase family protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 425; Mature: 424
Protein sequence:
>425_residues MTHFGLICPALTGHLNPMLPIGQELKRRGHRVTTIGILDAEAKTLAAGLEFVAYGTEEYSKGSTAEALNHLSKLSGLAAF RYTITLLKDWTNVLLRDAPQVIKNAGVDALLIDQASLGESIGDFLDIPFITICSALVLNQDENVPHPVSNWKYNPAWWAK LRNRATWSFYQILGKPINKVVAEYRRQWNLPLYSDPNDAYSLLAQISQQPAELEFPRENLPKCFHFTGPYHYSGTREPVS FPWEQLTGKPLIYASMGTIQNRLVEVFYQITAACEGLDAQLVISLGGSATPESLPNLAGNPLVVEYAPQLEILQKATLTI THAGMNTTLECLSNAVPMVAIPIANDQPGVAARIAWAGAGVAITLKRLTVPRLRTAISQVLTQPSYKQNALRLQKAIKRA GGVTRAADIIEQAVSTGKPVLTGTI
Sequences:
>Translated_425_residues MTHFGLICPALTGHLNPMLPIGQELKRRGHRVTTIGILDAEAKTLAAGLEFVAYGTEEYSKGSTAEALNHLSKLSGLAAF RYTITLLKDWTNVLLRDAPQVIKNAGVDALLIDQASLGESIGDFLDIPFITICSALVLNQDENVPHPVSNWKYNPAWWAK LRNRATWSFYQILGKPINKVVAEYRRQWNLPLYSDPNDAYSLLAQISQQPAELEFPRENLPKCFHFTGPYHYSGTREPVS FPWEQLTGKPLIYASMGTIQNRLVEVFYQITAACEGLDAQLVISLGGSATPESLPNLAGNPLVVEYAPQLEILQKATLTI THAGMNTTLECLSNAVPMVAIPIANDQPGVAARIAWAGAGVAITLKRLTVPRLRTAISQVLTQPSYKQNALRLQKAIKRA GGVTRAADIIEQAVSTGKPVLTGTI >Mature_424_residues THFGLICPALTGHLNPMLPIGQELKRRGHRVTTIGILDAEAKTLAAGLEFVAYGTEEYSKGSTAEALNHLSKLSGLAAFR YTITLLKDWTNVLLRDAPQVIKNAGVDALLIDQASLGESIGDFLDIPFITICSALVLNQDENVPHPVSNWKYNPAWWAKL RNRATWSFYQILGKPINKVVAEYRRQWNLPLYSDPNDAYSLLAQISQQPAELEFPRENLPKCFHFTGPYHYSGTREPVSF PWEQLTGKPLIYASMGTIQNRLVEVFYQITAACEGLDAQLVISLGGSATPESLPNLAGNPLVVEYAPQLEILQKATLTIT HAGMNTTLECLSNAVPMVAIPIANDQPGVAARIAWAGAGVAITLKRLTVPRLRTAISQVLTQPSYKQNALRLQKAIKRAG GVTRAADIIEQAVSTGKPVLTGTI
Specific function: Catalyzes the glycosylation reaction which converts zeaxanthin to zeaxanthin-beta-diglucoside [H]
COG id: COG1819
COG function: function code GC; Glycosyl transferases, related to UDP-glucuronosyltransferase
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: Belongs to the UDP-glycosyltransferase family [H]
Homologues:
Organism=Homo sapiens, GI45827765, Length=428, Percent_Identity=25.4672897196262, Blast_Score=91, Evalue=2e-18, Organism=Homo sapiens, GI157787091, Length=440, Percent_Identity=25.6818181818182, Blast_Score=86, Evalue=8e-17, Organism=Homo sapiens, GI110611919, Length=190, Percent_Identity=31.5789473684211, Blast_Score=81, Evalue=2e-15, Organism=Homo sapiens, GI29789078, Length=434, Percent_Identity=22.1198156682028, Blast_Score=80, Evalue=3e-15, Organism=Homo sapiens, GI31377618, Length=437, Percent_Identity=22.4256292906178, Blast_Score=77, Evalue=3e-14, Organism=Homo sapiens, GI41282213, Length=326, Percent_Identity=23.3128834355828, Blast_Score=76, Evalue=7e-14, Organism=Homo sapiens, GI11276085, Length=433, Percent_Identity=22.8637413394919, Blast_Score=75, Evalue=1e-13, Organism=Homo sapiens, GI193211427, Length=433, Percent_Identity=24.9422632794457, Blast_Score=74, Evalue=2e-13, Organism=Homo sapiens, GI13487900, Length=428, Percent_Identity=21.9626168224299, Blast_Score=73, Evalue=5e-13, Organism=Homo sapiens, GI270132412, Length=421, Percent_Identity=22.5653206650831, Blast_Score=72, Evalue=9e-13, Organism=Homo sapiens, GI270132420, Length=332, Percent_Identity=23.4939759036145, Blast_Score=71, Evalue=2e-12, Organism=Homo sapiens, GI45827767, Length=100, Percent_Identity=38, Blast_Score=71, Evalue=2e-12, Organism=Homo sapiens, GI8850236, Length=105, Percent_Identity=37.1428571428571, Blast_Score=70, Evalue=3e-12, Organism=Homo sapiens, GI46249404, Length=105, Percent_Identity=37.1428571428571, Blast_Score=70, Evalue=3e-12, Organism=Homo sapiens, GI189491660, Length=420, Percent_Identity=22.8571428571429, Blast_Score=70, Evalue=4e-12, Organism=Homo sapiens, GI40254471, Length=420, Percent_Identity=22.8571428571429, Blast_Score=70, Evalue=4e-12, Organism=Homo sapiens, GI6005930, Length=105, Percent_Identity=37.1428571428571, Blast_Score=70, Evalue=4e-12, Organism=Drosophila melanogaster, GI21357689, Length=99, Percent_Identity=41.4141414141414, Blast_Score=67, Evalue=2e-11, Organism=Drosophila melanogaster, GI24584725, Length=98, Percent_Identity=36.734693877551, Blast_Score=66, Evalue=5e-11, Organism=Drosophila melanogaster, GI24584723, Length=98, Percent_Identity=36.734693877551, Blast_Score=66, Evalue=5e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR002213 - InterPro: IPR006326 [H]
Pfam domain/function: PF00201 UDPGT [H]
EC number: NA
Molecular weight: Translated: 46431; Mature: 46300
Theoretical pI: Translated: 8.43; Mature: 8.43
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.2 %Cys (Translated Protein) 1.2 %Met (Translated Protein) 2.4 %Cys+Met (Translated Protein) 1.2 %Cys (Mature Protein) 0.9 %Met (Mature Protein) 2.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTHFGLICPALTGHLNPMLPIGQELKRRGHRVTTIGILDAEAKTLAAGLEFVAYGTEEYS CCCCHHHHHHHHCCCCCCCCCCHHHHHCCCEEEEEEEECCHHHHHHHHHHHEEECCHHHC KGSTAEALNHLSKLSGLAAFRYTITLLKDWTNVLLRDAPQVIKNAGVDALLIDQASLGES CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCHHHHHH IGDFLDIPFITICSALVLNQDENVPHPVSNWKYNPAWWAKLRNRATWSFYQILGKPINKV HHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHCCCHHHHHHHHCCHHHHH VAEYRRQWNLPLYSDPNDAYSLLAQISQQPAELEFPRENLPKCFHFTGPYHYSGTREPVS HHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCCCHHCCCCEEECCCCCCCCCCCCCCC FPWEQLTGKPLIYASMGTIQNRLVEVFYQITAACEGLDAQLVISLGGSATPESLPNLAGN CCHHHHCCCCEEEEEHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCHHHCCCCCCC PLVVEYAPQLEILQKATLTITHAGMNTTLECLSNAVPMVAIPIANDQPGVAARIAWAGAG CEEEEECCHHHHHHHHHEEEEECCCHHHHHHHHCCCCEEEEEECCCCCCCEEEEEEECCC VAITLKRLTVPRLRTAISQVLTQPSYKQNALRLQKAIKRAGGVTRAADIIEQAVSTGKPV CEEEEHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCE LTGTI EECCC >Mature Secondary Structure THFGLICPALTGHLNPMLPIGQELKRRGHRVTTIGILDAEAKTLAAGLEFVAYGTEEYS CCCHHHHHHHHCCCCCCCCCCHHHHHCCCEEEEEEEECCHHHHHHHHHHHEEECCHHHC KGSTAEALNHLSKLSGLAAFRYTITLLKDWTNVLLRDAPQVIKNAGVDALLIDQASLGES CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCHHHHHH IGDFLDIPFITICSALVLNQDENVPHPVSNWKYNPAWWAKLRNRATWSFYQILGKPINKV HHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHCCCHHHHHHHHCCHHHHH VAEYRRQWNLPLYSDPNDAYSLLAQISQQPAELEFPRENLPKCFHFTGPYHYSGTREPVS HHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCCCHHCCCCEEECCCCCCCCCCCCCCC FPWEQLTGKPLIYASMGTIQNRLVEVFYQITAACEGLDAQLVISLGGSATPESLPNLAGN CCHHHHCCCCEEEEEHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCHHHCCCCCCC PLVVEYAPQLEILQKATLTITHAGMNTTLECLSNAVPMVAIPIANDQPGVAARIAWAGAG CEEEEECCHHHHHHHHHEEEEECCCHHHHHHHHCCCCEEEEEECCCCCCCEEEEEEECCC VAITLKRLTVPRLRTAISQVLTQPSYKQNALRLQKAIKRAGGVTRAADIIEQAVSTGKPV CEEEEHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCE LTGTI EECCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 2254247 [H]