Definition | Trichodesmium erythraeum IMS101 chromosome, complete genome. |
---|---|
Accession | NC_008312 |
Length | 7,750,108 |
Click here to switch to the map view.
The map label for this gene is 113475613
Identifier: 113475613
GI number: 113475613
Start: 3018030
End: 3019724
Strand: Reverse
Name: 113475613
Synonym: Tery_1947
Alternate gene names: NA
Gene position: 3019724-3018030 (Counterclockwise)
Preceding gene: 113475614
Following gene: 113475610
Centisome position: 38.96
GC content: 32.92
Gene sequence:
>1695_bases ATGAGCAAAACTCATGCTAAATCATCGGAAAAATTGACATTTTTAACTACATTAAATACTAAAAATAATCAAACTATTTC CTCTTCTAAGGGAATTTTTAGCATCTTTTCTGCGTTCCCAGAATATGACCGTGGTAACAGACTGTATGAAATGGGTAGAT ACGAGTCAGCTATTCCCTACTACGAAAATGCAGTGAAAATAAAGCCAGACTGGGCTATAGGTTGGTTAAAACTAGCTGAG GCCTTATCTAAATTGCAAAAGTATGAACAAGCAGTAGAGGCTTATAAAAGATCCCTATCTCTCAAACAAAACGCTCATCA AGCTTGGCATAGTTATGGAGTTGTATTATCTAATTTAAAGCAGTATGAGCAAGCGATCGCTTGCTTTGACAAAGCAATTA AAATTAATCCAAATGATTATCAATCATGGTTTAATAAAGCAATTATTTTAAGCGAATTAAAACAAGATTTACCTGCGATA TACTGCTACAAAGAAGCACTAAAAATACAACCTATGAAGGGAGAAATTTGGTATGGTCAAGGTCAAGCATTATTAAATGT GCAAAAATATGCTGAAGCATTAGCAGCTTATGATTGTGCTGCGAAGCTGCAACCTGATAATTATGATATTTGGTTTAAGA GAGGATTAGCTTTATTTCAAACTCAACGTTATGCAGAAGCAGTTATCAGTTATGGCCACGCTATAGAATTACAACCAGAG AATTATCTAGGTTGGTTTAACTTAGGTATTGCTCAAAGTAAACTACATAAATATCACGATGCAGTCTCTTCTTTTAATAA GGCAATTAAATTAAATCCTGATGATTATGAAGCTTGGTATTATAAAGGATTAGCTTTAAAAAATCATTGGAAAGAAGGAG GAGTTGCTTGTTTAGATAAGGCAATTAATTTTAACCCTAATTTACCAGAAATTTGGATTAGTCGTGGTTATATTTTATTA GATTTATTTAAATATCGTGAGGCATTAGAGTCTTTTAATAAGGCAATTACAATTAACTCTAATTATCCCGAATCTTGGTT AGGTAGAGGTAAAGCATGGATGGCTCTAGGTAAATATAATGAAGCTCTTATTGCTTATGGTAATGCTGTTAGTATTGAGC CATATTTTTTAGAGGCTTGGAATTGTCGAGGTGAAGCATTAGAAAGAGTCCAAAATTATGATCAAGCATTGGCAGCTTAT GACAAAGTGATAAAAATGAGTTTTGAGCAAGGAGTTTCTGTTGCTAAAGTAGGTTTACAGAGAGGAGCAGCTTTAGAAAA GTTAGAGCGATATCCTGAAGCAATAGAAGCTTATAATTTGGTAATTGAAAAACAACCAAATAATTTTGATGGTTGGTTAA ACCGGGGATTAAACTTAGAAAAAATGGCAAATTATGAAGAAGCTGTTTTGAGTTATAGTCGAGCTATTAGTATATGGCCT AGTAATTATCAAGCTTGGTTACAATTAGCTTTAATGCTGGAAAAATTAGAGAGGTTAGATGAAGCGATCGTTGCCTATAA CAAAATCATTTCTTTAAGGCCTGGTAATCATGAAACTTGGTTGAAAAGAGGATTAATTCTGGAAAGATTAGGATATGTTC AAGAAGCTGTTAGTTCTTACAAAATTGTATTAGAAATTAAACCTGACTATCACGAAGCAATTGAAAGAAAAAAACGATTA GAATTAACTGTTTAA
Upstream 100 bases:
>100_bases GTTTAAGAGATAAGTTTCTACTTAAAAATTAGATATTAAATAATCACAAAAAATATAATATTATTCCAGAATTTGTCCTA TCAGCTTCTAGGAAAAAATT
Downstream 100 bases:
>100_bases TTTAATTTTGAGTAGTATTTGAGGCGGTAGATAATCCTTAAATTAGATCAAATCCAGATCATTTATTATAGGTCTGTAGA GTGTACCTTATCTATACAAG
Product: hypothetical protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 564; Mature: 563
Protein sequence:
>564_residues MSKTHAKSSEKLTFLTTLNTKNNQTISSSKGIFSIFSAFPEYDRGNRLYEMGRYESAIPYYENAVKIKPDWAIGWLKLAE ALSKLQKYEQAVEAYKRSLSLKQNAHQAWHSYGVVLSNLKQYEQAIACFDKAIKINPNDYQSWFNKAIILSELKQDLPAI YCYKEALKIQPMKGEIWYGQGQALLNVQKYAEALAAYDCAAKLQPDNYDIWFKRGLALFQTQRYAEAVISYGHAIELQPE NYLGWFNLGIAQSKLHKYHDAVSSFNKAIKLNPDDYEAWYYKGLALKNHWKEGGVACLDKAINFNPNLPEIWISRGYILL DLFKYREALESFNKAITINSNYPESWLGRGKAWMALGKYNEALIAYGNAVSIEPYFLEAWNCRGEALERVQNYDQALAAY DKVIKMSFEQGVSVAKVGLQRGAALEKLERYPEAIEAYNLVIEKQPNNFDGWLNRGLNLEKMANYEEAVLSYSRAISIWP SNYQAWLQLALMLEKLERLDEAIVAYNKIISLRPGNHETWLKRGLILERLGYVQEAVSSYKIVLEIKPDYHEAIERKKRL ELTV
Sequences:
>Translated_564_residues MSKTHAKSSEKLTFLTTLNTKNNQTISSSKGIFSIFSAFPEYDRGNRLYEMGRYESAIPYYENAVKIKPDWAIGWLKLAE ALSKLQKYEQAVEAYKRSLSLKQNAHQAWHSYGVVLSNLKQYEQAIACFDKAIKINPNDYQSWFNKAIILSELKQDLPAI YCYKEALKIQPMKGEIWYGQGQALLNVQKYAEALAAYDCAAKLQPDNYDIWFKRGLALFQTQRYAEAVISYGHAIELQPE NYLGWFNLGIAQSKLHKYHDAVSSFNKAIKLNPDDYEAWYYKGLALKNHWKEGGVACLDKAINFNPNLPEIWISRGYILL DLFKYREALESFNKAITINSNYPESWLGRGKAWMALGKYNEALIAYGNAVSIEPYFLEAWNCRGEALERVQNYDQALAAY DKVIKMSFEQGVSVAKVGLQRGAALEKLERYPEAIEAYNLVIEKQPNNFDGWLNRGLNLEKMANYEEAVLSYSRAISIWP SNYQAWLQLALMLEKLERLDEAIVAYNKIISLRPGNHETWLKRGLILERLGYVQEAVSSYKIVLEIKPDYHEAIERKKRL ELTV >Mature_563_residues SKTHAKSSEKLTFLTTLNTKNNQTISSSKGIFSIFSAFPEYDRGNRLYEMGRYESAIPYYENAVKIKPDWAIGWLKLAEA LSKLQKYEQAVEAYKRSLSLKQNAHQAWHSYGVVLSNLKQYEQAIACFDKAIKINPNDYQSWFNKAIILSELKQDLPAIY CYKEALKIQPMKGEIWYGQGQALLNVQKYAEALAAYDCAAKLQPDNYDIWFKRGLALFQTQRYAEAVISYGHAIELQPEN YLGWFNLGIAQSKLHKYHDAVSSFNKAIKLNPDDYEAWYYKGLALKNHWKEGGVACLDKAINFNPNLPEIWISRGYILLD LFKYREALESFNKAITINSNYPESWLGRGKAWMALGKYNEALIAYGNAVSIEPYFLEAWNCRGEALERVQNYDQALAAYD KVIKMSFEQGVSVAKVGLQRGAALEKLERYPEAIEAYNLVIEKQPNNFDGWLNRGLNLEKMANYEEAVLSYSRAISIWPS NYQAWLQLALMLEKLERLDEAIVAYNKIISLRPGNHETWLKRGLILERLGYVQEAVSSYKIVLEIKPDYHEAIERKKRLE LTV
Specific function: Unknown
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: Contains 8 TPR repeats [H]
Homologues:
Organism=Homo sapiens, GI32307148, Length=437, Percent_Identity=25.858123569794, Blast_Score=124, Evalue=2e-28, Organism=Homo sapiens, GI32307150, Length=437, Percent_Identity=25.858123569794, Blast_Score=124, Evalue=2e-28, Organism=Homo sapiens, GI83415184, Length=440, Percent_Identity=21.8181818181818, Blast_Score=90, Evalue=5e-18, Organism=Homo sapiens, GI301336134, Length=441, Percent_Identity=21.7687074829932, Blast_Score=90, Evalue=6e-18, Organism=Homo sapiens, GI224809432, Length=258, Percent_Identity=24.4186046511628, Blast_Score=77, Evalue=4e-14, Organism=Homo sapiens, GI25952122, Length=318, Percent_Identity=25.1572327044025, Blast_Score=72, Evalue=1e-12, Organism=Homo sapiens, GI167466177, Length=200, Percent_Identity=24, Blast_Score=70, Evalue=5e-12, Organism=Homo sapiens, GI167466175, Length=200, Percent_Identity=24, Blast_Score=70, Evalue=5e-12, Organism=Homo sapiens, GI310123097, Length=512, Percent_Identity=21.6796875, Blast_Score=70, Evalue=8e-12, Organism=Homo sapiens, GI118766330, Length=238, Percent_Identity=24.7899159663866, Blast_Score=69, Evalue=1e-11, Organism=Homo sapiens, GI118766328, Length=238, Percent_Identity=24.7899159663866, Blast_Score=69, Evalue=1e-11, Organism=Homo sapiens, GI310110582, Length=512, Percent_Identity=21.6796875, Blast_Score=68, Evalue=2e-11, Organism=Homo sapiens, GI7706671, Length=272, Percent_Identity=25, Blast_Score=68, Evalue=2e-11, Organism=Homo sapiens, GI310131789, Length=512, Percent_Identity=21.6796875, Blast_Score=68, Evalue=2e-11, Organism=Caenorhabditis elegans, GI115532692, Length=430, Percent_Identity=25.1162790697674, Blast_Score=103, Evalue=2e-22, Organism=Caenorhabditis elegans, GI115532690, Length=430, Percent_Identity=25.1162790697674, Blast_Score=103, Evalue=2e-22, Organism=Caenorhabditis elegans, GI25147174, Length=214, Percent_Identity=24.7663551401869, Blast_Score=70, Evalue=2e-12, Organism=Saccharomyces cerevisiae, GI6319387, Length=270, Percent_Identity=23.7037037037037, Blast_Score=75, Evalue=3e-14, Organism=Drosophila melanogaster, GI17647755, Length=437, Percent_Identity=25.858123569794, Blast_Score=128, Evalue=1e-29, Organism=Drosophila melanogaster, GI24585827, Length=437, Percent_Identity=25.858123569794, Blast_Score=128, Evalue=1e-29, Organism=Drosophila melanogaster, GI24585829, Length=437, Percent_Identity=25.858123569794, Blast_Score=128, Evalue=1e-29, Organism=Drosophila melanogaster, GI19920486, Length=334, Percent_Identity=23.6526946107784, Blast_Score=85, Evalue=1e-16, Organism=Drosophila melanogaster, GI161076610, Length=334, Percent_Identity=23.6526946107784, Blast_Score=85, Evalue=1e-16, Organism=Drosophila melanogaster, GI24647123, Length=223, Percent_Identity=25.1121076233184, Blast_Score=75, Evalue=1e-13, Organism=Drosophila melanogaster, GI24659892, Length=273, Percent_Identity=24.5421245421245, Blast_Score=67, Evalue=3e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001440 - InterPro: IPR013026 - InterPro: IPR011990 - InterPro: IPR019734 [H]
Pfam domain/function: PF00515 TPR_1 [H]
EC number: NA
Molecular weight: Translated: 64951; Mature: 64820
Theoretical pI: Translated: 8.64; Mature: 8.64
Prosite motif: PS50005 TPR ; PS50293 TPR_REGION
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.9 %Cys (Translated Protein) 1.2 %Met (Translated Protein) 2.1 %Cys+Met (Translated Protein) 0.9 %Cys (Mature Protein) 1.1 %Met (Mature Protein) 2.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSKTHAKSSEKLTFLTTLNTKNNQTISSSKGIFSIFSAFPEYDRGNRLYEMGRYESAIPY CCCCCCCCCCCEEEEEEECCCCCCCCCCCCCHHHHHHHCCCCCCCCCHHHHCCCCCCCCC YENAVKIKPDWAIGWLKLAEALSKLQKYEQAVEAYKRSLSLKQNAHQAWHSYGVVLSNLK CCCEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH QYEQAIACFDKAIKINPNDYQSWFNKAIILSELKQDLPAIYCYKEALKIQPMKGEIWYGQ HHHHHHHHHHHHEECCCHHHHHHHHHHHHHHHHHHCCCCEEEEHHHHEECCCCCCEEECC GQALLNVQKYAEALAAYDCAAKLQPDNYDIWFKRGLALFQTQRYAEAVISYGHAIELQPE CHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHHHHHHHHHHCCCEEEECCC NYLGWFNLGIAQSKLHKYHDAVSSFNKAIKLNPDDYEAWYYKGLALKNHWKEGGVACLDK CCCCEEECCHHHHHHHHHHHHHHHHCCEEECCCCCCCHHHHCCCHHHCCCCCCCHHHHHH AINFNPNLPEIWISRGYILLDLFKYREALESFNKAITINSNYPESWLGRGKAWMALGKYN HCCCCCCCCHHHHCCCHHHHHHHHHHHHHHHCCCEEEECCCCCHHHHCCCCEEEEECCCC EALIAYGNAVSIEPYFLEAWNCRGEALERVQNYDQALAAYDKVIKMSFEQGVSVAKVGLQ CEEEEECCEEECCHHEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHH RGAALEKLERYPEAIEAYNLVIEKQPNNFDGWLNRGLNLEKMANYEEAVLSYSRAISIWP CCHHHHHHHHHHHHHHHHHEEEECCCCCCHHHHHCCCCHHHHCCHHHHHHHHHHEEEECC SNYQAWLQLALMLEKLERLDEAIVAYNKIISLRPGNHETWLKRGLILERLGYVQEAVSSY CCHHHHHHHHHHHHHHHHHHHHHHHHHHHEEECCCCHHHHHHHHHHHHHHHHHHHHHCCE KIVLEIKPDYHEAIERKKRLELTV EEEEEECCCHHHHHHHHHHCCCCC >Mature Secondary Structure SKTHAKSSEKLTFLTTLNTKNNQTISSSKGIFSIFSAFPEYDRGNRLYEMGRYESAIPY CCCCCCCCCCEEEEEEECCCCCCCCCCCCCHHHHHHHCCCCCCCCCHHHHCCCCCCCCC YENAVKIKPDWAIGWLKLAEALSKLQKYEQAVEAYKRSLSLKQNAHQAWHSYGVVLSNLK CCCEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH QYEQAIACFDKAIKINPNDYQSWFNKAIILSELKQDLPAIYCYKEALKIQPMKGEIWYGQ HHHHHHHHHHHHEECCCHHHHHHHHHHHHHHHHHHCCCCEEEEHHHHEECCCCCCEEECC GQALLNVQKYAEALAAYDCAAKLQPDNYDIWFKRGLALFQTQRYAEAVISYGHAIELQPE CHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHHHHHHHHHHCCCEEEECCC NYLGWFNLGIAQSKLHKYHDAVSSFNKAIKLNPDDYEAWYYKGLALKNHWKEGGVACLDK CCCCEEECCHHHHHHHHHHHHHHHHCCEEECCCCCCCHHHHCCCHHHCCCCCCCHHHHHH AINFNPNLPEIWISRGYILLDLFKYREALESFNKAITINSNYPESWLGRGKAWMALGKYN HCCCCCCCCHHHHCCCHHHHHHHHHHHHHHHCCCEEEECCCCCHHHHCCCCEEEEECCCC EALIAYGNAVSIEPYFLEAWNCRGEALERVQNYDQALAAYDKVIKMSFEQGVSVAKVGLQ CEEEEECCEEECCHHEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHH RGAALEKLERYPEAIEAYNLVIEKQPNNFDGWLNRGLNLEKMANYEEAVLSYSRAISIWP CCHHHHHHHHHHHHHHHHHEEEECCCCCCHHHHHCCCCHHHHCCHHHHHHHHHHEEEECC SNYQAWLQLALMLEKLERLDEAIVAYNKIISLRPGNHETWLKRGLILERLGYVQEAVSSY CCHHHHHHHHHHHHHHHHHHHHHHHHHHHEEECCCCHHHHHHHHHHHHHHHHHHHHHCCE KIVLEIKPDYHEAIERKKRLELTV EEEEEECCCHHHHHHHHHHCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 8688087 [H]