| Definition | Trichodesmium erythraeum IMS101 chromosome, complete genome. |
|---|---|
| Accession | NC_008312 |
| Length | 7,750,108 |
Click here to switch to the map view.
The map label for this gene is 113476275
Identifier: 113476275
GI number: 113476275
Start: 4138435
End: 4140921
Strand: Reverse
Name: 113476275
Synonym: Tery_2671
Alternate gene names: NA
Gene position: 4140921-4138435 (Counterclockwise)
Preceding gene: 113476276
Following gene: 113476272
Centisome position: 53.43
GC content: 41.58
Gene sequence:
>2487_bases ATGAAACGAATTATTGTTACCTGTCTAATAGGAATAACAACCCTGACTACAACTGCCCTAATACTGCCGCTATTACCACA AAGCAGTTATGCTCAAAGTTCAAACTCTCAGGCAGAAAAGTTAGAGCAACTAATAAAAACAGCACAGCAACAAATAGGGC AATACCAATATCAGGAGTCTATAGAAACGCTTCAAGAAGCTTTAGCTATTGCTAGAAAAATTAAAGACCGAAAATATGAG GCAGTGGCTAACCTTGGTCTTGGTTATGTTTACGACCAAACAGGCCAACCCCAAAAGGCATTAGAATTCTACGAAAAAGC TTTACCTATCTGGCAAGAAGTCGGCTATCGCTTTGGAGAAGCTACTACTCTCAATAATATTGGTGGAGTTTACTCTGACA TAGGTCAACCCCAAAAGGCGTTAGAATTCTACGAAAAAGCTTTACCTATCTCGCAAGAAGTGGGCGCTCGCTCACAAGAA GCTACTACTCTCAATAATATTGGTCTAGTTTACGATAATATAGGTCAACCCCAAAAGGCATTAGAATACTACGAAAAAGC TTTACCTATCTCGCAAGAAGTGGGCGCTCGCTCACAAGAAGCTACTACTCTCAATAATATTGGTCTAGTTTACAGCAGTA TAGGTCAACCCCAAAAGGCATTAGAATACTACGAAAAAGCTTTACCTATCTCGCAAGAAATGGGCGCTCACTCACAAGAA GCTACTACTCTCAATAATATTGGTCTAGTTTACTCTAATATAGGTCAACCCCAAAAGGCATTAGAATTCTTCGAAAAAGC TTTACCTATCTGGCAAGAAGTGGGCTATCGCTCACAAGAAGCTACTACTCTGAATAATATTGGTAACGTTTACAGCAGTA TAGGTCAACCCCAAAAGGCATTAGAATACTTACAGAAAGCTTTACCTATCATGCAAGAAGTGGGCGCTCGCTCATTAGAA TCTACTACTCTCAGTAGTATTGGTAAAGTTTACCGAGATAGCAACCAACCAGAAAAAGCCATAGACCACTTGGAAAAGTC AGTAAAAATTACCCTAGAAATTCGTGGGGGTTTAAAAAAAGAAAATCGCAAAACTTTTTTAAATTTCCAATATAATAAGT GGACACCGATCGCCCTGATCGATCTTCTCATCGACCAAAACCAACCCGAAGCCGCTTGGAAATGGTATAATCTAGCCACC ACTTTCGACCTCGCCGACTACACCCGCCTTATTAAGGCCAAAGTCAAAAACCCAGAAGCCCAAAAACTGATTAACCAATG GGAAGAAAACTATCAAAAATTGCAAGCTCTCTATAGCAAAATAGAAGATGGAACAACTACCCAACTTTCCCAACAAATTA AGCAGTTACAAGCAGAAAACAACCAACTCGCAGAAAACGCTAGCCAGAAATATCCAGAAGTTGCCGAGCTTTTTGAATTT GAACCCAAAGACATCGACAAACTCAAAGCCAATATTCCACCGGGCACCGTTGTTATTCAACCTGCTCTTTTAACTGGTCT GAAAAGTGTCCCCGATAGCATAGCTATATTCCTCGTGACCAGAGACCAAGCCACCCTCGTCAAAAAACTCCCCATAGATG CTAAAGAATTTGATAGCATCCTCACCGAATATCGCAGCCAACTAGAAAAACCCAACGCCGACAACTACGCCACTAACCAA GAAAAACTCTACGACTATCTCATCCGCCCCATAGAAACAGAAATAGCCGCCTACTCCCCCAAACGACTAGCCATCATCCC CACAGGAAAATTGCGTTATATTCCCTTTGAAACCCTATATGATAATCAAACCGAGCAATATTTAATAGCCAAATATCCCA TCCACTACCTGACCCGCATCTCTGCCACCAGACAGGAGCCAAAAGAGCCGACAAAGTCCCTGAAAGTATTGGCATTTGGA AATCCCCAGCCCACAGAGATCAACCTCCGCGGTGCAGAAGATGAAGCGAAAATTATCGCGGAAAACTTATCAGGAAAATC CTTAACTCGAGAAAAAGCGACCCTCAGCAGTTTTGAAAACGAATCTCCCGGTTTTCCCTTAGTGCATCTAGCTACCCACG GCTGCTTTCAAAAAGGTGGTTGCCGTGAGCAGGGTTTAGAAGAAAATACTATTTTGTTTGCCAACAATAAAACCTTTAAT ATTGCCAATGCAGCACGTTTAGGATTAGAAAATACAGATTTAATCGCTTTGAGTGCATGCCAAACAGCCATGAAAGCGGA CTCCAACGGAGAAGAAATAGCAGGAGTTGCCTATTTATTTGAGCGTGCAGGTGCGGATGCTGTCATAGCCAGTTTATGGA ATGCCGAGGATGGAACAACAAAGGATATTATGGTGAAATTTTATGATAATTTGAAACAAGGAATGACCAAGGTGGAAGCA TTACGTCAGGCGAAATTAAGTTATGCTAGGTCAGATGTGAGTCCGTTTTATTGGTCGCCTTTTATTTTGATAGGGGATGG AGAATAA
Upstream 100 bases:
>100_bases TCTCCCCAAGGGGGGCGCTCAAAAGTCATCCCTGACTGAATTCTGACTAGGGATAAGTCCTGATTAATTATTTAATTATT TATTTGGAGAAAATCAGAAA
Downstream 100 bases:
>100_bases GTCAAAAGTTAACCGACCCCCCCTTTCTTCTTGAATTAATAAACCAGCATATTCATTGATCCCTTCCTCGTATTTTTAGG CGATCGCTTTTCTTGAAAAA
Product: hypothetical protein
Products: NA
Alternate protein names: Tetratricopeptide Repeat Domain Protein; Tetratricopeptide Repeat Family; Tetratricopeptide TPR_2 Repeat Protein; SARP Family Transcriptional Regulator; Haemagglutination Activity Domain Protein; TPR Domain-Containing Protein; NB-ARC Domain-Containing Protein; Tetratricopeptide Repeat Family Protein; Kinesin Light Chain; Diguanylate Cyclase/Serine/Threonine Protein Kinase; Tetratricopeptide Repeat-Containing Protein; Tetratricopeptide Domain-Containing Protein; Tetratricopeptide Domain Protein; Tetratricopeptide Region; Filamentous Haemagglutinin Outer Membrane Protein; Tetratricopeptide Tpr_1 Repeat-Containing Protein; Tetratricopeptide Protein; SLEI Family; Histidine Kinase; TPR Domain Family Protein; Filamentous Hemagglutinin Family Outer Membrane Protein; Tetratricopeptide TPR2 Protein; Tetratricopeptide Tpr_3 Repeat-Containing Protein; Transcriptional Regulator SARP Family; XRE Family Transcriptional Regulator With TPR Repeat Domain; Tetratricopeptide TPR-2 Repeat-Containing Protein; Signal Transduction Protein; WD-40 Repeat-Containing Protein; Peptidase Domain-Containing Protein; Hemagglutination Activity Domain-Containing Protein; O-Linked GlcNAc Transferase; Transcriptional Activator Domain Containing Protein; Peptidase-Like; Fis Family Transcriptional Regulator
Number of amino acids: Translated: 828; Mature: 828
Protein sequence:
>828_residues MKRIIVTCLIGITTLTTTALILPLLPQSSYAQSSNSQAEKLEQLIKTAQQQIGQYQYQESIETLQEALAIARKIKDRKYE AVANLGLGYVYDQTGQPQKALEFYEKALPIWQEVGYRFGEATTLNNIGGVYSDIGQPQKALEFYEKALPISQEVGARSQE ATTLNNIGLVYDNIGQPQKALEYYEKALPISQEVGARSQEATTLNNIGLVYSSIGQPQKALEYYEKALPISQEMGAHSQE ATTLNNIGLVYSNIGQPQKALEFFEKALPIWQEVGYRSQEATTLNNIGNVYSSIGQPQKALEYLQKALPIMQEVGARSLE STTLSSIGKVYRDSNQPEKAIDHLEKSVKITLEIRGGLKKENRKTFLNFQYNKWTPIALIDLLIDQNQPEAAWKWYNLAT TFDLADYTRLIKAKVKNPEAQKLINQWEENYQKLQALYSKIEDGTTTQLSQQIKQLQAENNQLAENASQKYPEVAELFEF EPKDIDKLKANIPPGTVVIQPALLTGLKSVPDSIAIFLVTRDQATLVKKLPIDAKEFDSILTEYRSQLEKPNADNYATNQ EKLYDYLIRPIETEIAAYSPKRLAIIPTGKLRYIPFETLYDNQTEQYLIAKYPIHYLTRISATRQEPKEPTKSLKVLAFG NPQPTEINLRGAEDEAKIIAENLSGKSLTREKATLSSFENESPGFPLVHLATHGCFQKGGCREQGLEENTILFANNKTFN IANAARLGLENTDLIALSACQTAMKADSNGEEIAGVAYLFERAGADAVIASLWNAEDGTTKDIMVKFYDNLKQGMTKVEA LRQAKLSYARSDVSPFYWSPFILIGDGE
Sequences:
>Translated_828_residues MKRIIVTCLIGITTLTTTALILPLLPQSSYAQSSNSQAEKLEQLIKTAQQQIGQYQYQESIETLQEALAIARKIKDRKYE AVANLGLGYVYDQTGQPQKALEFYEKALPIWQEVGYRFGEATTLNNIGGVYSDIGQPQKALEFYEKALPISQEVGARSQE ATTLNNIGLVYDNIGQPQKALEYYEKALPISQEVGARSQEATTLNNIGLVYSSIGQPQKALEYYEKALPISQEMGAHSQE ATTLNNIGLVYSNIGQPQKALEFFEKALPIWQEVGYRSQEATTLNNIGNVYSSIGQPQKALEYLQKALPIMQEVGARSLE STTLSSIGKVYRDSNQPEKAIDHLEKSVKITLEIRGGLKKENRKTFLNFQYNKWTPIALIDLLIDQNQPEAAWKWYNLAT TFDLADYTRLIKAKVKNPEAQKLINQWEENYQKLQALYSKIEDGTTTQLSQQIKQLQAENNQLAENASQKYPEVAELFEF EPKDIDKLKANIPPGTVVIQPALLTGLKSVPDSIAIFLVTRDQATLVKKLPIDAKEFDSILTEYRSQLEKPNADNYATNQ EKLYDYLIRPIETEIAAYSPKRLAIIPTGKLRYIPFETLYDNQTEQYLIAKYPIHYLTRISATRQEPKEPTKSLKVLAFG NPQPTEINLRGAEDEAKIIAENLSGKSLTREKATLSSFENESPGFPLVHLATHGCFQKGGCREQGLEENTILFANNKTFN IANAARLGLENTDLIALSACQTAMKADSNGEEIAGVAYLFERAGADAVIASLWNAEDGTTKDIMVKFYDNLKQGMTKVEA LRQAKLSYARSDVSPFYWSPFILIGDGE >Mature_828_residues MKRIIVTCLIGITTLTTTALILPLLPQSSYAQSSNSQAEKLEQLIKTAQQQIGQYQYQESIETLQEALAIARKIKDRKYE AVANLGLGYVYDQTGQPQKALEFYEKALPIWQEVGYRFGEATTLNNIGGVYSDIGQPQKALEFYEKALPISQEVGARSQE ATTLNNIGLVYDNIGQPQKALEYYEKALPISQEVGARSQEATTLNNIGLVYSSIGQPQKALEYYEKALPISQEMGAHSQE ATTLNNIGLVYSNIGQPQKALEFFEKALPIWQEVGYRSQEATTLNNIGNVYSSIGQPQKALEYLQKALPIMQEVGARSLE STTLSSIGKVYRDSNQPEKAIDHLEKSVKITLEIRGGLKKENRKTFLNFQYNKWTPIALIDLLIDQNQPEAAWKWYNLAT TFDLADYTRLIKAKVKNPEAQKLINQWEENYQKLQALYSKIEDGTTTQLSQQIKQLQAENNQLAENASQKYPEVAELFEF EPKDIDKLKANIPPGTVVIQPALLTGLKSVPDSIAIFLVTRDQATLVKKLPIDAKEFDSILTEYRSQLEKPNADNYATNQ EKLYDYLIRPIETEIAAYSPKRLAIIPTGKLRYIPFETLYDNQTEQYLIAKYPIHYLTRISATRQEPKEPTKSLKVLAFG NPQPTEINLRGAEDEAKIIAENLSGKSLTREKATLSSFENESPGFPLVHLATHGCFQKGGCREQGLEENTILFANNKTFN IANAARLGLENTDLIALSACQTAMKADSNGEEIAGVAYLFERAGADAVIASLWNAEDGTTKDIMVKFYDNLKQGMTKVEA LRQAKLSYARSDVSPFYWSPFILIGDGE
Specific function: Unknown
COG id: COG0457
COG function: function code R; FOG: TPR repeat
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
Organism=Homo sapiens, GI224028289, Length=328, Percent_Identity=26.5243902439024, Blast_Score=131, Evalue=2e-30, Organism=Homo sapiens, GI224548927, Length=298, Percent_Identity=29.5302013422819, Blast_Score=127, Evalue=5e-29, Organism=Homo sapiens, GI164519122, Length=346, Percent_Identity=27.4566473988439, Blast_Score=126, Evalue=7e-29, Organism=Homo sapiens, GI224548925, Length=298, Percent_Identity=29.5302013422819, Blast_Score=126, Evalue=9e-29, Organism=Homo sapiens, GI38488692, Length=299, Percent_Identity=23.4113712374582, Blast_Score=78, Evalue=3e-14, Organism=Homo sapiens, GI282165719, Length=324, Percent_Identity=23.7654320987654, Blast_Score=77, Evalue=5e-14, Organism=Homo sapiens, GI34304360, Length=288, Percent_Identity=25.6944444444444, Blast_Score=72, Evalue=3e-12, Organism=Caenorhabditis elegans, GI25147165, Length=289, Percent_Identity=22.4913494809689, Blast_Score=72, Evalue=1e-12, Organism=Drosophila melanogaster, GI24650658, Length=299, Percent_Identity=27.7591973244147, Blast_Score=126, Evalue=5e-29, Organism=Drosophila melanogaster, GI24660950, Length=290, Percent_Identity=25.1724137931034, Blast_Score=119, Evalue=6e-27,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
NA
Pfam domain/function: NA
EC number: NA
Molecular weight: Translated: 92715; Mature: 92715
Theoretical pI: Translated: 5.10; Mature: 5.10
Prosite motif: PS50005 TPR ; PS50293 TPR_REGION
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.5 %Cys (Translated Protein) 0.7 %Met (Translated Protein) 1.2 %Cys+Met (Translated Protein) 0.5 %Cys (Mature Protein) 0.7 %Met (Mature Protein) 1.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKRIIVTCLIGITTLTTTALILPLLPQSSYAQSSNSQAEKLEQLIKTAQQQIGQYQYQES CCHHHHHHHHHHHHHHHHHHHHHCCCCCHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH IETLQEALAIARKIKDRKYEAVANLGLGYVYDQTGQPQKALEFYEKALPIWQEVGYRFGE HHHHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCHHHHHHHHHHHHHHHHHHCCCCCC ATTLNNIGGVYSDIGQPQKALEFYEKALPISQEVGARSQEATTLNNIGLVYDNIGQPQKA CCCHHHCCCHHHHCCCHHHHHHHHHHHCCCCHHHCCCCHHCHHHHCCCEEECCCCCHHHH LEYYEKALPISQEVGARSQEATTLNNIGLVYSSIGQPQKALEYYEKALPISQEMGAHSQE HHHHHHHCCCCHHHCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCHHHHCCCCHH ATTLNNIGLVYSNIGQPQKALEFFEKALPIWQEVGYRSQEATTLNNIGNVYSSIGQPQKA HHHHHCCCHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCHHHH LEYLQKALPIMQEVGARSLESTTLSSIGKVYRDSNQPEKAIDHLEKSVKITLEIRGGLKK HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCEEEEEEECCCCCC ENRKTFLNFQYNKWTPIALIDLLIDQNQPEAAWKWYNLATTFDLADYTRLIKAKVKNPEA CCCCEEEEEECCCCCHHHHHHHHHCCCCCHHHHHHEEHHEEHHHHHHHHHHHHHCCCCHH QKLINQWEENYQKLQALYSKIEDGTTTQLSQQIKQLQAENNQLAENASQKYPEVAELFEF HHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCHHHHHHHHHCCHHHHHHHCC EPKDIDKLKANIPPGTVVIQPALLTGLKSVPDSIAIFLVTRDQATLVKKLPIDAKEFDSI CCCCHHHHHCCCCCCEEEECHHHHHHHHHCCCCEEEEEEECCHHHHHHHCCCCHHHHHHH LTEYRSQLEKPNADNYATNQEKLYDYLIRPIETEIAAYSPKRLAIIPTGKLRYIPFETLY HHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCEEECCHHHHH DNQTEQYLIAKYPIHYLTRISATRQEPKEPTKSLKVLAFGNPQPTEINLRGAEDEAKIIA CCCCCCEEEECCCHHHHHHHHHHHCCCCCHHHHEEEEECCCCCCCEEEEECCCHHHHHHH ENLSGKSLTREKATLSSFENESPGFPLVHLATHGCFQKGGCREQGLEENTILFANNKTFN HCCCCCCHHHHHHHHHHCCCCCCCCCEEEHHHHHHHCCCCCHHCCCCCCEEEEECCCEEE IANAARLGLENTDLIALSACQTAMKADSNGEEIAGVAYLFERAGADAVIASLWNAEDGTT CCHHHHCCCCCCCEEEHHHHHHHHHCCCCCCHHHHHHHHHHHCCCHHHHHHHHCCCCCCC KDIMVKFYDNLKQGMTKVEALRQAKLSYARSDVSPFYWSPFILIGDGE HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCC >Mature Secondary Structure MKRIIVTCLIGITTLTTTALILPLLPQSSYAQSSNSQAEKLEQLIKTAQQQIGQYQYQES CCHHHHHHHHHHHHHHHHHHHHHCCCCCHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH IETLQEALAIARKIKDRKYEAVANLGLGYVYDQTGQPQKALEFYEKALPIWQEVGYRFGE HHHHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCHHHHHHHHHHHHHHHHHHCCCCCC ATTLNNIGGVYSDIGQPQKALEFYEKALPISQEVGARSQEATTLNNIGLVYDNIGQPQKA CCCHHHCCCHHHHCCCHHHHHHHHHHHCCCCHHHCCCCHHCHHHHCCCEEECCCCCHHHH LEYYEKALPISQEVGARSQEATTLNNIGLVYSSIGQPQKALEYYEKALPISQEMGAHSQE HHHHHHHCCCCHHHCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCHHHHCCCCHH ATTLNNIGLVYSNIGQPQKALEFFEKALPIWQEVGYRSQEATTLNNIGNVYSSIGQPQKA HHHHHCCCHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCHHHH LEYLQKALPIMQEVGARSLESTTLSSIGKVYRDSNQPEKAIDHLEKSVKITLEIRGGLKK HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCEEEEEEECCCCCC ENRKTFLNFQYNKWTPIALIDLLIDQNQPEAAWKWYNLATTFDLADYTRLIKAKVKNPEA CCCCEEEEEECCCCCHHHHHHHHHCCCCCHHHHHHEEHHEEHHHHHHHHHHHHHCCCCHH QKLINQWEENYQKLQALYSKIEDGTTTQLSQQIKQLQAENNQLAENASQKYPEVAELFEF HHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCHHHHHHHHHCCHHHHHHHCC EPKDIDKLKANIPPGTVVIQPALLTGLKSVPDSIAIFLVTRDQATLVKKLPIDAKEFDSI CCCCHHHHHCCCCCCEEEECHHHHHHHHHCCCCEEEEEEECCHHHHHHHCCCCHHHHHHH LTEYRSQLEKPNADNYATNQEKLYDYLIRPIETEIAAYSPKRLAIIPTGKLRYIPFETLY HHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCEEECCHHHHH DNQTEQYLIAKYPIHYLTRISATRQEPKEPTKSLKVLAFGNPQPTEINLRGAEDEAKIIA CCCCCCEEEECCCHHHHHHHHHHHCCCCCHHHHEEEEECCCCCCCEEEEECCCHHHHHHH ENLSGKSLTREKATLSSFENESPGFPLVHLATHGCFQKGGCREQGLEENTILFANNKTFN HCCCCCCHHHHHHHHHHCCCCCCCCCEEEHHHHHHHCCCCCHHCCCCCCEEEEECCCEEE IANAARLGLENTDLIALSACQTAMKADSNGEEIAGVAYLFERAGADAVIASLWNAEDGTT CCHHHHCCCCCCCEEEHHHHHHHHHCCCCCCHHHHHHHHHHHCCCHHHHHHHHCCCCCCC KDIMVKFYDNLKQGMTKVEALRQAKLSYARSDVSPFYWSPFILIGDGE HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA