| Definition | Trichodesmium erythraeum IMS101 chromosome, complete genome. |
|---|---|
| Accession | NC_008312 |
| Length | 7,750,108 |
Click here to switch to the map view.
The map label for this gene is yebC [C]
Identifier: 113475772
GI number: 113475772
Start: 3321810
End: 3322571
Strand: Reverse
Name: yebC [C]
Synonym: Tery_2125
Alternate gene names: 113475772
Gene position: 3322571-3321810 (Counterclockwise)
Preceding gene: 113475773
Following gene: 113475768
Centisome position: 42.87
GC content: 41.34
Gene sequence:
>762_bases ATGGCAGGACATAGTAAGTGGGCCAATATTAAACGCCAAAAAGCAAGAGTTGATGCTGTTAAAGGAAAAGTTTTCGCTAA AGTATCTCGGCAAATTATTGTTGCTGCTCGCAGTGGAGCTGATCCTGCTGGTAATTTTCAGTTACGCACGGCGATAGAGA AGGCAAAGACAGTAGGTATTCCCAATGATAATATTGAACGAGCGATCGCTAAAGGTTCAGGCCAATTAAATGATGGCAGT CAATTGGAGGAGATTCGTTATGAGGGCTATGGTGCTGGAGGTATAGCAATTATAATTGAGGCCTTGACAGATAACCGCAA CCGTACAGCAGCAGACCTAAGAAGTGCGTTTACCAAAAATGGTGGTAATTTAGGTGAGACTGGTTGCGTGAGTTGGATGT TTGACCAAAAAGGTGTGGTTAGTATTACAGGAAGTTATGATGAGGATGAGTTACTCGAAGCATCTGTGGAAGGGGAAGCG GAATATTATGAAGTAATTGCAGAAGATGATTTTCAAGGTGTGGAGGTTTTCACGGAAACTACTAATTTAGAAAATTTGAG TCAGGTATTACAAGAGAAGGGCTTTGATATTAGTGAGGTAGAATTTCGGTGGGTTTCTGCTCATACTATTGAGGTGAGTG ATCCGGAACAGGCGCGATCGCTTCTCAAGTTAATGGATGCTCTTGACGATCTTGATGATGTGCAGAATATTACTGCTAAT TTCGATATAGCAAACAAACTTTTGAAGACTTTAGCTAGTTGA
Upstream 100 bases:
>100_bases TCTGAGTACATTGTTTCAGCTCCCGGTACAGAGTATGTCAAGTGTAACTAGTTATGAAAATTACTGAAGTTAGGATATAT TTTTTACAAGATTTGATTTT
Downstream 100 bases:
>100_bases ATATTCTAAGTTATAGGGCTAGGAGCAAGTAAGATAGGATAGCAGTTAGATGGGCGAGGTACAGGAGTCAAAAATTAGAA GTAATTTCTGAGGTGGTTTC
Product: hypothetical protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 253; Mature: 252
Protein sequence:
>253_residues MAGHSKWANIKRQKARVDAVKGKVFAKVSRQIIVAARSGADPAGNFQLRTAIEKAKTVGIPNDNIERAIAKGSGQLNDGS QLEEIRYEGYGAGGIAIIIEALTDNRNRTAADLRSAFTKNGGNLGETGCVSWMFDQKGVVSITGSYDEDELLEASVEGEA EYYEVIAEDDFQGVEVFTETTNLENLSQVLQEKGFDISEVEFRWVSAHTIEVSDPEQARSLLKLMDALDDLDDVQNITAN FDIANKLLKTLAS
Sequences:
>Translated_253_residues MAGHSKWANIKRQKARVDAVKGKVFAKVSRQIIVAARSGADPAGNFQLRTAIEKAKTVGIPNDNIERAIAKGSGQLNDGS QLEEIRYEGYGAGGIAIIIEALTDNRNRTAADLRSAFTKNGGNLGETGCVSWMFDQKGVVSITGSYDEDELLEASVEGEA EYYEVIAEDDFQGVEVFTETTNLENLSQVLQEKGFDISEVEFRWVSAHTIEVSDPEQARSLLKLMDALDDLDDVQNITAN FDIANKLLKTLAS >Mature_252_residues AGHSKWANIKRQKARVDAVKGKVFAKVSRQIIVAARSGADPAGNFQLRTAIEKAKTVGIPNDNIERAIAKGSGQLNDGSQ LEEIRYEGYGAGGIAIIIEALTDNRNRTAADLRSAFTKNGGNLGETGCVSWMFDQKGVVSITGSYDEDELLEASVEGEAE YYEVIAEDDFQGVEVFTETTNLENLSQVLQEKGFDISEVEFRWVSAHTIEVSDPEQARSLLKLMDALDDLDDVQNITANF DIANKLLKTLAS
Specific function: Unknown
COG id: COG0217
COG function: function code S; Uncharacterized conserved protein
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the TACO1 family
Homologues:
Organism=Homo sapiens, GI27545315, Length=251, Percent_Identity=25.4980079681275, Blast_Score=86, Evalue=2e-17, Organism=Escherichia coli, GI1788171, Length=252, Percent_Identity=42.0634920634921, Blast_Score=194, Evalue=5e-51, Organism=Escherichia coli, GI1788294, Length=239, Percent_Identity=38.0753138075314, Blast_Score=141, Evalue=4e-35, Organism=Caenorhabditis elegans, GI17556100, Length=247, Percent_Identity=30.7692307692308, Blast_Score=87, Evalue=7e-18, Organism=Saccharomyces cerevisiae, GI6321458, Length=263, Percent_Identity=34.6007604562738, Blast_Score=127, Evalue=2e-30, Organism=Drosophila melanogaster, GI24583305, Length=255, Percent_Identity=30.1960784313725, Blast_Score=91, Evalue=8e-19,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): Y2125_TRIEI (Q113G4)
Other databases:
- EMBL: CP000393 - RefSeq: YP_721833.1 - ProteinModelPortal: Q113G4 - SMR: Q113G4 - STRING: Q113G4 - GeneID: 4244097 - GenomeReviews: CP000393_GR - KEGG: ter:Tery_2125 - NMPDR: fig|203124.1.peg.6260 - eggNOG: COG0217 - HOGENOM: HBG715231 - OMA: VYANFDI - PhylomeDB: Q113G4 - ProtClustDB: PRK00110 - BioCyc: TERY203124:TERY_2125-MONOMER - HAMAP: MF_00693 - InterPro: IPR002876 - InterPro: IPR017856 - Gene3D: G3DSA:1.10.10.200 - PANTHER: PTHR12532 - TIGRFAMs: TIGR01033
Pfam domain/function: PF01709 DUF28; SSF75625 DUF28
EC number: NA
Molecular weight: Translated: 27607; Mature: 27476
Theoretical pI: Translated: 4.37; Mature: 4.37
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.4 %Cys (Translated Protein) 1.2 %Met (Translated Protein) 1.6 %Cys+Met (Translated Protein) 0.4 %Cys (Mature Protein) 0.8 %Met (Mature Protein) 1.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MAGHSKWANIKRQKARVDAVKGKVFAKVSRQIIVAARSGADPAGNFQLRTAIEKAKTVGI CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHEEEEEECCCCCCCCCCHHHHHHHHHHCCCC PNDNIERAIAKGSGQLNDGSQLEEIRYEGYGAGGIAIIIEALTDNRNRTAADLRSAFTKN CCCHHHHHHHCCCCCCCCCHHHHHHHHCCCCCCCHHEEEEHHHCCCCCHHHHHHHHHHCC GGNLGETGCVSWMFDQKGVVSITGSYDEDELLEASVEGEAEYYEVIAEDDFQGVEVFTET CCCCCCCHHEEEEECCCCEEEEECCCCHHHHHHHHCCCCHHHHHHHHCCCCCCEEEEEEC TNLENLSQVLQEKGFDISEVEFRWVSAHTIEVSDPEQARSLLKLMDALDDLDDVQNITAN CCHHHHHHHHHHCCCCCCCEEEEEEEEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHCC FDIANKLLKTLAS HHHHHHHHHHHCC >Mature Secondary Structure AGHSKWANIKRQKARVDAVKGKVFAKVSRQIIVAARSGADPAGNFQLRTAIEKAKTVGI CCCCHHHHHHHHHHHHHHHHHHHHHHHHHEEEEEECCCCCCCCCCHHHHHHHHHHCCCC PNDNIERAIAKGSGQLNDGSQLEEIRYEGYGAGGIAIIIEALTDNRNRTAADLRSAFTKN CCCHHHHHHHCCCCCCCCCHHHHHHHHCCCCCCCHHEEEEHHHCCCCCHHHHHHHHHHCC GGNLGETGCVSWMFDQKGVVSITGSYDEDELLEASVEGEAEYYEVIAEDDFQGVEVFTET CCCCCCCHHEEEEECCCCEEEEECCCCHHHHHHHHCCCCHHHHHHHHCCCCCCEEEEEEC TNLENLSQVLQEKGFDISEVEFRWVSAHTIEVSDPEQARSLLKLMDALDDLDDVQNITAN CCHHHHHHHHHHCCCCCCCEEEEEEEEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHCC FDIANKLLKTLAS HHHHHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: NA