| Definition | Trichodesmium erythraeum IMS101 chromosome, complete genome. |
|---|---|
| Accession | NC_008312 |
| Length | 7,750,108 |
Click here to switch to the map view.
The map label for this gene is prfA
Identifier: 113476549
GI number: 113476549
Start: 4637845
End: 4638954
Strand: Reverse
Name: prfA
Synonym: Tery_2982
Alternate gene names: 113476549
Gene position: 4638954-4637845 (Counterclockwise)
Preceding gene: 113476550
Following gene: 113476547
Centisome position: 59.86
GC content: 39.82
Gene sequence:
>1110_bases ATGGCTGAAACATATTTATTAGATAAGTTAAAATCTGTAGAGCAAACATATAATGAATTAACTCTGCGTTTGGCCGATCC AGATGTTGCTAAAGACCCCAGTGAGTTCCAAAAATTAGCTAAAGCACGTTCTTCCCTGGAAGAAGTTGTAAATTGTTATG TAGAGTGGAAAAATGCTCAAGAAGAATTAGCTGATGCTAAGGAAATATTGAAAGAAGCTGTTGGCGATTTGGAAATGCAA GAGATGGCCAAAGTTGAAGTAGAAGATTTGGAAGCCAAACTAGAATCTCTAGAAAATCAAATGAAAATTGCTCTACTGCC ACGAGACCCCAACGATGATAAGAATATTATGTTGGAGATTAGAGCTGGTACAGGAGGAGATGAAGCTAGTATTTGGGCAG GAGATCTCGTTAGAATGTACTCCCGCTACTCAGAGAACCAAAGCTGGAAGGTGAGCTTGCTGAGTGAATCTTTAGCAGAT ATGGGCGGTTTTAAAGAAGCAATTTTAGAAATTAAAGGGGATCACGTTTATAGTAAGCTCAAATTTGAAGCAGGAGTTCA TCGGGTTCAAAGAGTGCCAGTTACAGAAGCAGGTGGAAGAGTACATACTTCTACTGCTACAGTGGCGATAATGCCAGAGG TGGATGATGTGGAAGTTGAAATAGACCAAAAGGATATTGAATTGTCAACTGCTCGTTCTGGTGGAGCTGGTGGACAAAAT GTCAACAAGGTTGAAACTGCTGTAGATTTGTTTCACAAGCCTACGGGAATTAGAATTTTTTGTACCCAGGAGCGGAGCCA GCTACAAAACAGGGAGCGAGCAATGCAGATTTTGCGGGCTAAACTTTATGAGATTAAGTTACAAGAGCAACAAGCAGAAG TGAGTTCTATAAGGCGATCGCAAGTAGGTACTGGTTCCCGTTCTGAAAAAATTCGTACTTATAATTATAAAGATAATCGG GTGACAGATCATCGCTTAAATCAGAACTTTTCTCTTGTACCTCTTTTGGAGGGAGATATAGAAAATGTTATTCAAGCTTG TATTACCCAAGATCAACAGGAGCGTTTACAAGAGTTAGCAGCATCTAGTTCTACTCCTATTTCAGTATAG
Upstream 100 bases:
>100_bases TTTTGCATTTTATCCATCGCTATAATCAATCATTACTTGTTTAGCACTACTGCAAAGGACAACGGACAATGGCTGAAACA TATTTAAAGGACAACTTATA
Downstream 100 bases:
>100_bases CTAATATGTTCATTCCCCTTTATGAGAGGAATTGTGGCACTTTTTCTCAAATCTTCTCAGGGTTGTGTTTTGGATTTAGG AAAAACACCAATAGATCGGC
Product: peptide chain release factor 1
Products: NA
Alternate protein names: RF-1
Number of amino acids: Translated: 369; Mature: 368
Protein sequence:
>369_residues MAETYLLDKLKSVEQTYNELTLRLADPDVAKDPSEFQKLAKARSSLEEVVNCYVEWKNAQEELADAKEILKEAVGDLEMQ EMAKVEVEDLEAKLESLENQMKIALLPRDPNDDKNIMLEIRAGTGGDEASIWAGDLVRMYSRYSENQSWKVSLLSESLAD MGGFKEAILEIKGDHVYSKLKFEAGVHRVQRVPVTEAGGRVHTSTATVAIMPEVDDVEVEIDQKDIELSTARSGGAGGQN VNKVETAVDLFHKPTGIRIFCTQERSQLQNRERAMQILRAKLYEIKLQEQQAEVSSIRRSQVGTGSRSEKIRTYNYKDNR VTDHRLNQNFSLVPLLEGDIENVIQACITQDQQERLQELAASSSTPISV
Sequences:
>Translated_369_residues MAETYLLDKLKSVEQTYNELTLRLADPDVAKDPSEFQKLAKARSSLEEVVNCYVEWKNAQEELADAKEILKEAVGDLEMQ EMAKVEVEDLEAKLESLENQMKIALLPRDPNDDKNIMLEIRAGTGGDEASIWAGDLVRMYSRYSENQSWKVSLLSESLAD MGGFKEAILEIKGDHVYSKLKFEAGVHRVQRVPVTEAGGRVHTSTATVAIMPEVDDVEVEIDQKDIELSTARSGGAGGQN VNKVETAVDLFHKPTGIRIFCTQERSQLQNRERAMQILRAKLYEIKLQEQQAEVSSIRRSQVGTGSRSEKIRTYNYKDNR VTDHRLNQNFSLVPLLEGDIENVIQACITQDQQERLQELAASSSTPISV >Mature_368_residues AETYLLDKLKSVEQTYNELTLRLADPDVAKDPSEFQKLAKARSSLEEVVNCYVEWKNAQEELADAKEILKEAVGDLEMQE MAKVEVEDLEAKLESLENQMKIALLPRDPNDDKNIMLEIRAGTGGDEASIWAGDLVRMYSRYSENQSWKVSLLSESLADM GGFKEAILEIKGDHVYSKLKFEAGVHRVQRVPVTEAGGRVHTSTATVAIMPEVDDVEVEIDQKDIELSTARSGGAGGQNV NKVETAVDLFHKPTGIRIFCTQERSQLQNRERAMQILRAKLYEIKLQEQQAEVSSIRRSQVGTGSRSEKIRTYNYKDNRV TDHRLNQNFSLVPLLEGDIENVIQACITQDQQERLQELAASSSTPISV
Specific function: Peptide chain release factor 1 directs the termination of translation in response to the peptide chain termination codons UAG and UAA
COG id: COG0216
COG function: function code J; Protein chain release factor A
Gene ontology:
Cell location: Cytoplasm
Metaboloic importance: Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the prokaryotic/mitochondrial release factor family
Homologues:
Organism=Homo sapiens, GI166795303, Length=281, Percent_Identity=41.6370106761566, Blast_Score=236, Evalue=3e-62, Organism=Homo sapiens, GI34577120, Length=254, Percent_Identity=41.7322834645669, Blast_Score=222, Evalue=4e-58, Organism=Homo sapiens, GI166795305, Length=196, Percent_Identity=37.7551020408163, Blast_Score=152, Evalue=5e-37, Organism=Escherichia coli, GI1787462, Length=357, Percent_Identity=42.5770308123249, Blast_Score=301, Evalue=3e-83, Organism=Escherichia coli, GI2367172, Length=334, Percent_Identity=37.125748502994, Blast_Score=208, Evalue=4e-55, Organism=Caenorhabditis elegans, GI17542784, Length=302, Percent_Identity=36.7549668874172, Blast_Score=194, Evalue=5e-50, Organism=Saccharomyces cerevisiae, GI6321295, Length=336, Percent_Identity=37.797619047619, Blast_Score=207, Evalue=3e-54, Organism=Drosophila melanogaster, GI19921226, Length=364, Percent_Identity=34.8901098901099, Blast_Score=209, Evalue=3e-54,
Paralogues:
None
Copy number: 1,800 Molecules/Cell In: Glucose minimal media [C]
Swissprot (AC and ID): RF1_TRIEI (Q110D7)
Other databases:
- EMBL: CP000393 - RefSeq: YP_722610.1 - ProteinModelPortal: Q110D7 - SMR: Q110D7 - STRING: Q110D7 - GeneID: 4245098 - GenomeReviews: CP000393_GR - KEGG: ter:Tery_2982 - NMPDR: fig|203124.1.peg.5476 - eggNOG: COG0216 - HOGENOM: HBG629764 - OMA: SEQGGYK - ProtClustDB: PRK00591 - BioCyc: TERY203124:TERY_2982-MONOMER - GO: GO:0005737 - HAMAP: MF_00093 - InterPro: IPR005139 - InterPro: IPR000352 - InterPro: IPR004373 - SMART: SM00937 - TIGRFAMs: TIGR00019
Pfam domain/function: PF03462 PCRF; PF00472 RF-1
EC number: NA
Molecular weight: Translated: 41475; Mature: 41344
Theoretical pI: Translated: 4.63; Mature: 4.63
Prosite motif: PS00745 RF_PROK_I
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.8 %Cys (Translated Protein) 2.4 %Met (Translated Protein) 3.3 %Cys+Met (Translated Protein) 0.8 %Cys (Mature Protein) 2.2 %Met (Mature Protein) 3.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MAETYLLDKLKSVEQTYNELTLRLADPDVAKDPSEFQKLAKARSSLEEVVNCYVEWKNAQ CCCHHHHHHHHHHHHHHHHEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHH EELADAKEILKEAVGDLEMQEMAKVEVEDLEAKLESLENQMKIALLPRDPNDDKNIMLEI HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCEEEEEECCCCCCCCEEEEEE RAGTGGDEASIWAGDLVRMYSRYSENQSWKVSLLSESLADMGGFKEAILEIKGDHVYSKL EECCCCCCCHHHHHHHHHHHHHHCCCCCEEEHHHHHHHHHCCCHHHHHHHHCCCHHHHHH KFEAGVHRVQRVPVTEAGGRVHTSTATVAIMPEVDDVEVEIDQKDIELSTARSGGAGGQN HHHHHHHHHHCCCCCCCCCEEEECCEEEEEECCCCCCEEEECCCCCEEEECCCCCCCCCC VNKVETAVDLFHKPTGIRIFCTQERSQLQNRERAMQILRAKLYEIKLQEQQAEVSSIRRS CHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH QVGTGSRSEKIRTYNYKDNRVTDHRLNQNFSLVPLLEGDIENVIQACITQDQQERLQELA HCCCCCCCCCEEEECCCCCCCCHHHCCCCCCEEEEECCCHHHHHHHHHCCHHHHHHHHHH ASSSTPISV CCCCCCCCC >Mature Secondary Structure AETYLLDKLKSVEQTYNELTLRLADPDVAKDPSEFQKLAKARSSLEEVVNCYVEWKNAQ CCHHHHHHHHHHHHHHHHEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHH EELADAKEILKEAVGDLEMQEMAKVEVEDLEAKLESLENQMKIALLPRDPNDDKNIMLEI HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCEEEEEECCCCCCCCEEEEEE RAGTGGDEASIWAGDLVRMYSRYSENQSWKVSLLSESLADMGGFKEAILEIKGDHVYSKL EECCCCCCCHHHHHHHHHHHHHHCCCCCEEEHHHHHHHHHCCCHHHHHHHHCCCHHHHHH KFEAGVHRVQRVPVTEAGGRVHTSTATVAIMPEVDDVEVEIDQKDIELSTARSGGAGGQN HHHHHHHHHHCCCCCCCCCEEEECCEEEEEECCCCCCEEEECCCCCEEEECCCCCCCCCC VNKVETAVDLFHKPTGIRIFCTQERSQLQNRERAMQILRAKLYEIKLQEQQAEVSSIRRS CHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH QVGTGSRSEKIRTYNYKDNRVTDHRLNQNFSLVPLLEGDIENVIQACITQDQQERLQELA HCCCCCCCCCEEEECCCCCCCCHHHCCCCCCEEEEECCCHHHHHHHHHCCHHHHHHHHHH ASSSTPISV CCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA