Definition | Methanocaldococcus jannaschii DSM 2661 chromosome, complete genome. |
---|---|
Accession | NC_000909 |
Length | 1,664,970 |
Click here to switch to the map view.
The map label for this gene is yfiQ [C]
Identifier: 15668770
GI number: 15668770
Start: 522027
End: 524141
Strand: Direct
Name: yfiQ [C]
Synonym: MJ0590
Alternate gene names: 15668770
Gene position: 522027-524141 (Clockwise)
Preceding gene: 15668769
Following gene: 15668771
Centisome position: 31.35
GC content: 31.35
Gene sequence:
>2115_bases ATGTGGGGGAGGGATTATGAGCTTAAATATATTTCCTATCCAAAATCAGTTGCTATTATTGGAGCTTCAAAAACTGAAGG AAAGGTTGGATATGCAATAATGAAAAATTTAAAAGACTTTAATGGAAAAATCTATCCCATAAATCCAAAATATGATGAAA TATTCGGAATAAAATGCTATAAATCAGTTTTGGACGTTGAGGATGACATAGATTTGGCAGTTATAGTAGTTCCAAATATT GTTGTTCCTAAGGTATTGGAAGAATGTGGAAAAAAAGGGGTTAAAGGGGCTGTAATTATTACAGCTGGCTTTTCAGAAGT AGGAAATTATGAGTTGGAAAATAAAATTAAAGAAATAGCAAAAAGATACAACATAAGAATTATAGGGCCTAATTGTTTAG GTATAATGAACACCCATATAAACTTAAATGCCACATTTGCGAAGGTATTTCCTCCAAAAGGAGGAGTTTCAATAATCTCA CAAAGTGGGGCTGTTTTAAATGCCATATTAGACATAGCCCCTTTATTGAATATTGGCTTTTCTAAAGTTGTTAGCATTGG AAATAAAGCTGATATTCAGGAAAGTGATTTATTAGAGTATTTTTTAGATGATGAAGATACTAAGATAGTTGTTTTATACA TAGAAGGATTAAAGGATAAGAGATTTTTAAAAGTAGCTAAAAAATTATCTAAGAAAAAGCCAATAATTGCCCTAAAATCT GGAAGAACTGAAGTAGGAAAGAAAGCGGCAAAATCCCACACTGGCTCTTTAGCTGGAGAAGATGTTATCTATGAGGCAGC GTTTAAAGAAGCTGGGATAATTAGGGCATATACGTTTGAGGAGTTAGTTGATTTAATCCATTTATTCTCAACACAGCCAA CAATAAGCTCAAATGAAATTGGAATAATAACAAATGCAGGAGGATTTGGAGTTTTAGCAGCTGATAGCTGTGTTGATTAT AACATGAAGCTATCTAACTTTGAAAAATCAACAATAGAAAAGCTTAAAAATATTCTGCCACCAACTGCCAATATATCAAA TCCATTGGATATTATAGGAGATGCCACACCAGAGAGATATAAAAAGGTTATAGAAGTTTTAGCTGAAGATAGCAATGTTA AGGGGCTTTTAGTTATCTTAACTCCACAAGAGATGACAAAACCATTAGAAGTTGCTAAATCTATTATAGAAGTTAAAAAT TCCCATAAAGAATTTAAAAATAAACCGTTAATTACTTCATTTGTTGGAGGAGTTTCAGTTAAAGGAGCTAAAAGTTATTT AAGGAAGAATGGAATCCCTGCATACATAACTCCAGAAAATGGTGTCAAAGCCCTATCTCATCTCTATAAATATAGCTTAA TGAAAGTTAAGGAAGATTATGATGAATACTTAGAAAATATTAAAGAAGAGTTCATAAAAATTACTGAAGAAAATAAAGAA ATTATTAAAGAATTATTATCAAATCCAAATGAATACACTGCTAAAAAATTATTAAGCATTTATGGTCTTCCAGTTCCTAA GGGCTATTTAGCTAAAAATGAAGATGAAGCTTTAGAATATTGCAAAAAATTAGGTAAATGCGTAATGAAAATTGTCTCAC CACAAATAATACATAAAACGGAGGCAGGAGGAGTTATAATAAATCCAAAAAATCCTAAAGAGGCATTTAAAAAATTAATT GAAAATGCTAAGGAATATGCAAAAAGAATGGGCATTGATAATTTAATTATAGAGGGAGTGTTAGTTGAAGAGTTCATTGA GAAAGATATGATGGAAATTATAATAGGGGCTAAGAGGGATGATATTTTTGGCTCTGTAGTTATGGTTGGGTTAGGAGGAG TATTTGTTGAGGTTTTAAAAGATGTATCTTTTGGCATTTCGCCAATAACAAGGGACTTTGCTCATGAGATGTTGAGGGAA TTGAAATCCTATAAAGTCTTAGAAGGCGTTAGAGGAAGACCTAAGAGAGATATTAACTTTATTGTTGATACCCTAATAAA GATTGGAGTATTTATGGATATTCACAAAGAGATTAAAGAGCTTGATTTAAACCCAGTATTTGTCTTTAATGAAAAAGAGG GAGGATGTATAGGTGATGCAAGAATAATTAAATAA
Upstream 100 bases:
>100_bases CTCCATAATCGTTATTTGTTTAGGAGGTGAGAGTATATAAATGTTAAAAGAGATAATAATTAGAATATTAAAAATCACTA TAAAAATTTTTCTCTGAAAA
Downstream 100 bases:
>100_bases ATTTAATTTATTTCTGAAAAAACAGATAAATCCACACAATAATAGAAGCTAACCGGTCCCAAAATTGGGTTCCCGGTTAG CACTCACTACTACCTTCTAT
Product: hypothetical protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 704; Mature: 704
Protein sequence:
>704_residues MWGRDYELKYISYPKSVAIIGASKTEGKVGYAIMKNLKDFNGKIYPINPKYDEIFGIKCYKSVLDVEDDIDLAVIVVPNI VVPKVLEECGKKGVKGAVIITAGFSEVGNYELENKIKEIAKRYNIRIIGPNCLGIMNTHINLNATFAKVFPPKGGVSIIS QSGAVLNAILDIAPLLNIGFSKVVSIGNKADIQESDLLEYFLDDEDTKIVVLYIEGLKDKRFLKVAKKLSKKKPIIALKS GRTEVGKKAAKSHTGSLAGEDVIYEAAFKEAGIIRAYTFEELVDLIHLFSTQPTISSNEIGIITNAGGFGVLAADSCVDY NMKLSNFEKSTIEKLKNILPPTANISNPLDIIGDATPERYKKVIEVLAEDSNVKGLLVILTPQEMTKPLEVAKSIIEVKN SHKEFKNKPLITSFVGGVSVKGAKSYLRKNGIPAYITPENGVKALSHLYKYSLMKVKEDYDEYLENIKEEFIKITEENKE IIKELLSNPNEYTAKKLLSIYGLPVPKGYLAKNEDEALEYCKKLGKCVMKIVSPQIIHKTEAGGVIINPKNPKEAFKKLI ENAKEYAKRMGIDNLIIEGVLVEEFIEKDMMEIIIGAKRDDIFGSVVMVGLGGVFVEVLKDVSFGISPITRDFAHEMLRE LKSYKVLEGVRGRPKRDINFIVDTLIKIGVFMDIHKEIKELDLNPVFVFNEKEGGCIGDARIIK
Sequences:
>Translated_704_residues MWGRDYELKYISYPKSVAIIGASKTEGKVGYAIMKNLKDFNGKIYPINPKYDEIFGIKCYKSVLDVEDDIDLAVIVVPNI VVPKVLEECGKKGVKGAVIITAGFSEVGNYELENKIKEIAKRYNIRIIGPNCLGIMNTHINLNATFAKVFPPKGGVSIIS QSGAVLNAILDIAPLLNIGFSKVVSIGNKADIQESDLLEYFLDDEDTKIVVLYIEGLKDKRFLKVAKKLSKKKPIIALKS GRTEVGKKAAKSHTGSLAGEDVIYEAAFKEAGIIRAYTFEELVDLIHLFSTQPTISSNEIGIITNAGGFGVLAADSCVDY NMKLSNFEKSTIEKLKNILPPTANISNPLDIIGDATPERYKKVIEVLAEDSNVKGLLVILTPQEMTKPLEVAKSIIEVKN SHKEFKNKPLITSFVGGVSVKGAKSYLRKNGIPAYITPENGVKALSHLYKYSLMKVKEDYDEYLENIKEEFIKITEENKE IIKELLSNPNEYTAKKLLSIYGLPVPKGYLAKNEDEALEYCKKLGKCVMKIVSPQIIHKTEAGGVIINPKNPKEAFKKLI ENAKEYAKRMGIDNLIIEGVLVEEFIEKDMMEIIIGAKRDDIFGSVVMVGLGGVFVEVLKDVSFGISPITRDFAHEMLRE LKSYKVLEGVRGRPKRDINFIVDTLIKIGVFMDIHKEIKELDLNPVFVFNEKEGGCIGDARIIK >Mature_704_residues MWGRDYELKYISYPKSVAIIGASKTEGKVGYAIMKNLKDFNGKIYPINPKYDEIFGIKCYKSVLDVEDDIDLAVIVVPNI VVPKVLEECGKKGVKGAVIITAGFSEVGNYELENKIKEIAKRYNIRIIGPNCLGIMNTHINLNATFAKVFPPKGGVSIIS QSGAVLNAILDIAPLLNIGFSKVVSIGNKADIQESDLLEYFLDDEDTKIVVLYIEGLKDKRFLKVAKKLSKKKPIIALKS GRTEVGKKAAKSHTGSLAGEDVIYEAAFKEAGIIRAYTFEELVDLIHLFSTQPTISSNEIGIITNAGGFGVLAADSCVDY NMKLSNFEKSTIEKLKNILPPTANISNPLDIIGDATPERYKKVIEVLAEDSNVKGLLVILTPQEMTKPLEVAKSIIEVKN SHKEFKNKPLITSFVGGVSVKGAKSYLRKNGIPAYITPENGVKALSHLYKYSLMKVKEDYDEYLENIKEEFIKITEENKE IIKELLSNPNEYTAKKLLSIYGLPVPKGYLAKNEDEALEYCKKLGKCVMKIVSPQIIHKTEAGGVIINPKNPKEAFKKLI ENAKEYAKRMGIDNLIIEGVLVEEFIEKDMMEIIIGAKRDDIFGSVVMVGLGGVFVEVLKDVSFGISPITRDFAHEMLRE LKSYKVLEGVRGRPKRDINFIVDTLIKIGVFMDIHKEIKELDLNPVFVFNEKEGGCIGDARIIK
Specific function: Unknown
COG id: COG1042
COG function: function code C; Acyl-CoA synthetase (NDP forming)
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: To E.coli yfiQ
Homologues:
Organism=Escherichia coli, GI1788938, Length=692, Percent_Identity=29.1907514450867, Blast_Score=311, Evalue=7e-86,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): Y590_METJA (Q58010)
Other databases:
- EMBL: L77117 - PIR: F64373 - RefSeq: NP_247570.1 - ProteinModelPortal: Q58010 - GeneID: 1451455 - GenomeReviews: L77117_GR - KEGG: mja:MJ_0590 - NMPDR: fig|243232.1.peg.607 - TIGR: MJ0590 - HOGENOM: HBG695583 - OMA: TPQEMTK - ProtClustDB: CLSK876257 - BioCyc: MJAN243232:MJ_0590-MONOMER - GO: GO:0005488 - InterPro: IPR014089 - InterPro: IPR013650 - InterPro: IPR003781 - InterPro: IPR016040 - InterPro: IPR005809 - InterPro: IPR016102 - Gene3D: G3DSA:3.40.50.720 - Gene3D: G3DSA:3.40.50.261 - PANTHER: PTHR11815 - SMART: SM00881 - TIGRFAMs: TIGR02717
Pfam domain/function: PF08442 ATP-grasp_2; PF02629 CoA_binding; SSF52210 CoA_ligase
EC number: NA
Molecular weight: Translated: 78173; Mature: 78173
Theoretical pI: Translated: 8.51; Mature: 8.51
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.0 %Cys (Translated Protein) 1.8 %Met (Translated Protein) 2.8 %Cys+Met (Translated Protein) 1.0 %Cys (Mature Protein) 1.8 %Met (Mature Protein) 2.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MWGRDYELKYISYPKSVAIIGASKTEGKVGYAIMKNLKDFNGKIYPINPKYDEIFGIKCY CCCCCEEEEEEECCCEEEEEECCCCCCCHHHHHHHHHHHHCCEEEECCCCHHHHHHHHHH KSVLDVEDDIDLAVIVVPNIVVPKVLEECGKKGVKGAVIITAGFSEVGNYELENKIKEIA HHHHCCCCCCCEEEEEECCCHHHHHHHHHCCCCCCEEEEEEECCHHHCCCHHHHHHHHHH KRYNIRIIGPNCLGIMNTHINLNATFAKVFPPKGGVSIISQSGAVLNAILDIAPLLNIGF HHCCEEEECCCCEEEEECEEEEEEEEEEECCCCCCCCEEECCCHHHHHHHHHHHHHHCCH SKVVSIGNKADIQESDLLEYFLDDEDTKIVVLYIEGLKDKRFLKVAKKLSKKKPIIALKS HHHHHCCCCCCCCHHHHHHHHCCCCCCEEEEEEEECCCCHHHHHHHHHHHCCCCEEEEEC GRTEVGKKAAKSHTGSLAGEDVIYEAAFKEAGIIRAYTFEELVDLIHLFSTQPTISSNEI CCCHHHHHHHHHCCCCCCCCHHHHHHHHHHCCEEEEECHHHHHHHHHHHCCCCCCCCCCE GIITNAGGFGVLAADSCVDYNMKLSNFEKSTIEKLKNILPPTANISNPLDIIGDATPERY EEEECCCCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCHHHHCCCCHHHH KKVIEVLAEDSNVKGLLVILTPQEMTKPLEVAKSIIEVKNSHKEFKNKPLITSFVGGVSV HHHHHHHHCCCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHCCCCC KGAKSYLRKNGIPAYITPENGVKALSHLYKYSLMKVKEDYDEYLENIKEEFIKITEENKE HHHHHHHHHCCCCEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHH IIKELLSNPNEYTAKKLLSIYGLPVPKGYLAKNEDEALEYCKKLGKCVMKIVSPQIIHKT HHHHHHCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCHHHEEEC EAGGVIINPKNPKEAFKKLIENAKEYAKRMGIDNLIIEGVLVEEFIEKDMMEIIIGAKRD CCCCEEECCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCC DIFGSVVMVGLGGVFVEVLKDVSFGISPITRDFAHEMLRELKSYKVLEGVRGRPKRDINF HHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHH IVDTLIKIGVFMDIHKEIKELDLNPVFVFNEKEGGCIGDARIIK HHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCEECCCEECC >Mature Secondary Structure MWGRDYELKYISYPKSVAIIGASKTEGKVGYAIMKNLKDFNGKIYPINPKYDEIFGIKCY CCCCCEEEEEEECCCEEEEEECCCCCCCHHHHHHHHHHHHCCEEEECCCCHHHHHHHHHH KSVLDVEDDIDLAVIVVPNIVVPKVLEECGKKGVKGAVIITAGFSEVGNYELENKIKEIA HHHHCCCCCCCEEEEEECCCHHHHHHHHHCCCCCCEEEEEEECCHHHCCCHHHHHHHHHH KRYNIRIIGPNCLGIMNTHINLNATFAKVFPPKGGVSIISQSGAVLNAILDIAPLLNIGF HHCCEEEECCCCEEEEECEEEEEEEEEEECCCCCCCCEEECCCHHHHHHHHHHHHHHCCH SKVVSIGNKADIQESDLLEYFLDDEDTKIVVLYIEGLKDKRFLKVAKKLSKKKPIIALKS HHHHHCCCCCCCCHHHHHHHHCCCCCCEEEEEEEECCCCHHHHHHHHHHHCCCCEEEEEC GRTEVGKKAAKSHTGSLAGEDVIYEAAFKEAGIIRAYTFEELVDLIHLFSTQPTISSNEI CCCHHHHHHHHHCCCCCCCCHHHHHHHHHHCCEEEEECHHHHHHHHHHHCCCCCCCCCCE GIITNAGGFGVLAADSCVDYNMKLSNFEKSTIEKLKNILPPTANISNPLDIIGDATPERY EEEECCCCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCHHHHCCCCHHHH KKVIEVLAEDSNVKGLLVILTPQEMTKPLEVAKSIIEVKNSHKEFKNKPLITSFVGGVSV HHHHHHHHCCCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHCCCCC KGAKSYLRKNGIPAYITPENGVKALSHLYKYSLMKVKEDYDEYLENIKEEFIKITEENKE HHHHHHHHHCCCCEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHH IIKELLSNPNEYTAKKLLSIYGLPVPKGYLAKNEDEALEYCKKLGKCVMKIVSPQIIHKT HHHHHHCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCHHHEEEC EAGGVIINPKNPKEAFKKLIENAKEYAKRMGIDNLIIEGVLVEEFIEKDMMEIIIGAKRD CCCCEEECCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCC DIFGSVVMVGLGGVFVEVLKDVSFGISPITRDFAHEMLRELKSYKVLEGVRGRPKRDINF HHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHH IVDTLIKIGVFMDIHKEIKELDLNPVFVFNEKEGGCIGDARIIK HHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCEECCCEECC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 8688087