Definition | Methanocaldococcus jannaschii DSM 2661 chromosome, complete genome. |
---|---|
Accession | NC_000909 |
Length | 1,664,970 |
Click here to switch to the map view.
The map label for this gene is nth [C]
Identifier: 15668794
GI number: 15668794
Start: 543933
End: 544967
Strand: Reverse
Name: nth [C]
Synonym: MJ0613
Alternate gene names: 15668794
Gene position: 544967-543933 (Counterclockwise)
Preceding gene: 15668795
Following gene: 15668790
Centisome position: 32.73
GC content: 27.34
Gene sequence:
>1035_bases ATGGAGTTAATTGAAATATTGCTAAAAAAACTAAACAAAAATGCAGTAGTTACAGAGATAGCCAAAGATAAAGACCCTTT TAAGGTTTTAATATCAACTATAATAAGTGCAAGAACAAAGGATGAAGTAACTGAAGAGGTTTCTAAAAAACTATTTAAAG AGATTAAGGATGTTGATGATTTATTAAACATAGATGAAGAAAAATTAGCAGATTTGATATACCCAGCAGGATTTTATAAA AATAAGGCAAAAAATTTAAAAAAATTAGCCAAAATTTTAAAAGAAAATTATAATGGGAAAGTTCCAGATTCTTTGGAAGA GTTGTTAAAGCTCCCAGGGGTTGGAAGGAAAACAGCTAATTTGGTTATAACCTTAGCTTTCAACAAAGATGGGATTTGTG TAGATACCCATGTCCATAGGATATGTAATAGATGGGAAATAGTTGATACTGAGACTCCTGAAGAGACAGAGTTTGAATTA AGAAAAAAGCTTCCTAAAAAATATTGGAAAGTAATAAATAATTTGTTGGTGGTTTTTGGAAGGGAGATTTGTTCTTCAAA ATCTAAGTGTGATAAATGTTTTAAAGAAATTAAAGAGAAATGCCCTTACTATGAAAAAATTAAGCACTTTGAAAATATAT TAAAAAAATTCAATTTTAGAAAAGTCTCAAAAAACAAAATCCCTAATGAAAAAGGAACTTACATCTTAAAAATTAGGTTA AAAGAAGGTAAAAAAATAAAATTTGGAAAAACAGAGAGATTTTTTAAAAAAGGATATTATTTCTACATTGGCTCTGCCTT TGGAAATTCAATGAACTTAAAAAATAGGATAGAGAGGCATTTAAAGGATGATAAAAAGATGCACTGGCATATTGATTATT TATTAAAATATGGTAAGATTGAAGAGATTTATATTACAAATGAGAGAGTTGAGTGTGAGGTTGCAAATGAATTTATAAAA AAATTTGATTTTGTTGAGAACTTTGGATGTTCTGATTGTAAATGTAAGAGTCATTTATTTTATTTGAAACCATAG
Upstream 100 bases:
>100_bases AAAGAGAGTAAGTAATTTACTAAAAACTTTTTTATTATTTCTTTATAGAAAATTTAACAAAATTTTATTATTTTTGAAAG ATGCTAATTTTGGGATTCCT
Downstream 100 bases:
>100_bases AGGGGGCGTAGCCCCCTCTATGGTGTGGATACCCAGAGCGGGGCTTCACTACGTTCAGCCCCACTTAATTAAGAGGCATT GCCGAGTGAAGCGAGGTAAT
Product: endonuclease III
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 344; Mature: 344
Protein sequence:
>344_residues MELIEILLKKLNKNAVVTEIAKDKDPFKVLISTIISARTKDEVTEEVSKKLFKEIKDVDDLLNIDEEKLADLIYPAGFYK NKAKNLKKLAKILKENYNGKVPDSLEELLKLPGVGRKTANLVITLAFNKDGICVDTHVHRICNRWEIVDTETPEETEFEL RKKLPKKYWKVINNLLVVFGREICSSKSKCDKCFKEIKEKCPYYEKIKHFENILKKFNFRKVSKNKIPNEKGTYILKIRL KEGKKIKFGKTERFFKKGYYFYIGSAFGNSMNLKNRIERHLKDDKKMHWHIDYLLKYGKIEEIYITNERVECEVANEFIK KFDFVENFGCSDCKCKSHLFYLKP
Sequences:
>Translated_344_residues MELIEILLKKLNKNAVVTEIAKDKDPFKVLISTIISARTKDEVTEEVSKKLFKEIKDVDDLLNIDEEKLADLIYPAGFYK NKAKNLKKLAKILKENYNGKVPDSLEELLKLPGVGRKTANLVITLAFNKDGICVDTHVHRICNRWEIVDTETPEETEFEL RKKLPKKYWKVINNLLVVFGREICSSKSKCDKCFKEIKEKCPYYEKIKHFENILKKFNFRKVSKNKIPNEKGTYILKIRL KEGKKIKFGKTERFFKKGYYFYIGSAFGNSMNLKNRIERHLKDDKKMHWHIDYLLKYGKIEEIYITNERVECEVANEFIK KFDFVENFGCSDCKCKSHLFYLKP >Mature_344_residues MELIEILLKKLNKNAVVTEIAKDKDPFKVLISTIISARTKDEVTEEVSKKLFKEIKDVDDLLNIDEEKLADLIYPAGFYK NKAKNLKKLAKILKENYNGKVPDSLEELLKLPGVGRKTANLVITLAFNKDGICVDTHVHRICNRWEIVDTETPEETEFEL RKKLPKKYWKVINNLLVVFGREICSSKSKCDKCFKEIKEKCPYYEKIKHFENILKKFNFRKVSKNKIPNEKGTYILKIRL KEGKKIKFGKTERFFKKGYYFYIGSAFGNSMNLKNRIERHLKDDKKMHWHIDYLLKYGKIEEIYITNERVECEVANEFIK KFDFVENFGCSDCKCKSHLFYLKP
Specific function: Has Both An Apurinic And/Or Apyrimidinic Endonuclease Activity And A DNA N-Glycosylase Activity. Incises Damaged DNA At Cytosines, Thymines And Guanines. Acts On A Damaged Strand, 5' From The Damaged Site. Required For The Repair Of Both Oxidative DNA Da
COG id: COG0177
COG function: function code L; Predicted EndoIII-related endonuclease
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the Nth/MutY family
Homologues:
Organism=Homo sapiens, GI4505471, Length=185, Percent_Identity=40.5405405405405, Blast_Score=155, Evalue=5e-38, Organism=Escherichia coli, GI1787920, Length=205, Percent_Identity=34.1463414634146, Blast_Score=120, Evalue=2e-28, Organism=Escherichia coli, GI1789331, Length=227, Percent_Identity=25.9911894273128, Blast_Score=79, Evalue=4e-16, Organism=Caenorhabditis elegans, GI17554540, Length=170, Percent_Identity=44.7058823529412, Blast_Score=144, Evalue=6e-35, Organism=Saccharomyces cerevisiae, GI6319304, Length=178, Percent_Identity=38.7640449438202, Blast_Score=121, Evalue=1e-28, Organism=Saccharomyces cerevisiae, GI6324530, Length=207, Percent_Identity=36.7149758454106, Blast_Score=120, Evalue=3e-28, Organism=Drosophila melanogaster, GI45550361, Length=180, Percent_Identity=38.8888888888889, Blast_Score=126, Evalue=2e-29,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): Y613_METJA (Q58030)
Other databases:
- EMBL: L77117 - RefSeq: NP_247597.1 - ProteinModelPortal: Q58030 - SMR: Q58030 - GeneID: 1451479 - GenomeReviews: L77117_GR - KEGG: mja:MJ_0613 - NMPDR: fig|243232.1.peg.634 - TIGR: MJ0613 - HOGENOM: HBG541186 - OMA: CKCKSHL - ProtClustDB: CLSK876263 - BioCyc: MJAN243232:MJ_0613-MONOMER - GO: GO:0005622 - InterPro: IPR011257 - InterPro: IPR002837 - InterPro: IPR004036 - InterPro: IPR003651 - InterPro: IPR003265 - InterPro: IPR000445 - InterPro: IPR003583 - InterPro: IPR023170 - InterPro: IPR000305 - Gene3D: G3DSA:1.10.340.30 - Gene3D: G3DSA:1.10.1670.10 - SMART: SM00478 - SMART: SM00525 - SMART: SM00465 - SMART: SM00278
Pfam domain/function: PF01986 DUF123; PF00633 HHH; PF00730 HhH-GPD; SSF48150 DNA_glycsylse
EC number: 4.2.99.18
Molecular weight: Translated: 40568; Mature: 40568
Theoretical pI: Translated: 9.81; Mature: 9.81
Prosite motif: PS00764 ENDONUCLEASE_III_1; PS01155 ENDONUCLEASE_III_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.9 %Cys (Translated Protein) 0.9 %Met (Translated Protein) 3.8 %Cys+Met (Translated Protein) 2.9 %Cys (Mature Protein) 0.9 %Met (Mature Protein) 3.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MELIEILLKKLNKNAVVTEIAKDKDPFKVLISTIISARTKDEVTEEVSKKLFKEIKDVDD CHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH LLNIDEEKLADLIYPAGFYKNKAKNLKKLAKILKENYNGKVPDSLEELLKLPGVGRKTAN HHCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCCCCCCCC LVITLAFNKDGICVDTHVHRICNRWEIVDTETPEETEFELRKKLPKKYWKVINNLLVVFG EEEEEEECCCCEEEHHHHHHHHCCCEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH REICSSKSKCDKCFKEIKEKCPYYEKIKHFENILKKFNFRKVSKNKIPNEKGTYILKIRL HHHHCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCHHHHHCCCCCCCCCEEEEEEE KEGKKIKFGKTERFFKKGYYFYIGSAFGNSMNLKNRIERHLKDDKKMHWHIDYLLKYGKI CCCCEECCCCHHHHHHCCCEEEEECCCCCCCCHHHHHHHHHCCCCHHEEHHHHHHHCCCE EEIYITNERVECEVANEFIKKFDFVENFGCSDCKCKSHLFYLKP EEEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEECC >Mature Secondary Structure MELIEILLKKLNKNAVVTEIAKDKDPFKVLISTIISARTKDEVTEEVSKKLFKEIKDVDD CHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH LLNIDEEKLADLIYPAGFYKNKAKNLKKLAKILKENYNGKVPDSLEELLKLPGVGRKTAN HHCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCCCCCCCC LVITLAFNKDGICVDTHVHRICNRWEIVDTETPEETEFELRKKLPKKYWKVINNLLVVFG EEEEEEECCCCEEEHHHHHHHHCCCEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH REICSSKSKCDKCFKEIKEKCPYYEKIKHFENILKKFNFRKVSKNKIPNEKGTYILKIRL HHHHCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCHHHHHCCCCCCCCCEEEEEEE KEGKKIKFGKTERFFKKGYYFYIGSAFGNSMNLKNRIERHLKDDKKMHWHIDYLLKYGKI CCCCEECCCCHHHHHHCCCEEEEECCCCCCCCHHHHHHHHHCCCCHHEEHHHHHHHCCCE EEIYITNERVECEVANEFIKKFDFVENFGCSDCKCKSHLFYLKP EEEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEECC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: 4Fe-4S Cluster. [C]
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: Carbon-Oxygen Lyase [C]
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 8688087