Definition | Jannaschia sp. CCS1 chromosome, complete genome. |
---|---|
Accession | NC_007802 |
Length | 4,317,977 |
Click here to switch to the map view.
The map label for this gene is nqo3 [H]
Identifier: 89053674
GI number: 89053674
Start: 1144969
End: 1146987
Strand: Direct
Name: nqo3 [H]
Synonym: Jann_1183
Alternate gene names: 89053674
Gene position: 1144969-1146987 (Clockwise)
Preceding gene: 89053673
Following gene: 89053675
Centisome position: 26.52
GC content: 65.03
Gene sequence:
>2019_bases ATGACCGACCTGCGCACGATCATCATCGACGACAACGAGGTCGAGGTTGATCCCGCCATGACCCTGATCCAGGCCTGTGA GCAGGCCGGGATCGAGATCCCACGGTTCTGTTATCATGAGCGTCTGACCATCGCGGGCAATTGCCGCATGTGTCTGGTCG AAGTCGTGGGCGGGCCGCCGAAACCTGCGGCAAGCTGCGCGATGCAGGTGAAGGATCTGCGCCCCGGCCCCGAGGGCGCG CCGCCGGTGATCAAGACGAACTCGCCCATGGTCAAGAAGGCCCGCGAGGGGGTGATGGAGTTTCTGCTGATCAACCATCC GCTGGATTGCCCGATCTGCGATCAGGGCGGCGAGTGCGATTTGCAGGATCAGGCGATGGCGTATGGCGTGGATTTCTCGC GCTTCCGCGAGCCCAAGCGCGCCGTGGATAACCTGGAACTTGGCCCGCTGGTCGGCACCGCGATGACGCGCTGCATTTCC TGCACCCGCTGCGTGCGGTTTATCACTGAGGTCGCGGGCATGCCCGAGATGGGCCAGACCGGCCGGGGCGAGGATGCGGA GATCACCTCCTATCTGGGGGCAACGCTGGAATCAGAGATGCAGGGCAATATCGTGGATCTGTGCCCCGTCGGCGCGCTGA CCAACAAGCCCTACAGCTTCACGGCACGCCCGTGGGAGTTGACGAAGACGGAAAGCATCGACGTGATGGATGCGCTTGGC TCCAACATCCGGGTGGACACCAAGGGCCGCGAAGTGATGCGCATCCTACCGCGCAACCATGACGGCGTGAATGAGGAATG GCTGAGCGACAAGTCCCGCTATATCTGGGACGGGTTGAAGCGCCAGCGTCTGGACCAGCCCTATATCCGTGAAAACGGCA AGCTGCGCCCGGCCAGCTGGGGCGAGGCGATGGGCCTTGCGGCGTCGGAGATCAAGGGGGCCACGAAGCTGGCCGGGTTG GTGGGCGATCTGGCCTCCACCGAGGCGGCGTTTGCGTTGAAATCCCTGGTCGAGGGGCAGGGCGGTGTCGTGGAATGCCG CACCGACGGCGCGAAGCTGCCCGCCGGCAACCGCGCGGCCTACGTTGGCACCGCAAGCATCGAGGATATCGATGCGGCTG AATATATCCAGCTGATCGGCACCAATCCCCGCGCCGAGGCTCCGGTCCTGAACGCCCGTCTGCGCAAGGCGTGGCTGCGG GGCGCAAAGATCGGGCTGGTGGGCGAGGCCGTGGACCTGACCTATGAGTATGCCCATGTCGGCACCGGGTTCCCGGCGCT GCGGACCCTTGCGGATCAGCAATATGATCAGGTGCTTGAGGCCAGTTCTCTGGTGATCGTGGGTCAGGGCGCTTTGACTG GCGAGGGCGGGGCGGATGCACTGGCGCTTGCGATGCGAATGGCCGAACGGTCCCGGTCGGGTTTGCTTGTCCTGCATACG GCGGCGGGCCGCGTGGGTGCCATGGATGTGGGGGCGACCAACTCCGATGGGATGGCCACCGTGCAGGATGCGGATGTGAT TTACAACATGGGTGCGGATGAGGTGGAGGTCTCCACCGGGTCTTTTGTCATTTACCAAGGCTCCCACGGGGATCGGGGTG CGCACCGCGCGGATGTGATCCTGCCGGGCGCGGCCTACACCGAGGAGACCGGCATCTTCGTGAACACCGAAGGCCGCCCG CAAATGGCGCAGCGCGCGGGCTTCCCCCCGGGTGATGCGCGCGAGAACTGGGCGATCCTGCGGGCCCTGTCGGCGGAGGT CGGCGCGACGTTGCCGTTTGATACCATTGCGGCGCTGCGCAAGGCGATGATGGCAGAGGCGCCGCATCTGAAGATGATCG ACGAGGTGGCCGAGAATGAAGGCGAGGCGCTGGAGATCACGGATTTGGGGCAGGGGGATTTCACCAATGCGGTCAGCGAT CACTACCTGACCAATCCGATCGCGCGGGCCTCGGGGCTGATGGCGGAGCTGAGCGCGGGCGCGAAGGCGCGTGGCCAGTC CAGGATCGCGGCGGAGTAA
Upstream 100 bases:
>100_bases TTACCGCAGGGACGCCGATTGGCAGATTGCTGCGGTCGCGTTAGAAGCGGGGTCAGACGGTGGCGCGCGCCTGCGCCAAT GACATGAGGGACCGACCGCC
Downstream 100 bases:
>100_bases GGGGGATGCGGGTCGCTCCGCTCCGCTCAGCCCGCTTTTCCACCCTGGCGGGTGTCAGCGCGGCATGTCTGATCGGGTGT CTGCCCGCCACCGAGGGGCC
Product: NADH dehydrogenase subunit G
Products: NA
Alternate protein names: NADH dehydrogenase I, chain 3; NDH-1, chain 3 [H]
Number of amino acids: Translated: 672; Mature: 671
Protein sequence:
>672_residues MTDLRTIIIDDNEVEVDPAMTLIQACEQAGIEIPRFCYHERLTIAGNCRMCLVEVVGGPPKPAASCAMQVKDLRPGPEGA PPVIKTNSPMVKKAREGVMEFLLINHPLDCPICDQGGECDLQDQAMAYGVDFSRFREPKRAVDNLELGPLVGTAMTRCIS CTRCVRFITEVAGMPEMGQTGRGEDAEITSYLGATLESEMQGNIVDLCPVGALTNKPYSFTARPWELTKTESIDVMDALG SNIRVDTKGREVMRILPRNHDGVNEEWLSDKSRYIWDGLKRQRLDQPYIRENGKLRPASWGEAMGLAASEIKGATKLAGL VGDLASTEAAFALKSLVEGQGGVVECRTDGAKLPAGNRAAYVGTASIEDIDAAEYIQLIGTNPRAEAPVLNARLRKAWLR GAKIGLVGEAVDLTYEYAHVGTGFPALRTLADQQYDQVLEASSLVIVGQGALTGEGGADALALAMRMAERSRSGLLVLHT AAGRVGAMDVGATNSDGMATVQDADVIYNMGADEVEVSTGSFVIYQGSHGDRGAHRADVILPGAAYTEETGIFVNTEGRP QMAQRAGFPPGDARENWAILRALSAEVGATLPFDTIAALRKAMMAEAPHLKMIDEVAENEGEALEITDLGQGDFTNAVSD HYLTNPIARASGLMAELSAGAKARGQSRIAAE
Sequences:
>Translated_672_residues MTDLRTIIIDDNEVEVDPAMTLIQACEQAGIEIPRFCYHERLTIAGNCRMCLVEVVGGPPKPAASCAMQVKDLRPGPEGA PPVIKTNSPMVKKAREGVMEFLLINHPLDCPICDQGGECDLQDQAMAYGVDFSRFREPKRAVDNLELGPLVGTAMTRCIS CTRCVRFITEVAGMPEMGQTGRGEDAEITSYLGATLESEMQGNIVDLCPVGALTNKPYSFTARPWELTKTESIDVMDALG SNIRVDTKGREVMRILPRNHDGVNEEWLSDKSRYIWDGLKRQRLDQPYIRENGKLRPASWGEAMGLAASEIKGATKLAGL VGDLASTEAAFALKSLVEGQGGVVECRTDGAKLPAGNRAAYVGTASIEDIDAAEYIQLIGTNPRAEAPVLNARLRKAWLR GAKIGLVGEAVDLTYEYAHVGTGFPALRTLADQQYDQVLEASSLVIVGQGALTGEGGADALALAMRMAERSRSGLLVLHT AAGRVGAMDVGATNSDGMATVQDADVIYNMGADEVEVSTGSFVIYQGSHGDRGAHRADVILPGAAYTEETGIFVNTEGRP QMAQRAGFPPGDARENWAILRALSAEVGATLPFDTIAALRKAMMAEAPHLKMIDEVAENEGEALEITDLGQGDFTNAVSD HYLTNPIARASGLMAELSAGAKARGQSRIAAE >Mature_671_residues TDLRTIIIDDNEVEVDPAMTLIQACEQAGIEIPRFCYHERLTIAGNCRMCLVEVVGGPPKPAASCAMQVKDLRPGPEGAP PVIKTNSPMVKKAREGVMEFLLINHPLDCPICDQGGECDLQDQAMAYGVDFSRFREPKRAVDNLELGPLVGTAMTRCISC TRCVRFITEVAGMPEMGQTGRGEDAEITSYLGATLESEMQGNIVDLCPVGALTNKPYSFTARPWELTKTESIDVMDALGS NIRVDTKGREVMRILPRNHDGVNEEWLSDKSRYIWDGLKRQRLDQPYIRENGKLRPASWGEAMGLAASEIKGATKLAGLV GDLASTEAAFALKSLVEGQGGVVECRTDGAKLPAGNRAAYVGTASIEDIDAAEYIQLIGTNPRAEAPVLNARLRKAWLRG AKIGLVGEAVDLTYEYAHVGTGFPALRTLADQQYDQVLEASSLVIVGQGALTGEGGADALALAMRMAERSRSGLLVLHTA AGRVGAMDVGATNSDGMATVQDADVIYNMGADEVEVSTGSFVIYQGSHGDRGAHRADVILPGAAYTEETGIFVNTEGRPQ MAQRAGFPPGDARENWAILRALSAEVGATLPFDTIAALRKAMMAEAPHLKMIDEVAENEGEALEITDLGQGDFTNAVSDH YLTNPIARASGLMAELSAGAKARGQSRIAAE
Specific function: NDH-1 shuttles electrons from NADH, via FMN and iron- sulfur (Fe-S) centers, to quinones in the respiratory chain. The immediate electron acceptor for the enzyme in this species is believed to be ubiquinone. Couples the redox reaction to proton translocat
COG id: COG1034
COG function: function code C; NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G)
Gene ontology:
Cell location: Cell inner membrane; Peripheral membrane protein [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 2Fe-2S ferredoxin-type domain [H]
Homologues:
Organism=Homo sapiens, GI33519475, Length=707, Percent_Identity=45.968882602546, Blast_Score=588, Evalue=1e-168, Organism=Escherichia coli, GI145693161, Length=336, Percent_Identity=34.2261904761905, Blast_Score=201, Evalue=9e-53, Organism=Caenorhabditis elegans, GI17565758, Length=687, Percent_Identity=46.5793304221252, Blast_Score=572, Evalue=1e-163, Organism=Caenorhabditis elegans, GI32566231, Length=584, Percent_Identity=48.6301369863014, Blast_Score=522, Evalue=1e-148, Organism=Caenorhabditis elegans, GI193209088, Length=283, Percent_Identity=55.4770318021201, Blast_Score=327, Evalue=1e-89, Organism=Drosophila melanogaster, GI24640559, Length=680, Percent_Identity=47.3529411764706, Blast_Score=573, Evalue=1e-163, Organism=Drosophila melanogaster, GI24640557, Length=680, Percent_Identity=47.3529411764706, Blast_Score=573, Evalue=1e-163,
Paralogues:
None
Copy number: 60 Molecules/Cell In: Stationary Phase, Rich Media (Based on E. coli). [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR012675 - InterPro: IPR001041 - InterPro: IPR006656 - InterPro: IPR000283 - InterPro: IPR010228 - InterPro: IPR019574 - InterPro: IPR015405 [H]
Pfam domain/function: PF09326 DUF1982; PF00111 Fer2; PF00384 Molybdopterin; PF10588 NADH-G_4Fe-4S_3 [H]
EC number: =1.6.99.5 [H]
Molecular weight: Translated: 71884; Mature: 71753
Theoretical pI: Translated: 4.61; Mature: 4.61
Prosite motif: PS00641 COMPLEX1_75K_1 ; PS00642 COMPLEX1_75K_2 ; PS00643 COMPLEX1_75K_3 ; PS51085 2FE2S_FER_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.9 %Cys (Translated Protein) 3.6 %Met (Translated Protein) 5.5 %Cys+Met (Translated Protein) 1.9 %Cys (Mature Protein) 3.4 %Met (Mature Protein) 5.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTDLRTIIIDDNEVEVDPAMTLIQACEQAGIEIPRFCYHERLTIAGNCRMCLVEVVGGPP CCCCEEEEECCCCEECCHHHHHHHHHHHCCCCCCHHHHHCCEEEECCCCEEEEEECCCCC KPAASCAMQVKDLRPGPEGAPPVIKTNSPMVKKAREGVMEFLLINHPLDCPICDQGGECD CHHHHHHHHHHHCCCCCCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC LQDQAMAYGVDFSRFREPKRAVDNLELGPLVGTAMTRCISCTRCVRFITEVAGMPEMGQT CCCCHHHCCCCHHHHCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC GRGEDAEITSYLGATLESEMQGNIVDLCPVGALTNKPYSFTARPWELTKTESIDVMDALG CCCCCHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCEECCCCCCCCCCCCCHHHHHCC SNIRVDTKGREVMRILPRNHDGVNEEWLSDKSRYIWDGLKRQRLDQPYIRENGKLRPASW CCEEEECCCHHHHHHCCCCCCCCCHHHHCCCHHHHHHHHHHHHCCCCHHHCCCCCCCCCC GEAMGLAASEIKGATKLAGLVGDLASTEAAFALKSLVEGQGGVVECRTDGAKLPAGNRAA CHHHCCHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCEE YVGTASIEDIDAAEYIQLIGTNPRAEAPVLNARLRKAWLRGAKIGLVGEAVDLTYEYAHV EEECCCCCCCCHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCEEEEECHHHEEEEEHHC GTGFPALRTLADQQYDQVLEASSLVIVGQGALTGEGGADALALAMRMAERSRSGLLVLHT CCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHCCCCEEEEEE AAGRVGAMDVGATNSDGMATVQDADVIYNMGADEVEVSTGSFVIYQGSHGDRGAHRADVI CCCCCCEEECCCCCCCCCEEEECCCEEEECCCCEEEECCCCEEEEECCCCCCCCCCCCEE LPGAAYTEETGIFVNTEGRPQMAQRAGFPPGDARENWAILRALSAEVGATLPFDTIAALR ECCCCCCCCCCEEEECCCCHHHHHHCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHHHHHH KAMMAEAPHLKMIDEVAENEGEALEITDLGQGDFTNAVSDHYLTNPIARASGLMAELSAG HHHHHCCCCHHHHHHHHCCCCCEEEEEECCCCCCCHHHHHHHHCCHHHHHHHHHHHHHCC AKARGQSRIAAE CCCCCCCCCCCC >Mature Secondary Structure TDLRTIIIDDNEVEVDPAMTLIQACEQAGIEIPRFCYHERLTIAGNCRMCLVEVVGGPP CCCEEEEECCCCEECCHHHHHHHHHHHCCCCCCHHHHHCCEEEECCCCEEEEEECCCCC KPAASCAMQVKDLRPGPEGAPPVIKTNSPMVKKAREGVMEFLLINHPLDCPICDQGGECD CHHHHHHHHHHHCCCCCCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC LQDQAMAYGVDFSRFREPKRAVDNLELGPLVGTAMTRCISCTRCVRFITEVAGMPEMGQT CCCCHHHCCCCHHHHCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC GRGEDAEITSYLGATLESEMQGNIVDLCPVGALTNKPYSFTARPWELTKTESIDVMDALG CCCCCHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCEECCCCCCCCCCCCCHHHHHCC SNIRVDTKGREVMRILPRNHDGVNEEWLSDKSRYIWDGLKRQRLDQPYIRENGKLRPASW CCEEEECCCHHHHHHCCCCCCCCCHHHHCCCHHHHHHHHHHHHCCCCHHHCCCCCCCCCC GEAMGLAASEIKGATKLAGLVGDLASTEAAFALKSLVEGQGGVVECRTDGAKLPAGNRAA CHHHCCHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCEE YVGTASIEDIDAAEYIQLIGTNPRAEAPVLNARLRKAWLRGAKIGLVGEAVDLTYEYAHV EEECCCCCCCCHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCEEEEECHHHEEEEEHHC GTGFPALRTLADQQYDQVLEASSLVIVGQGALTGEGGADALALAMRMAERSRSGLLVLHT CCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHCCCCEEEEEE AAGRVGAMDVGATNSDGMATVQDADVIYNMGADEVEVSTGSFVIYQGSHGDRGAHRADVI CCCCCCEEECCCCCCCCCEEEECCCEEEECCCCEEEECCCCEEEEECCCCCCCCCCCCEE LPGAAYTEETGIFVNTEGRPQMAQRAGFPPGDARENWAILRALSAEVGATLPFDTIAALR ECCCCCCCCCCEEEECCCCHHHHHHCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHHHHHH KAMMAEAPHLKMIDEVAENEGEALEITDLGQGDFTNAVSDHYLTNPIARASGLMAELSAG HHHHHCCCCHHHHHHHHCCCCCEEEEEECCCCCCCHHHHHHHHCCHHHHHHHHHHHHHCC AKARGQSRIAAE CCCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 1605643; 8422400 [H]