| Definition | Clostridium botulinum A str. Hall, complete genome. |
|---|---|
| Accession | NC_009698 |
| Length | 3,760,560 |
Click here to switch to the map view.
The map label for this gene is nuoG [C]
Identifier: 153935257
GI number: 153935257
Start: 1898579
End: 1900312
Strand: Reverse
Name: nuoG [C]
Synonym: CLC_1790
Alternate gene names: 153935257
Gene position: 1900312-1898579 (Counterclockwise)
Preceding gene: 153936195
Following gene: 153937826
Centisome position: 50.53
GC content: 33.16
Gene sequence:
>1734_bases ATGAGTTTAGTAACTTTAAATATAAATGGTAAAGATCTTAAAGTAGAAAATGGAACTACTATATTAGATGCAGCAAAACT TTTAAATATAAATATACCAACTCTATGTAATTTTCATCTTAATGATAATAAAACAGAAAATAAACCTGGTTCCTGTAGGG TCTGTGTAGTAGAAGTAGAAGGGAGAAAAAACTTAGCGCCAGCATGTTGTACCCCTGTAGGGGAAGGAATGATAGTAAAA ACTAATTCCATAAGGGCTATAAAAGCTAGAAGAGCTATAGTAGAATTACTTTTATCTGATCATCCAAAAGACTGTTTGCT TTGTGAGAAAAATACAAAATGTGAGCTACAAAAATTAGCTGCAGATATGGGTATAAGAGAAATGAAATATCAAGGTGATG TTTCAATGTATCCTATAGATATTTCTAGCTATTCCATAGTTAGGGATATGGATAAATGTATACTTTGTAGAAGATGCATA ACTATATGTAATGAAGTTCAAACTGTAGGAACCCTATCTGCTATTGGAAGAGGTTTTGAAACTGTAGTAGCACCAGCCTT TTCTGAAGCTATAAAAAATACTAATTGTACTTTCTGTGGACAATGTGTTTCTGTCTGTCCAACTGGGGCTTTAACAGAAG TAAATAATACAAGTAAAGTTTGGGATGCTCTATCACAAAAGGATAAAATTGTTATTGTACAAACAGCTCCAGCTATTAGA GCTGCTTTAGGTGAAGAATTTGGATTAGAACCTGGAACAACTGTTACAGGAAAAATGGTAGCGGCTCTTCGTCAATTAGG ATTCAGTAAAGTTTTTGATACAGATTTTGCAGCAGACTTGACTATTATGGAGGAAGCTTCAGAATTTATTCATAGATTAG AACATGGCGGAACACTTCCAATGCTTACAAGTTGTTGTCCAGGATGGATAAAATTCTTTGAACACAACTTTAATGATTTA ATGGATATACCATCTAGTTGTAAATCACCTCAGCAAATGTTTGGAGCTATAGCTAAAAGTTATTTAGCAGAAAAAATGAA GGTAGATCCTAAAGATATTATAGTAGTATCTGTAATGCCTTGTCTTGCTAAAAAGTATGAGGCAAAAAGAGAAGAAATGA AGAGAAATGGAATCCCTGATGTTGATATTGTTATAAGTACAAGAGAATTAGCTAAAATGATAGTAGAAGCGGGTATAGAT TTTAATTCTCTACAGGAGGAAGAATTTGATAATCCCCTAGGTGAATCTACAGGGGCTTCAGTAATTTTTGGAACTACCGG CGGTGTTATGGAAGCGGCTTTAAGAACTGCCTATGAATGGGTTACTAAAGGTACTTTAAAAGATGTAGAATTTATAGAAG TTCGCGGTGAAGATGGTATAAGAGAAGCTACAGTAAACATAAAGGATACAGAGGTTAAAGTAGCTATAGCTAGTGGATTA GGTAATGCTAGAAAGCTTTTAAATGATATAAGAAATGGAAAGTCTAAATATCATATGATCGAAATCATGGCGTGTCCATC AGGATGTGTAGATGGTGGTGGTCAACCTTATATCTATGGAGATACAAATATATTGAAAAAAAGAACAGAAGCCCTATATA AAGAAGATAGTAATAAAGAAATAAGGAAGTCTCACGAAAATCCATATATAAAGAAACTTTATGAAGAATATTTAGGTAAA CCTTATGGTGAAAAAGCTCATGAACTTCTTCATACTAAATATAGAGTTAGATAA
Upstream 100 bases:
>100_bases CATTTTATAAATCAAGAAAAATGTATTAAATGTGGAAATTGTTATAGCGCTTGTCCAGTTGGGGCTATTATAAAGAAATA GGAAGAAAGAGGTGAATAAA
Downstream 100 bases:
>100_bases CATGAAATAAACACAAATTTTCCTATTAATATACACCAAATAAAACCAGTATCAGTATGGTTTGAAACTATAATCAGTTA TAAGTTTAAAGTAATTTCAT
Product: [Fe] hydrogenase
Products: NA
Alternate protein names: CpI; Fe-only hydrogenase; [Fe] hydrogenase [H]
Number of amino acids: Translated: 577; Mature: 576
Protein sequence:
>577_residues MSLVTLNINGKDLKVENGTTILDAAKLLNINIPTLCNFHLNDNKTENKPGSCRVCVVEVEGRKNLAPACCTPVGEGMIVK TNSIRAIKARRAIVELLLSDHPKDCLLCEKNTKCELQKLAADMGIREMKYQGDVSMYPIDISSYSIVRDMDKCILCRRCI TICNEVQTVGTLSAIGRGFETVVAPAFSEAIKNTNCTFCGQCVSVCPTGALTEVNNTSKVWDALSQKDKIVIVQTAPAIR AALGEEFGLEPGTTVTGKMVAALRQLGFSKVFDTDFAADLTIMEEASEFIHRLEHGGTLPMLTSCCPGWIKFFEHNFNDL MDIPSSCKSPQQMFGAIAKSYLAEKMKVDPKDIIVVSVMPCLAKKYEAKREEMKRNGIPDVDIVISTRELAKMIVEAGID FNSLQEEEFDNPLGESTGASVIFGTTGGVMEAALRTAYEWVTKGTLKDVEFIEVRGEDGIREATVNIKDTEVKVAIASGL GNARKLLNDIRNGKSKYHMIEIMACPSGCVDGGGQPYIYGDTNILKKRTEALYKEDSNKEIRKSHENPYIKKLYEEYLGK PYGEKAHELLHTKYRVR
Sequences:
>Translated_577_residues MSLVTLNINGKDLKVENGTTILDAAKLLNINIPTLCNFHLNDNKTENKPGSCRVCVVEVEGRKNLAPACCTPVGEGMIVK TNSIRAIKARRAIVELLLSDHPKDCLLCEKNTKCELQKLAADMGIREMKYQGDVSMYPIDISSYSIVRDMDKCILCRRCI TICNEVQTVGTLSAIGRGFETVVAPAFSEAIKNTNCTFCGQCVSVCPTGALTEVNNTSKVWDALSQKDKIVIVQTAPAIR AALGEEFGLEPGTTVTGKMVAALRQLGFSKVFDTDFAADLTIMEEASEFIHRLEHGGTLPMLTSCCPGWIKFFEHNFNDL MDIPSSCKSPQQMFGAIAKSYLAEKMKVDPKDIIVVSVMPCLAKKYEAKREEMKRNGIPDVDIVISTRELAKMIVEAGID FNSLQEEEFDNPLGESTGASVIFGTTGGVMEAALRTAYEWVTKGTLKDVEFIEVRGEDGIREATVNIKDTEVKVAIASGL GNARKLLNDIRNGKSKYHMIEIMACPSGCVDGGGQPYIYGDTNILKKRTEALYKEDSNKEIRKSHENPYIKKLYEEYLGK PYGEKAHELLHTKYRVR >Mature_576_residues SLVTLNINGKDLKVENGTTILDAAKLLNINIPTLCNFHLNDNKTENKPGSCRVCVVEVEGRKNLAPACCTPVGEGMIVKT NSIRAIKARRAIVELLLSDHPKDCLLCEKNTKCELQKLAADMGIREMKYQGDVSMYPIDISSYSIVRDMDKCILCRRCIT ICNEVQTVGTLSAIGRGFETVVAPAFSEAIKNTNCTFCGQCVSVCPTGALTEVNNTSKVWDALSQKDKIVIVQTAPAIRA ALGEEFGLEPGTTVTGKMVAALRQLGFSKVFDTDFAADLTIMEEASEFIHRLEHGGTLPMLTSCCPGWIKFFEHNFNDLM DIPSSCKSPQQMFGAIAKSYLAEKMKVDPKDIIVVSVMPCLAKKYEAKREEMKRNGIPDVDIVISTRELAKMIVEAGIDF NSLQEEEFDNPLGESTGASVIFGTTGGVMEAALRTAYEWVTKGTLKDVEFIEVRGEDGIREATVNIKDTEVKVAIASGLG NARKLLNDIRNGKSKYHMIEIMACPSGCVDGGGQPYIYGDTNILKKRTEALYKEDSNKEIRKSHENPYIKKLYEEYLGKP YGEKAHELLHTKYRVR
Specific function: Ndh-1 Shuttles Electrons From NADH, Via Fmn And Iron- Sulfur (Fe-S) Centers, To Quinones In The Respiratory Chain. The Immediate Electron Acceptor For The Enzyme In This Species Is Believed To Be Ubiquinone. Couples The Redox Reaction To Proton Translocat
COG id: COG4624
COG function: function code R; Iron only hydrogenase large subunit, C-terminal domain
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 2 4Fe-4S ferredoxin-type domains [H]
Homologues:
Organism=Homo sapiens, GI11968051, Length=402, Percent_Identity=29.8507462686567, Blast_Score=180, Evalue=3e-45, Organism=Homo sapiens, GI6912524, Length=402, Percent_Identity=28.1094527363184, Blast_Score=162, Evalue=6e-40, Organism=Homo sapiens, GI134284357, Length=363, Percent_Identity=28.9256198347107, Blast_Score=160, Evalue=4e-39, Organism=Homo sapiens, GI84452151, Length=363, Percent_Identity=28.9256198347107, Blast_Score=159, Evalue=5e-39, Organism=Homo sapiens, GI14165461, Length=450, Percent_Identity=26.8888888888889, Blast_Score=146, Evalue=5e-35, Organism=Homo sapiens, GI33519475, Length=224, Percent_Identity=30.3571428571429, Blast_Score=114, Evalue=2e-25, Organism=Escherichia coli, GI145693161, Length=246, Percent_Identity=26.0162601626016, Blast_Score=95, Evalue=1e-20, Organism=Caenorhabditis elegans, GI71995015, Length=403, Percent_Identity=31.2655086848635, Blast_Score=137, Evalue=1e-32, Organism=Caenorhabditis elegans, GI17565758, Length=225, Percent_Identity=32.4444444444444, Blast_Score=116, Evalue=3e-26, Organism=Caenorhabditis elegans, GI32566231, Length=225, Percent_Identity=31.5555555555556, Blast_Score=116, Evalue=3e-26, Organism=Caenorhabditis elegans, GI193209088, Length=229, Percent_Identity=32.3144104803493, Blast_Score=114, Evalue=2e-25, Organism=Saccharomyces cerevisiae, GI6324089, Length=394, Percent_Identity=25.3807106598985, Blast_Score=100, Evalue=1e-21, Organism=Drosophila melanogaster, GI116007470, Length=373, Percent_Identity=31.3672922252011, Blast_Score=174, Evalue=1e-43, Organism=Drosophila melanogaster, GI116007466, Length=373, Percent_Identity=31.3672922252011, Blast_Score=174, Evalue=1e-43, Organism=Drosophila melanogaster, GI116007464, Length=373, Percent_Identity=31.3672922252011, Blast_Score=174, Evalue=2e-43, Organism=Drosophila melanogaster, GI116007468, Length=373, Percent_Identity=31.3672922252011, Blast_Score=173, Evalue=2e-43, Organism=Drosophila melanogaster, GI116007462, Length=287, Percent_Identity=33.4494773519164, Blast_Score=146, Evalue=3e-35, Organism=Drosophila melanogaster, GI24640559, Length=205, Percent_Identity=33.6585365853659, Blast_Score=110, Evalue=2e-24, Organism=Drosophila melanogaster, GI24640557, Length=205, Percent_Identity=33.6585365853659, Blast_Score=110, Evalue=2e-24,
Paralogues:
None
Copy number: 60 Molecules/Cell In: Stationary Phase, Rich Media (Based on E. coli). [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR017896 - InterPro: IPR017900 - InterPro: IPR012675 - InterPro: IPR009016 - InterPro: IPR004108 - InterPro: IPR003149 - InterPro: IPR013352 - InterPro: IPR001041 [H]
Pfam domain/function: PF02906 Fe_hyd_lg_C; PF02256 Fe_hyd_SSU [H]
EC number: =1.12.7.2 [H]
Molecular weight: Translated: 63695; Mature: 63564
Theoretical pI: Translated: 6.58; Mature: 6.58
Prosite motif: PS51085 2FE2S_FER_2 ; PS00198 4FE4S_FERREDOXIN
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
3.8 %Cys (Translated Protein) 3.1 %Met (Translated Protein) 6.9 %Cys+Met (Translated Protein) 3.8 %Cys (Mature Protein) 3.0 %Met (Mature Protein) 6.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSLVTLNINGKDLKVENGTTILDAAKLLNINIPTLCNFHLNDNKTENKPGSCRVCVVEVE CEEEEEEECCCEEEECCCCCCEEHHHHHCCCCCCEEEEEECCCCCCCCCCCEEEEEEEEC GRKNLAPACCTPVGEGMIVKTNSIRAIKARRAIVELLLSDHPKDCLLCEKNTKCELQKLA CCCCCCCHHCCCCCCCEEEECCCCHHHHHHHHHHHHHHCCCCCCEEEECCCCHHHHHHHH ADMGIREMKYQGDVSMYPIDISSYSIVRDMDKCILCRRCITICNEVQTVGTLSAIGRGFE HHCCHHHHEECCCCEEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHH TVVAPAFSEAIKNTNCTFCGQCVSVCPTGALTEVNNTSKVWDALSQKDKIVIVQTAPAIR HHHHHHHHHHHCCCCCCHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCEEEEECCHHHH AALGEEFGLEPGTTVTGKMVAALRQLGFSKVFDTDFAADLTIMEEASEFIHRLEHGGTLP HHHHHHHCCCCCCCHHHHHHHHHHHCCCHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCH MLTSCCPGWIKFFEHNFNDLMDIPSSCKSPQQMFGAIAKSYLAEKMKVDPKDIIVVSVMP HHHHCCHHHHHHHHCCHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCCHHHEEEEHHHH CLAKKYEAKREEMKRNGIPDVDIVISTRELAKMIVEAGIDFNSLQEEEFDNPLGESTGAS HHHHHHHHHHHHHHHCCCCCEEEEECHHHHHHHHHHHCCCHHHCCHHHCCCCCCCCCCCE VIFGTTGGVMEAALRTAYEWVTKGTLKDVEFIEVRGEDGIREATVNIKDTEVKVAIASGL EEEECCCHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCEEEEEEEECCEEEEEEECCC GNARKLLNDIRNGKSKYHMIEIMACPSGCVDGGGQPYIYGDTNILKKRTEALYKEDSNKE CHHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCCCEEEECCHHHHHHHHHHHHHCCCCHH IRKSHENPYIKKLYEEYLGKPYGEKAHELLHTKYRVR HHHHCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHCCC >Mature Secondary Structure SLVTLNINGKDLKVENGTTILDAAKLLNINIPTLCNFHLNDNKTENKPGSCRVCVVEVE EEEEEEECCCEEEECCCCCCEEHHHHHCCCCCCEEEEEECCCCCCCCCCCEEEEEEEEC GRKNLAPACCTPVGEGMIVKTNSIRAIKARRAIVELLLSDHPKDCLLCEKNTKCELQKLA CCCCCCCHHCCCCCCCEEEECCCCHHHHHHHHHHHHHHCCCCCCEEEECCCCHHHHHHHH ADMGIREMKYQGDVSMYPIDISSYSIVRDMDKCILCRRCITICNEVQTVGTLSAIGRGFE HHCCHHHHEECCCCEEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHH TVVAPAFSEAIKNTNCTFCGQCVSVCPTGALTEVNNTSKVWDALSQKDKIVIVQTAPAIR HHHHHHHHHHHCCCCCCHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCEEEEECCHHHH AALGEEFGLEPGTTVTGKMVAALRQLGFSKVFDTDFAADLTIMEEASEFIHRLEHGGTLP HHHHHHHCCCCCCCHHHHHHHHHHHCCCHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCH MLTSCCPGWIKFFEHNFNDLMDIPSSCKSPQQMFGAIAKSYLAEKMKVDPKDIIVVSVMP HHHHCCHHHHHHHHCCHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCCHHHEEEEHHHH CLAKKYEAKREEMKRNGIPDVDIVISTRELAKMIVEAGIDFNSLQEEEFDNPLGESTGAS HHHHHHHHHHHHHHHCCCCCEEEEECHHHHHHHHHHHCCCHHHCCHHHCCCCCCCCCCCE VIFGTTGGVMEAALRTAYEWVTKGTLKDVEFIEVRGEDGIREATVNIKDTEVKVAIASGL EEEECCCHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCEEEEEEEECCEEEEEEECCC GNARKLLNDIRNGKSKYHMIEIMACPSGCVDGGGQPYIYGDTNILKKRTEALYKEDSNKE CHHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCCCEEEECCHHHHHHHHHHHHHCCCCHH IRKSHENPYIKKLYEEYLGKPYGEKAHELLHTKYRVR HHHHCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 1911757; 9836629; 10529166 [H]