Definition Candidatus Blochmannia pennsylvanicus str. BPEN, complete genome.
Accession NC_007292
Length 791,654

Click here to switch to the map view.

The map label for this gene is nuoCD

Identifier: 71892264

GI number: 71892264

Start: 606847

End: 608637

Strand: Reverse

Name: nuoCD

Synonym: BPEN_507

Alternate gene names: 71892264

Gene position: 608637-606847 (Counterclockwise)

Preceding gene: 71892265

Following gene: 71892263

Centisome position: 76.88

GC content: 34.45

Gene sequence:

>1791_bases
TTGGTAGATATTATGTGTAATGATTCTACAGGTGTTTCTTTAGTTAAAAATATTTACCCGATATTAGATGATTTATTCTC
CGTATTCAGTTCTGTAGATTTTGTGCTACAACCTACTCATACTGGAATTTTAATAATTTGGATTAAACGTGAGATGGTAA
TCCCGGTGTTGACATTTTTAAAAACAACATCAAAACCGTATATTATGTTATATGATTTGCATGGCATAGATGAAAGGTTG
CGCATACATCGCGAAGGATTGCCAGAATCAGATTTTACAGTGTTTTATCATTTAATTTCCATATTACGTAATGATGATAT
AATAATAAAAGTTCCTTTGCTAGAGCAATCTCTATACATAGACACAGTAGTGTCTGTGTTTGCAAATGCGAATTGGTATG
AACGGGAAACGTGGGAAATGTTTGGAATTCATTTTAATAAGCATCCTAATTTAACGCGTATAATTATGCCAAAAAACTGG
AATGGATATCCGTTACGCAAAGAATATCCAGCTCGTGCTACAGAATTTAATCCTTTTATTCTTACGAAACAAAAGGAAGA
TTTGGCAATGGAAGGATTATTATTTAAACCCGAAGAATGGGGCATGCATAAACATAGTAAACATGAAAATTTTATGTTTC
TTAATTTAGGACCTAATCATCCTTCAGTGCATGGAGTATTTCGTATTATTTTACAATTAAATGGAGAAGAAATTATAGAT
TGTGTTCCAGATATTGGTTATCATCATCGAGGTGCTGAAAAAATGGGAGAACGACAAACTTGGCATAGTTACATTCCTTA
TACTGATCGCATTGAATATTTAGGGGGCTGTGTCAATGAAATGCCCTATATTTTAGCTGTTGAAAAACTTGCTGGAATTA
CAGTACCGGATAGAGTAAAAGTGATCCGCATTATGTTGTCTGAATTATTTAGAATTAATAGCCATTTGTTGTATATTAGT
ACTTATTTACAAGATGTAGGTGCTATGTCTCCAGTTTTTTTAGCGTTCACTGACAGACAGAAGATTTATGATGTTATTGA
ATCAATTACCGGATCTCGTATGCATCCTGCATGGTTTCGTATTGGAGGGGTTGCTCATGATTTGCCTAGAGGTTGGGAGT
GTTTATTGAGAAAATGTCTTGACTGGATTCCACATCGCGTTTCTTTTTATGTTAAATCAACATTAGAAAACAGTATATTT
AAAAAACGAGCGTGTGGCATTGGCGCATATAATGCCAAGGATGCATTAGATTGGGGAGTAACTGGAGCAGGATTGCGTGC
TACCGGCATTGAATTTGATATACGTAAGTCACGTCCTTATTCTGGATATGAAAATTTTGATTTCGATGTACCAATAGGAA
ATGGAATCAGCGATTCTTACAGCCGAGTAATGTTAAAAGTAGAAGAAATATATCAAAGTGTACGTATTTTGGAACAATGT
TTGCAGAACATGCCAATAGGCCCATTTAAATCAGATCACCCTTTAGCAACTCCTCCTATGAAAGAATACGCCCTACAACA
TATAGAGACTCTTATTACTCATTTTCTGCAAGTATCATGGGGTCCTGTTATTCCGGCTAATGAGTCCTTTCAAATGATTG
AAGCCACAAAAGGTATTAATAGTTATTATTTAATTAGTGATGGGAATACTATGAGTTATCGTACTCGTATTCGTACTCCT
AGTTTTCCTCATCTACAGCAGATTCCACATGTTATTCGTGGTAGTTTAATATCTGATTTAATTGTGTATTTAGGTAGTAT
TGATTTTGTTATGTCTGATGTAGATCGTTAA

Upstream 100 bases:

>100_bases
ACTGTTTTATCTTCTCCAGATGATGAACTTAATATATATTCAAAACAAAACGGTGAGGCATAATCAAGATTTTAAGAATT
TTTGTGATAATTGTCATTAG

Downstream 100 bases:

>100_bases
TAAATGGAGAAATATGAGTAATATTAAAATTAATGATATGTCCACTAGTTTAACTAGTAATGGTTTTTTTCAATTAAGTC
AAGAAGAATGTAATGCTATT

Product: bifunctional NADH:ubiquinone oxidoreductase subunit C/D

Products: NA

Alternate protein names: NADH dehydrogenase I subunit C/D; NDH-1 subunit C/D

Number of amino acids: Translated: 596; Mature: 596

Protein sequence:

>596_residues
MVDIMCNDSTGVSLVKNIYPILDDLFSVFSSVDFVLQPTHTGILIIWIKREMVIPVLTFLKTTSKPYIMLYDLHGIDERL
RIHREGLPESDFTVFYHLISILRNDDIIIKVPLLEQSLYIDTVVSVFANANWYERETWEMFGIHFNKHPNLTRIIMPKNW
NGYPLRKEYPARATEFNPFILTKQKEDLAMEGLLFKPEEWGMHKHSKHENFMFLNLGPNHPSVHGVFRIILQLNGEEIID
CVPDIGYHHRGAEKMGERQTWHSYIPYTDRIEYLGGCVNEMPYILAVEKLAGITVPDRVKVIRIMLSELFRINSHLLYIS
TYLQDVGAMSPVFLAFTDRQKIYDVIESITGSRMHPAWFRIGGVAHDLPRGWECLLRKCLDWIPHRVSFYVKSTLENSIF
KKRACGIGAYNAKDALDWGVTGAGLRATGIEFDIRKSRPYSGYENFDFDVPIGNGISDSYSRVMLKVEEIYQSVRILEQC
LQNMPIGPFKSDHPLATPPMKEYALQHIETLITHFLQVSWGPVIPANESFQMIEATKGINSYYLISDGNTMSYRTRIRTP
SFPHLQQIPHVIRGSLISDLIVYLGSIDFVMSDVDR

Sequences:

>Translated_596_residues
MVDIMCNDSTGVSLVKNIYPILDDLFSVFSSVDFVLQPTHTGILIIWIKREMVIPVLTFLKTTSKPYIMLYDLHGIDERL
RIHREGLPESDFTVFYHLISILRNDDIIIKVPLLEQSLYIDTVVSVFANANWYERETWEMFGIHFNKHPNLTRIIMPKNW
NGYPLRKEYPARATEFNPFILTKQKEDLAMEGLLFKPEEWGMHKHSKHENFMFLNLGPNHPSVHGVFRIILQLNGEEIID
CVPDIGYHHRGAEKMGERQTWHSYIPYTDRIEYLGGCVNEMPYILAVEKLAGITVPDRVKVIRIMLSELFRINSHLLYIS
TYLQDVGAMSPVFLAFTDRQKIYDVIESITGSRMHPAWFRIGGVAHDLPRGWECLLRKCLDWIPHRVSFYVKSTLENSIF
KKRACGIGAYNAKDALDWGVTGAGLRATGIEFDIRKSRPYSGYENFDFDVPIGNGISDSYSRVMLKVEEIYQSVRILEQC
LQNMPIGPFKSDHPLATPPMKEYALQHIETLITHFLQVSWGPVIPANESFQMIEATKGINSYYLISDGNTMSYRTRIRTP
SFPHLQQIPHVIRGSLISDLIVYLGSIDFVMSDVDR
>Mature_596_residues
MVDIMCNDSTGVSLVKNIYPILDDLFSVFSSVDFVLQPTHTGILIIWIKREMVIPVLTFLKTTSKPYIMLYDLHGIDERL
RIHREGLPESDFTVFYHLISILRNDDIIIKVPLLEQSLYIDTVVSVFANANWYERETWEMFGIHFNKHPNLTRIIMPKNW
NGYPLRKEYPARATEFNPFILTKQKEDLAMEGLLFKPEEWGMHKHSKHENFMFLNLGPNHPSVHGVFRIILQLNGEEIID
CVPDIGYHHRGAEKMGERQTWHSYIPYTDRIEYLGGCVNEMPYILAVEKLAGITVPDRVKVIRIMLSELFRINSHLLYIS
TYLQDVGAMSPVFLAFTDRQKIYDVIESITGSRMHPAWFRIGGVAHDLPRGWECLLRKCLDWIPHRVSFYVKSTLENSIF
KKRACGIGAYNAKDALDWGVTGAGLRATGIEFDIRKSRPYSGYENFDFDVPIGNGISDSYSRVMLKVEEIYQSVRILEQC
LQNMPIGPFKSDHPLATPPMKEYALQHIETLITHFLQVSWGPVIPANESFQMIEATKGINSYYLISDGNTMSYRTRIRTP
SFPHLQQIPHVIRGSLISDLIVYLGSIDFVMSDVDR

Specific function: NDH-1 shuttles electrons from NADH, via FMN and iron- sulfur (Fe-S) centers, to quinones in the respiratory chain. The immediate electron acceptor for the enzyme in this species is believed to be ubiquinone. Couples the redox reaction to proton translocat

COG id: COG0649

COG function: function code C; NADH:ubiquinone oxidoreductase 49 kD subunit 7

Gene ontology:

Cell location: Cell inner membrane; Peripheral membrane protein; Cytoplasmic side

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: In the C-terminal section; belongs to the complex I 49 kDa subunit family

Homologues:

Organism=Homo sapiens, GI4758786, Length=383, Percent_Identity=38.6422976501305, Blast_Score=303, Evalue=5e-82,
Organism=Homo sapiens, GI260898743, Length=373, Percent_Identity=38.3378016085791, Blast_Score=293, Evalue=4e-79,
Organism=Homo sapiens, GI4758788, Length=128, Percent_Identity=39.84375, Blast_Score=90, Evalue=6e-18,
Organism=Escherichia coli, GI145693162, Length=577, Percent_Identity=74.6967071057192, Blast_Score=944, Evalue=0.0,
Organism=Escherichia coli, GI1789076, Length=514, Percent_Identity=25.4863813229572, Blast_Score=150, Evalue=2e-37,
Organism=Escherichia coli, GI1788832, Length=503, Percent_Identity=25.4473161033797, Blast_Score=135, Evalue=1e-32,
Organism=Caenorhabditis elegans, GI17555284, Length=428, Percent_Identity=41.3551401869159, Blast_Score=334, Evalue=7e-92,
Organism=Caenorhabditis elegans, GI17568379, Length=428, Percent_Identity=40.6542056074766, Blast_Score=331, Evalue=6e-91,
Organism=Caenorhabditis elegans, GI32563621, Length=119, Percent_Identity=35.2941176470588, Blast_Score=75, Evalue=1e-13,
Organism=Caenorhabditis elegans, GI71990788, Length=119, Percent_Identity=35.2941176470588, Blast_Score=75, Evalue=1e-13,
Organism=Drosophila melanogaster, GI221459469, Length=384, Percent_Identity=38.8020833333333, Blast_Score=308, Evalue=6e-84,
Organism=Drosophila melanogaster, GI24638644, Length=385, Percent_Identity=38.961038961039, Blast_Score=305, Evalue=5e-83,
Organism=Drosophila melanogaster, GI24656494, Length=136, Percent_Identity=35.2941176470588, Blast_Score=84, Evalue=4e-16,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NUOCD_BLOPB (Q492H8)

Other databases:

- EMBL:   CP000016
- RefSeq:   YP_277998.1
- HSSP:   Q56220
- ProteinModelPortal:   Q492H8
- SMR:   Q492H8
- STRING:   Q492H8
- GeneID:   3563024
- GenomeReviews:   CP000016_GR
- KEGG:   bpn:BPEN_507
- eggNOG:   COG0649
- HOGENOM:   HBG459705
- OMA:   PHLQQIP
- ProtClustDB:   PRK11742
- BioCyc:   CBLO291272:BPEN_507-MONOMER
- GO:   GO:0006810
- HAMAP:   MF_01359
- InterPro:   IPR010219
- InterPro:   IPR010218
- InterPro:   IPR023062
- InterPro:   IPR001135
- InterPro:   IPR001268
- InterPro:   IPR014029
- InterPro:   IPR022885
- ProDom:   PD001581
- TIGRFAMs:   TIGR01961
- TIGRFAMs:   TIGR01962

Pfam domain/function: PF00329 Complex1_30kDa; PF00346 Complex1_49kDa

EC number: =1.6.99.5

Molecular weight: Translated: 68704; Mature: 68704

Theoretical pI: Translated: 6.77; Mature: 6.77

Prosite motif: PS00542 COMPLEX1_30K; PS00535 COMPLEX1_49K

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.2 %Cys     (Translated Protein)
3.4 %Met     (Translated Protein)
4.5 %Cys+Met (Translated Protein)
1.2 %Cys     (Mature Protein)
3.4 %Met     (Mature Protein)
4.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MVDIMCNDSTGVSLVKNIYPILDDLFSVFSSVDFVLQPTHTGILIIWIKREMVIPVLTFL
CEEEEECCCCCHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCEEEEEECCHHHHHHHHHH
KTTSKPYIMLYDLHGIDERLRIHREGLPESDFTVFYHLISILRNDDIIIKVPLLEQSLYI
HCCCCCEEEEEECCCCHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCEEEEEECCCCHHHH
DTVVSVFANANWYERETWEMFGIHFNKHPNLTRIIMPKNWNGYPLRKEYPARATEFNPFI
HHHHHHHHCCCCCCCCCEEEEEEEECCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCEE
LTKQKEDLAMEGLLFKPEEWGMHKHSKHENFMFLNLGPNHPSVHGVFRIILQLNGEEIID
EECCHHHHHHCCEEECCHHHCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHCCCHHHHH
CVPDIGYHHRGAEKMGERQTWHSYIPYTDRIEYLGGCVNEMPYILAVEKLAGITVPDRVK
HCCCCCCCCCCHHHHCCCCHHHCCCCCHHHHHHHHHHHHHCCHHEEHHHHHCCCCCHHHH
VIRIMLSELFRINSHLLYISTYLQDVGAMSPVFLAFTDRQKIYDVIESITGSRMHPAWFR
HHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCEEEEECCHHHHHHHHHHHCCCCCCCHHHE
IGGVAHDLPRGWECLLRKCLDWIPHRVSFYVKSTLENSIFKKRACGIGAYNAKDALDWGV
ECCHHHCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHCCC
TGAGLRATGIEFDIRKSRPYSGYENFDFDVPIGNGISDSYSRVMLKVEEIYQSVRILEQC
CCCCCEECCCEEEEECCCCCCCCCCCEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH
LQNMPIGPFKSDHPLATPPMKEYALQHIETLITHFLQVSWGPVIPANESFQMIEATKGIN
HHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCC
SYYLISDGNTMSYRTRIRTPSFPHLQQIPHVIRGSLISDLIVYLGSIDFVMSDVDR
EEEEEECCCEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHCCC
>Mature Secondary Structure
MVDIMCNDSTGVSLVKNIYPILDDLFSVFSSVDFVLQPTHTGILIIWIKREMVIPVLTFL
CEEEEECCCCCHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCEEEEEECCHHHHHHHHHH
KTTSKPYIMLYDLHGIDERLRIHREGLPESDFTVFYHLISILRNDDIIIKVPLLEQSLYI
HCCCCCEEEEEECCCCHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCEEEEEECCCCHHHH
DTVVSVFANANWYERETWEMFGIHFNKHPNLTRIIMPKNWNGYPLRKEYPARATEFNPFI
HHHHHHHHCCCCCCCCCEEEEEEEECCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCEE
LTKQKEDLAMEGLLFKPEEWGMHKHSKHENFMFLNLGPNHPSVHGVFRIILQLNGEEIID
EECCHHHHHHCCEEECCHHHCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHCCCHHHHH
CVPDIGYHHRGAEKMGERQTWHSYIPYTDRIEYLGGCVNEMPYILAVEKLAGITVPDRVK
HCCCCCCCCCCHHHHCCCCHHHCCCCCHHHHHHHHHHHHHCCHHEEHHHHHCCCCCHHHH
VIRIMLSELFRINSHLLYISTYLQDVGAMSPVFLAFTDRQKIYDVIESITGSRMHPAWFR
HHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCEEEEECCHHHHHHHHHHHCCCCCCCHHHE
IGGVAHDLPRGWECLLRKCLDWIPHRVSFYVKSTLENSIFKKRACGIGAYNAKDALDWGV
ECCHHHCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHCCC
TGAGLRATGIEFDIRKSRPYSGYENFDFDVPIGNGISDSYSRVMLKVEEIYQSVRILEQC
CCCCCEECCCEEEEECCCCCCCCCCCEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH
LQNMPIGPFKSDHPLATPPMKEYALQHIETLITHFLQVSWGPVIPANESFQMIEATKGIN
HHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCC
SYYLISDGNTMSYRTRIRTPSFPHLQQIPHVIRGSLISDLIVYLGSIDFVMSDVDR
EEEEEECCCEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: NA