Definition | Agrobacterium tumefaciens str. C58 chromosome circular, complete sequence. |
---|---|
Accession | NC_003062 |
Length | 2,841,580 |
Click here to switch to the map view.
The map label for this gene is nuoG [H]
Identifier: 159184690
GI number: 159184690
Start: 1262252
End: 1264333
Strand: Direct
Name: nuoG [H]
Synonym: Atu1276
Alternate gene names: 159184690
Gene position: 1262252-1264333 (Clockwise)
Preceding gene: 15888604
Following gene: 15888606
Centisome position: 44.42
GC content: 61.38
Gene sequence:
>2082_bases ATGGCAAAGCTCAAGGTTGACGGTAAAGAGATCGAGGTTCCGGATCATTTCACGCTGTTGCAGGCGTGCGAGGAGGCTGG TGCCGAAGTTCCGCGCTTTTGTTTTCATGAGCGCTTGTCGGTCGCGGGTAACTGCCGCATGTGTCTCATCGAGGTGAAGG GCGGACCACCGAAGCCGGCCGCCTCCTGCGCCATGGGCGTGCGCGATGTTCGCGGCGGCCCGAATGGCGAATTGCCTGAG GTTTTCACCAACACACCGATGGTCAAAAAGGCCCGCGAAGGTGTGATGGAGTTCCTGCTCATCAACCACCCGCTCGATTG CCCGATCTGCGACCAGGGCGGCGAATGCGACCTGCAGGACCAGGCCATGGCCTTCGGTATCGCCGGTTCGCGTTACGCGG AAAACAAGCGCGCGGTTGAAGACAAATATATCGGCCCGCTCGTCAAGACGGTCATGAACCGCTGCATTCACTGCACGCGC TGCGTCCGTTTCACCACGGAAGTGGCCGGCATTGCCGAACTCGGCCTCATCGGCCGCGGCGAAGATGCCGAAATCACCAC CTATCTCGAACAGGCCATGACGTCTGAGCTTCAGGGCAACGTCGTCGACCTTTGCCCGGTTGGTGCGCTCACTTCCAAGC CATTCGCTTTCACCGCCCGTCCGTGGGAACTGAACAAGACCGAAACCATCGACGTGATGGACGCGCTCGGTTCGGCCATC CGCGTCGATACCCGTGGCCGCGAAGTCATGCGCGTCATGCCGCGTGTCAACGAGGCGATCAACGAAGAGTGGATTTCCGA CAAGAGCCGCTTCATCTGGGACGGCCTGAAGACGCAGCGCCTCGACCGGCCTTACGTCCGCAAGGATGGCCGGCTGCAGC CCGCCACTTGGGGCGAAGCCTTTGGCGCCATCAAGCAGGCTGTAGCCGCAACGTCGGGTTCGAAAATTGGCGCAATCGCC GGCGATCTGTCCTCCGTCGAAGAGATGTTCGCGCTGAAGTCGCTGCTCGCTTCGCTCGGCTCGGCCAATGTCGATTGCCG TCAGGATGGCGCAGCGCTTGATCCGTCTTTGGGTCGTGCCAGCTATATCTTCAATTCCACTATTGAAGGCATCGAACACG CCGACGCTCTGCTGATCGTTGGTTCCAACCCGCGCTTCGAAGCGGCCGTGCTCAACGCCCGCATCCGCAAGCGCTGGCGT CGCGGCGGTTTCCCGATCGGTGTGATCGGTGAGGCCGGTGAGCTGCGCTACAATTACGAATATCTCGGCTCGGGCGCGGA AACGCTTTCCGATCTCGCCAATGGCTCGCACAGCTTCATCGACAAGCTGAAATCCGCCAAGAACCCACTGATCATCATCG GTCAGGGCGCTCTGGCACGTGCCGATGGTGCTGCCGTTCTCGCGGCTGCTGCAAAACTTGCCGTTTCGGTTGGCGCTGTC ACCGAAGAGTGGAATGGCTTCTCGGTGTTGCACACTGCGGCTGCCCGCGTCGGCGGTCTCGATATCGGCTTCGTGCCGGG CGAGGGCGCTGTTGCTGCCGCCGAGATGGTGACGTCCATGGACGTCCTGTTCCTACTCGGCGCCGATGAAATTGACCTGT CCAACAAGGGCGCCAAGTTCACCGTCTATATCGGCAGCCACGGCGATCAGGGCGCGATGAACGCCGACGTCATCCTTCCG GGTGCGGCCTATACCGAAAAATCCGGTATCTGGGTCAACACCGAAGGCCGTGTTCAGGTGGGCAACCGCGCCGGTTTCGC GCCGGGCGAAGCCCGTGAAGACTGGGCGATCCTGCGCGCGCTGTCCGACGTGCTCGGCAAAAAGCTGCCGTTCGATTCGC TGTCTGCCCTGCGCGGCCAGCTTTATGTCGCGCATCCGCATCTGGCGGAAACCGACGAGATCGTTGCCGGCAAGGCCACG GATATCGAGGCGCTGGCTGGTAAAACCGGCTCGCTCACGAAGTCGGCGTTTGCCTCGCCGGTAAAAGACTTTTATTTGAC GAACCCGATTGCGCGCGCCTCAGCTGTGATGGCCGAGTGCTCCGCATTGGCCCGCAACAATTTCCAGGCTGCGGCAGAGT AA
Upstream 100 bases:
>100_bases TGAAGCGCCTGGCGAAACTCTGGATGCCTGGCGTGATTAAGGATTTGTTGCGGACCGTGTGAGACATGGGACGAACAGGC AACAGGACGTAAGTAGAACC
Downstream 100 bases:
>100_bases GGGTAGGGGATTTATGGAACAGTTTTTCTGGAGCTACGTCTGGCCCGCGCTGATCATGGTCGGCCAGTCCCTGCTGCTTC TCGTTTGCCTTCTGGTGGCC
Product: NADH dehydrogenase subunit G
Products: NA
Alternate protein names: NADH dehydrogenase I, chain 3; NDH-1, chain 3 [H]
Number of amino acids: Translated: 693; Mature: 692
Protein sequence:
>693_residues MAKLKVDGKEIEVPDHFTLLQACEEAGAEVPRFCFHERLSVAGNCRMCLIEVKGGPPKPAASCAMGVRDVRGGPNGELPE VFTNTPMVKKAREGVMEFLLINHPLDCPICDQGGECDLQDQAMAFGIAGSRYAENKRAVEDKYIGPLVKTVMNRCIHCTR CVRFTTEVAGIAELGLIGRGEDAEITTYLEQAMTSELQGNVVDLCPVGALTSKPFAFTARPWELNKTETIDVMDALGSAI RVDTRGREVMRVMPRVNEAINEEWISDKSRFIWDGLKTQRLDRPYVRKDGRLQPATWGEAFGAIKQAVAATSGSKIGAIA GDLSSVEEMFALKSLLASLGSANVDCRQDGAALDPSLGRASYIFNSTIEGIEHADALLIVGSNPRFEAAVLNARIRKRWR RGGFPIGVIGEAGELRYNYEYLGSGAETLSDLANGSHSFIDKLKSAKNPLIIIGQGALARADGAAVLAAAAKLAVSVGAV TEEWNGFSVLHTAAARVGGLDIGFVPGEGAVAAAEMVTSMDVLFLLGADEIDLSNKGAKFTVYIGSHGDQGAMNADVILP GAAYTEKSGIWVNTEGRVQVGNRAGFAPGEAREDWAILRALSDVLGKKLPFDSLSALRGQLYVAHPHLAETDEIVAGKAT DIEALAGKTGSLTKSAFASPVKDFYLTNPIARASAVMAECSALARNNFQAAAE
Sequences:
>Translated_693_residues MAKLKVDGKEIEVPDHFTLLQACEEAGAEVPRFCFHERLSVAGNCRMCLIEVKGGPPKPAASCAMGVRDVRGGPNGELPE VFTNTPMVKKAREGVMEFLLINHPLDCPICDQGGECDLQDQAMAFGIAGSRYAENKRAVEDKYIGPLVKTVMNRCIHCTR CVRFTTEVAGIAELGLIGRGEDAEITTYLEQAMTSELQGNVVDLCPVGALTSKPFAFTARPWELNKTETIDVMDALGSAI RVDTRGREVMRVMPRVNEAINEEWISDKSRFIWDGLKTQRLDRPYVRKDGRLQPATWGEAFGAIKQAVAATSGSKIGAIA GDLSSVEEMFALKSLLASLGSANVDCRQDGAALDPSLGRASYIFNSTIEGIEHADALLIVGSNPRFEAAVLNARIRKRWR RGGFPIGVIGEAGELRYNYEYLGSGAETLSDLANGSHSFIDKLKSAKNPLIIIGQGALARADGAAVLAAAAKLAVSVGAV TEEWNGFSVLHTAAARVGGLDIGFVPGEGAVAAAEMVTSMDVLFLLGADEIDLSNKGAKFTVYIGSHGDQGAMNADVILP GAAYTEKSGIWVNTEGRVQVGNRAGFAPGEAREDWAILRALSDVLGKKLPFDSLSALRGQLYVAHPHLAETDEIVAGKAT DIEALAGKTGSLTKSAFASPVKDFYLTNPIARASAVMAECSALARNNFQAAAE >Mature_692_residues AKLKVDGKEIEVPDHFTLLQACEEAGAEVPRFCFHERLSVAGNCRMCLIEVKGGPPKPAASCAMGVRDVRGGPNGELPEV FTNTPMVKKAREGVMEFLLINHPLDCPICDQGGECDLQDQAMAFGIAGSRYAENKRAVEDKYIGPLVKTVMNRCIHCTRC VRFTTEVAGIAELGLIGRGEDAEITTYLEQAMTSELQGNVVDLCPVGALTSKPFAFTARPWELNKTETIDVMDALGSAIR VDTRGREVMRVMPRVNEAINEEWISDKSRFIWDGLKTQRLDRPYVRKDGRLQPATWGEAFGAIKQAVAATSGSKIGAIAG DLSSVEEMFALKSLLASLGSANVDCRQDGAALDPSLGRASYIFNSTIEGIEHADALLIVGSNPRFEAAVLNARIRKRWRR GGFPIGVIGEAGELRYNYEYLGSGAETLSDLANGSHSFIDKLKSAKNPLIIIGQGALARADGAAVLAAAAKLAVSVGAVT EEWNGFSVLHTAAARVGGLDIGFVPGEGAVAAAEMVTSMDVLFLLGADEIDLSNKGAKFTVYIGSHGDQGAMNADVILPG AAYTEKSGIWVNTEGRVQVGNRAGFAPGEAREDWAILRALSDVLGKKLPFDSLSALRGQLYVAHPHLAETDEIVAGKATD IEALAGKTGSLTKSAFASPVKDFYLTNPIARASAVMAECSALARNNFQAAAE
Specific function: NDH-1 shuttles electrons from NADH, via FMN and iron- sulfur (Fe-S) centers, to quinones in the respiratory chain. The immediate electron acceptor for the enzyme in this species is believed to be ubiquinone. Couples the redox reaction to proton translocat
COG id: COG1034
COG function: function code C; NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G)
Gene ontology:
Cell location: Cell inner membrane; Peripheral membrane protein [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 2Fe-2S ferredoxin-type domain [H]
Homologues:
Organism=Homo sapiens, GI33519475, Length=707, Percent_Identity=49.6463932107496, Blast_Score=677, Evalue=0.0, Organism=Escherichia coli, GI145693161, Length=387, Percent_Identity=29.9741602067183, Blast_Score=180, Evalue=3e-46, Organism=Caenorhabditis elegans, GI17565758, Length=700, Percent_Identity=48.8571428571429, Blast_Score=622, Evalue=1e-178, Organism=Caenorhabditis elegans, GI32566231, Length=587, Percent_Identity=51.1073253833049, Blast_Score=566, Evalue=1e-161, Organism=Caenorhabditis elegans, GI193209088, Length=258, Percent_Identity=59.6899224806201, Blast_Score=319, Evalue=4e-87, Organism=Drosophila melanogaster, GI24640559, Length=690, Percent_Identity=50.2898550724638, Blast_Score=643, Evalue=0.0, Organism=Drosophila melanogaster, GI24640557, Length=690, Percent_Identity=50.2898550724638, Blast_Score=643, Evalue=0.0,
Paralogues:
None
Copy number: 60 Molecules/Cell In: Stationary Phase, Rich Media (Based on E. coli). [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR012675 - InterPro: IPR001041 - InterPro: IPR006656 - InterPro: IPR000283 - InterPro: IPR010228 - InterPro: IPR019574 - InterPro: IPR015405 [H]
Pfam domain/function: PF09326 DUF1982; PF00111 Fer2; PF00384 Molybdopterin; PF10588 NADH-G_4Fe-4S_3 [H]
EC number: =1.6.99.5 [H]
Molecular weight: Translated: 73881; Mature: 73750
Theoretical pI: Translated: 5.28; Mature: 5.28
Prosite motif: PS00642 COMPLEX1_75K_2 ; PS00643 COMPLEX1_75K_3 ; PS51085 2FE2S_FER_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.0 %Cys (Translated Protein) 2.3 %Met (Translated Protein) 4.3 %Cys+Met (Translated Protein) 2.0 %Cys (Mature Protein) 2.2 %Met (Mature Protein) 4.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MAKLKVDGKEIEVPDHFTLLQACEEAGAEVPRFCFHERLSVAGNCRMCLIEVKGGPPKPA CCEEECCCCEECCCCHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCEEEEEEECCCCCCCH ASCAMGVRDVRGGPNGELPEVFTNTPMVKKAREGVMEFLLINHPLDCPICDQGGECDLQD HHHHHHHHHHCCCCCCCCCHHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCC QAMAFGIAGSRYAENKRAVEDKYIGPLVKTVMNRCIHCTRCVRFTTEVAGIAELGLIGRG CHHEEECCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC EDAEITTYLEQAMTSELQGNVVDLCPVGALTSKPFAFTARPWELNKTETIDVMDALGSAI CCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCEEECCCCCCCCCCHHHHHHHCCCCE RVDTRGREVMRVMPRVNEAINEEWISDKSRFIWDGLKTQRLDRPYVRKDGRLQPATWGEA EECCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHCCHHHHCCCCCCCCCCCCCCCCHHHHH FGAIKQAVAATSGSKIGAIAGDLSSVEEMFALKSLLASLGSANVDCRQDGAALDPSLGRA HHHHHHHHHHCCCCCCEEHHCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCH SYIFNSTIEGIEHADALLIVGSNPRFEAAVLNARIRKRWRRGGFPIGVIGEAGELRYNYE HHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHHHHHCCCCEEEECCCCCEEECHH YLGSGAETLSDLANGSHSFIDKLKSAKNPLIIIGQGALARADGAAVLAAAAKLAVSVGAV HHCCCHHHHHHHHCCCHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHCH TEEWNGFSVLHTAAARVGGLDIGFVPGEGAVAAAEMVTSMDVLFLLGADEIDLSNKGAKF HHCCCCHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHCEEEEECCCCCCCCCCCCEE TVYIGSHGDQGAMNADVILPGAAYTEKSGIWVNTEGRVQVGNRAGFAPGEAREDWAILRA EEEEECCCCCCCCCCCEEECCCCEECCCCEEECCCCCEEECCCCCCCCCCCHHHHHHHHH LSDVLGKKLPFDSLSALRGQLYVAHPHLAETDEIVAGKATDIEALAGKTGSLTKSAFASP HHHHHCCCCCHHHHHHHCCEEEEECCCCCCCCCEECCCCCCHHHHCCCCCCCCHHHHHHH VKDFYLTNPIARASAVMAECSALARNNFQAAAE HHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCCC >Mature Secondary Structure AKLKVDGKEIEVPDHFTLLQACEEAGAEVPRFCFHERLSVAGNCRMCLIEVKGGPPKPA CEEECCCCEECCCCHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCEEEEEEECCCCCCCH ASCAMGVRDVRGGPNGELPEVFTNTPMVKKAREGVMEFLLINHPLDCPICDQGGECDLQD HHHHHHHHHHCCCCCCCCCHHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCC QAMAFGIAGSRYAENKRAVEDKYIGPLVKTVMNRCIHCTRCVRFTTEVAGIAELGLIGRG CHHEEECCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC EDAEITTYLEQAMTSELQGNVVDLCPVGALTSKPFAFTARPWELNKTETIDVMDALGSAI CCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCEEECCCCCCCCCCHHHHHHHCCCCE RVDTRGREVMRVMPRVNEAINEEWISDKSRFIWDGLKTQRLDRPYVRKDGRLQPATWGEA EECCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHCCHHHHCCCCCCCCCCCCCCCCHHHHH FGAIKQAVAATSGSKIGAIAGDLSSVEEMFALKSLLASLGSANVDCRQDGAALDPSLGRA HHHHHHHHHHCCCCCCEEHHCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCH SYIFNSTIEGIEHADALLIVGSNPRFEAAVLNARIRKRWRRGGFPIGVIGEAGELRYNYE HHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHHHHHCCCCEEEECCCCCEEECHH YLGSGAETLSDLANGSHSFIDKLKSAKNPLIIIGQGALARADGAAVLAAAAKLAVSVGAV HHCCCHHHHHHHHCCCHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHCH TEEWNGFSVLHTAAARVGGLDIGFVPGEGAVAAAEMVTSMDVLFLLGADEIDLSNKGAKF HHCCCCHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHCEEEEECCCCCCCCCCCCEE TVYIGSHGDQGAMNADVILPGAAYTEKSGIWVNTEGRVQVGNRAGFAPGEAREDWAILRA EEEEECCCCCCCCCCCEEECCCCEECCCCEEECCCCCEEECCCCCCCCCCCHHHHHHHHH LSDVLGKKLPFDSLSALRGQLYVAHPHLAETDEIVAGKATDIEALAGKTGSLTKSAFASP HHHHHCCCCCHHHHHHHCCEEEEECCCCCCCCCEECCCCCCHHHHCCCCCCCCHHHHHHH VKDFYLTNPIARASAVMAECSALARNNFQAAAE HHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 1605643; 8422400 [H]