Definition | Frankia sp. EAN1pec chromosome, complete genome. |
---|---|
Accession | NC_009921 |
Length | 8,982,042 |
Click here to switch to the map view.
The map label for this gene is yhhX [H]
Identifier: 158313675
GI number: 158313675
Start: 2208902
End: 2210458
Strand: Direct
Name: yhhX [H]
Synonym: Franean1_1839
Alternate gene names: 158313675
Gene position: 2208902-2210458 (Clockwise)
Preceding gene: 158313674
Following gene: 158313676
Centisome position: 24.59
GC content: 74.63
Gene sequence:
>1557_bases GTGCCCCCCGTGCCACGCATTCACCTGGTGTCGAACCTTCCCGGCGTGAATGCGCTCGGTGATCATCTACGGGCCGCCGG GCTCGGGCCGACGTCGGCCCGCTCGGCCGACGCGCTGCTCGTCCTCATCGACCGCCCGCTTGACCATGTCGAGCAGGAGC TGCTCGACCGGGCCCGGCAGTCCGTGCCCGTCCTGCTGGCCGGGCCGACCGTCCGGTCGCTGTCGCCGGACAGCCCGCTC ATCGACGCCTCCGGGCTCACCCCCGGCCGGGTGACGCCCCCCTACGACCTGCCGCTGATCCCCGGGCCGGACGGCGCCGC CGTGGCCGCCCGCCTCGGGGACTTCCGGCCTCGGGAGTCCTGGGTCATCCCGGAGAAGGTGGCCGAGGACGTCGAACGGC TGCTCATGGTGCGCCACGAGATGGGGGAGCACCCGATCTGCACCTGGCGGCCGTCCACCGGCCTGGGCATCTTCACCCTC GGGGCCGGCGAGGAACTGCTCGCCGATCCCCGCTACCAGCGGCTCGTCGGCCGCTGGCTGCGCCACGCGCTCGGCGTGAC CGACGCCGGGCCGGTCAAGGTCGGGCTCATCGGCGCCCCGGACGTCTTCGGTGTGCACATCGACGCCGTCGACGCCGTCG AGGGCCTGGAGCTGGCCGCCCTCTGCGACGGTGGGATGGCCCGCCCGCCCGGCCAGGACACCGACCGGCCGGCCCGCCGG GTCGACGACCCCGACGACCTGGTCAACGACCCCGAGCTCAACCTGGTGATCGTCGCGACCCCCACCCACACCCATGTCGA GTGGGCCCGCCGCGCGCTCGAGGCCGGCAAGCAGGTGGTCGTGCACGCCCCGATGTGCCTGTCGACCCACGAGGTGGACG AGCTGACCGAGCTTGCCCAGAGCCGCTCGCTGCTGCTCGCCGTCTACCCCGACGGCCAGGACGACCCAGGTCACCGGGCG ATGCGCACGGCCGTGCACCGCGGGGACGTCGGCGAGGTGATGTGGATCGACGTCTTCTCCGGTGGTCTGCGGCGGCCTGC CGGCACCTGGCACGACGACGAGCGGATCAGCGGCGGGCTGATCTTCGACCGGGGCGCCGCGCAGCTCGGCCGTGTCCTCG ATCTCGTCGACGACCAGGTCGAGTGGGTCAGCGCCTTCGGGCACAAGCGGGTGTGGCACCACGTCACGAACGCCGACCAC GCCCGGGTGCTCCTCCACTTCGCCGGCGGCTGCGAGGCCCAGGTGACGATTTCGGACGTGTCCGCCGCCGCGCGGCCCGG CACCCAGGTGCTTGGCAGCATCGGCTCGTTGCTCGCCCGCGACGGCCACGGCGGGGCGCCGCGGTCACCGCGGCTGCTCG CCCACGACGGGACGCGCACCGCGCTGCCGACCGGGTCGCACGACGCCACGAGGTTCCATCGCGAGCTCGCGGACTGCCTG GTCGCCGGCTGGCCACTGGCCGAGCACCAGCCGGAGGACGCCCGCCGGCTGGTCGCGGTGCTCGAGGCGGCCCGCAGGTC GGCGGCGGCCGGCGGCGCGCAGATGGCGCCCGGCTGA
Upstream 100 bases:
>100_bases CTACTTTTCGTAACGATATTGTTTACTTTTTGTCGTGGAGTGCGTGGCGTGCTCGCGTGCCGTGAACCCACTGTTCCCGT CCGGGTGGGTTCTGACTAAA
Downstream 100 bases:
>100_bases CCAGCCCGGCGCAGGGTGTTTACTGTTGACCATGGATTGGCTGAACGTGCCCGACGTGGCCGAGAGTCTGGGAGTGCCCG TCACCCGGGTGCGCCAGATG
Product: oxidoreductase domain-containing protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 518; Mature: 517
Protein sequence:
>518_residues MPPVPRIHLVSNLPGVNALGDHLRAAGLGPTSARSADALLVLIDRPLDHVEQELLDRARQSVPVLLAGPTVRSLSPDSPL IDASGLTPGRVTPPYDLPLIPGPDGAAVAARLGDFRPRESWVIPEKVAEDVERLLMVRHEMGEHPICTWRPSTGLGIFTL GAGEELLADPRYQRLVGRWLRHALGVTDAGPVKVGLIGAPDVFGVHIDAVDAVEGLELAALCDGGMARPPGQDTDRPARR VDDPDDLVNDPELNLVIVATPTHTHVEWARRALEAGKQVVVHAPMCLSTHEVDELTELAQSRSLLLAVYPDGQDDPGHRA MRTAVHRGDVGEVMWIDVFSGGLRRPAGTWHDDERISGGLIFDRGAAQLGRVLDLVDDQVEWVSAFGHKRVWHHVTNADH ARVLLHFAGGCEAQVTISDVSAAARPGTQVLGSIGSLLARDGHGGAPRSPRLLAHDGTRTALPTGSHDATRFHRELADCL VAGWPLAEHQPEDARRLVAVLEAARRSAAAGGAQMAPG
Sequences:
>Translated_518_residues MPPVPRIHLVSNLPGVNALGDHLRAAGLGPTSARSADALLVLIDRPLDHVEQELLDRARQSVPVLLAGPTVRSLSPDSPL IDASGLTPGRVTPPYDLPLIPGPDGAAVAARLGDFRPRESWVIPEKVAEDVERLLMVRHEMGEHPICTWRPSTGLGIFTL GAGEELLADPRYQRLVGRWLRHALGVTDAGPVKVGLIGAPDVFGVHIDAVDAVEGLELAALCDGGMARPPGQDTDRPARR VDDPDDLVNDPELNLVIVATPTHTHVEWARRALEAGKQVVVHAPMCLSTHEVDELTELAQSRSLLLAVYPDGQDDPGHRA MRTAVHRGDVGEVMWIDVFSGGLRRPAGTWHDDERISGGLIFDRGAAQLGRVLDLVDDQVEWVSAFGHKRVWHHVTNADH ARVLLHFAGGCEAQVTISDVSAAARPGTQVLGSIGSLLARDGHGGAPRSPRLLAHDGTRTALPTGSHDATRFHRELADCL VAGWPLAEHQPEDARRLVAVLEAARRSAAAGGAQMAPG >Mature_517_residues PPVPRIHLVSNLPGVNALGDHLRAAGLGPTSARSADALLVLIDRPLDHVEQELLDRARQSVPVLLAGPTVRSLSPDSPLI DASGLTPGRVTPPYDLPLIPGPDGAAVAARLGDFRPRESWVIPEKVAEDVERLLMVRHEMGEHPICTWRPSTGLGIFTLG AGEELLADPRYQRLVGRWLRHALGVTDAGPVKVGLIGAPDVFGVHIDAVDAVEGLELAALCDGGMARPPGQDTDRPARRV DDPDDLVNDPELNLVIVATPTHTHVEWARRALEAGKQVVVHAPMCLSTHEVDELTELAQSRSLLLAVYPDGQDDPGHRAM RTAVHRGDVGEVMWIDVFSGGLRRPAGTWHDDERISGGLIFDRGAAQLGRVLDLVDDQVEWVSAFGHKRVWHHVTNADHA RVLLHFAGGCEAQVTISDVSAAARPGTQVLGSIGSLLARDGHGGAPRSPRLLAHDGTRTALPTGSHDATRFHRELADCLV AGWPLAEHQPEDARRLVAVLEAARRSAAAGGAQMAPG
Specific function: Unknown
COG id: COG0673
COG function: function code R; Predicted dehydrogenases and related proteins
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the gfo/idh/mocA family. Biliverdin reductase subfamily [H]
Homologues:
Organism=Escherichia coli, GI1789848, Length=95, Percent_Identity=35.7894736842105, Blast_Score=72, Evalue=7e-14, Organism=Escherichia coli, GI87081947, Length=227, Percent_Identity=26.8722466960352, Blast_Score=71, Evalue=2e-13,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR016040 - InterPro: IPR000683 - InterPro: IPR004104 [H]
Pfam domain/function: PF01408 GFO_IDH_MocA; PF02894 GFO_IDH_MocA_C [H]
EC number: 1.-.-.- [C]
Molecular weight: Translated: 55388; Mature: 55257
Theoretical pI: Translated: 5.99; Mature: 5.99
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.0 %Cys (Translated Protein) 1.5 %Met (Translated Protein) 2.5 %Cys+Met (Translated Protein) 1.0 %Cys (Mature Protein) 1.4 %Met (Mature Protein) 2.3 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MPPVPRIHLVSNLPGVNALGDHLRAAGLGPTSARSADALLVLIDRPLDHVEQELLDRARQ CCCCCCEEEECCCCCCHHHHHHHHHCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHH SVPVLLAGPTVRSLSPDSPLIDASGLTPGRVTPPYDLPLIPGPDGAAVAARLGDFRPRES CCCEEEECCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCCCCCCC WVIPEKVAEDVERLLMVRHEMGEHPICTWRPSTGLGIFTLGAGEELLADPRYQRLVGRWL CCCHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCEEEEECCCHHHHCCHHHHHHHHHHH RHALGVTDAGPVKVGLIGAPDVFGVHIDAVDAVEGLELAALCDGGMARPPGQDTDRPARR HHHHCCCCCCCEEEEEEECCCEEEEEEEHHHHHCCCEEEEECCCCCCCCCCCCCCCCHHC VDDPDDLVNDPELNLVIVATPTHTHVEWARRALEAGKQVVVHAPMCLSTHEVDELTELAQ CCCHHHHCCCCCCCEEEEECCCCHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHHH SRSLLLAVYPDGQDDPGHRAMRTAVHRGDVGEVMWIDVFSGGLRRPAGTWHDDERISGGL CCCEEEEECCCCCCCCHHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCE IFDRGAAQLGRVLDLVDDQVEWVSAFGHKRVWHHVTNADHARVLLHFAGGCEAQVTISDV EEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHEEEEEECCCCCEEEEHHHH SAAARPGTQVLGSIGSLLARDGHGGAPRSPRLLAHDGTRTALPTGSHDATRFHRELADCL HHHCCCHHHHHHHHHHHHHCCCCCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHH VAGWPLAEHQPEDARRLVAVLEAARRSAAAGGAQMAPG HHCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCC >Mature Secondary Structure PPVPRIHLVSNLPGVNALGDHLRAAGLGPTSARSADALLVLIDRPLDHVEQELLDRARQ CCCCCEEEECCCCCCHHHHHHHHHCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHH SVPVLLAGPTVRSLSPDSPLIDASGLTPGRVTPPYDLPLIPGPDGAAVAARLGDFRPRES CCCEEEECCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCCCCCCC WVIPEKVAEDVERLLMVRHEMGEHPICTWRPSTGLGIFTLGAGEELLADPRYQRLVGRWL CCCHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCEEEEECCCHHHHCCHHHHHHHHHHH RHALGVTDAGPVKVGLIGAPDVFGVHIDAVDAVEGLELAALCDGGMARPPGQDTDRPARR HHHHCCCCCCCEEEEEEECCCEEEEEEEHHHHHCCCEEEEECCCCCCCCCCCCCCCCHHC VDDPDDLVNDPELNLVIVATPTHTHVEWARRALEAGKQVVVHAPMCLSTHEVDELTELAQ CCCHHHHCCCCCCCEEEEECCCCHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHHH SRSLLLAVYPDGQDDPGHRAMRTAVHRGDVGEVMWIDVFSGGLRRPAGTWHDDERISGGL CCCEEEEECCCCCCCCHHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCE IFDRGAAQLGRVLDLVDDQVEWVSAFGHKRVWHHVTNADHARVLLHFAGGCEAQVTISDV EEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHEEEEEECCCCCEEEEHHHH SAAARPGTQVLGSIGSLLARDGHGGAPRSPRLLAHDGTRTALPTGSHDATRFHRELADCL HHHCCCHHHHHHHHHHHHHCCCCCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHH VAGWPLAEHQPEDARRLVAVLEAARRSAAAGGAQMAPG HHCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9278503; 10493123 [H]