| Definition | Escherichia coli 55989, complete genome. |
|---|---|
| Accession | NC_011748 |
| Length | 5,154,862 |
Click here to switch to the map view.
The map label for this gene is ygjR
Identifier: 218696791
GI number: 218696791
Start: 3585582
End: 3586586
Strand: Direct
Name: ygjR
Synonym: EC55989_3501
Alternate gene names: 218696791
Gene position: 3585582-3586586 (Clockwise)
Preceding gene: 218696790
Following gene: 218696792
Centisome position: 69.56
GC content: 52.54
Gene sequence:
>1005_bases ATGGAGTATCCACGCGTTATGATACGTTTCGCTGTGATTGGTACGAACTGGATCACTCGCCAGTTCGTCGAGGCCGCCCA TGAGAGCGGTAAATACAAGTTAACCGCCGTATATTCCCGCAGCCTTGAGCAGGCTCAGCACTTCGCCAATGATTTTTCTG TCGAGCATCTGTTTACCTCGCTGGAAGCGATGGCGGAAAGCGATGCCATTGACGCGGTGTATATTGCCAGCCCGAATGCC CTGCACTTTTCCCAGACACAACTTTTCCTTAGCCATAAAATTCATGTGATTTGCGAGAAACCACTGGCGTCGAATCTGGC GGAAGTGGATGCCGCCATTGCCTGTGCGCGGGAAAATCAGGTGGTGCTGTTTGAGGCATTTAAAACCGCCTGCCTGCCGA ACTTTCATTTGCTGCGCCAGGCGCTGCCGAAAGTCGGCAAATTGCGTAAAGTCTTTTTCAACTATTGCCAGTATTCCTCG CGCTATCAACGTTACCTGGACGGTGAAAATCCCAACACCTTTAATCCGTCATTCTCTAACGGTTCAATTATGGATATCGG CTTTTACTGCCTGGCGTCGGCGGTGGCGTTATTTGGTGAGCCGAAAAGCGTGCAGGCAACCGCCAGTTTGCTGGCAAGCG GTGTAGATGCCCACGGCGTGGTGGTGATGGATTACGGTGATTTCAGCGTCACCTTGCAGCACTCAAAAGTCAGTGATTCT GTCCTGGCGAGCGAGATTCAGGGCGAAGCAGGATCGCTGGTGATTGAAAAACTGTCTGAATGCCAGAAAGTGTGCTTCGT GCCGCGTGGCAGCCAAATGCAGGATCTCACCCAGCCGCAGCATATTAATACCATGCTCTACGAAGCAGAGCTGTTCGCTA CCCTGGTGGATGAGCATCTGGTGGATCATCCGGGGCTGGCGGTCAGTCGCATCACCGCCAAACTGCTGACCGAGATCCGC CGCCAGACTGGGGTGATTTTTCCGGTAGATAGCGTAAAACTATAA
Upstream 100 bases:
>100_bases TGGGCCGTTTAGTGAGCATGGCTGTCCGGCAAAAGAATAATGCGTATCTGCGCACGTCGAAGATGAAAAAGGCGTGCTAC ATTGACGACAGAATCCCTTT
Downstream 100 bases:
>100_bases TTGCCAAAGTAAAACAGTGTAAAAGGTATGTAACAGACCATTGACTGGCTGAATGGTCTGTCATACTTTGTTACCTGCAA AGGGGAGTAACTTCATTGCC
Product: putative NAD(P)-binding dehydrogenase
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 334; Mature: 334
Protein sequence:
>334_residues MEYPRVMIRFAVIGTNWITRQFVEAAHESGKYKLTAVYSRSLEQAQHFANDFSVEHLFTSLEAMAESDAIDAVYIASPNA LHFSQTQLFLSHKIHVICEKPLASNLAEVDAAIACARENQVVLFEAFKTACLPNFHLLRQALPKVGKLRKVFFNYCQYSS RYQRYLDGENPNTFNPSFSNGSIMDIGFYCLASAVALFGEPKSVQATASLLASGVDAHGVVVMDYGDFSVTLQHSKVSDS VLASEIQGEAGSLVIEKLSECQKVCFVPRGSQMQDLTQPQHINTMLYEAELFATLVDEHLVDHPGLAVSRITAKLLTEIR RQTGVIFPVDSVKL
Sequences:
>Translated_334_residues MEYPRVMIRFAVIGTNWITRQFVEAAHESGKYKLTAVYSRSLEQAQHFANDFSVEHLFTSLEAMAESDAIDAVYIASPNA LHFSQTQLFLSHKIHVICEKPLASNLAEVDAAIACARENQVVLFEAFKTACLPNFHLLRQALPKVGKLRKVFFNYCQYSS RYQRYLDGENPNTFNPSFSNGSIMDIGFYCLASAVALFGEPKSVQATASLLASGVDAHGVVVMDYGDFSVTLQHSKVSDS VLASEIQGEAGSLVIEKLSECQKVCFVPRGSQMQDLTQPQHINTMLYEAELFATLVDEHLVDHPGLAVSRITAKLLTEIR RQTGVIFPVDSVKL >Mature_334_residues MEYPRVMIRFAVIGTNWITRQFVEAAHESGKYKLTAVYSRSLEQAQHFANDFSVEHLFTSLEAMAESDAIDAVYIASPNA LHFSQTQLFLSHKIHVICEKPLASNLAEVDAAIACARENQVVLFEAFKTACLPNFHLLRQALPKVGKLRKVFFNYCQYSS RYQRYLDGENPNTFNPSFSNGSIMDIGFYCLASAVALFGEPKSVQATASLLASGVDAHGVVVMDYGDFSVTLQHSKVSDS VLASEIQGEAGSLVIEKLSECQKVCFVPRGSQMQDLTQPQHINTMLYEAELFATLVDEHLVDHPGLAVSRITAKLLTEIR RQTGVIFPVDSVKL
Specific function: Unknown
COG id: COG0673
COG function: function code R; Predicted dehydrogenases and related proteins
Gene ontology:
Cell location: Membrane; Single-pass membrane protein (Potential) [H]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the gfo/idh/mocA family [H]
Homologues:
Organism=Homo sapiens, GI7657212, Length=348, Percent_Identity=25.2873563218391, Blast_Score=95, Evalue=9e-20, Organism=Escherichia coli, GI145693182, Length=328, Percent_Identity=98.4756097560976, Blast_Score=671, Evalue=0.0, Organism=Drosophila melanogaster, GI24581117, Length=344, Percent_Identity=25.2906976744186, Blast_Score=103, Evalue=1e-22, Organism=Drosophila melanogaster, GI24581115, Length=341, Percent_Identity=24.0469208211144, Blast_Score=80, Evalue=1e-15, Organism=Drosophila melanogaster, GI24584727, Length=213, Percent_Identity=26.7605633802817, Blast_Score=74, Evalue=1e-13,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR016040 - InterPro: IPR000683 [H]
Pfam domain/function: PF01408 GFO_IDH_MocA [H]
EC number: 1.-.-.- [C]
Molecular weight: Translated: 37051; Mature: 37051
Theoretical pI: Translated: 6.23; Mature: 6.23
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.1 %Cys (Translated Protein) 2.1 %Met (Translated Protein) 4.2 %Cys+Met (Translated Protein) 2.1 %Cys (Mature Protein) 2.1 %Met (Mature Protein) 4.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MEYPRVMIRFAVIGTNWITRQFVEAAHESGKYKLTAVYSRSLEQAQHFANDFSVEHLFTS CCCCCEEEEEEECCCHHHHHHHHHHHHHCCCEEEEHHHHHHHHHHHHHHHHCHHHHHHHH LEAMAESDAIDAVYIASPNALHFSQTQLFLSHKIHVICEKPLASNLAEVDAAIACARENQ HHHHHCCCCCCEEEEECCCCEEHHHHHHHHHCCEEEEECCHHHHHHHHHHHHHHCCCCCC VVLFEAFKTACLPNFHLLRQALPKVGKLRKVFFNYCQYSSRYQRYLDGENPNTFNPSFSN EEEEEHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCC GSIMDIGFYCLASAVALFGEPKSVQATASLLASGVDAHGVVVMDYGDFSVTLQHSKVSDS CCEEHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCEEEEEECCEEEEEECCCCHHH VLASEIQGEAGSLVIEKLSECQKVCFVPRGSQMQDLTQPQHINTMLYEAELFATLVDEHL HHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCHHHCCCCHHHHHHHHHHHHHHHHHHHHH VDHPGLAVSRITAKLLTEIRRQTGVIFPVDSVKL HCCCCHHHHHHHHHHHHHHHHHCCEEECCCCCCC >Mature Secondary Structure MEYPRVMIRFAVIGTNWITRQFVEAAHESGKYKLTAVYSRSLEQAQHFANDFSVEHLFTS CCCCCEEEEEEECCCHHHHHHHHHHHHHCCCEEEEHHHHHHHHHHHHHHHHCHHHHHHHH LEAMAESDAIDAVYIASPNALHFSQTQLFLSHKIHVICEKPLASNLAEVDAAIACARENQ HHHHHCCCCCCEEEEECCCCEEHHHHHHHHHCCEEEEECCHHHHHHHHHHHHHHCCCCCC VVLFEAFKTACLPNFHLLRQALPKVGKLRKVFFNYCQYSSRYQRYLDGENPNTFNPSFSN EEEEEHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCC GSIMDIGFYCLASAVALFGEPKSVQATASLLASGVDAHGVVVMDYGDFSVTLQHSKVSDS CCEEHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCEEEEEECCEEEEEECCCCHHH VLASEIQGEAGSLVIEKLSECQKVCFVPRGSQMQDLTQPQHINTMLYEAELFATLVDEHL HHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCHHHCCCCHHHHHHHHHHHHHHHHHHHHH VDHPGLAVSRITAKLLTEIRRQTGVIFPVDSVKL HCCCCHHHHHHHHHHHHHHHHHCCEEECCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 7.0
TargetDB status: NA
Availability: NA
References: 9278503 [H]