Definition Streptococcus pneumoniae D39, complete genome.
Accession NC_008533
Length 2,046,115

Click here to switch to the map view.

The map label for this gene is yjhC [C]

Identifier: 116516108

GI number: 116516108

Start: 1514630

End: 1515733

Strand: Reverse

Name: yjhC [C]

Synonym: SPD_1498

Alternate gene names: 116516108

Gene position: 1515733-1514630 (Counterclockwise)

Preceding gene: 116516987

Following gene: 116515887

Centisome position: 74.08

GC content: 39.4

Gene sequence:

>1104_bases
ATGGTTAAATACGGTGTTGTTGGAGCAGGGTATTTTGGAGCTGAATTGGCTCGCTACATGCAAAAAAATGATGGAGCAGA
GATTACTCTTCTCTATGATCCAGATAATGCAGAGGCGATTGCAGAAGAATTGGGAGCAAAAGTAGCAAGTTCCTTAGATG
AGTTGGTTTCTAGCGATGAAGTAGACTGTGTTATCGTCGCAACTCCAAATAATCTTCATAAGGAACCGGTTATTAAGGCT
GCACAGCATGGTAAAAATGTTTTCTGTGAAAAACCAATTGCGCTTTCTTATCAAGATTGTCGCGAGATGGTAGATGCGTG
TAAAGAAAACAATGTAACCTTTATGGCAGGACATATTATGAATTTCTTTAATGGTGTTCATCATGCAAAAGAACTCATTA
ATCAAGGAGTTATCGGAGACGTTCTATATTGTCATACAGCTCGTAATGGTTGGGAAGAACAACAACCGTCAGTATCATGG
AAAAAAATTCGTGAAAAATCAGGTGGTCACTTGTATCACCACATCCATGAATTGGATTGCGTTCAATTCCTTATGGGGGG
CATGCCTGAAACTGTAACCATGACAGGTGGAAATGTGGCCCATGAAGGTGAACATTTCGGTGATGAAGATGATATGATTT
TTGTCAATATGGAATTTTCTAATAAGCGTTTTGCCTTGTTAGAATGGGGTTCAGCTTATCGTTGGGGTGAACATTATGTC
TTAATCCAAGGAAGCAAAGGTGCCATCCGCTTAGACTTATTCAACTGTAAAGGAACTCTTAAGCTAGATGGGCAAGAAAG
CTATTTCTTGATTCACGAATCGCAAGAAGAAGATGATGATCGGACTCGTATCTATCATAGTACAGAGATGGATGGAGCAA
TTGCTTATGGTAAACCAGGTAAACGTACTCCATTATGGCTATCATCTGTCATTGATAAAGAAATGCGCTATCTGCATGAG
ATTATGCAAGGAGCTCCAGTATCAGAAGAATTTGCAAAACTTTTGACAGGTGAAGCTGCCCTAGAAGCAATTGCTACTGC
AGATGCTTGTACCCAGTCTATGTTTGAAGATCGCAAAGTAAAATTGTCAGAAATTGTAAAATAA

Upstream 100 bases:

>100_bases
AAAAATATGATTCGTGGTCGAGAAATGAATTGCATTTAAGCAATGTAGTTCAGTATATAGATTTGGAAATTAATGATTTA
ACAAAATAAAGGAGAAAAAC

Downstream 100 bases:

>100_bases
ATTTTGGTATTCTCCTATTATAGGTCGACTTGCTCCTCTGAAAGTACTTTTAGAGGAGCTGTTTGACTTGGCTAGTTTTT
GAAACTGAAATCTATTATAC

Product: oxidoreductase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 367; Mature: 367

Protein sequence:

>367_residues
MVKYGVVGAGYFGAELARYMQKNDGAEITLLYDPDNAEAIAEELGAKVASSLDELVSSDEVDCVIVATPNNLHKEPVIKA
AQHGKNVFCEKPIALSYQDCREMVDACKENNVTFMAGHIMNFFNGVHHAKELINQGVIGDVLYCHTARNGWEEQQPSVSW
KKIREKSGGHLYHHIHELDCVQFLMGGMPETVTMTGGNVAHEGEHFGDEDDMIFVNMEFSNKRFALLEWGSAYRWGEHYV
LIQGSKGAIRLDLFNCKGTLKLDGQESYFLIHESQEEDDDRTRIYHSTEMDGAIAYGKPGKRTPLWLSSVIDKEMRYLHE
IMQGAPVSEEFAKLLTGEAALEAIATADACTQSMFEDRKVKLSEIVK

Sequences:

>Translated_367_residues
MVKYGVVGAGYFGAELARYMQKNDGAEITLLYDPDNAEAIAEELGAKVASSLDELVSSDEVDCVIVATPNNLHKEPVIKA
AQHGKNVFCEKPIALSYQDCREMVDACKENNVTFMAGHIMNFFNGVHHAKELINQGVIGDVLYCHTARNGWEEQQPSVSW
KKIREKSGGHLYHHIHELDCVQFLMGGMPETVTMTGGNVAHEGEHFGDEDDMIFVNMEFSNKRFALLEWGSAYRWGEHYV
LIQGSKGAIRLDLFNCKGTLKLDGQESYFLIHESQEEDDDRTRIYHSTEMDGAIAYGKPGKRTPLWLSSVIDKEMRYLHE
IMQGAPVSEEFAKLLTGEAALEAIATADACTQSMFEDRKVKLSEIVK
>Mature_367_residues
MVKYGVVGAGYFGAELARYMQKNDGAEITLLYDPDNAEAIAEELGAKVASSLDELVSSDEVDCVIVATPNNLHKEPVIKA
AQHGKNVFCEKPIALSYQDCREMVDACKENNVTFMAGHIMNFFNGVHHAKELINQGVIGDVLYCHTARNGWEEQQPSVSW
KKIREKSGGHLYHHIHELDCVQFLMGGMPETVTMTGGNVAHEGEHFGDEDDMIFVNMEFSNKRFALLEWGSAYRWGEHYV
LIQGSKGAIRLDLFNCKGTLKLDGQESYFLIHESQEEDDDRTRIYHSTEMDGAIAYGKPGKRTPLWLSSVIDKEMRYLHE
IMQGAPVSEEFAKLLTGEAALEAIATADACTQSMFEDRKVKLSEIVK

Specific function: Unknown

COG id: COG0673

COG function: function code R; Predicted dehydrogenases and related proteins

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the gfo/idh/mocA family [H]

Homologues:

Organism=Escherichia coli, GI87082405, Length=367, Percent_Identity=64.5776566757493, Blast_Score=509, Evalue=1e-146,
Organism=Escherichia coli, GI1787574, Length=264, Percent_Identity=25.3787878787879, Blast_Score=64, Evalue=2e-11,
Organism=Saccharomyces cerevisiae, GI6323975, Length=267, Percent_Identity=25.4681647940075, Blast_Score=72, Evalue=2e-13,
Organism=Drosophila melanogaster, GI24581117, Length=277, Percent_Identity=27.4368231046931, Blast_Score=87, Evalue=1e-17,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR016040
- InterPro:   IPR000683
- InterPro:   IPR004104 [H]

Pfam domain/function: PF01408 GFO_IDH_MocA; PF02894 GFO_IDH_MocA_C [H]

EC number: 1.-.-.- [C]

Molecular weight: Translated: 41065; Mature: 41065

Theoretical pI: Translated: 4.87; Mature: 4.87

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.2 %Cys     (Translated Protein)
3.8 %Met     (Translated Protein)
6.0 %Cys+Met (Translated Protein)
2.2 %Cys     (Mature Protein)
3.8 %Met     (Mature Protein)
6.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MVKYGVVGAGYFGAELARYMQKNDGAEITLLYDPDNAEAIAEELGAKVASSLDELVSSDE
CCEEEEEECCHHHHHHHHHHHCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCC
VDCVIVATPNNLHKEPVIKAAQHGKNVFCEKPIALSYQDCREMVDACKENNVTFMAGHIM
CCEEEEECCCCCCHHHHHHHHHCCCCEEECCCCCCCHHHHHHHHHHHHCCCEEEEHHHHH
NFFNGVHHAKELINQGVIGDVLYCHTARNGWEEQQPSVSWKKIREKSGGHLYHHIHELDC
HHHHHHHHHHHHHHCCCHHHHHEEECCCCCCCCCCCCCHHHHHHHCCCCHHHHHHHHHHH
VQFLMGGMPETVTMTGGNVAHEGEHFGDEDDMIFVNMEFSNKRFALLEWGSAYRWGEHYV
HHHHHCCCCCEEEEECCCCCCCCCCCCCCCCEEEEEEEECCCEEEEEECCCCCCCCCEEE
LIQGSKGAIRLDLFNCKGTLKLDGQESYFLIHESQEEDDDRTRIYHSTEMDGAIAYGKPG
EEECCCCEEEEEEEECCEEEEECCCCCEEEEECCCCCCCCHHEEEECCCCCCEEEECCCC
KRTPLWLSSVIDKEMRYLHEIMQGAPVSEEFAKLLTGEAALEAIATADACTQSMFEDRKV
CCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
KLSEIVK
HHHHHCC
>Mature Secondary Structure
MVKYGVVGAGYFGAELARYMQKNDGAEITLLYDPDNAEAIAEELGAKVASSLDELVSSDE
CCEEEEEECCHHHHHHHHHHHCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCC
VDCVIVATPNNLHKEPVIKAAQHGKNVFCEKPIALSYQDCREMVDACKENNVTFMAGHIM
CCEEEEECCCCCCHHHHHHHHHCCCCEEECCCCCCCHHHHHHHHHHHHCCCEEEEHHHHH
NFFNGVHHAKELINQGVIGDVLYCHTARNGWEEQQPSVSWKKIREKSGGHLYHHIHELDC
HHHHHHHHHHHHHHCCCHHHHHEEECCCCCCCCCCCCCHHHHHHHCCCCHHHHHHHHHHH
VQFLMGGMPETVTMTGGNVAHEGEHFGDEDDMIFVNMEFSNKRFALLEWGSAYRWGEHYV
HHHHHCCCCCEEEEECCCCCCCCCCCCCCCCEEEEEEEECCCEEEEEECCCCCCCCCEEE
LIQGSKGAIRLDLFNCKGTLKLDGQESYFLIHESQEEDDDRTRIYHSTEMDGAIAYGKPG
EEECCCCEEEEEEEECCEEEEECCCCCEEEEECCCCCCCCHHEEEECCCCCCEEEECCCC
KRTPLWLSSVIDKEMRYLHEIMQGAPVSEEFAKLLTGEAALEAIATADACTQSMFEDRKV
CCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
KLSEIVK
HHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11463916; 8759848 [H]