| Definition | Yersinia pestis KIM 10 chromosome, complete genome. |
|---|---|
| Accession | NC_004088 |
| Length | 4,600,755 |
Click here to switch to the map view.
The map label for this gene is deoC [H]
Identifier: 22127614
GI number: 22127614
Start: 4162486
End: 4163298
Strand: Reverse
Name: deoC [H]
Synonym: y3743
Alternate gene names: 22127614
Gene position: 4163298-4162486 (Counterclockwise)
Preceding gene: 22127615
Following gene: 22127613
Centisome position: 90.49
GC content: 50.06
Gene sequence:
>813_bases GTGTTGGAGAGTTTTATGACCGATTTAACCGCCTGCGCGCATTTGAACGATTATGCTAAACGCGCACTAAGTTTGATGGA TTTAACCACCCTGAATGACGACGATACTGATGAAAAAGTGATCGCGCTCTGTCATCAGGCTAAAAGCCCGGCCGGTAATA CCGCGGCAATTTGTATTTATCCTCGCTTTATCCCGGTTGCCCGTAAGGCGTTACGTGAGCAGGGCACTCCTGAGATCCGT ATCGCAACGGTGACCAACTTCCCTCATGGCAATGATGATGTTGCGATTGCGTTGGCTGAAACCCGTGCTGCCATTGCTTA TGGTGCCGATGAAGTGGATGTGGTTTTCCCTTATCGCGCGTTAATGGCCGGTAACGATAAAATTGGCTTCGAACTCGTGA AAACGTGTAAAGAAGCCTGCGCTGCGGCGAATGTATTACTGAAAGTCATCATTGAAACTGGCGAACTTAAGCAAGCGCAT TTAATTCGTCAGGCCTCTGAAATTGCAATCAAAGCGGGTGCCGATTTTATTAAAACTTCTACCGGTAAAGTGCCGGTGAA TGCCACGTTGGAAAGTGCAGACATTATGATCCGCACTATCCGTGAGTTAGGCGTAGGCGAAACGGTGGGCTTCAAACCGG CTGGTGGTGTCCGTACCGCAGAAGATGCCGCGCAATTTTTGCAATTGGCTGATCAACTGATGGGGGAAGGCTGGGCTGAT GCACGCCATTTCCGCTTTGGCGCTTCCAGTCTACTCGCCAGTTTGCTGACCACATTGGGTCACCAGAGTAACGCCAACAG CAGCGGCTACTAA
Upstream 100 bases:
>100_bases CTCGCCAACGTATTGAGCTTAACCACCGAGAGATACCGCCAAGGATCTTTTCTGCTCATAACTACATGCTCAGTTTTGGC AAACTGCCGGAGCTGAAACA
Downstream 100 bases:
>100_bases CGTTATTATCATCATTACACTGAATTGATGCACAGTAGGCTGAAGGCCTGCTGTGTCGCTATTTCCGAAAGCGCCTGTGG GCGGCTTTCATCCCATATTC
Product: deoxyribose-phosphate aldolase
Products: NA
Alternate protein names: Phosphodeoxyriboaldolase 2; DERA 2; Deoxyriboaldolase 2 [H]
Number of amino acids: Translated: 270; Mature: 270
Protein sequence:
>270_residues MLESFMTDLTACAHLNDYAKRALSLMDLTTLNDDDTDEKVIALCHQAKSPAGNTAAICIYPRFIPVARKALREQGTPEIR IATVTNFPHGNDDVAIALAETRAAIAYGADEVDVVFPYRALMAGNDKIGFELVKTCKEACAAANVLLKVIIETGELKQAH LIRQASEIAIKAGADFIKTSTGKVPVNATLESADIMIRTIRELGVGETVGFKPAGGVRTAEDAAQFLQLADQLMGEGWAD ARHFRFGASSLLASLLTTLGHQSNANSSGY
Sequences:
>Translated_270_residues MLESFMTDLTACAHLNDYAKRALSLMDLTTLNDDDTDEKVIALCHQAKSPAGNTAAICIYPRFIPVARKALREQGTPEIR IATVTNFPHGNDDVAIALAETRAAIAYGADEVDVVFPYRALMAGNDKIGFELVKTCKEACAAANVLLKVIIETGELKQAH LIRQASEIAIKAGADFIKTSTGKVPVNATLESADIMIRTIRELGVGETVGFKPAGGVRTAEDAAQFLQLADQLMGEGWAD ARHFRFGASSLLASLLTTLGHQSNANSSGY >Mature_270_residues MLESFMTDLTACAHLNDYAKRALSLMDLTTLNDDDTDEKVIALCHQAKSPAGNTAAICIYPRFIPVARKALREQGTPEIR IATVTNFPHGNDDVAIALAETRAAIAYGADEVDVVFPYRALMAGNDKIGFELVKTCKEACAAANVLLKVIIETGELKQAH LIRQASEIAIKAGADFIKTSTGKVPVNATLESADIMIRTIRELGVGETVGFKPAGGVRTAEDAAQFLQLADQLMGEGWAD ARHFRFGASSLLASLLTTLGHQSNANSSGY
Specific function: Nucleotide and deoxyribonucleotide catabolism. [C]
COG id: COG0274
COG function: function code F; Deoxyribose-phosphate aldolase
Gene ontology:
Cell location: Cytoplasm [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the deoC/fbaB aldolase family. DeoC type 2 subfamily [H]
Homologues:
Organism=Homo sapiens, GI116063554, Length=267, Percent_Identity=37.4531835205993, Blast_Score=167, Evalue=1e-41, Organism=Escherichia coli, GI1790841, Length=265, Percent_Identity=78.4905660377359, Blast_Score=384, Evalue=1e-108, Organism=Caenorhabditis elegans, GI17533015, Length=251, Percent_Identity=37.4501992031873, Blast_Score=157, Evalue=6e-39, Organism=Drosophila melanogaster, GI19922098, Length=255, Percent_Identity=37.6470588235294, Blast_Score=137, Evalue=8e-33,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR013785 - InterPro: IPR011343 - InterPro: IPR002915 [H]
Pfam domain/function: PF01791 DeoC [H]
EC number: =4.1.2.4 [H]
Molecular weight: Translated: 28882; Mature: 28882
Theoretical pI: Translated: 5.36; Mature: 5.36
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.9 %Cys (Translated Protein) 2.2 %Met (Translated Protein) 4.1 %Cys+Met (Translated Protein) 1.9 %Cys (Mature Protein) 2.2 %Met (Mature Protein) 4.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MLESFMTDLTACAHLNDYAKRALSLMDLTTLNDDDTDEKVIALCHQAKSPAGNTAAICIY CHHHHHHHHHHHHHHHHHHHHHHHHHHHEECCCCCCHHHHHHHHHHHCCCCCCEEEEEEE PRFIPVARKALREQGTPEIRIATVTNFPHGNDDVAIALAETRAAIAYGADEVDVVFPYRA CCHHHHHHHHHHHCCCCCEEEEEEECCCCCCCCEEEEEEHHHHHHHCCCCCEEEEECHHH LMAGNDKIGFELVKTCKEACAAANVLLKVIIETGELKQAHLIRQASEIAIKAGADFIKTS HHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCHHHHCC TGKVPVNATLESADIMIRTIRELGVGETVGFKPAGGVRTAEDAAQFLQLADQLMGEGWAD CCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCC ARHFRFGASSLLASLLTTLGHQSNANSSGY HHHHHCCHHHHHHHHHHHHCCCCCCCCCCC >Mature Secondary Structure MLESFMTDLTACAHLNDYAKRALSLMDLTTLNDDDTDEKVIALCHQAKSPAGNTAAICIY CHHHHHHHHHHHHHHHHHHHHHHHHHHHEECCCCCCHHHHHHHHHHHCCCCCCEEEEEEE PRFIPVARKALREQGTPEIRIATVTNFPHGNDDVAIALAETRAAIAYGADEVDVVFPYRA CCHHHHHHHHHHHCCCCCEEEEEEECCCCCCCCEEEEEEHHHHHHHCCCCCEEEEECHHH LMAGNDKIGFELVKTCKEACAAANVLLKVIIETGELKQAHLIRQASEIAIKAGADFIKTS HHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCHHHHCC TGKVPVNATLESADIMIRTIRELGVGETVGFKPAGGVRTAEDAAQFLQLADQLMGEGWAD CCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCC ARHFRFGASSLLASLLTTLGHQSNANSSGY HHHHHCCHHHHHHHHHHHHCCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11586360; 12142430 [H]