| Definition | Deinococcus geothermalis DSM 11300 plasmid pDGEO01, complete sequence. |
|---|---|
| Accession | NC_008010 |
| Length | 574,127 |
Click here to switch to the map view.
The map label for this gene is 94972291
Identifier: 94972291
GI number: 94972291
Start: 197949
End: 199481
Strand: Direct
Name: 94972291
Synonym: Dgeo_2828
Alternate gene names: NA
Gene position: 197949-199481 (Clockwise)
Preceding gene: 94972292
Following gene: 94972285
Centisome position: 34.48
GC content: 69.08
Gene sequence:
>1533_bases ATGACCTCTGAGCGGTCCTGGACCCTGCCCATCTCCCCTCATGAACTGCGGGTTCTCGTGAACGGCTACCAGTCGTGGAG CGAGGCGGAGCTGCGGGCGCTGCTGGACACGCCGCAGCGCGCGCTGTTTTCGTGGATGATCGAGCAGGGGCAAGATCCGG TCTTTCCCCCCAGCGGGGAGGCGGGCGTATGGCGTTCGCACACCCTGATCGCCCTGATTCGCCCGGATGGGGGCGGCTGG GTGGGCTGTCTTCAAGATGCCCGGCAGACCTTTGCTCACTGGGAGGCGCGGGCGGTGGGGGGGCAGGTGGAGCTGAGCTG CACCTTGGAGGGTCCCCCAGCGGAGGTGGCCTTTGAGGAGACCGAGGACGTGATCACAACCGTCGAAACCCTCACGGCAC GATTGGGAACGGTGATGCAGGCCCGGACCCCGCCGCCCCTGCGGGTGTGGTGCAGTTGGTACAGCTACTACCGGGACGTG ACGCTCGCCGCGATGCTGGACAATGCCCGGCTCGCCCGCGAGTTGGCGCTGCCTTTCGATGTGTTTCAGCTGGACGACGG CTTTCAGGCAGATTTGGGCGACTGGTTGGAGCCTAGCGCCTGGTTCGGGGGTCATGCGAAGGACCTTCCGGCGCAGCTCA AAGAACTCGGCTTCCGGGCTGGGTTGTGGTTGGCTCCCTTTTTGGTGGGTCCGCGCTCGCGGCTGCGGGCGCAGCAGCCA GAGTGGCTGCTGCGCGGTGCGGACGGGGAGCCGCTGCTGGCAGGGCACAACTGGGGCGGTCCCTACCACGTGCTGGACAC CACCCACCCCGAGGTGCTGGCGTGGCTGCGTGACCTGGCGGCCACTGTGCGGGGGTGGGGCTACACGTATCTGAAGCTGG ACTTTCTGTACGGGGCTGCGCTGCCCGGGGTGCGGTATGACCCAGCGGTGAGCCGGGCTGCGGCCTACCGGCAGGGTGTG CAGGCGCTGCGGGATGGGGCGGGCGAGGAGACCTTCCTGCTGGGGTGTGGGGCGCCCCTTGCGAGCAGCATCGGGCTGGT GGACGCGATGCGGACCGGGCCGGACGTGACGCCCTTTTGGGACGACGAGGCGCGGCGGGTGCTGCTGGGGGACGGGGCGG TGCCCAGCACGCGTAGCGTGCTGCACACGGCCCTGTCGCGCTGGTACCAGCATGCCTGGTACCAGCCTGACCCGGATGTG ATGATTGCCCGGCGGGAACTGAGTCTGCTGGGGGAACACGAGCGCGGCGCCCTGCTTGGCCTGCTCGACGTGATTGGTGG CCTGCGGGCGAGCAGCGATCCCATCCGGCTGCTGGACGAGGCGGGACGGGCGCTGCTGCGGCAGAGCCTGCAGCTCAGCC GCCCGGATCGGCCCCGGACGCTGACGACGAGTTACGGCGGAGCGGTGACCCACTTCACGCGGGGCACCTTTAACCTGCTG GATGTGCCTGCGGGTGGCCTGGCGCCGCACAGTTACAGCGCGGCCCAGGTTGGGGGTCTCCAGCCCCTTCTCGCCCGCCA CCAGCAGACCTAG
Upstream 100 bases:
>100_bases CTCAGTCGCCGCGCCGGGGTGACGCTGGTACAGAACTGGACCGACGGGCCGGTGACCTGGCAGGGCCAAACGCTGCCACC CGTGAGTTTCGAGGTGCGGC
Downstream 100 bases:
>100_bases GTCCGTCGGGCATGACCCGGCCAGCACATGTCCGGAAGGTGCCACGCGGCCGTGCGGGACGTGGGCGGCAGGTCTTGCGC GAACTGGAGGCCGCCGGAGG
Product: glycoside hydrolase, clan GH-D
Products: NA
Alternate protein names: Glycoside Hydrolase Clan GH-D; Alpha-1 6-Galactosidase; Alpha-Galactosidase-Like Protein; Melibiase Family/PKD Domain Protein; Alpha-Glucosidase; Glycoside Hydrolase/PKD; Alpha-Galactosidase SCF; Glycoside Hydrolase
Number of amino acids: Translated: 510; Mature: 509
Protein sequence:
>510_residues MTSERSWTLPISPHELRVLVNGYQSWSEAELRALLDTPQRALFSWMIEQGQDPVFPPSGEAGVWRSHTLIALIRPDGGGW VGCLQDARQTFAHWEARAVGGQVELSCTLEGPPAEVAFEETEDVITTVETLTARLGTVMQARTPPPLRVWCSWYSYYRDV TLAAMLDNARLARELALPFDVFQLDDGFQADLGDWLEPSAWFGGHAKDLPAQLKELGFRAGLWLAPFLVGPRSRLRAQQP EWLLRGADGEPLLAGHNWGGPYHVLDTTHPEVLAWLRDLAATVRGWGYTYLKLDFLYGAALPGVRYDPAVSRAAAYRQGV QALRDGAGEETFLLGCGAPLASSIGLVDAMRTGPDVTPFWDDEARRVLLGDGAVPSTRSVLHTALSRWYQHAWYQPDPDV MIARRELSLLGEHERGALLGLLDVIGGLRASSDPIRLLDEAGRALLRQSLQLSRPDRPRTLTTSYGGAVTHFTRGTFNLL DVPAGGLAPHSYSAAQVGGLQPLLARHQQT
Sequences:
>Translated_510_residues MTSERSWTLPISPHELRVLVNGYQSWSEAELRALLDTPQRALFSWMIEQGQDPVFPPSGEAGVWRSHTLIALIRPDGGGW VGCLQDARQTFAHWEARAVGGQVELSCTLEGPPAEVAFEETEDVITTVETLTARLGTVMQARTPPPLRVWCSWYSYYRDV TLAAMLDNARLARELALPFDVFQLDDGFQADLGDWLEPSAWFGGHAKDLPAQLKELGFRAGLWLAPFLVGPRSRLRAQQP EWLLRGADGEPLLAGHNWGGPYHVLDTTHPEVLAWLRDLAATVRGWGYTYLKLDFLYGAALPGVRYDPAVSRAAAYRQGV QALRDGAGEETFLLGCGAPLASSIGLVDAMRTGPDVTPFWDDEARRVLLGDGAVPSTRSVLHTALSRWYQHAWYQPDPDV MIARRELSLLGEHERGALLGLLDVIGGLRASSDPIRLLDEAGRALLRQSLQLSRPDRPRTLTTSYGGAVTHFTRGTFNLL DVPAGGLAPHSYSAAQVGGLQPLLARHQQT >Mature_509_residues TSERSWTLPISPHELRVLVNGYQSWSEAELRALLDTPQRALFSWMIEQGQDPVFPPSGEAGVWRSHTLIALIRPDGGGWV GCLQDARQTFAHWEARAVGGQVELSCTLEGPPAEVAFEETEDVITTVETLTARLGTVMQARTPPPLRVWCSWYSYYRDVT LAAMLDNARLARELALPFDVFQLDDGFQADLGDWLEPSAWFGGHAKDLPAQLKELGFRAGLWLAPFLVGPRSRLRAQQPE WLLRGADGEPLLAGHNWGGPYHVLDTTHPEVLAWLRDLAATVRGWGYTYLKLDFLYGAALPGVRYDPAVSRAAAYRQGVQ ALRDGAGEETFLLGCGAPLASSIGLVDAMRTGPDVTPFWDDEARRVLLGDGAVPSTRSVLHTALSRWYQHAWYQPDPDVM IARRELSLLGEHERGALLGLLDVIGGLRASSDPIRLLDEAGRALLRQSLQLSRPDRPRTLTTSYGGAVTHFTRGTFNLLD VPAGGLAPHSYSAAQVGGLQPLLARHQQT
Specific function: Unknown
COG id: COG3345
COG function: function code G; Alpha-galactosidase
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
None
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
NA
Pfam domain/function: NA
EC number: 3.2.1.22
Molecular weight: Translated: 56102; Mature: 55971
Theoretical pI: Translated: 5.60; Mature: 5.60
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.8 %Cys (Translated Protein) 1.2 %Met (Translated Protein) 2.0 %Cys+Met (Translated Protein) 0.8 %Cys (Mature Protein) 1.0 %Met (Mature Protein) 1.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTSERSWTLPISPHELRVLVNGYQSWSEAELRALLDTPQRALFSWMIEQGQDPVFPPSGE CCCCCCEECCCCHHHHEEEEECCCCHHHHHHHHHHCCHHHHHHHHHHHCCCCCCCCCCCC AGVWRSHTLIALIRPDGGGWVGCLQDARQTFAHWEARAVGGQVELSCTLEGPPAEVAFEE CCCEECCEEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCCCCCCCHHH TEDVITTVETLTARLGTVMQARTPPPLRVWCSWYSYYRDVTLAAMLDNARLARELALPFD HHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCE VFQLDDGFQADLGDWLEPSAWFGGHAKDLPAQLKELGFRAGLWLAPFLVGPRSRLRAQQP EEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCHHHHHHHCCCHHHHCCCCC EWLLRGADGEPLLAGHNWGGPYHVLDTTHPEVLAWLRDLAATVRGWGYTYLKLDFLYGAA CCEEECCCCCEEEECCCCCCCEEEECCCCHHHHHHHHHHHHHHHCCCEEEEEEEEHHHCC LPGVRYDPAVSRAAAYRQGVQALRDGAGEETFLLGCGAPLASSIGLVDAMRTGPDVTPFW CCCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCHHHHHHHHHHHHHCCCCCCCCC DDEARRVLLGDGAVPSTRSVLHTALSRWYQHAWYQPDPDVMIARRELSLLGEHERGALLG CCCCCEEEEECCCCCCHHHHHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHCCCCCCHHHH LLDVIGGLRASSDPIRLLDEAGRALLRQSLQLSRPDRPRTLTTSYGGAVTHFTRGTFNLL HHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEECCCCCEEEECCCCCEEE DVPAGGLAPHSYSAAQVGGLQPLLARHQQT ECCCCCCCCCCCCHHHHCCCHHHHHHCCCC >Mature Secondary Structure TSERSWTLPISPHELRVLVNGYQSWSEAELRALLDTPQRALFSWMIEQGQDPVFPPSGE CCCCCEECCCCHHHHEEEEECCCCHHHHHHHHHHCCHHHHHHHHHHHCCCCCCCCCCCC AGVWRSHTLIALIRPDGGGWVGCLQDARQTFAHWEARAVGGQVELSCTLEGPPAEVAFEE CCCEECCEEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCCCCCCCHHH TEDVITTVETLTARLGTVMQARTPPPLRVWCSWYSYYRDVTLAAMLDNARLARELALPFD HHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCE VFQLDDGFQADLGDWLEPSAWFGGHAKDLPAQLKELGFRAGLWLAPFLVGPRSRLRAQQP EEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCHHHHHHHCCCHHHHCCCCC EWLLRGADGEPLLAGHNWGGPYHVLDTTHPEVLAWLRDLAATVRGWGYTYLKLDFLYGAA CCEEECCCCCEEEECCCCCCCEEEECCCCHHHHHHHHHHHHHHHCCCEEEEEEEEHHHCC LPGVRYDPAVSRAAAYRQGVQALRDGAGEETFLLGCGAPLASSIGLVDAMRTGPDVTPFW CCCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCHHHHHHHHHHHHHCCCCCCCCC DDEARRVLLGDGAVPSTRSVLHTALSRWYQHAWYQPDPDVMIARRELSLLGEHERGALLG CCCCCEEEEECCCCCCHHHHHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHCCCCCCHHHH LLDVIGGLRASSDPIRLLDEAGRALLRQSLQLSRPDRPRTLTTSYGGAVTHFTRGTFNLL HHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEECCCCCEEEECCCCCEEE DVPAGGLAPHSYSAAQVGGLQPLLARHQQT ECCCCCCCCCCCCHHHHCCCHHHHHHCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA