Definition | Deinococcus geothermalis DSM 11300 plasmid pDGEO01, complete sequence. |
---|---|
Accession | NC_008010 |
Length | 574,127 |
Click here to switch to the map view.
The map label for this gene is hutI [H]
Identifier: 94972198
GI number: 94972198
Start: 295489
End: 296724
Strand: Direct
Name: hutI [H]
Synonym: Dgeo_2731
Alternate gene names: 94972198
Gene position: 295489-296724 (Clockwise)
Preceding gene: 94972199
Following gene: 94972197
Centisome position: 51.47
GC content: 71.36
Gene sequence:
>1236_bases ATGGCTGAACTGCTCTTGACTGGCATCACCCAGCTCGTGACGCCCCCGCCGGGACCGCAAAGGGGCGCGGCCATGCGGAA GCTGACGGTGCTGCAGGACGCGGCGCTCCTCATGCGTGACGGGATGATCGCGTGGGTGGGATCACGGCAGGAGGCGCCCG CCGCCGCTCAGATCCGCGACTTGGGCGGCGTCGCCGTCGTTCCCGGCCTGGTCGACCCTCACACCCATGCGGTCTGGGCC GGGGACCGCCTGGCCGATTTCGAGGCACGGGTGGAAGGCGTGCCCTACGAGGAGCTGCTGGCGCGGGGGGGCGGCATCCG CTCCACCATGCAGGCGACGGCAACGGCAGGTGTGGAGGAACTTGCCCAGCTCGCCCACCCCCGCCTAGCGGCCCTGCTCC ATTCCGGCGCGACCACCATCGAGGTCAAGAGCGGCTACGGGCTGGACTTTGGGGCCGAGTTGAGGATGCTGAAGGCGGTG CGTGCGTTGCAGGAGAGCTTGCCGGCCACGCTCGTGCCCACGCTGCTGATTCACGTCCCGCCCACCGAAAGCCGCGCGGC GTACGTCCGGGCGGTCTGTGAGGCCCTCATTCCCGAGGTGGCGCGCAAGCGCCTGGCTGCCGCTGTGGACGTGTTCTGCG AGCGCGAAGCCTTCACGGTGGAGGAAACGCGTGCCCTCTTCGCGGCGGCCCGGTCGAATGGCCTGCAGGTCAAGCTGCAC GCCGACCAGTTCCACGCCCTCGGCGGCACCGAACTTGCCTGCGCGGTGGAGGCGCTCAGCGTGGACCACCTGGAAGCCAG CGGCGAGGCGCAGATCGAGGCGCTGGCCGCGTCGGAGACGGTGGCGACGGTCCTGCCCGGCGTCACGCTGCACCTGGGGC TGAGGGCAGCCCCGGCCCGCCGCCTCGTGGACGCGGGCGCCTGCGTGGCGGTCGGTACGGACCTGAACCCCGGCAGCTCT CCCCTCTTCAGCGCCCAGCTCGCGCTGGCCCTCGCGGTGCGGCTGAACGGCCTCACGCCCGCCGAGGCCCTCACCGCTTG CACCGTGAACGCCGCCGCCGCACTGGGGCTGAGGGACCGGGGGGCACTGGTGGCTGGGCAGCGGGCCGACTTGCTCGCCC TGCATGCCTCCGACTGGCGCGACCTGGCCTACACGCTGGGCGCAAACCCTGTCCGCGACGTGTTCGTGGGCGGGCAAAAC ATCAAGGAGACTCTGAGCAAGGAGAAGGCCCTGTGA
Upstream 100 bases:
>100_bases GGGATGGACCTCGTCGAACTCGCACCCAACCTCGACCCCAGCGGGCGCAGCGCCCTGATCGGCGCGCGGCTGGTGATGGA GACGCTCTGCGAGGCCTTCG
Downstream 100 bases:
>100_bases TCCTAGACCGGCAATTGACGCTTGACGACTTTATCCGCGTGGTGCGTGGCGGCGAGGAGGTGACCCTTGCTGATGCGGCG CGGACACGGATGGGACGAGC
Product: imidazolonepropionase
Products: NA
Alternate protein names: Imidazolone-5-propionate hydrolase [H]
Number of amino acids: Translated: 411; Mature: 410
Protein sequence:
>411_residues MAELLLTGITQLVTPPPGPQRGAAMRKLTVLQDAALLMRDGMIAWVGSRQEAPAAAQIRDLGGVAVVPGLVDPHTHAVWA GDRLADFEARVEGVPYEELLARGGGIRSTMQATATAGVEELAQLAHPRLAALLHSGATTIEVKSGYGLDFGAELRMLKAV RALQESLPATLVPTLLIHVPPTESRAAYVRAVCEALIPEVARKRLAAAVDVFCEREAFTVEETRALFAAARSNGLQVKLH ADQFHALGGTELACAVEALSVDHLEASGEAQIEALAASETVATVLPGVTLHLGLRAAPARRLVDAGACVAVGTDLNPGSS PLFSAQLALALAVRLNGLTPAEALTACTVNAAAALGLRDRGALVAGQRADLLALHASDWRDLAYTLGANPVRDVFVGGQN IKETLSKEKAL
Sequences:
>Translated_411_residues MAELLLTGITQLVTPPPGPQRGAAMRKLTVLQDAALLMRDGMIAWVGSRQEAPAAAQIRDLGGVAVVPGLVDPHTHAVWA GDRLADFEARVEGVPYEELLARGGGIRSTMQATATAGVEELAQLAHPRLAALLHSGATTIEVKSGYGLDFGAELRMLKAV RALQESLPATLVPTLLIHVPPTESRAAYVRAVCEALIPEVARKRLAAAVDVFCEREAFTVEETRALFAAARSNGLQVKLH ADQFHALGGTELACAVEALSVDHLEASGEAQIEALAASETVATVLPGVTLHLGLRAAPARRLVDAGACVAVGTDLNPGSS PLFSAQLALALAVRLNGLTPAEALTACTVNAAAALGLRDRGALVAGQRADLLALHASDWRDLAYTLGANPVRDVFVGGQN IKETLSKEKAL >Mature_410_residues AELLLTGITQLVTPPPGPQRGAAMRKLTVLQDAALLMRDGMIAWVGSRQEAPAAAQIRDLGGVAVVPGLVDPHTHAVWAG DRLADFEARVEGVPYEELLARGGGIRSTMQATATAGVEELAQLAHPRLAALLHSGATTIEVKSGYGLDFGAELRMLKAVR ALQESLPATLVPTLLIHVPPTESRAAYVRAVCEALIPEVARKRLAAAVDVFCEREAFTVEETRALFAAARSNGLQVKLHA DQFHALGGTELACAVEALSVDHLEASGEAQIEALAASETVATVLPGVTLHLGLRAAPARRLVDAGACVAVGTDLNPGSSP LFSAQLALALAVRLNGLTPAEALTACTVNAAAALGLRDRGALVAGQRADLLALHASDWRDLAYTLGANPVRDVFVGGQNI KETLSKEKAL
Specific function: Unknown
COG id: COG1228
COG function: function code Q; Imidazolonepropionase and related amidohydrolases
Gene ontology:
Cell location: Cytoplasm (Potential) [H]
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: Belongs to the hutI family [H]
Homologues:
Organism=Homo sapiens, GI223972677, Length=398, Percent_Identity=32.9145728643216, Blast_Score=192, Evalue=6e-49, Organism=Caenorhabditis elegans, GI17555004, Length=405, Percent_Identity=29.3827160493827, Blast_Score=191, Evalue=6e-49,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR013108 - InterPro: IPR005920 - InterPro: IPR011059 [H]
Pfam domain/function: PF07969 Amidohydro_3 [H]
EC number: =3.5.2.7 [H]
Molecular weight: Translated: 42854; Mature: 42723
Theoretical pI: Translated: 6.01; Mature: 6.01
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.2 %Cys (Translated Protein) 1.5 %Met (Translated Protein) 2.7 %Cys+Met (Translated Protein) 1.2 %Cys (Mature Protein) 1.2 %Met (Mature Protein) 2.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MAELLLTGITQLVTPPPGPQRGAAMRKLTVLQDAALLMRDGMIAWVGSRQEAPAAAQIRD CCHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHH LGGVAVVPGLVDPHTHAVWAGDRLADFEARVEGVPYEELLARGGGIRSTMQATATAGVEE CCCEEEECCCCCCCCCEEECCCCHHHHHHHHCCCCHHHHHHCCCCCHHHHHHHHHHHHHH LAQLAHPRLAALLHSGATTIEVKSGYGLDFGAELRMLKAVRALQESLPATLVPTLLIHVP HHHHHCHHHHHHHHCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHEEEECC PTESRAAYVRAVCEALIPEVARKRLAAAVDVFCEREAFTVEETRALFAAARSNGLQVKLH CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCCEEEEE ADQFHALGGTELACAVEALSVDHLEASGEAQIEALAASETVATVLPGVTLHLGLRAAPAR HHHHHCCCCCHHHHHHHHHHHHHHCCCCCCCEEHHHHHHHHHHHHCCCEEEECCCCCCHH RLVDAGACVAVGTDLNPGSSPLFSAQLALALAVRLNGLTPAEALTACTVNAAAALGLRDR HHHCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCC GALVAGQRADLLALHASDWRDLAYTLGANPVRDVFVGGQNIKETLSKEKAL CCEEECCCCCEEEEECCCHHHHHHHCCCCCHHHHHCCCCHHHHHHHHHCCC >Mature Secondary Structure AELLLTGITQLVTPPPGPQRGAAMRKLTVLQDAALLMRDGMIAWVGSRQEAPAAAQIRD CHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHH LGGVAVVPGLVDPHTHAVWAGDRLADFEARVEGVPYEELLARGGGIRSTMQATATAGVEE CCCEEEECCCCCCCCCEEECCCCHHHHHHHHCCCCHHHHHHCCCCCHHHHHHHHHHHHHH LAQLAHPRLAALLHSGATTIEVKSGYGLDFGAELRMLKAVRALQESLPATLVPTLLIHVP HHHHHCHHHHHHHHCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHEEEECC PTESRAAYVRAVCEALIPEVARKRLAAAVDVFCEREAFTVEETRALFAAARSNGLQVKLH CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCCEEEEE ADQFHALGGTELACAVEALSVDHLEASGEAQIEALAASETVATVLPGVTLHLGLRAAPAR HHHHHCCCCCHHHHHHHHHHHHHHCCCCCCCEEHHHHHHHHHHHHCCCEEEECCCCCCHH RLVDAGACVAVGTDLNPGSSPLFSAQLALALAVRLNGLTPAEALTACTVNAAAALGLRDR HHHCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCC GALVAGQRADLLALHASDWRDLAYTLGANPVRDVFVGGQNIKETLSKEKAL CCEEECCCCCEEEEECCCHHHHHHHCCCCCHHHHHCCCCHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA