Definition | Candidatus Protochlamydia amoebophila UWE25, complete genome. |
---|---|
Accession | NC_005861 |
Length | 2,414,465 |
Click here to switch to the map view.
The map label for this gene is gcpE
Identifier: 46446374
GI number: 46446374
Start: 911982
End: 913946
Strand: Direct
Name: gcpE
Synonym: pc0740
Alternate gene names: 46446374
Gene position: 911982-913946 (Clockwise)
Preceding gene: 46446371
Following gene: 46446377
Centisome position: 37.77
GC content: 39.69
Gene sequence:
>1965_bases ATGGTCCAAAAAAAATATTGTGAGGCGATTCATCAAACAGAGAGGCGCCCGACGCGCATTGTTAATGTGGGAAATGTTGG CATTGGAGGCAACCATCCTATTCGTATCCAATCCATGACAACATCCAGTACACGAGATGTGGAAGCGACAATTGAACAAG TCATCCGTTTGGCTGATCAGGGATGCGAAATTGTTCGCGTGACAGTCCAGGGAATTAAAGAAGCGGATGCGTGTGAACAT ATCAAAAAGGGGTTGATTAAACGTGGTTATCAAATTCCTCTTGTGGCGGATATCCATTTTTATCCTCCTGCAGCCATGCG TGTTGTGGATTTTGTAGACAAAGTACGCATTAATCCGGGAAATTTTGTCGATAAACGTGCGAGCTTTAAGCAAATTGTTT ATGATGATGAGTCTTATGCCAGAGAAATTGAGAGGATTGAAGAAAAATTTACTCCCTTAGTGGAAAAGTGTAAACGGTTA AACCGTGCTATGAGAATTGGAACCAATCATGGCTCATTATCCGATCGAATCATGAACCGATATGGAGATACTCCTTTTGG AATGGTCGAGTCAGCTCTGGAATTTGCTCGCATTTGTCGAAAAAATGATTATCATAACTTTCTTTTTTCTATGAAAGCTT CGAATCCACAAGTCATGATTCAAGCTTATCGTTTATTGACGCAAGCCATGTATGCATTGGAGTGGGATTATCCCTTACAC TTAGGTGTGACAGAAGCCGGAGAAGGGGAAGATGGGCGAATCAAATCTGCGATGGGAATTGGATCTCTTTTGATTGATGG AATTGGGGATACCATTCGTGTTTCGTTAACAGAAGATCCTTGGCACGAGATAAATCCTTGCCAACGATTGATTAAACTGG CTTCTGCTTATCAACAGCAAGGTGTGGCTCCTTTTATAGAAAATTATCGTCAGATAGAAGCAATCGAACGACGTCAAGTG CACTTATCTTCAACCGTTCCCATGCATCGTGACGGAACAGTTTTTATTTCGTTACCGATTAATATGCTTAAAGAGGCTTC TCTCTATCAACAAATTGGTTGTGAAGGCCCTTTTGGGAAACCTAAATTAAAGACTGCCACAGCAGATAATTTAGTTTTAA AAAATCCAAATTCTGACTCTGAAGAAAAACGGCAGCTTCAAATTTTAAAAGATTTAGGAATTGGCCTTTTTTCAAAAGAT CCTTTTGAAATGAGTTTGGTTATTCACCCGTTAAAAAAATGGCTCCAGTCCCGTGCAGTGGATTCTTTTGCTTCCCGTTT TTCTTCATCATGGGCTAAATCCGCTGGGCAGCCCTTGATCATTCAAATAACTGATGAAACTGAAAAAGAGTGGAAAGAAG TTATTTCTTTAAAACCACAATTAATTATTTTATCTCCTTCAACTAATCGCTTACACTATTCAAGACAATTTTTTGAGTGG CTACAGCAAAATCAATTAAATTATCCAGTCATTTTAAACTTCACTTATCAAGGAGAAAATGAAGATACCATTCTTTTAGC TAGTATGGAATGTGGATCCCTTTTATGTGATGGACTTGGAGAAGGAGTTTGGTTAGAAGGTCCTTATGATATTCTTTTTC TTCGTCAATTGAGTTTTTCCATTTTACAGGCGGCTCGCCTTCGCATGTCTAAAACTGATTTTATTTCTTGCCCAAGCTGT GGACGGACATTATTTAATCTTCAAGATGTCACTAAACGAATTCAGTCTCGTACCTCTCATTTACCGGGCGTTAAAATTGC GATTATGGGATGCATTGTTAACGGGCCGGGGGAAATGGCCGATGCAGATTTTGGGTACGTTGGATCTAAGCCTGGTAAAA TCGATCTTTACGTTGGGAAAGAATGCGTGGAAAAAGATATCGATTTTGCAGATGCAGATGATCGTCTTGTCAATTTAATC AGAGCACATGGTCGTTGGATAGAACCTCAAACGGTCAATGCATAA
Upstream 100 bases:
>100_bases AAAGATTCCCTTGGGATAAAGTCATTGGCATTTTGGCTAACTTCTGTTATATTTTTTTTCTTAGAGAACGAAAAGTTTTA AACTTGAAAAGGTGTTCGTC
Downstream 100 bases:
>100_bases CATTCGCCATCGATGTTAATTTAACGATGAGCATTTATGGGGTTCCCAGGCAGTTTCAGGCACATTTAAGTGAAAATTGA CAAATCTTGACACAACAAAT
Product: 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase
Products: NA
Alternate protein names: 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase
Number of amino acids: Translated: 654; Mature: 654
Protein sequence:
>654_residues MVQKKYCEAIHQTERRPTRIVNVGNVGIGGNHPIRIQSMTTSSTRDVEATIEQVIRLADQGCEIVRVTVQGIKEADACEH IKKGLIKRGYQIPLVADIHFYPPAAMRVVDFVDKVRINPGNFVDKRASFKQIVYDDESYAREIERIEEKFTPLVEKCKRL NRAMRIGTNHGSLSDRIMNRYGDTPFGMVESALEFARICRKNDYHNFLFSMKASNPQVMIQAYRLLTQAMYALEWDYPLH LGVTEAGEGEDGRIKSAMGIGSLLIDGIGDTIRVSLTEDPWHEINPCQRLIKLASAYQQQGVAPFIENYRQIEAIERRQV HLSSTVPMHRDGTVFISLPINMLKEASLYQQIGCEGPFGKPKLKTATADNLVLKNPNSDSEEKRQLQILKDLGIGLFSKD PFEMSLVIHPLKKWLQSRAVDSFASRFSSSWAKSAGQPLIIQITDETEKEWKEVISLKPQLIILSPSTNRLHYSRQFFEW LQQNQLNYPVILNFTYQGENEDTILLASMECGSLLCDGLGEGVWLEGPYDILFLRQLSFSILQAARLRMSKTDFISCPSC GRTLFNLQDVTKRIQSRTSHLPGVKIAIMGCIVNGPGEMADADFGYVGSKPGKIDLYVGKECVEKDIDFADADDRLVNLI RAHGRWIEPQTVNA
Sequences:
>Translated_654_residues MVQKKYCEAIHQTERRPTRIVNVGNVGIGGNHPIRIQSMTTSSTRDVEATIEQVIRLADQGCEIVRVTVQGIKEADACEH IKKGLIKRGYQIPLVADIHFYPPAAMRVVDFVDKVRINPGNFVDKRASFKQIVYDDESYAREIERIEEKFTPLVEKCKRL NRAMRIGTNHGSLSDRIMNRYGDTPFGMVESALEFARICRKNDYHNFLFSMKASNPQVMIQAYRLLTQAMYALEWDYPLH LGVTEAGEGEDGRIKSAMGIGSLLIDGIGDTIRVSLTEDPWHEINPCQRLIKLASAYQQQGVAPFIENYRQIEAIERRQV HLSSTVPMHRDGTVFISLPINMLKEASLYQQIGCEGPFGKPKLKTATADNLVLKNPNSDSEEKRQLQILKDLGIGLFSKD PFEMSLVIHPLKKWLQSRAVDSFASRFSSSWAKSAGQPLIIQITDETEKEWKEVISLKPQLIILSPSTNRLHYSRQFFEW LQQNQLNYPVILNFTYQGENEDTILLASMECGSLLCDGLGEGVWLEGPYDILFLRQLSFSILQAARLRMSKTDFISCPSC GRTLFNLQDVTKRIQSRTSHLPGVKIAIMGCIVNGPGEMADADFGYVGSKPGKIDLYVGKECVEKDIDFADADDRLVNLI RAHGRWIEPQTVNA >Mature_654_residues MVQKKYCEAIHQTERRPTRIVNVGNVGIGGNHPIRIQSMTTSSTRDVEATIEQVIRLADQGCEIVRVTVQGIKEADACEH IKKGLIKRGYQIPLVADIHFYPPAAMRVVDFVDKVRINPGNFVDKRASFKQIVYDDESYAREIERIEEKFTPLVEKCKRL NRAMRIGTNHGSLSDRIMNRYGDTPFGMVESALEFARICRKNDYHNFLFSMKASNPQVMIQAYRLLTQAMYALEWDYPLH LGVTEAGEGEDGRIKSAMGIGSLLIDGIGDTIRVSLTEDPWHEINPCQRLIKLASAYQQQGVAPFIENYRQIEAIERRQV HLSSTVPMHRDGTVFISLPINMLKEASLYQQIGCEGPFGKPKLKTATADNLVLKNPNSDSEEKRQLQILKDLGIGLFSKD PFEMSLVIHPLKKWLQSRAVDSFASRFSSSWAKSAGQPLIIQITDETEKEWKEVISLKPQLIILSPSTNRLHYSRQFFEW LQQNQLNYPVILNFTYQGENEDTILLASMECGSLLCDGLGEGVWLEGPYDILFLRQLSFSILQAARLRMSKTDFISCPSC GRTLFNLQDVTKRIQSRTSHLPGVKIAIMGCIVNGPGEMADADFGYVGSKPGKIDLYVGKECVEKDIDFADADDRLVNLI RAHGRWIEPQTVNA
Specific function: Converts 2C-methyl-D-erythritol 2,4-cyclodiphosphate (ME-2,4cPP) into 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate
COG id: COG0821
COG function: function code I; Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the ispG family
Homologues:
Organism=Escherichia coli, GI1788863, Length=273, Percent_Identity=41.025641025641, Blast_Score=177, Evalue=2e-45,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): ISPG_PARUW (Q6MD85)
Other databases:
- EMBL: BX908798 - RefSeq: YP_007739.1 - ProteinModelPortal: Q6MD85 - STRING: Q6MD85 - GeneID: 2780947 - GenomeReviews: BX908798_GR - KEGG: pcu:pc0740 - NMPDR: fig|264201.1.peg.740 - eggNOG: COG0821 - HOGENOM: HBG335271 - OMA: SMRIGTN - PhylomeDB: Q6MD85 - BioCyc: CPRO264201:PC0740-MONOMER - BioCyc: PCHL-E25-01:PCHL-E25-01-000740-MONOMER - HAMAP: MF_00159 - InterPro: IPR017178 - InterPro: IPR004588 - PIRSF: PIRSF037336 - TIGRFAMs: TIGR00612
Pfam domain/function: PF04551 GcpE
EC number: =1.17.7.1
Molecular weight: Translated: 73896; Mature: 73896
Theoretical pI: Translated: 7.32; Mature: 7.32
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.0 %Cys (Translated Protein) 2.6 %Met (Translated Protein) 4.6 %Cys+Met (Translated Protein) 2.0 %Cys (Mature Protein) 2.6 %Met (Mature Protein) 4.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MVQKKYCEAIHQTERRPTRIVNVGNVGIGGNHPIRIQSMTTSSTRDVEATIEQVIRLADQ CCCHHHHHHHHHCCCCCCEEEEECCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHC GCEIVRVTVQGIKEADACEHIKKGLIKRGYQIPLVADIHFYPPAAMRVVDFVDKVRINPG CCCEEEEEHHHCCCHHHHHHHHHHHHHCCCCCEEEEEEEECCHHHHHHHHHHHHHCCCCC NFVDKRASFKQIVYDDESYAREIERIEEKFTPLVEKCKRLNRAMRIGTNHGSLSDRIMNR CCHHHHHHHHHHEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHH YGDTPFGMVESALEFARICRKNDYHNFLFSMKASNPQVMIQAYRLLTQAMYALEWDYPLH HCCCCHHHHHHHHHHHHHHHCCCCCCEEEEEECCCHHHHHHHHHHHHHHHHHHCCCCCEE LGVTEAGEGEDGRIKSAMGIGSLLIDGIGDTIRVSLTEDPWHEINPCQRLIKLASAYQQQ ECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHC GVAPFIENYRQIEAIERRQVHLSSTVPMHRDGTVFISLPINMLKEASLYQQIGCEGPFGK CCCHHHHHHHHHHHHHHHHEEHHCCCCCCCCCEEEEEECHHHHHHHHHHHHHCCCCCCCC PKLKTATADNLVLKNPNSDSEEKRQLQILKDLGIGLFSKDPFEMSLVIHPLKKWLQSRAV CCCEEECCCCEEEECCCCCHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHH DSFASRFSSSWAKSAGQPLIIQITDETEKEWKEVISLKPQLIILSPSTNRLHYSRQFFEW HHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEEECCCCCCHHHHHHHHHH LQQNQLNYPVILNFTYQGENEDTILLASMECGSLLCDGLGEGVWLEGPYDILFLRQLSFS HHHCCCCCCEEEEEEECCCCCCEEEEEECCHHHHHHHHCCCCEEECCCHHHHHHHHHHHH ILQAARLRMSKTDFISCPSCGRTLFNLQDVTKRIQSRTSHLPGVKIAIMGCIVNGPGEMA HHHHHHHHCCCCCEECCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEEEEECCCCCCC DADFGYVGSKPGKIDLYVGKECVEKDIDFADADDRLVNLIRAHGRWIEPQTVNA CCCCCCCCCCCCEEEEEECHHHHHHCCCCCCCHHHHHHHHHHCCCCCCCCCCCC >Mature Secondary Structure MVQKKYCEAIHQTERRPTRIVNVGNVGIGGNHPIRIQSMTTSSTRDVEATIEQVIRLADQ CCCHHHHHHHHHCCCCCCEEEEECCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHC GCEIVRVTVQGIKEADACEHIKKGLIKRGYQIPLVADIHFYPPAAMRVVDFVDKVRINPG CCCEEEEEHHHCCCHHHHHHHHHHHHHCCCCCEEEEEEEECCHHHHHHHHHHHHHCCCCC NFVDKRASFKQIVYDDESYAREIERIEEKFTPLVEKCKRLNRAMRIGTNHGSLSDRIMNR CCHHHHHHHHHHEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHH YGDTPFGMVESALEFARICRKNDYHNFLFSMKASNPQVMIQAYRLLTQAMYALEWDYPLH HCCCCHHHHHHHHHHHHHHHCCCCCCEEEEEECCCHHHHHHHHHHHHHHHHHHCCCCCEE LGVTEAGEGEDGRIKSAMGIGSLLIDGIGDTIRVSLTEDPWHEINPCQRLIKLASAYQQQ ECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHC GVAPFIENYRQIEAIERRQVHLSSTVPMHRDGTVFISLPINMLKEASLYQQIGCEGPFGK CCCHHHHHHHHHHHHHHHHEEHHCCCCCCCCCEEEEEECHHHHHHHHHHHHHCCCCCCCC PKLKTATADNLVLKNPNSDSEEKRQLQILKDLGIGLFSKDPFEMSLVIHPLKKWLQSRAV CCCEEECCCCEEEECCCCCHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHH DSFASRFSSSWAKSAGQPLIIQITDETEKEWKEVISLKPQLIILSPSTNRLHYSRQFFEW HHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEEECCCCCCHHHHHHHHHH LQQNQLNYPVILNFTYQGENEDTILLASMECGSLLCDGLGEGVWLEGPYDILFLRQLSFS HHHCCCCCCEEEEEEECCCCCCEEEEEECCHHHHHHHHCCCCEEECCCHHHHHHHHHHHH ILQAARLRMSKTDFISCPSCGRTLFNLQDVTKRIQSRTSHLPGVKIAIMGCIVNGPGEMA HHHHHHHHCCCCCEECCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEEEEECCCCCCC DADFGYVGSKPGKIDLYVGKECVEKDIDFADADDRLVNLIRAHGRWIEPQTVNA CCCCCCCCCCCCEEEEEECHHHHHHCCCCCCCHHHHHHHHHHCCCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA