Definition | Sinorhizobium fredii NGR234 plasmid pNGR234a, complete sequence. |
---|---|
Accession | NC_000914 |
Length | 536,165 |
Click here to switch to the map view.
The map label for this gene is shc
Identifier: 16519641
GI number: 16519641
Start: 56458
End: 58401
Strand: Direct
Name: shc
Synonym: NGR_a00460
Alternate gene names: 16519641
Gene position: 56458-58401 (Clockwise)
Preceding gene: 16519642
Following gene: 16520036
Centisome position: 10.53
GC content: 60.7
Gene sequence:
>1944_bases GTGAATAAGCATTCCGGAAATCGCACTGCCATCGATCCCGCCGCGCTGGAAATGAGCATCGCCTCAGCCACTGAGGCGCT GCTCGCTTATCGTCATGCCGACGGCCATTGGGCGTTCGAGCTTGAAGCGGATTCCACCATCCCTTCCGAATACATCCTGC TACGTCATTACCTGGCCGAGCCCATTGACGTCGTGCTCGAAGCCAAAATCGGAAATTATCTGCGCCGCACTCAAGGCGCG CACGGCGGCTGGCCGCTGGTGCATGACGGGCCCTTCGATATGAGCGCAAGCGTGAAGTCTTACTTCGCGCTCAAGATGAT CGGTGATTCCGTCGACGCGGCTCATATGGTGAAGGCGCGCGAGGCGATCCGCGCGCGCGGCGGCGCCGCCAACAGCAACG TACTCACGCGCTTCCTGCTCGCGCTCTATGGCGTAGTTAGCTGGCGCGCGGTGCCGGTTCTGCCAATCGAGATCGTGCTA CTGCCAATCTGGTCGCCGTTCCACCTCTACAAGATCTCCTACTGGGCGCGCACCACTATTGTGCCGCTGATGGTTCTTGC AGTGCTGAAGCCGCGTGCGAAGAATCCTAAGGGCGTCGGCATCGAAGAACTGTTCCTTCAGGATACCAAGAGCGTAGGCA TGAACCCGAAGGCGCCGCACCAGAGCTGGGGCTGGTTCTTGCTGTTCCGCGGCATCGACGGCATCCTGCGGGTTATTGAG CCGCATCTTCCGAAGAAACTTCGTGAGCGCGCGATCGCGAGTGCGCTCGCCTTCACTGAAGAGCGGCTCAACGGTGAAGA TGGCATGGGCGCGATCTATCCGTCGATGGCCAATATCGTGATGATGTATGACGCGCTCGGTAAAGACGATCATTTTCCGC CGCGCGCGATAGCACGTCGCGCTATTGACAAGCTCTTGGTGATCGGCGAGGAAGAGGCCTATTGCCAGCCCTGCCTGTCG CCGGTCTGGGACACCGCGCTGACGTGCCACGCACTTCAGGAGGTGGGCGGCGCTAACGCCGTGGCGAAAGCGAAGCAGGG CCTGGACTGGCTGAAGCCGCGGCAGGTACTGGACGTGAAGGGCGACTGGGCAGTGAAGGCGCCCAACATCAGGCCCGGCG GCTGGCCGTTTCAATACAACAACGCGCACTATCCCGATCTCGATGACACTGCAGTGGTGGTCATGGCCATGGACCGCGCG CAGCGGCACGCCGGCAGCAAGGAATACGCCACCGCGATCGCACGTGGCCGGGAATGGATCGAAGGCATGCAAAGCCGGGA TGGCGGCTGGGCCGCCTTCGACGTCAACAATCTCGAATACTACCTGAATAACCTCCCATTCGCGGACCATGGCGCGCTGC TCGACCCGCCGACCGAAGACGTTACCGCACGGTGCGTCTCGATGTTGGCCCAGGTCGGCGAATTCACCCAAAGGAGCAAG GCAGTCGCCGAGGGGATTGCATATTTGCGCCGCACCCAACACGCGGAAGGATCATGGTACGGCCGCTGGGGCCTGAACTA CATCTACGGTACCTGGTCGGTGTTGTGCGCGCTGAATGCCGCAGGGATCGATCACCAAGACCCCATGATAAGGAAGGCGG TGGAATGGCTAGTATCCATCCAGAGTTGGGACGGCGGTTGGGGCGAGGATGCAATCAGCTACCGACTCGACTATAGCGGA TATGAGCAGGCGCCTTCCACGTCCTCCCAAACGGCGTGGGCCTTGCTTGGACTGATGGCGGCCGGCGAGGTGGAGCATCC GGCCGTCGCACGCGGGGTGAACTACCTAAAAAACGCACAAACCGAGAACGGTCTGTGGGATGAGCAGCGCTACACCGCCA CGGGTTTTCCGCGGGTGTTTTATTTGCGATATCACGGCTACTCCAAGTTCTTTCCGCTCTGGGCATTAGCACGGTACCGG AACTTGAGGAGCACGAACGTTTGA
Upstream 100 bases:
>100_bases GCGGTCGGGTAACCGTGCCGCCGATCTGGCTCTTTCAGAGACAAACACTTGATTGATAAATCCAGACGGTTTTGCACGCT GCCGAACACGCAGGAAGTCA
Downstream 100 bases:
>100_bases TGAGAAGAGTTTGGTCGTTAGGAGAACTCGTGAGGGTGAATGTCGACTCGATTGCATTTAAGTTTTCACCTGCGCCCATT TGCCTGACTTTATGGAGGTG
Product: squalene-hopene cyclase
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 647; Mature: 647
Protein sequence:
>647_residues MNKHSGNRTAIDPAALEMSIASATEALLAYRHADGHWAFELEADSTIPSEYILLRHYLAEPIDVVLEAKIGNYLRRTQGA HGGWPLVHDGPFDMSASVKSYFALKMIGDSVDAAHMVKAREAIRARGGAANSNVLTRFLLALYGVVSWRAVPVLPIEIVL LPIWSPFHLYKISYWARTTIVPLMVLAVLKPRAKNPKGVGIEELFLQDTKSVGMNPKAPHQSWGWFLLFRGIDGILRVIE PHLPKKLRERAIASALAFTEERLNGEDGMGAIYPSMANIVMMYDALGKDDHFPPRAIARRAIDKLLVIGEEEAYCQPCLS PVWDTALTCHALQEVGGANAVAKAKQGLDWLKPRQVLDVKGDWAVKAPNIRPGGWPFQYNNAHYPDLDDTAVVVMAMDRA QRHAGSKEYATAIARGREWIEGMQSRDGGWAAFDVNNLEYYLNNLPFADHGALLDPPTEDVTARCVSMLAQVGEFTQRSK AVAEGIAYLRRTQHAEGSWYGRWGLNYIYGTWSVLCALNAAGIDHQDPMIRKAVEWLVSIQSWDGGWGEDAISYRLDYSG YEQAPSTSSQTAWALLGLMAAGEVEHPAVARGVNYLKNAQTENGLWDEQRYTATGFPRVFYLRYHGYSKFFPLWALARYR NLRSTNV
Sequences:
>Translated_647_residues MNKHSGNRTAIDPAALEMSIASATEALLAYRHADGHWAFELEADSTIPSEYILLRHYLAEPIDVVLEAKIGNYLRRTQGA HGGWPLVHDGPFDMSASVKSYFALKMIGDSVDAAHMVKAREAIRARGGAANSNVLTRFLLALYGVVSWRAVPVLPIEIVL LPIWSPFHLYKISYWARTTIVPLMVLAVLKPRAKNPKGVGIEELFLQDTKSVGMNPKAPHQSWGWFLLFRGIDGILRVIE PHLPKKLRERAIASALAFTEERLNGEDGMGAIYPSMANIVMMYDALGKDDHFPPRAIARRAIDKLLVIGEEEAYCQPCLS PVWDTALTCHALQEVGGANAVAKAKQGLDWLKPRQVLDVKGDWAVKAPNIRPGGWPFQYNNAHYPDLDDTAVVVMAMDRA QRHAGSKEYATAIARGREWIEGMQSRDGGWAAFDVNNLEYYLNNLPFADHGALLDPPTEDVTARCVSMLAQVGEFTQRSK AVAEGIAYLRRTQHAEGSWYGRWGLNYIYGTWSVLCALNAAGIDHQDPMIRKAVEWLVSIQSWDGGWGEDAISYRLDYSG YEQAPSTSSQTAWALLGLMAAGEVEHPAVARGVNYLKNAQTENGLWDEQRYTATGFPRVFYLRYHGYSKFFPLWALARYR NLRSTNV >Mature_647_residues MNKHSGNRTAIDPAALEMSIASATEALLAYRHADGHWAFELEADSTIPSEYILLRHYLAEPIDVVLEAKIGNYLRRTQGA HGGWPLVHDGPFDMSASVKSYFALKMIGDSVDAAHMVKAREAIRARGGAANSNVLTRFLLALYGVVSWRAVPVLPIEIVL LPIWSPFHLYKISYWARTTIVPLMVLAVLKPRAKNPKGVGIEELFLQDTKSVGMNPKAPHQSWGWFLLFRGIDGILRVIE PHLPKKLRERAIASALAFTEERLNGEDGMGAIYPSMANIVMMYDALGKDDHFPPRAIARRAIDKLLVIGEEEAYCQPCLS PVWDTALTCHALQEVGGANAVAKAKQGLDWLKPRQVLDVKGDWAVKAPNIRPGGWPFQYNNAHYPDLDDTAVVVMAMDRA QRHAGSKEYATAIARGREWIEGMQSRDGGWAAFDVNNLEYYLNNLPFADHGALLDPPTEDVTARCVSMLAQVGEFTQRSK AVAEGIAYLRRTQHAEGSWYGRWGLNYIYGTWSVLCALNAAGIDHQDPMIRKAVEWLVSIQSWDGGWGEDAISYRLDYSG YEQAPSTSSQTAWALLGLMAAGEVEHPAVARGVNYLKNAQTENGLWDEQRYTATGFPRVFYLRYHGYSKFFPLWALARYR NLRSTNV
Specific function: Catalyzes the cyclization of squalene into hopene. Probably part of an operon y4aABCD involved in the synthesis of an isoprenoid compound
COG id: COG1657
COG function: function code I; Squalene cyclase
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: Contains 3 PFTB repeats
Homologues:
Organism=Homo sapiens, GI47933395, Length=647, Percent_Identity=26.4296754250386, Blast_Score=182, Evalue=7e-46, Organism=Homo sapiens, GI47933397, Length=647, Percent_Identity=26.4296754250386, Blast_Score=182, Evalue=7e-46, Organism=Homo sapiens, GI224177558, Length=647, Percent_Identity=26.4296754250386, Blast_Score=182, Evalue=8e-46, Organism=Homo sapiens, GI224177556, Length=647, Percent_Identity=25.6568778979907, Blast_Score=169, Evalue=1e-41, Organism=Saccharomyces cerevisiae, GI6321863, Length=164, Percent_Identity=34.1463414634146, Blast_Score=94, Evalue=7e-20,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): SQHC_RHISN (P55348)
Other databases:
- EMBL: U00090 - RefSeq: NP_443761.1 - ProteinModelPortal: P55348 - GeneID: 962250 - GenomeReviews: U00090_GR - KEGG: rhi:NGR_a00460 - HOGENOM: HBG535807 - ProtClustDB: CLSK2393506 - InterPro: IPR006400 - InterPro: IPR001330 - InterPro: IPR018333 - InterPro: IPR002365 - InterPro: IPR008930 - TIGRFAMs: TIGR01507 - TIGRFAMs: TIGR01787
Pfam domain/function: PF00432 Prenyltrans; SSF48239 Terp_cyc_toroid
EC number: =5.4.99.17
Molecular weight: Translated: 72131; Mature: 72131
Theoretical pI: Translated: 7.18; Mature: 7.18
Prosite motif: PS01074 TERPENE_SYNTHASES
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.8 %Cys (Translated Protein) 2.6 %Met (Translated Protein) 3.4 %Cys+Met (Translated Protein) 0.8 %Cys (Mature Protein) 2.6 %Met (Mature Protein) 3.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MNKHSGNRTAIDPAALEMSIASATEALLAYRHADGHWAFELEADSTIPSEYILLRHYLAE CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHEECCCCEEEEEECCCCCCHHHHHHHHHHHC PIDVVLEAKIGNYLRRTQGAHGGWPLVHDGPFDMSASVKSYFALKMIGDSVDAAHMVKAR HHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHHHHH EAIRARGGAANSNVLTRFLLALYGVVSWRAVPVLPIEIVLLPIWSPFHLYKISYWARTTI HHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCCHHHEEEHHHHHHHH VPLMVLAVLKPRAKNPKGVGIEELFLQDTKSVGMNPKAPHQSWGWFLLFRGIDGILRVIE HHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCEEEEHHHHHHHHHHC PHLPKKLRERAIASALAFTEERLNGEDGMGAIYPSMANIVMMYDALGKDDHFPPRAIARR CCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHH AIDKLLVIGEEEAYCQPCLSPVWDTALTCHALQEVGGANAVAKAKQGLDWLKPRQVLDVK HHHHHEEECCCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCHHCCCCCEEECC GDWAVKAPNIRPGGWPFQYNNAHYPDLDDTAVVVMAMDRAQRHAGSKEYATAIARGREWI CCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEEEEEHHHHHHHCCCHHHHHHHHHHHHHH EGMQSRDGGWAAFDVNNLEYYLNNLPFADHGALLDPPTEDVTARCVSMLAQVGEFTQRSK HHHHCCCCCEEEEECCCHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH AVAEGIAYLRRTQHAEGSWYGRWGLNYIYGTWSVLCALNAAGIDHQDPMIRKAVEWLVSI HHHHHHHHHHHHCCCCCCEEECCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHH QSWDGGWGEDAISYRLDYSGYEQAPSTSSQTAWALLGLMAAGEVEHPAVARGVNYLKNAQ HCCCCCCCCCCEEEEECCCCCCCCCCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCC TENGLWDEQRYTATGFPRVFYLRYHGYSKFFPLWALARYRNLRSTNV CCCCCCCCCCEEECCCCEEEEEEECCCCHHHHHHHHHHHHHHCCCCC >Mature Secondary Structure MNKHSGNRTAIDPAALEMSIASATEALLAYRHADGHWAFELEADSTIPSEYILLRHYLAE CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHEECCCCEEEEEECCCCCCHHHHHHHHHHHC PIDVVLEAKIGNYLRRTQGAHGGWPLVHDGPFDMSASVKSYFALKMIGDSVDAAHMVKAR HHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHHHHH EAIRARGGAANSNVLTRFLLALYGVVSWRAVPVLPIEIVLLPIWSPFHLYKISYWARTTI HHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCCHHHEEEHHHHHHHH VPLMVLAVLKPRAKNPKGVGIEELFLQDTKSVGMNPKAPHQSWGWFLLFRGIDGILRVIE HHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCEEEEHHHHHHHHHHC PHLPKKLRERAIASALAFTEERLNGEDGMGAIYPSMANIVMMYDALGKDDHFPPRAIARR CCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHH AIDKLLVIGEEEAYCQPCLSPVWDTALTCHALQEVGGANAVAKAKQGLDWLKPRQVLDVK HHHHHEEECCCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCHHCCCCCEEECC GDWAVKAPNIRPGGWPFQYNNAHYPDLDDTAVVVMAMDRAQRHAGSKEYATAIARGREWI CCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEEEEEHHHHHHHCCCHHHHHHHHHHHHHH EGMQSRDGGWAAFDVNNLEYYLNNLPFADHGALLDPPTEDVTARCVSMLAQVGEFTQRSK HHHHCCCCCEEEEECCCHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH AVAEGIAYLRRTQHAEGSWYGRWGLNYIYGTWSVLCALNAAGIDHQDPMIRKAVEWLVSI HHHHHHHHHHHHCCCCCCEEECCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHH QSWDGGWGEDAISYRLDYSGYEQAPSTSSQTAWALLGLMAAGEVEHPAVARGVNYLKNAQ HCCCCCCCCCCEEEEECCCCCCCCCCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCC TENGLWDEQRYTATGFPRVFYLRYHGYSKFFPLWALARYRNLRSTNV CCCCCCCCCCEEECCCCEEEEEEECCCCHHHHHHHHHHHHHHCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9163424