| Definition | Acidiphilium cryptum JF-5 plasmid pACRY04, complete sequence. |
|---|---|
| Accession | NC_009470 |
| Length | 37,415 |
Click here to switch to the map view.
The map label for this gene is ecoRIIR [H]
Identifier: 148244083
GI number: 148244083
Start: 31098
End: 32336
Strand: Reverse
Name: ecoRIIR [H]
Synonym: Acry_3581
Alternate gene names: 148244083
Gene position: 32336-31098 (Counterclockwise)
Preceding gene: 148244084
Following gene: 148244080
Centisome position: 86.43
GC content: 61.9
Gene sequence:
>1239_bases ATGGCGCTGACTGATCTTGTGGGCTGGATGGACGAGTACGGCGTCCCCGGCGCTGTCTGGTTCGCCAAGCGGCTGGCAGC CAACGATACCCTGGCAACCGGAGCCCATGGAGCGGGGCCCTATATTCCAAAGGAGTTCCTGTTCAGCGTTTTCCCGGACC TCAACCACCCAGAGGAGGAGAATCCGGACTGCTGGTTCGACCTGTACATCGACTCTCACGCGGACCGCCGGAGGATCCGG GCGATCTGGTACAACAACAGGCTTCGTGGAGGGACGAGGAACGAAGCCCGACTGACGAATTTCGGCGGTGGCCGGTCAGC ACTTCTGGATCCGGACAGTACCGGTGCCCTGGCCGTGTTTGCCTTCCTCCCTCACGGGCCGGGCAGGCAGATGGAATGTC ATGTCTGGGTCTGCGATCAGGGGGCGGAGGCAGATCTTCTTGAAGAGCGCATTGGTCCGGTAGAGCCGAAGGCTCCTGTG GTCTGGCAGCCGGGCGAAATCGCCCGCGACCTGTTCATGCACACGGATGCAGGACCGGGGTCGTGCCGGCTCCTGCCCGG TGAAATTCCTCCTGCCTGGCTGACCAGGTTCCCGACGGGCGAGGAAATTATCAGGGAGACGGTCAAACGACGCCAGCGAA ACGGTCTGGAGCCCGACCGGCTTCTCATGCGCCGCCGTGAATGCGAATTTGAGATATTTCTCAGCGTCGAGGAGGCCGTC TATCTCCCACGTATCAGGCAAGGCTTCTCCGCGATCGGCGATTTTCTCTCGCTTGCCCAGACAATCCTGCAGAGCCGGAA GACCCGTTCAGGCACTTCGCTGGAACTGCATGTCCGCGAGATCATGACCGAGGAAGGTCTGCTGGCTGATACCTCCTTCA CCTACCGGCCAGTAATCGAGAACGGCAAGCGGCCGGATTTCCTGTTCCCGTCCAAGGCCGCCTACGACAATCCGGCGTTT CCGGCAGAGCGGTTGCGCATGCTGGCCGCGAAAACGACCTGCAAGGACCGCTGGCGGCAGGTGCTCAACGAGGCCGACAG GATCGCAACCAAGCACCTCCTCACCCTCCAGGAAGGGGTTTCGGAGGGGCAGTTCCGGGAAATGAGCGAGGCCGGCGTCC GTCTGGTGGTGCCTGCTGAACTCCACCGTTCCTATCCGGCGAGTGTCCGGCCGCATCTTGTCACGTTCGAGAACTTCATT GGTGACGTGAGACCGCTGTCCGCCATTCAGGGGCCATAG
Upstream 100 bases:
>100_bases GCGATCAGGCGCAGGCGCGGCATGGTGCCGGAACCTGTCATTGACGAAGCCGTCTCGTGGCTGAGATCCGGAGATGGAAA CCTGGAAATTCGGGGGAACA
Downstream 100 bases:
>100_bases TCCCACAGGATCAGGCTGCCCGGTCTATCGGGTATGCTTTCACATCGGCCGGCCGCTTGCCCGCATACCAGGCGGGATCC GGCAGCCGCCCCGATGTGAC
Product: hypothetical protein
Products: NA
Alternate protein names: R.EcoRII; Endonuclease EcoRII; Type II restriction enzyme EcoRII [H]
Number of amino acids: Translated: 412; Mature: 411
Protein sequence:
>412_residues MALTDLVGWMDEYGVPGAVWFAKRLAANDTLATGAHGAGPYIPKEFLFSVFPDLNHPEEENPDCWFDLYIDSHADRRRIR AIWYNNRLRGGTRNEARLTNFGGGRSALLDPDSTGALAVFAFLPHGPGRQMECHVWVCDQGAEADLLEERIGPVEPKAPV VWQPGEIARDLFMHTDAGPGSCRLLPGEIPPAWLTRFPTGEEIIRETVKRRQRNGLEPDRLLMRRRECEFEIFLSVEEAV YLPRIRQGFSAIGDFLSLAQTILQSRKTRSGTSLELHVREIMTEEGLLADTSFTYRPVIENGKRPDFLFPSKAAYDNPAF PAERLRMLAAKTTCKDRWRQVLNEADRIATKHLLTLQEGVSEGQFREMSEAGVRLVVPAELHRSYPASVRPHLVTFENFI GDVRPLSAIQGP
Sequences:
>Translated_412_residues MALTDLVGWMDEYGVPGAVWFAKRLAANDTLATGAHGAGPYIPKEFLFSVFPDLNHPEEENPDCWFDLYIDSHADRRRIR AIWYNNRLRGGTRNEARLTNFGGGRSALLDPDSTGALAVFAFLPHGPGRQMECHVWVCDQGAEADLLEERIGPVEPKAPV VWQPGEIARDLFMHTDAGPGSCRLLPGEIPPAWLTRFPTGEEIIRETVKRRQRNGLEPDRLLMRRRECEFEIFLSVEEAV YLPRIRQGFSAIGDFLSLAQTILQSRKTRSGTSLELHVREIMTEEGLLADTSFTYRPVIENGKRPDFLFPSKAAYDNPAF PAERLRMLAAKTTCKDRWRQVLNEADRIATKHLLTLQEGVSEGQFREMSEAGVRLVVPAELHRSYPASVRPHLVTFENFI GDVRPLSAIQGP >Mature_411_residues ALTDLVGWMDEYGVPGAVWFAKRLAANDTLATGAHGAGPYIPKEFLFSVFPDLNHPEEENPDCWFDLYIDSHADRRRIRA IWYNNRLRGGTRNEARLTNFGGGRSALLDPDSTGALAVFAFLPHGPGRQMECHVWVCDQGAEADLLEERIGPVEPKAPVV WQPGEIARDLFMHTDAGPGSCRLLPGEIPPAWLTRFPTGEEIIRETVKRRQRNGLEPDRLLMRRRECEFEIFLSVEEAVY LPRIRQGFSAIGDFLSLAQTILQSRKTRSGTSLELHVREIMTEEGLLADTSFTYRPVIENGKRPDFLFPSKAAYDNPAFP AERLRMLAAKTTCKDRWRQVLNEADRIATKHLLTLQEGVSEGQFREMSEAGVRLVVPAELHRSYPASVRPHLVTFENFIG DVRPLSAIQGP
Specific function: Recognizes the double-stranded sequence CCWGG and cleaves before C-1 [H]
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
None
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR011335 - InterPro: IPR015109 - InterPro: IPR015300 [H]
Pfam domain/function: PF09019 EcoRII-C; PF09217 EcoRII-N [H]
EC number: =3.1.21.4 [H]
Molecular weight: Translated: 46432; Mature: 46301
Theoretical pI: Translated: 6.20; Mature: 6.20
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.5 %Cys (Translated Protein) 1.9 %Met (Translated Protein) 3.4 %Cys+Met (Translated Protein) 1.5 %Cys (Mature Protein) 1.7 %Met (Mature Protein) 3.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MALTDLVGWMDEYGVPGAVWFAKRLAANDTLATGAHGAGPYIPKEFLFSVFPDLNHPEEE CCHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHCCCCCCCCCC NPDCWFDLYIDSHADRRRIRAIWYNNRLRGGTRNEARLTNFGGGRSALLDPDSTGALAVF CCCEEEEEEECCCCCHHHEEEEEECCCCCCCCCCCCEEECCCCCCCEEECCCCCCCEEEE AFLPHGPGRQMECHVWVCDQGAEADLLEERIGPVEPKAPVVWQPGEIARDLFMHTDAGPG EECCCCCCCCEEEEEEEECCCCCHHHHHHHCCCCCCCCCEEECCHHHHHHHHHCCCCCCC SCRLLPGEIPPAWLTRFPTGEEIIRETVKRRQRNGLEPDRLLMRRRECEFEIFLSVEEAV CEEECCCCCCHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCEEEEEEHHHHH YLPRIRQGFSAIGDFLSLAQTILQSRKTRSGTSLELHVREIMTEEGLLADTSFTYRPVIE HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEHHHHHHHCCCCEECCCCEECHHHC NGKRPDFLFPSKAAYDNPAFPAERLRMLAAKTTCKDRWRQVLNEADRIATKHLLTLQEGV CCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC SEGQFREMSEAGVRLVVPAELHRSYPASVRPHLVTFENFIGDVRPLSAIQGP CCCHHHHHHHCCCEEEEEHHHHCCCCCCCCCCEEEHHHHHCCCCHHHHCCCC >Mature Secondary Structure ALTDLVGWMDEYGVPGAVWFAKRLAANDTLATGAHGAGPYIPKEFLFSVFPDLNHPEEE CHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHCCCCCCCCCC NPDCWFDLYIDSHADRRRIRAIWYNNRLRGGTRNEARLTNFGGGRSALLDPDSTGALAVF CCCEEEEEEECCCCCHHHEEEEEECCCCCCCCCCCCEEECCCCCCCEEECCCCCCCEEEE AFLPHGPGRQMECHVWVCDQGAEADLLEERIGPVEPKAPVVWQPGEIARDLFMHTDAGPG EECCCCCCCCEEEEEEEECCCCCHHHHHHHCCCCCCCCCEEECCHHHHHHHHHCCCCCCC SCRLLPGEIPPAWLTRFPTGEEIIRETVKRRQRNGLEPDRLLMRRRECEFEIFLSVEEAV CEEECCCCCCHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCEEEEEEHHHHH YLPRIRQGFSAIGDFLSLAQTILQSRKTRSGTSLELHVREIMTEEGLLADTSFTYRPVIE HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEHHHHHHHCCCCEECCCCEECHHHC NGKRPDFLFPSKAAYDNPAFPAERLRMLAAKTTCKDRWRQVLNEADRIATKHLLTLQEGV CCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC SEGQFREMSEAGVRLVVPAELHRSYPASVRPHLVTFENFIGDVRPLSAIQGP CCCHHHHHHHCCCEEEEEHHHHCCCCCCCCCCEEEHHHHHCCCCHHHHCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 2104830; 2597679; 2612358; 8392701 [H]