Definition Nocardioides sp. JS614 chromosome, complete genome.
Accession NC_008699
Length 4,985,871

Click here to switch to the map view.

The map label for this gene is 119715343

Identifier: 119715343

GI number: 119715343

Start: 1167904

End: 1168581

Strand: Direct

Name: 119715343

Synonym: Noca_1105

Alternate gene names: NA

Gene position: 1167904-1168581 (Clockwise)

Preceding gene: 119715342

Following gene: 119715344

Centisome position: 23.42

GC content: 60.91

Gene sequence:

>678_bases
GTGAGTCGCATCGATCGCCTCATCGCGGAACTCGCGCCGCAGGGGGTGCCGCTTATGCCGCTGGGTCAGCTTGGTGAGTT
CATACGTGGCCGTCGATTCACGAAAGCCGACTACGTCGACTCCGGACTCGGCTCAATTCACTACGGCGAGATTTACACCG
ACTACGGCACGACCGCGTCATCGGTACATCGGTTCGTTCGTCCCGAGTTGAAGGGAAGCCTCCGCCTGGCTCGCCCAGGT
GACTTGGTCATCGCGGCAACGGGCGAGAACGTACAGGAAGTATGCAAGGCGGTCGCTTGGCTGGGCGACGAAGAGGTTGC
CATCCACGATGATTGCTACATCTTCCGTCACCAGATGGATCCGACGTTTGTCTCGTACTTCTTTCAGACCGCCCACTTCC
ACGAGCAAAAGGCACGGCTGGCCTCCGAGTCGAAGCTTGCGCGGGTGTCCGGTGCGAACCTGGCCAGGATCGTTGCTCCC
GCTCCGCCACTGGAGGTGCAGCGAGAGATCGTCAGCGTCCTGGACAAGTTCAGAGCTCTTGAAGCCGAGCTCAAGGCCGA
GCTTGAAGCACGGCGTGAGCAGTACAGGTACTACCGCGACGCGCTGGTGGCGTTCGACGCGCCTGACTCTCTCTCTCTCT
CTCTCTCTCGCAGAGCAGGCTCAGATGGGCGAGACTGA

Upstream 100 bases:

>100_bases
TCCGCGAGCTCAACGCGGAGATCGGGCGGATTGTGAGCCGTCAGTGTGAGCTGCGTGCGGAGATCGATGCTGTCGTTGCG
GACCTTGAGGAGCGCCGGTC

Downstream 100 bases:

>100_bases
GTGATGTTGCGACACTTCGGCGCGGCTCAGCCATGACAGCGACGTCTGCAGCTCTCGGCGACGTGCCCGTGGTCGCGAAT
GCTCCTGAGCCTGCGTACTT

Product: restriction modification system DNA specificity subunit

Products: NA

Alternate protein names: Restriction System DNA Specificity Domain Protein; Type I Restriction- System Specificity Determinant; Restriction System DNA Specificity Subunit; Type I Restriction- System Specificity Subunit; Restriction System DNA Specificity Domain; Type I Restriction- System; Type I Restriction; Type I Restriction- System S Protein; LOW QUALITY PROTEIN Restriction Endonuclease S; Type I Restriction- Specificity Subunit S; Type I Restriction Specificity Protein; Type I Restriction Subunit S

Number of amino acids: Translated: 225; Mature: 224

Protein sequence:

>225_residues
MSRIDRLIAELAPQGVPLMPLGQLGEFIRGRRFTKADYVDSGLGSIHYGEIYTDYGTTASSVHRFVRPELKGSLRLARPG
DLVIAATGENVQEVCKAVAWLGDEEVAIHDDCYIFRHQMDPTFVSYFFQTAHFHEQKARLASESKLARVSGANLARIVAP
APPLEVQREIVSVLDKFRALEAELKAELEARREQYRYYRDALVAFDAPDSLSLSLSRRAGSDGRD

Sequences:

>Translated_225_residues
MSRIDRLIAELAPQGVPLMPLGQLGEFIRGRRFTKADYVDSGLGSIHYGEIYTDYGTTASSVHRFVRPELKGSLRLARPG
DLVIAATGENVQEVCKAVAWLGDEEVAIHDDCYIFRHQMDPTFVSYFFQTAHFHEQKARLASESKLARVSGANLARIVAP
APPLEVQREIVSVLDKFRALEAELKAELEARREQYRYYRDALVAFDAPDSLSLSLSRRAGSDGRD
>Mature_224_residues
SRIDRLIAELAPQGVPLMPLGQLGEFIRGRRFTKADYVDSGLGSIHYGEIYTDYGTTASSVHRFVRPELKGSLRLARPGD
LVIAATGENVQEVCKAVAWLGDEEVAIHDDCYIFRHQMDPTFVSYFFQTAHFHEQKARLASESKLARVSGANLARIVAPA
PPLEVQREIVSVLDKFRALEAELKAELEARREQYRYYRDALVAFDAPDSLSLSLSRRAGSDGRD

Specific function: Unknown

COG id: COG0732

COG function: function code V; Restriction endonuclease S subunits

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 25172; Mature: 25041

Theoretical pI: Translated: 6.52; Mature: 6.52

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.9 %Cys     (Translated Protein)
1.3 %Met     (Translated Protein)
2.2 %Cys+Met (Translated Protein)
0.9 %Cys     (Mature Protein)
0.9 %Met     (Mature Protein)
1.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSRIDRLIAELAPQGVPLMPLGQLGEFIRGRRFTKADYVDSGLGSIHYGEIYTDYGTTAS
CCHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCCCHHHHHCCCCCEEECEEEECCCCCHH
SVHRFVRPELKGSLRLARPGDLVIAATGENVQEVCKAVAWLGDEEVAIHDDCYIFRHQMD
HHHHHHCHHHCCCEEEECCCCEEEEECCCCHHHHHHHHHHCCCCCEEEECCEEEEEECCC
PTFVSYFFQTAHFHEQKARLASESKLARVSGANLARIVAPAPPLEVQREIVSVLDKFRAL
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHHHH
EAELKAELEARREQYRYYRDALVAFDAPDSLSLSLSRRAGSDGRD
HHHHHHHHHHHHHHHHHHHHHHHEECCCCCCEEEECCCCCCCCCC
>Mature Secondary Structure 
SRIDRLIAELAPQGVPLMPLGQLGEFIRGRRFTKADYVDSGLGSIHYGEIYTDYGTTAS
CHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCCCHHHHHCCCCCEEECEEEECCCCCHH
SVHRFVRPELKGSLRLARPGDLVIAATGENVQEVCKAVAWLGDEEVAIHDDCYIFRHQMD
HHHHHHCHHHCCCEEEECCCCEEEEECCCCHHHHHHHHHHCCCCCEEEECCEEEEEECCC
PTFVSYFFQTAHFHEQKARLASESKLARVSGANLARIVAPAPPLEVQREIVSVLDKFRAL
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHHHH
EAELKAELEARREQYRYYRDALVAFDAPDSLSLSLSRRAGSDGRD
HHHHHHHHHHHHHHHHHHHHHHHEECCCCCCEEEECCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: NA