Definition | Dinoroseobacter shibae DFL 12 plasmid pDSHI03, complete sequence. |
---|---|
Accession | NC_009957 |
Length | 126,304 |
Click here to switch to the map view.
The map label for this gene is xerD [C]
Identifier: 159046542
GI number: 159046542
Start: 67841
End: 69367
Strand: Direct
Name: xerD [C]
Synonym: Dshi_4004
Alternate gene names: 159046542
Gene position: 67841-69367 (Clockwise)
Preceding gene: 159046536
Following gene: 159046543
Centisome position: 53.71
GC content: 61.17
Gene sequence:
>1527_bases ATGAACTCCCCCTCGAACTCTCAACGCCGTGCGCGCGACCGCGCTGACAACCTCGCTCTGGTCGACACGTACCACGCCGA CAATCCCGGGCTTTCCCAGGGTGCGCTGAGCGCAGCCCGCCATTTTCTGAAGTGGGCGCAGGCACGGAAAGTCTCTGTCC GTGACCTCGACGCGTCGGTTGTAGACAGTTTTCTTCGTCACCACTGCCGCTGCGGTCGCTACAGCCCCAATCAGTTGCGG AGCCCGATGTATGCCACCTATACTCGCCGGTTTTTTCGATATCTCGAAGACACCGGCATAGTTGCGATCCCAAATGACAC TGCCCGGCTAAGACAGCACCTGGAGGCCTTTGCCAAGAAGCTCGAGAGCGCTGGCTACAGCGAGGTATCGCGCGCATCAT TGCTCAGCCATGCCGCCCATTTTGCTGAATGGGTCCTACAACAGCGAGTCCCGTTGACCGCAATCGGTGAAGGAACCATC GATCAGTTCGCGTGGCACGAATGCCGATGCGGGGAGATGACCAAGCATGGCAACAGGGTTCTGGCGTCTCACTATAAGAA CCGTAAGCGAGGTGCCCATGCGCTTGTCCGGCACCTGATCGACGAAGGTCTGCTCCCGCCCCAAGCACCTGACGACGTCT CCGCAGAAGACCCGCGTCTGATCAGCTTTTCGGAATGGTTGCGCCGCGAGCGCGGGGTGGCACCCGAGACCGTACGGCGC TATTTGAACGAAGTGGGCCGCTGGCTGGACAGCCTGGGGGCAACGCCGGAGGATTACGATGCGGCGGCCATCCGTTCGAT CATATTGGATCAAGGTGAAGAACGGTCCCAATCATCTGTGCGCAAGACGGTCACCGTGTTGCGAGCCTTCCTGCGCTTTA CGATCGTCCAAGGCGCATGCGCCCCGTCGCTTCTGCATGCGGTGCCATCTGCTGTTCGGCGCAAGCTTTCTACAGTCCCC CGCACGATCCCCACGGCGAAGATCGAAGAGATCCTTGCCTCCTGTCGCACCGATACGCCGGTTGAGATCCGAGACCGCGC AATACTTCTCCTCCTCGCCCGGTTGGCCTTGCGCGCGGGAGACATCTGGCAGTTGCACCTGTCAGATATCGACTGGCGCA CGAGTCGGTTGCGATTGCACGGCAAGGGCAGGCGTGGCGTTATGATGCCGCTGCCGCAGGATGTAGGCGATGCGCTGCTG GTCTATATCGAAGATGCGCGTCCAGTGGTCGCGTCGAACAGGGTCTTTCTCCGAGTCCAGGCTCCTTTCACCCCGCTGCG ATCCTCTGCCGAGATCGCCGGGATCGTTTCCCGCGTCCTGAGCCGGGGAGGTTTTACCGACTTGCCGACTGGTTCGCATG TCTTTCGTCATTCACTGGCCTCCGCCTGGCTGCGTGGCGGTGCGGACCTTGACCTGATCGGTGCAGCGCTGCGCCACACC TCGCGCGATACGACCGCGATCTACGCTAAGGTCGATGTTGGGATGCTGGAAGAAGTGGCTCAGCCGTGGCCGGGAGACGC GTCATGA
Upstream 100 bases:
>100_bases AGTTTTGGGTTTCAAGCTGTTGTTATGACGATTCATTTTTGGGTTTCCGCTTTGGATAATCATGATGCCTCTGCTCTCAA CCGCAGAAGGACATCCCCTC
Downstream 100 bases:
>100_bases TACATCACCATGTAGACCGCTTCGTCCAACTCAACCGCACACTCGGAAAGAAGTTCGCCGCACAGGAGACATCCTTGCGC GCCTTCGCGGATTTCGCTGC
Product: integrase family protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 508; Mature: 508
Protein sequence:
>508_residues MNSPSNSQRRARDRADNLALVDTYHADNPGLSQGALSAARHFLKWAQARKVSVRDLDASVVDSFLRHHCRCGRYSPNQLR SPMYATYTRRFFRYLEDTGIVAIPNDTARLRQHLEAFAKKLESAGYSEVSRASLLSHAAHFAEWVLQQRVPLTAIGEGTI DQFAWHECRCGEMTKHGNRVLASHYKNRKRGAHALVRHLIDEGLLPPQAPDDVSAEDPRLISFSEWLRRERGVAPETVRR YLNEVGRWLDSLGATPEDYDAAAIRSIILDQGEERSQSSVRKTVTVLRAFLRFTIVQGACAPSLLHAVPSAVRRKLSTVP RTIPTAKIEEILASCRTDTPVEIRDRAILLLLARLALRAGDIWQLHLSDIDWRTSRLRLHGKGRRGVMMPLPQDVGDALL VYIEDARPVVASNRVFLRVQAPFTPLRSSAEIAGIVSRVLSRGGFTDLPTGSHVFRHSLASAWLRGGADLDLIGAALRHT SRDTTAIYAKVDVGMLEEVAQPWPGDAS
Sequences:
>Translated_508_residues MNSPSNSQRRARDRADNLALVDTYHADNPGLSQGALSAARHFLKWAQARKVSVRDLDASVVDSFLRHHCRCGRYSPNQLR SPMYATYTRRFFRYLEDTGIVAIPNDTARLRQHLEAFAKKLESAGYSEVSRASLLSHAAHFAEWVLQQRVPLTAIGEGTI DQFAWHECRCGEMTKHGNRVLASHYKNRKRGAHALVRHLIDEGLLPPQAPDDVSAEDPRLISFSEWLRRERGVAPETVRR YLNEVGRWLDSLGATPEDYDAAAIRSIILDQGEERSQSSVRKTVTVLRAFLRFTIVQGACAPSLLHAVPSAVRRKLSTVP RTIPTAKIEEILASCRTDTPVEIRDRAILLLLARLALRAGDIWQLHLSDIDWRTSRLRLHGKGRRGVMMPLPQDVGDALL VYIEDARPVVASNRVFLRVQAPFTPLRSSAEIAGIVSRVLSRGGFTDLPTGSHVFRHSLASAWLRGGADLDLIGAALRHT SRDTTAIYAKVDVGMLEEVAQPWPGDAS >Mature_508_residues MNSPSNSQRRARDRADNLALVDTYHADNPGLSQGALSAARHFLKWAQARKVSVRDLDASVVDSFLRHHCRCGRYSPNQLR SPMYATYTRRFFRYLEDTGIVAIPNDTARLRQHLEAFAKKLESAGYSEVSRASLLSHAAHFAEWVLQQRVPLTAIGEGTI DQFAWHECRCGEMTKHGNRVLASHYKNRKRGAHALVRHLIDEGLLPPQAPDDVSAEDPRLISFSEWLRRERGVAPETVRR YLNEVGRWLDSLGATPEDYDAAAIRSIILDQGEERSQSSVRKTVTVLRAFLRFTIVQGACAPSLLHAVPSAVRRKLSTVP RTIPTAKIEEILASCRTDTPVEIRDRAILLLLARLALRAGDIWQLHLSDIDWRTSRLRLHGKGRRGVMMPLPQDVGDALL VYIEDARPVVASNRVFLRVQAPFTPLRSSAEIAGIVSRVLSRGGFTDLPTGSHVFRHSLASAWLRGGADLDLIGAALRHT SRDTTAIYAKVDVGMLEEVAQPWPGDAS
Specific function: Site-Specific Tyrosine Recombinase, Which Acts By Catalyzing The Cutting And Rejoining Of The Recombining DNA Molecules. Binds Cooperatively To Specific DNA Consensus Sequences That Are Separated From Xerc Binding Sites By A Short Central Region, Forming
COG id: COG0582
COG function: function code L; Integrase
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the 'phage' integrase family [H]
Homologues:
Organism=Escherichia coli, GI1789261, Length=301, Percent_Identity=28.2392026578073, Blast_Score=86, Evalue=5e-18,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR011010 - InterPro: IPR013762 - InterPro: IPR002104 - InterPro: IPR023109 - InterPro: IPR004107 [H]
Pfam domain/function: PF02899 Phage_integr_N; PF00589 Phage_integrase [H]
EC number: NA
Molecular weight: Translated: 56662; Mature: 56662
Theoretical pI: Translated: 10.05; Mature: 10.05
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.2 %Cys (Translated Protein) 1.2 %Met (Translated Protein) 2.4 %Cys+Met (Translated Protein) 1.2 %Cys (Mature Protein) 1.2 %Met (Mature Protein) 2.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MNSPSNSQRRARDRADNLALVDTYHADNPGLSQGALSAARHFLKWAQARKVSVRDLDASV CCCCCCHHHHHHHHCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHCCHHH VDSFLRHHCRCGRYSPNQLRSPMYATYTRRFFRYLEDTGIVAIPNDTARLRQHLEAFAKK HHHHHHHHHCCCCCCHHHHCCCHHHHHHHHHHHHHHCCCEEEECCCHHHHHHHHHHHHHH LESAGYSEVSRASLLSHAAHFAEWVLQQRVPLTAIGEGTIDQFAWHECRCGEMTKHGNRV HHHCCHHHHHHHHHHHHHHHHHHHHHHHCCCCEECCCCCHHHHHHHHCCCCCHHHCCCHH LASHYKNRKRGAHALVRHLIDEGLLPPQAPDDVSAEDPRLISFSEWLRRERGVAPETVRR HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEECHHHHHHHHCCCCHHHHHH YLNEVGRWLDSLGATPEDYDAAAIRSIILDQGEERSQSSVRKTVTVLRAFLRFTIVQGAC HHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCH APSLLHAVPSAVRRKLSTVPRTIPTAKIEEILASCRTDTPVEIRDRAILLLLARLALRAG HHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHCCC DIWQLHLSDIDWRTSRLRLHGKGRRGVMMPLPQDVGDALLVYIEDARPVVASNRVFLRVQ CEEEEECCCCCCCHHHEEEECCCCCCEECCCCCCCCCEEEEEECCCCCEEECCEEEEEEE APFTPLRSSAEIAGIVSRVLSRGGFTDLPTGSHVFRHSLASAWLRGGADLDLIGAALRHT CCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHC SRDTTAIYAKVDVGMLEEVAQPWPGDAS CCCCEEEEEECCHHHHHHHCCCCCCCCC >Mature Secondary Structure MNSPSNSQRRARDRADNLALVDTYHADNPGLSQGALSAARHFLKWAQARKVSVRDLDASV CCCCCCHHHHHHHHCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHCCHHH VDSFLRHHCRCGRYSPNQLRSPMYATYTRRFFRYLEDTGIVAIPNDTARLRQHLEAFAKK HHHHHHHHHCCCCCCHHHHCCCHHHHHHHHHHHHHHCCCEEEECCCHHHHHHHHHHHHHH LESAGYSEVSRASLLSHAAHFAEWVLQQRVPLTAIGEGTIDQFAWHECRCGEMTKHGNRV HHHCCHHHHHHHHHHHHHHHHHHHHHHHCCCCEECCCCCHHHHHHHHCCCCCHHHCCCHH LASHYKNRKRGAHALVRHLIDEGLLPPQAPDDVSAEDPRLISFSEWLRRERGVAPETVRR HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEECHHHHHHHHCCCCHHHHHH YLNEVGRWLDSLGATPEDYDAAAIRSIILDQGEERSQSSVRKTVTVLRAFLRFTIVQGAC HHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCH APSLLHAVPSAVRRKLSTVPRTIPTAKIEEILASCRTDTPVEIRDRAILLLLARLALRAG HHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHCCC DIWQLHLSDIDWRTSRLRLHGKGRRGVMMPLPQDVGDALLVYIEDARPVVASNRVFLRVQ CEEEEECCCCCCCHHHEEEECCCCCCEECCCCCCCCCEEEEEECCCCCEEECCEEEEEEE APFTPLRSSAEIAGIVSRVLSRGGFTDLPTGSHVFRHSLASAWLRGGADLDLIGAALRHT CCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHC SRDTTAIYAKVDVGMLEEVAQPWPGDAS CCCCEEEEEECCHHHHHHHCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9163424 [H]