Definition Dinoroseobacter shibae DFL 12 plasmid pDSHI03, complete sequence.
Accession NC_009957
Length 126,304

Click here to switch to the map view.

The map label for this gene is xerD [C]

Identifier: 159046544

GI number: 159046544

Start: 70272

End: 71276

Strand: Direct

Name: xerD [C]

Synonym: Dshi_4006

Alternate gene names: 159046544

Gene position: 70272-71276 (Clockwise)

Preceding gene: 159046543

Following gene: 159046548

Centisome position: 55.64

GC content: 60.9

Gene sequence:

>1005_bases
ATGACCCCGCTCGCCCCCGACCTTTCAGCCTTCCTGCAAACCCATCTTCCCCATGAATGCGGCGCAAGCCGACATACCAT
CGCAGCCTATGCCTGTGCGTTCACCCTCTTGCTGCGGTTTGTCACAGGGCGGGTCAAGCGAACCCCATCGGAACTTTTCA
TTGAAGACCTGGACATTCAGACGATCAGGGCGTTCCTCGAACATATCGAAGAAGGACGCGCCAATTCCGTCCGGTCCCGC
AATGCGCGACTGGCGGCGATTAAATCGTTTTTCCGTTTCGTCGAACATCGCCAACCGGCCTGCCTTGAGCAGGCGATGAT
GATCCGGGCCATGCCGACCAAGCGAACAGACGCCAAGCTGATCGATTATCTTACAAAGGAAGAAGTCCGCGCCTTGCTTG
CCGCGCCGAACCGCCACACGCCAGGCGGATTGCGGGACCGCGCCATGCTGCATCTGACCTATGCCGCAGGGTTGAGAGCG
TCCGAACTTCTGGCTGTGCGGATGGACGATTTTCCCGACGGGTCGTTTTCCAACGTACGGATATTGGGCAAGGGGCGCCG
GGAGCGTGTGCTGCCGCTCTGGAAGGAGACTCAATGTGCCATTCGCGCGTGGTTGGCCGTCAGGCCCGGTCGAGTAGGCC
CGGAACTGTTCCTGAACCGTGATGGGCGCCGAATGACGCGGGACGGATTTGCATATCGGCTAAGGCAACATGTGGCGACA
GCCGAGCGCTCGACGCCATCGATTGCCGCAAAGCAGGTCACTCCGCATGTGTTGCGGCACAGCTGCGCCATGCACACGCT
TCAGGCCACCGGTGATATCCGCAAGGTCGCGCTCTGGCTTGGCCATGCCAGCATCCAGACGACCGAGATGTATCTGCGCG
CGGACCCGACCGAGAAGCTAGCGCTTCTGGAGGCGCACCACGCACCTCTGATCCAACCAGGCAAGTTCCGGGAGCCGTCG
GACAAGCTGATGCAGATCCTAGCCGTCGCCGCGCAACGTGCTTGA

Upstream 100 bases:

>100_bases
ATGTCGATATTGCCAACACCTACTGGTACCTTGAGGCTACGCCGGTTCTTCTGAAGATGATTGCGGCAACCGCCGAGGAG
ACCTGGATCGGAGGTGCGGC

Downstream 100 bases:

>100_bases
CCTTTGCGGTAAAATGGCGTCCCGCCCTCTGTGGGGCGCTATTCCATCACATAGCCGAGAGCCCGGTAACCGGTTCTTCT
TTTGGCCGCCATTCTGGCGA

Product: integrase family protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 334; Mature: 333

Protein sequence:

>334_residues
MTPLAPDLSAFLQTHLPHECGASRHTIAAYACAFTLLLRFVTGRVKRTPSELFIEDLDIQTIRAFLEHIEEGRANSVRSR
NARLAAIKSFFRFVEHRQPACLEQAMMIRAMPTKRTDAKLIDYLTKEEVRALLAAPNRHTPGGLRDRAMLHLTYAAGLRA
SELLAVRMDDFPDGSFSNVRILGKGRRERVLPLWKETQCAIRAWLAVRPGRVGPELFLNRDGRRMTRDGFAYRLRQHVAT
AERSTPSIAAKQVTPHVLRHSCAMHTLQATGDIRKVALWLGHASIQTTEMYLRADPTEKLALLEAHHAPLIQPGKFREPS
DKLMQILAVAAQRA

Sequences:

>Translated_334_residues
MTPLAPDLSAFLQTHLPHECGASRHTIAAYACAFTLLLRFVTGRVKRTPSELFIEDLDIQTIRAFLEHIEEGRANSVRSR
NARLAAIKSFFRFVEHRQPACLEQAMMIRAMPTKRTDAKLIDYLTKEEVRALLAAPNRHTPGGLRDRAMLHLTYAAGLRA
SELLAVRMDDFPDGSFSNVRILGKGRRERVLPLWKETQCAIRAWLAVRPGRVGPELFLNRDGRRMTRDGFAYRLRQHVAT
AERSTPSIAAKQVTPHVLRHSCAMHTLQATGDIRKVALWLGHASIQTTEMYLRADPTEKLALLEAHHAPLIQPGKFREPS
DKLMQILAVAAQRA
>Mature_333_residues
TPLAPDLSAFLQTHLPHECGASRHTIAAYACAFTLLLRFVTGRVKRTPSELFIEDLDIQTIRAFLEHIEEGRANSVRSRN
ARLAAIKSFFRFVEHRQPACLEQAMMIRAMPTKRTDAKLIDYLTKEEVRALLAAPNRHTPGGLRDRAMLHLTYAAGLRAS
ELLAVRMDDFPDGSFSNVRILGKGRRERVLPLWKETQCAIRAWLAVRPGRVGPELFLNRDGRRMTRDGFAYRLRQHVATA
ERSTPSIAAKQVTPHVLRHSCAMHTLQATGDIRKVALWLGHASIQTTEMYLRADPTEKLALLEAHHAPLIQPGKFREPSD
KLMQILAVAAQRA

Specific function: Site-Specific Tyrosine Recombinase, Which Acts By Catalyzing The Cutting And Rejoining Of The Recombining DNA Molecules. Binds Cooperatively To Specific DNA Consensus Sequences That Are Separated From Xerc Binding Sites By A Short Central Region, Forming

COG id: COG0582

COG function: function code L; Integrase

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the 'phage' integrase family [H]

Homologues:

Organism=Escherichia coli, GI1789261, Length=193, Percent_Identity=37.3056994818653, Blast_Score=106, Evalue=2e-24,
Organism=Escherichia coli, GI1790244, Length=326, Percent_Identity=29.4478527607362, Blast_Score=103, Evalue=2e-23,
Organism=Escherichia coli, GI1790767, Length=182, Percent_Identity=29.1208791208791, Blast_Score=68, Evalue=8e-13,
Organism=Escherichia coli, GI1790768, Length=187, Percent_Identity=31.0160427807487, Blast_Score=65, Evalue=4e-12,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR011010
- InterPro:   IPR013762
- InterPro:   IPR002104
- InterPro:   IPR010998
- InterPro:   IPR023109
- InterPro:   IPR004107 [H]

Pfam domain/function: PF02899 Phage_integr_N; PF00589 Phage_integrase [H]

EC number: NA

Molecular weight: Translated: 37642; Mature: 37511

Theoretical pI: Translated: 10.81; Mature: 10.81

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.5 %Cys     (Translated Protein)
3.0 %Met     (Translated Protein)
4.5 %Cys+Met (Translated Protein)
1.5 %Cys     (Mature Protein)
2.7 %Met     (Mature Protein)
4.2 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTPLAPDLSAFLQTHLPHECGASRHTIAAYACAFTLLLRFVTGRVKRTPSELFIEDLDIQ
CCCCCHHHHHHHHHHCCHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHCCHH
TIRAFLEHIEEGRANSVRSRNARLAAIKSFFRFVEHRQPACLEQAMMIRAMPTKRTDAKL
HHHHHHHHHHCCCCHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCHHHH
IDYLTKEEVRALLAAPNRHTPGGLRDRAMLHLTYAAGLRASELLAVRMDDFPDGSFSNVR
HHHHHHHHHHHHHHCCCCCCCCCCCCHHEEEEHHHCCCCHHHEEEEEECCCCCCCCCCEE
ILGKGRRERVLPLWKETQCAIRAWLAVRPGRVGPELFLNRDGRRMTRDGFAYRLRQHVAT
EEECCCCCCCCCCHHHHHHHHHHHHHCCCCCCCHHHHCCCCCCCHHHHHHHHHHHHHHHH
AERSTPSIAAKQVTPHVLRHSCAMHTLQATGDIRKVALWLGHASIQTTEMYLRADPTEKL
HHCCCCCHHHHHCCHHHHHHHHHHHHHHHCCCHHHHHHHHHCCHHEEEEEEEECCCHHHH
ALLEAHHAPLIQPGKFREPSDKLMQILAVAAQRA
HHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCC
>Mature Secondary Structure 
TPLAPDLSAFLQTHLPHECGASRHTIAAYACAFTLLLRFVTGRVKRTPSELFIEDLDIQ
CCCCHHHHHHHHHHCCHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHCCHH
TIRAFLEHIEEGRANSVRSRNARLAAIKSFFRFVEHRQPACLEQAMMIRAMPTKRTDAKL
HHHHHHHHHHCCCCHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCHHHH
IDYLTKEEVRALLAAPNRHTPGGLRDRAMLHLTYAAGLRASELLAVRMDDFPDGSFSNVR
HHHHHHHHHHHHHHCCCCCCCCCCCCHHEEEEHHHCCCCHHHEEEEEECCCCCCCCCCEE
ILGKGRRERVLPLWKETQCAIRAWLAVRPGRVGPELFLNRDGRRMTRDGFAYRLRQHVAT
EEECCCCCCCCCCHHHHHHHHHHHHHCCCCCCCHHHHCCCCCCCHHHHHHHHHHHHHHHH
AERSTPSIAAKQVTPHVLRHSCAMHTLQATGDIRKVALWLGHASIQTTEMYLRADPTEKL
HHCCCCCHHHHHCCHHHHHHHHHHHHHHHCCCHHHHHHHHHCCHHEEEEEEEECCCHHHH
ALLEAHHAPLIQPGKFREPSDKLMQILAVAAQRA
HHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 9163424 [H]