Definition Sinorhizobium fredii NGR234 plasmid pNGR234a, complete sequence.
Accession NC_000914
Length 536,165

Click here to switch to the map view.

The map label for this gene is Not Available

Identifier: 16519924

GI number: 16519924

Start: 230024

End: 231301

Strand: Direct

Name: Not Available

Synonym: NGR_a01810

Alternate gene names: 16519924

Gene position: 230024-231301 (Clockwise)

Preceding gene: 16519925

Following gene: 16519923

Centisome position: 42.9

GC content: 62.21

Gene sequence:

>1278_bases
GTGCGGGATCCCCCCGCGCCGCTATTCCGGAGGGACAACATGCTCGAGTTCTATTTCTCGTATCGTGGGGTGCTCAAGCG
CCTACGTAGCGGTGCGCTCGGTGGCGAGATGGATCGCCTCGCCGAGCATTTTCTTACGCTCGGTTATAAGCGAGCGTCCG
CCAAGATTTACCTGAGCCGGGTTGCGCGTTTTAGCCAGTTTGCCGCGACGCGCTGTGGCTTGATGCCAATCCATCAAGAT
GTCATCGACAGCTATTTGGGCACGTTTACTACAGATACTCCGCGGATCGGGGCCGTATCGGCTCTGGCACACGCACGGCG
AGTGGCTCCGGAGCGGTTCATTATTCCCGTTCCAAGTGAAGAGGCTGACCCGGATGCTCTGCTTCTGGCCTCCTTTTCGG
ACTATCTGCGCACGGTACGGGGCCTGGAGCCGAAGACCCGCGAAGGTATTCTTCTGGGCGGCCGCCGCTTTTTGGATTGG
TTCCGCCATCGCCACCCCGGCCAAAATCTTGAGGCGTTGACGGCCGAGCACGTGCTCGCTGCTGTCGAGCATCGGCTGTC
GCTATCGGCGACCTCCGGCACCCGCACGGCAGCGACCTCTCACATTCGAACATTTCTTCGGTTCTTATGTTGGGCTGGCC
ACCATCGCCAAGATCTTGCCCGCATCGTCCCGAGGACGCCCTATTGGCGTTTGGCGCATCTGCCGCCGCGCCTTGCATGG
GGTGACGTTCGGCGCGCAATCGATGCGATCGGCGCAACGACGCCGGTCGCTATCCGCGATCGAGCCGTCCTGCTGCTGCT
CGCCACCACGGGCATTCGCAACGGCGAGTTACGCGCCATTCGGCTGCAGGATATCGACTGGCGTACTGGCGAGGTTTTTA
TCCGGCGCACCAAGGGCAAGCGTGATCGGGTGGTGCCACTCCTTGAGGAGACCGGCGCCGCACTCGTCGACTACATCCTG
CGCGCTCGACCGAAGGTGGACAGTCCGTATCTGTTCCTGTCGTTCACGCCGCCGGTGGGAGCGTTCAAGTCTGCGGCGCC
TGTATCAAGGATCGTGAGGAAGCGATTGCGACATGGCGGGGTCGAACTCGGGCGGGTCGCAGGTGCACATCTCCTGCGCC
ACAGCCTCGCCACCCAGCTTGTCGGGCAGCGAAGGCCAATCAACGAGGTCGCCGATCTTCTTGGTCACCGGAGCATCAAC
ACAACGGCGCTGTACGTAAAGGTTGCGGCCTCGCAACTCGCCGAGGTCGCACTCCCCTTTCCGGGAGGCGCTGCATGA

Upstream 100 bases:

>100_bases
TGTGCCGGGCGAGATTATGTGGCGGAGCCGTCCCAGCGCCAGCCGAGGCCCGCCTAATCTGGAATTCTCCGCCACATAAT
ATTTCGTGCCTCTTGGATGA

Downstream 100 bases:

>100_bases
CCGCCTTTGCCCGATTCCTCGGCGAGAAGGTCGAGCGTTACATCGACCTGCGTCACTCGCTCGGCTATGCCTTCAGCAAA
CAAGCCGGCACGTTGCGGGC

Product: DNA integration/recombination/inversion protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 425; Mature: 425

Protein sequence:

>425_residues
MRDPPAPLFRRDNMLEFYFSYRGVLKRLRSGALGGEMDRLAEHFLTLGYKRASAKIYLSRVARFSQFAATRCGLMPIHQD
VIDSYLGTFTTDTPRIGAVSALAHARRVAPERFIIPVPSEEADPDALLLASFSDYLRTVRGLEPKTREGILLGGRRFLDW
FRHRHPGQNLEALTAEHVLAAVEHRLSLSATSGTRTAATSHIRTFLRFLCWAGHHRQDLARIVPRTPYWRLAHLPPRLAW
GDVRRAIDAIGATTPVAIRDRAVLLLLATTGIRNGELRAIRLQDIDWRTGEVFIRRTKGKRDRVVPLLEETGAALVDYIL
RARPKVDSPYLFLSFTPPVGAFKSAAPVSRIVRKRLRHGGVELGRVAGAHLLRHSLATQLVGQRRPINEVADLLGHRSIN
TTALYVKVAASQLAEVALPFPGGAA

Sequences:

>Translated_425_residues
MRDPPAPLFRRDNMLEFYFSYRGVLKRLRSGALGGEMDRLAEHFLTLGYKRASAKIYLSRVARFSQFAATRCGLMPIHQD
VIDSYLGTFTTDTPRIGAVSALAHARRVAPERFIIPVPSEEADPDALLLASFSDYLRTVRGLEPKTREGILLGGRRFLDW
FRHRHPGQNLEALTAEHVLAAVEHRLSLSATSGTRTAATSHIRTFLRFLCWAGHHRQDLARIVPRTPYWRLAHLPPRLAW
GDVRRAIDAIGATTPVAIRDRAVLLLLATTGIRNGELRAIRLQDIDWRTGEVFIRRTKGKRDRVVPLLEETGAALVDYIL
RARPKVDSPYLFLSFTPPVGAFKSAAPVSRIVRKRLRHGGVELGRVAGAHLLRHSLATQLVGQRRPINEVADLLGHRSIN
TTALYVKVAASQLAEVALPFPGGAA
>Mature_425_residues
MRDPPAPLFRRDNMLEFYFSYRGVLKRLRSGALGGEMDRLAEHFLTLGYKRASAKIYLSRVARFSQFAATRCGLMPIHQD
VIDSYLGTFTTDTPRIGAVSALAHARRVAPERFIIPVPSEEADPDALLLASFSDYLRTVRGLEPKTREGILLGGRRFLDW
FRHRHPGQNLEALTAEHVLAAVEHRLSLSATSGTRTAATSHIRTFLRFLCWAGHHRQDLARIVPRTPYWRLAHLPPRLAW
GDVRRAIDAIGATTPVAIRDRAVLLLLATTGIRNGELRAIRLQDIDWRTGEVFIRRTKGKRDRVVPLLEETGAALVDYIL
RARPKVDSPYLFLSFTPPVGAFKSAAPVSRIVRKRLRHGGVELGRVAGAHLLRHSLATQLVGQRRPINEVADLLGHRSIN
TTALYVKVAASQLAEVALPFPGGAA

Specific function: Site-Specific Tyrosine Recombinase, Which Acts By Catalyzing The Cutting And Rejoining Of The Recombining DNA Molecules. Binds Cooperatively To Specific DNA Consensus Sequences That Are Separated From Xerd Binding Sites By A Short Central Region, Forming

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the 'phage' integrase family

Homologues:

Organism=Escherichia coli, GI1790244, Length=222, Percent_Identity=30.6306306306306, Blast_Score=79, Evalue=4e-16,
Organism=Escherichia coli, GI1789261, Length=290, Percent_Identity=28.2758620689655, Blast_Score=79, Evalue=7e-16,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): Y4RF_RHISN (P55639)

Other databases:

- EMBL:   U00090
- RefSeq:   NP_444044.1
- ProteinModelPortal:   P55639
- SMR:   P55639
- GeneID:   962435
- GenomeReviews:   U00090_GR
- KEGG:   rhi:NGR_a01810
- HOGENOM:   HBG484411
- ProtClustDB:   CLSK809016
- InterPro:   IPR011010
- InterPro:   IPR013762
- InterPro:   IPR002104
- InterPro:   IPR023109
- InterPro:   IPR004107
- Gene3D:   G3DSA:1.10.150.130
- Gene3D:   G3DSA:1.10.443.10

Pfam domain/function: PF02899 Phage_integr_N; PF00589 Phage_integrase; SSF56349 DNA_brk_join_enz

EC number: NA

Molecular weight: Translated: 47385; Mature: 47385

Theoretical pI: Translated: 11.51; Mature: 11.51

Prosite motif: NA

Important sites: ACT_SITE 405-405

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.5 %Cys     (Translated Protein)
0.9 %Met     (Translated Protein)
1.4 %Cys+Met (Translated Protein)
0.5 %Cys     (Mature Protein)
0.9 %Met     (Mature Protein)
1.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MRDPPAPLFRRDNMLEFYFSYRGVLKRLRSGALGGEMDRLAEHFLTLGYKRASAKIYLSR
CCCCCCCHHHCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH
VARFSQFAATRCGLMPIHQDVIDSYLGTFTTDTPRIGAVSALAHARRVAPERFIIPVPSE
HHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCEEEECCCC
EADPDALLLASFSDYLRTVRGLEPKTREGILLGGRRFLDWFRHRHPGQNLEALTAEHVLA
CCCCCHHHHHHHHHHHHHHHCCCCCCCCCEEECCHHHHHHHHHCCCCCCHHHHHHHHHHH
AVEHRLSLSATSGTRTAATSHIRTFLRFLCWAGHHRQDLARIVPRTPYWRLAHLPPRLAW
HHHHHHHEECCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCEEECCCCCCCHH
GDVRRAIDAIGATTPVAIRDRAVLLLLATTGIRNGELRAIRLQDIDWRTGEVFIRRTKGK
HHHHHHHHHHCCCCCEEECCCEEEEEEEECCCCCCCEEEEEEECCCCCCCCEEEEECCCC
RDRVVPLLEETGAALVDYILRARPKVDSPYLFLSFTPPVGAFKSAAPVSRIVRKRLRHGG
CCCCCHHHHHCCHHHHHHHHHCCCCCCCCEEEEEECCCCCCHHCCHHHHHHHHHHHHHCC
VELGRVAGAHLLRHSLATQLVGQRRPINEVADLLGHRSINTTALYVKVAASQLAEVALPF
CHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCCCCCEEHHHHHHHHHHHHCCCC
PGGAA
CCCCC
>Mature Secondary Structure
MRDPPAPLFRRDNMLEFYFSYRGVLKRLRSGALGGEMDRLAEHFLTLGYKRASAKIYLSR
CCCCCCCHHHCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH
VARFSQFAATRCGLMPIHQDVIDSYLGTFTTDTPRIGAVSALAHARRVAPERFIIPVPSE
HHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCEEEECCCC
EADPDALLLASFSDYLRTVRGLEPKTREGILLGGRRFLDWFRHRHPGQNLEALTAEHVLA
CCCCCHHHHHHHHHHHHHHHCCCCCCCCCEEECCHHHHHHHHHCCCCCCHHHHHHHHHHH
AVEHRLSLSATSGTRTAATSHIRTFLRFLCWAGHHRQDLARIVPRTPYWRLAHLPPRLAW
HHHHHHHEECCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCEEECCCCCCCHH
GDVRRAIDAIGATTPVAIRDRAVLLLLATTGIRNGELRAIRLQDIDWRTGEVFIRRTKGK
HHHHHHHHHHCCCCCEEECCCEEEEEEEECCCCCCCEEEEEEECCCCCCCCEEEEECCCC
RDRVVPLLEETGAALVDYILRARPKVDSPYLFLSFTPPVGAFKSAAPVSRIVRKRLRHGG
CCCCCHHHHHCCHHHHHHHHHCCCCCCCCEEEEEECCCCCCHHCCHHHHHHHHHHHHHCC
VELGRVAGAHLLRHSLATQLVGQRRPINEVADLLGHRSINTTALYVKVAASQLAEVALPF
CHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCCCCCEEHHHHHHHHHHHHCCCC
PGGAA
CCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9163424