Definition | Sinorhizobium fredii NGR234 plasmid pNGR234a, complete sequence. |
---|---|
Accession | NC_000914 |
Length | 536,165 |
Click here to switch to the map view.
The map label for this gene is Not Available
Identifier: 16519924
GI number: 16519924
Start: 230024
End: 231301
Strand: Direct
Name: Not Available
Synonym: NGR_a01810
Alternate gene names: 16519924
Gene position: 230024-231301 (Clockwise)
Preceding gene: 16519925
Following gene: 16519923
Centisome position: 42.9
GC content: 62.21
Gene sequence:
>1278_bases GTGCGGGATCCCCCCGCGCCGCTATTCCGGAGGGACAACATGCTCGAGTTCTATTTCTCGTATCGTGGGGTGCTCAAGCG CCTACGTAGCGGTGCGCTCGGTGGCGAGATGGATCGCCTCGCCGAGCATTTTCTTACGCTCGGTTATAAGCGAGCGTCCG CCAAGATTTACCTGAGCCGGGTTGCGCGTTTTAGCCAGTTTGCCGCGACGCGCTGTGGCTTGATGCCAATCCATCAAGAT GTCATCGACAGCTATTTGGGCACGTTTACTACAGATACTCCGCGGATCGGGGCCGTATCGGCTCTGGCACACGCACGGCG AGTGGCTCCGGAGCGGTTCATTATTCCCGTTCCAAGTGAAGAGGCTGACCCGGATGCTCTGCTTCTGGCCTCCTTTTCGG ACTATCTGCGCACGGTACGGGGCCTGGAGCCGAAGACCCGCGAAGGTATTCTTCTGGGCGGCCGCCGCTTTTTGGATTGG TTCCGCCATCGCCACCCCGGCCAAAATCTTGAGGCGTTGACGGCCGAGCACGTGCTCGCTGCTGTCGAGCATCGGCTGTC GCTATCGGCGACCTCCGGCACCCGCACGGCAGCGACCTCTCACATTCGAACATTTCTTCGGTTCTTATGTTGGGCTGGCC ACCATCGCCAAGATCTTGCCCGCATCGTCCCGAGGACGCCCTATTGGCGTTTGGCGCATCTGCCGCCGCGCCTTGCATGG GGTGACGTTCGGCGCGCAATCGATGCGATCGGCGCAACGACGCCGGTCGCTATCCGCGATCGAGCCGTCCTGCTGCTGCT CGCCACCACGGGCATTCGCAACGGCGAGTTACGCGCCATTCGGCTGCAGGATATCGACTGGCGTACTGGCGAGGTTTTTA TCCGGCGCACCAAGGGCAAGCGTGATCGGGTGGTGCCACTCCTTGAGGAGACCGGCGCCGCACTCGTCGACTACATCCTG CGCGCTCGACCGAAGGTGGACAGTCCGTATCTGTTCCTGTCGTTCACGCCGCCGGTGGGAGCGTTCAAGTCTGCGGCGCC TGTATCAAGGATCGTGAGGAAGCGATTGCGACATGGCGGGGTCGAACTCGGGCGGGTCGCAGGTGCACATCTCCTGCGCC ACAGCCTCGCCACCCAGCTTGTCGGGCAGCGAAGGCCAATCAACGAGGTCGCCGATCTTCTTGGTCACCGGAGCATCAAC ACAACGGCGCTGTACGTAAAGGTTGCGGCCTCGCAACTCGCCGAGGTCGCACTCCCCTTTCCGGGAGGCGCTGCATGA
Upstream 100 bases:
>100_bases TGTGCCGGGCGAGATTATGTGGCGGAGCCGTCCCAGCGCCAGCCGAGGCCCGCCTAATCTGGAATTCTCCGCCACATAAT ATTTCGTGCCTCTTGGATGA
Downstream 100 bases:
>100_bases CCGCCTTTGCCCGATTCCTCGGCGAGAAGGTCGAGCGTTACATCGACCTGCGTCACTCGCTCGGCTATGCCTTCAGCAAA CAAGCCGGCACGTTGCGGGC
Product: DNA integration/recombination/inversion protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 425; Mature: 425
Protein sequence:
>425_residues MRDPPAPLFRRDNMLEFYFSYRGVLKRLRSGALGGEMDRLAEHFLTLGYKRASAKIYLSRVARFSQFAATRCGLMPIHQD VIDSYLGTFTTDTPRIGAVSALAHARRVAPERFIIPVPSEEADPDALLLASFSDYLRTVRGLEPKTREGILLGGRRFLDW FRHRHPGQNLEALTAEHVLAAVEHRLSLSATSGTRTAATSHIRTFLRFLCWAGHHRQDLARIVPRTPYWRLAHLPPRLAW GDVRRAIDAIGATTPVAIRDRAVLLLLATTGIRNGELRAIRLQDIDWRTGEVFIRRTKGKRDRVVPLLEETGAALVDYIL RARPKVDSPYLFLSFTPPVGAFKSAAPVSRIVRKRLRHGGVELGRVAGAHLLRHSLATQLVGQRRPINEVADLLGHRSIN TTALYVKVAASQLAEVALPFPGGAA
Sequences:
>Translated_425_residues MRDPPAPLFRRDNMLEFYFSYRGVLKRLRSGALGGEMDRLAEHFLTLGYKRASAKIYLSRVARFSQFAATRCGLMPIHQD VIDSYLGTFTTDTPRIGAVSALAHARRVAPERFIIPVPSEEADPDALLLASFSDYLRTVRGLEPKTREGILLGGRRFLDW FRHRHPGQNLEALTAEHVLAAVEHRLSLSATSGTRTAATSHIRTFLRFLCWAGHHRQDLARIVPRTPYWRLAHLPPRLAW GDVRRAIDAIGATTPVAIRDRAVLLLLATTGIRNGELRAIRLQDIDWRTGEVFIRRTKGKRDRVVPLLEETGAALVDYIL RARPKVDSPYLFLSFTPPVGAFKSAAPVSRIVRKRLRHGGVELGRVAGAHLLRHSLATQLVGQRRPINEVADLLGHRSIN TTALYVKVAASQLAEVALPFPGGAA >Mature_425_residues MRDPPAPLFRRDNMLEFYFSYRGVLKRLRSGALGGEMDRLAEHFLTLGYKRASAKIYLSRVARFSQFAATRCGLMPIHQD VIDSYLGTFTTDTPRIGAVSALAHARRVAPERFIIPVPSEEADPDALLLASFSDYLRTVRGLEPKTREGILLGGRRFLDW FRHRHPGQNLEALTAEHVLAAVEHRLSLSATSGTRTAATSHIRTFLRFLCWAGHHRQDLARIVPRTPYWRLAHLPPRLAW GDVRRAIDAIGATTPVAIRDRAVLLLLATTGIRNGELRAIRLQDIDWRTGEVFIRRTKGKRDRVVPLLEETGAALVDYIL RARPKVDSPYLFLSFTPPVGAFKSAAPVSRIVRKRLRHGGVELGRVAGAHLLRHSLATQLVGQRRPINEVADLLGHRSIN TTALYVKVAASQLAEVALPFPGGAA
Specific function: Site-Specific Tyrosine Recombinase, Which Acts By Catalyzing The Cutting And Rejoining Of The Recombining DNA Molecules. Binds Cooperatively To Specific DNA Consensus Sequences That Are Separated From Xerd Binding Sites By A Short Central Region, Forming
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the 'phage' integrase family
Homologues:
Organism=Escherichia coli, GI1790244, Length=222, Percent_Identity=30.6306306306306, Blast_Score=79, Evalue=4e-16, Organism=Escherichia coli, GI1789261, Length=290, Percent_Identity=28.2758620689655, Blast_Score=79, Evalue=7e-16,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): Y4RF_RHISN (P55639)
Other databases:
- EMBL: U00090 - RefSeq: NP_444044.1 - ProteinModelPortal: P55639 - SMR: P55639 - GeneID: 962435 - GenomeReviews: U00090_GR - KEGG: rhi:NGR_a01810 - HOGENOM: HBG484411 - ProtClustDB: CLSK809016 - InterPro: IPR011010 - InterPro: IPR013762 - InterPro: IPR002104 - InterPro: IPR023109 - InterPro: IPR004107 - Gene3D: G3DSA:1.10.150.130 - Gene3D: G3DSA:1.10.443.10
Pfam domain/function: PF02899 Phage_integr_N; PF00589 Phage_integrase; SSF56349 DNA_brk_join_enz
EC number: NA
Molecular weight: Translated: 47385; Mature: 47385
Theoretical pI: Translated: 11.51; Mature: 11.51
Prosite motif: NA
Important sites: ACT_SITE 405-405
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.5 %Cys (Translated Protein) 0.9 %Met (Translated Protein) 1.4 %Cys+Met (Translated Protein) 0.5 %Cys (Mature Protein) 0.9 %Met (Mature Protein) 1.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MRDPPAPLFRRDNMLEFYFSYRGVLKRLRSGALGGEMDRLAEHFLTLGYKRASAKIYLSR CCCCCCCHHHCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH VARFSQFAATRCGLMPIHQDVIDSYLGTFTTDTPRIGAVSALAHARRVAPERFIIPVPSE HHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCEEEECCCC EADPDALLLASFSDYLRTVRGLEPKTREGILLGGRRFLDWFRHRHPGQNLEALTAEHVLA CCCCCHHHHHHHHHHHHHHHCCCCCCCCCEEECCHHHHHHHHHCCCCCCHHHHHHHHHHH AVEHRLSLSATSGTRTAATSHIRTFLRFLCWAGHHRQDLARIVPRTPYWRLAHLPPRLAW HHHHHHHEECCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCEEECCCCCCCHH GDVRRAIDAIGATTPVAIRDRAVLLLLATTGIRNGELRAIRLQDIDWRTGEVFIRRTKGK HHHHHHHHHHCCCCCEEECCCEEEEEEEECCCCCCCEEEEEEECCCCCCCCEEEEECCCC RDRVVPLLEETGAALVDYILRARPKVDSPYLFLSFTPPVGAFKSAAPVSRIVRKRLRHGG CCCCCHHHHHCCHHHHHHHHHCCCCCCCCEEEEEECCCCCCHHCCHHHHHHHHHHHHHCC VELGRVAGAHLLRHSLATQLVGQRRPINEVADLLGHRSINTTALYVKVAASQLAEVALPF CHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCCCCCEEHHHHHHHHHHHHCCCC PGGAA CCCCC >Mature Secondary Structure MRDPPAPLFRRDNMLEFYFSYRGVLKRLRSGALGGEMDRLAEHFLTLGYKRASAKIYLSR CCCCCCCHHHCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH VARFSQFAATRCGLMPIHQDVIDSYLGTFTTDTPRIGAVSALAHARRVAPERFIIPVPSE HHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCEEEECCCC EADPDALLLASFSDYLRTVRGLEPKTREGILLGGRRFLDWFRHRHPGQNLEALTAEHVLA CCCCCHHHHHHHHHHHHHHHCCCCCCCCCEEECCHHHHHHHHHCCCCCCHHHHHHHHHHH AVEHRLSLSATSGTRTAATSHIRTFLRFLCWAGHHRQDLARIVPRTPYWRLAHLPPRLAW HHHHHHHEECCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCEEECCCCCCCHH GDVRRAIDAIGATTPVAIRDRAVLLLLATTGIRNGELRAIRLQDIDWRTGEVFIRRTKGK HHHHHHHHHHCCCCCEEECCCEEEEEEEECCCCCCCEEEEEEECCCCCCCCEEEEECCCC RDRVVPLLEETGAALVDYILRARPKVDSPYLFLSFTPPVGAFKSAAPVSRIVRKRLRHGG CCCCCHHHHHCCHHHHHHHHHCCCCCCCCEEEEEECCCCCCHHCCHHHHHHHHHHHHHCC VELGRVAGAHLLRHSLATQLVGQRRPINEVADLLGHRSINTTALYVKVAASQLAEVALPF CHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCCCCCEEHHHHHHHHHHHHCCCC PGGAA CCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9163424