Definition | Sinorhizobium fredii NGR234 plasmid pNGR234a, complete sequence. |
---|---|
Accession | NC_000914 |
Length | 536,165 |
Click here to switch to the map view.
The map label for this gene is Not Available
Identifier: 16519919
GI number: 16519919
Start: 235065
End: 236294
Strand: Reverse
Name: Not Available
Synonym: NGR_a01860
Alternate gene names: 16519919
Gene position: 236294-235065 (Counterclockwise)
Preceding gene: 16519912
Following gene: 16519920
Centisome position: 44.07
GC content: 58.37
Gene sequence:
>1230_bases ATGAAGTATTTCGTCAATGCCGATTTTGCGCTGTCGCGGCCGCCAGAGGGCCCGGTGGCGATTTACATTATTCCTTTTGC CGAATGGCTCGTTGATCGAGGCTACGGCCTTGTTTCTACAAGGAACCAGGTGCTGATGGCTGCCGGCTTCAGCAGTTGGC TTCGGCAAAAGGGGATTGGACTCAGCGACATAAATGGAGAGCATGCTGGGCGTTATTTGCTCGACCGAGTGCAGCGCCCA AAGCTTGGAGATGACGCCGCTCTTCGACATCTATTGGCTTTTCTTCGAAGTCAAAACGCGATCGCCGAAGAGATTGAGGT CGATCACAACCCGTCAGCGGTAGAACAACATGTGCAGGCATATGAGCGGCATCTGCGAGACGCCCGTGCCCTGTCGCGTC AAACGATCATAAATTACCGACCTGTTGTCCGGGATTTTCTCAGCTTCCGCTTCGGTGATGGCGAGATCTCGCTCGCACAA TTGCGCGCCGCCGACGTGACCGATTTCGTGCAAAAGAAGGTATCGCGCCTCAATATGCGACGCGCAAAGATTGTGACCAC GGCACTGCGGTCATTTCTCTCCTATGCGCGTTATCGGGGGGACATCACGTCGGACCTCGCGGCCGCGGTCCCGATCGTGG CTAATTGGTCGCTCTCATCCATTCCTCGTGCAATCGGCCGCGATGACGTGAGCCGATTGCTCTCCAGCATCGATCGGGAT ACGCCCATCGGATGTCGCGATTATGCGATGATCCTCGCATTGGCGCGACTGGGATTGCGGTCGAGCGAGGTGGTGACGCT CGAGCTCGACGATATTGACTGGGTAGCCGGACGGATCCGGGTGCGCGGTAAACACGGACGTAACGAACTTCCGTTGCCGG CGGACGTTGGCGAGGCGATTGCCGACTACCTGTGGAGGGCGCGTCCGCGCAATGCCAGTCGCCGTGTTTTTCTACGCGAC AAGGCCCCGATCCGAGGCTTCGTGGGCCCGAGCGGACTCGGGTCAATTGTCAGACGCTCACTCAAGAGGACCGGCATCGA CTCTCCAACAAAGGGAACGCACCAATTCCGACATGGGCTTGCCTCGGAGATGCTGCGTGGCGGCGCGTCGCTGGGTGAGA TCGGCGAAGTCCTGGGACACCGTCATGTGCAGACAACGGCAATTTACGCCAAGGTCGATCTCGACGCGTTGCGAACACTG GCTTTGCCATGGCCGGGAGAAGCCCAATGA
Upstream 100 bases:
>100_bases GTGTAACTAGATTTCGATCAAAGATTTCAATGCTTTATAGATAGCGTGTAGAGATAGTTCGTAGCCCTTCCAGCAATCCA GCTTGGGAAGGGAGCGAAGA
Downstream 100 bases:
>100_bases GCACATTCCGACAGGCTGTTCAGGAGTACATCGAGATGCGGCGAGGGCTGGGGTTCAAGCTGCGAGAGACAGAACGGGGA TTGATCGATTTCGCCGCCTT
Product: DNA integration/recombination/inversion protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 409; Mature: 409
Protein sequence:
>409_residues MKYFVNADFALSRPPEGPVAIYIIPFAEWLVDRGYGLVSTRNQVLMAAGFSSWLRQKGIGLSDINGEHAGRYLLDRVQRP KLGDDAALRHLLAFLRSQNAIAEEIEVDHNPSAVEQHVQAYERHLRDARALSRQTIINYRPVVRDFLSFRFGDGEISLAQ LRAADVTDFVQKKVSRLNMRRAKIVTTALRSFLSYARYRGDITSDLAAAVPIVANWSLSSIPRAIGRDDVSRLLSSIDRD TPIGCRDYAMILALARLGLRSSEVVTLELDDIDWVAGRIRVRGKHGRNELPLPADVGEAIADYLWRARPRNASRRVFLRD KAPIRGFVGPSGLGSIVRRSLKRTGIDSPTKGTHQFRHGLASEMLRGGASLGEIGEVLGHRHVQTTAIYAKVDLDALRTL ALPWPGEAQ
Sequences:
>Translated_409_residues MKYFVNADFALSRPPEGPVAIYIIPFAEWLVDRGYGLVSTRNQVLMAAGFSSWLRQKGIGLSDINGEHAGRYLLDRVQRP KLGDDAALRHLLAFLRSQNAIAEEIEVDHNPSAVEQHVQAYERHLRDARALSRQTIINYRPVVRDFLSFRFGDGEISLAQ LRAADVTDFVQKKVSRLNMRRAKIVTTALRSFLSYARYRGDITSDLAAAVPIVANWSLSSIPRAIGRDDVSRLLSSIDRD TPIGCRDYAMILALARLGLRSSEVVTLELDDIDWVAGRIRVRGKHGRNELPLPADVGEAIADYLWRARPRNASRRVFLRD KAPIRGFVGPSGLGSIVRRSLKRTGIDSPTKGTHQFRHGLASEMLRGGASLGEIGEVLGHRHVQTTAIYAKVDLDALRTL ALPWPGEAQ >Mature_409_residues MKYFVNADFALSRPPEGPVAIYIIPFAEWLVDRGYGLVSTRNQVLMAAGFSSWLRQKGIGLSDINGEHAGRYLLDRVQRP KLGDDAALRHLLAFLRSQNAIAEEIEVDHNPSAVEQHVQAYERHLRDARALSRQTIINYRPVVRDFLSFRFGDGEISLAQ LRAADVTDFVQKKVSRLNMRRAKIVTTALRSFLSYARYRGDITSDLAAAVPIVANWSLSSIPRAIGRDDVSRLLSSIDRD TPIGCRDYAMILALARLGLRSSEVVTLELDDIDWVAGRIRVRGKHGRNELPLPADVGEAIADYLWRARPRNASRRVFLRD KAPIRGFVGPSGLGSIVRRSLKRTGIDSPTKGTHQFRHGLASEMLRGGASLGEIGEVLGHRHVQTTAIYAKVDLDALRTL ALPWPGEAQ
Specific function: Site-Specific Tyrosine Recombinase, Which Acts By Catalyzing The Cutting And Rejoining Of The Recombining DNA Molecules. Binds Cooperatively To Specific DNA Consensus Sequences That Are Separated From Xerd Binding Sites By A Short Central Region, Forming
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the 'phage' integrase family
Homologues:
Organism=Escherichia coli, GI1790244, Length=288, Percent_Identity=28.8194444444444, Blast_Score=88, Evalue=1e-18,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): Y4RA_RHISN (P55634)
Other databases:
- EMBL: U00090 - RefSeq: NP_444039.1 - ProteinModelPortal: P55634 - SMR: P55634 - GeneID: 962535 - GenomeReviews: U00090_GR - KEGG: rhi:NGR_a01860 - HOGENOM: HBG484411 - ProtClustDB: CLSK516123 - InterPro: IPR011010 - InterPro: IPR013762 - InterPro: IPR002104 - InterPro: IPR023109 - InterPro: IPR004107 - Gene3D: G3DSA:1.10.150.130 - Gene3D: G3DSA:1.10.443.10
Pfam domain/function: PF02899 Phage_integr_N; PF00589 Phage_integrase; SSF56349 DNA_brk_join_enz
EC number: NA
Molecular weight: Translated: 45529; Mature: 45529
Theoretical pI: Translated: 10.50; Mature: 10.50
Prosite motif: NA
Important sites: ACT_SITE 389-389
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.2 %Cys (Translated Protein) 1.2 %Met (Translated Protein) 1.5 %Cys+Met (Translated Protein) 0.2 %Cys (Mature Protein) 1.2 %Met (Mature Protein) 1.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKYFVNADFALSRPPEGPVAIYIIPFAEWLVDRGYGLVSTRNQVLMAAGFSSWLRQKGIG CCEEECCCCEECCCCCCCEEEEEECHHHHHHHCCCCEEECCCCEEEEHHHHHHHHHCCCC LSDINGEHAGRYLLDRVQRPKLGDDAALRHLLAFLRSQNAIAEEIEVDHNPSAVEQHVQA CCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCHHHHHHCCCCCHHHHHHHHHH YERHLRDARALSRQTIINYRPVVRDFLSFRFGDGEISLAQLRAADVTDFVQKKVSRLNMR HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEHHHHHHHHHHHHHHHHHHHHHHH RAKIVTTALRSFLSYARYRGDITSDLAAAVPIVANWSLSSIPRAIGRDDVSRLLSSIDRD HHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCHHCCCCHHHHHHHCCHHHHHHHHHHCCCC TPIGCRDYAMILALARLGLRSSEVVTLELDDIDWVAGRIRVRGKHGRNELPLPADVGEAI CCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCEEECEEEEECCCCCCCCCCCHHHHHHH ADYLWRARPRNASRRVFLRDKAPIRGFVGPSGLGSIVRRSLKRTGIDSPTKGTHQFRHGL HHHHHHCCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHH ASEMLRGGASLGEIGEVLGHRHVQTTAIYAKVDLDALRTLALPWPGEAQ HHHHHHCCCCHHHHHHHHCCCCHHHEEEEEEECHHHHHHHCCCCCCCCC >Mature Secondary Structure MKYFVNADFALSRPPEGPVAIYIIPFAEWLVDRGYGLVSTRNQVLMAAGFSSWLRQKGIG CCEEECCCCEECCCCCCCEEEEEECHHHHHHHCCCCEEECCCCEEEEHHHHHHHHHCCCC LSDINGEHAGRYLLDRVQRPKLGDDAALRHLLAFLRSQNAIAEEIEVDHNPSAVEQHVQA CCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCHHHHHHCCCCCHHHHHHHHHH YERHLRDARALSRQTIINYRPVVRDFLSFRFGDGEISLAQLRAADVTDFVQKKVSRLNMR HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEHHHHHHHHHHHHHHHHHHHHHHH RAKIVTTALRSFLSYARYRGDITSDLAAAVPIVANWSLSSIPRAIGRDDVSRLLSSIDRD HHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCHHCCCCHHHHHHHCCHHHHHHHHHHCCCC TPIGCRDYAMILALARLGLRSSEVVTLELDDIDWVAGRIRVRGKHGRNELPLPADVGEAI CCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCEEECEEEEECCCCCCCCCCCHHHHHHH ADYLWRARPRNASRRVFLRDKAPIRGFVGPSGLGSIVRRSLKRTGIDSPTKGTHQFRHGL HHHHHHCCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHH ASEMLRGGASLGEIGEVLGHRHVQTTAIYAKVDLDALRTLALPWPGEAQ HHHHHHCCCCHHHHHHHHCCCCHHHEEEEEEECHHHHHHHCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9163424