Definition | Sinorhizobium fredii NGR234 plasmid pNGR234a, complete sequence. |
---|---|
Accession | NC_000914 |
Length | 536,165 |
Click here to switch to the map view.
The map label for this gene is Not Available
Identifier: 16519920
GI number: 16519920
Start: 234124
End: 235068
Strand: Reverse
Name: Not Available
Synonym: NGR_a01850
Alternate gene names: 16519920
Gene position: 235068-234124 (Counterclockwise)
Preceding gene: 16519919
Following gene: 16519921
Centisome position: 43.84
GC content: 60.11
Gene sequence:
>945_bases ATGAGCACATTCCGACAGGCTGTTCAGGAGTACATCGAGATGCGGCGAGGGCTGGGGTTCAAGCTGCGAGAGACAGAACG GGGATTGATCGATTTCGCCGCCTTCCTGGAGGCCAACGACACGCCACACATCACGACGGAACTGGCCCTTGCCTGGGCTC AGCGACCGTCGCGGGCGCAGCCTTCGCATTGGGCGACACGGCTGGGCTATGTCCGCGTATTCGCCCGTTATCGGGCCGCC GCCGATCCGCGAACTCAGATTCCTCCAAGCGGCTTGCTTCCCTTTCGCCCGAAGCGGGCTCGACCATATCTCTATTCGAA GGAAGACATCCAACGCCTCCTGTCGGCCGCTCTGGAGATGCCGTGTCGATATACCCGCTGCAAGCTCAGGCCATGGACAT ATTATTGCCTGTTCGGGCTGCTGAGCGTTTCCGGCTTGCGGCTCGGCGAGGCGCGCAACCTCAAGCTCGCGGACGTTGAT TTCGACGCTGCGGTGTTGACGATCCGCGGAACGAAGTTCGGAAAGTCCCGTCTTGTACCGATGCACGCATCGACATGCGC AGTGCTCCGCGATTATCTCAAACGCCGACGACAGCATTGTGCAGCCCAGGCGGCATCTCCCTATTTATTCACTTCGCAAC TGGGCAATCGCCTTGATGTCGGAGACATTCACCGAACATTCTATGCTCTGTCTCGCCAAATCGGCCTGCGCGGCGCAACT GACAGCCACGGTCCGCGGCTGCATGACATGCGGCATGTGTTCGCCACGAACACGCTGGTGCGCTGGTACGAAGCCGAGCA AGATCCCGAGCGGCTCCTGCCCATTCTGTCCACCTATCTCGGTCATGTCCACGTGGCCGATACCCAATGGTACCTCACCG GTTCACCCGAGCTGATGAAAGAAGCAATGCGCCGCCTTGAACGTCGCTGGGAGGATCGGACATGA
Upstream 100 bases:
>100_bases AAGTCCTGGGACACCGTCATGTGCAGACAACGGCAATTTACGCCAAGGTCGATCTCGACGCGTTGCGAACACTGGCTTTG CCATGGCCGGGAGAAGCCCA
Downstream 100 bases:
>100_bases CCAAGCACGCCAGCCTCGCGCCACTCTTGGAGAGCTTTTTCCTCCAACGCCTGATGCAACAGCGGCAGGCAAGCCCCCAT ACGATCAGTTCCTATCGCGA
Product: DNA integration/recombination/inversion protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 314; Mature: 313
Protein sequence:
>314_residues MSTFRQAVQEYIEMRRGLGFKLRETERGLIDFAAFLEANDTPHITTELALAWAQRPSRAQPSHWATRLGYVRVFARYRAA ADPRTQIPPSGLLPFRPKRARPYLYSKEDIQRLLSAALEMPCRYTRCKLRPWTYYCLFGLLSVSGLRLGEARNLKLADVD FDAAVLTIRGTKFGKSRLVPMHASTCAVLRDYLKRRRQHCAAQAASPYLFTSQLGNRLDVGDIHRTFYALSRQIGLRGAT DSHGPRLHDMRHVFATNTLVRWYEAEQDPERLLPILSTYLGHVHVADTQWYLTGSPELMKEAMRRLERRWEDRT
Sequences:
>Translated_314_residues MSTFRQAVQEYIEMRRGLGFKLRETERGLIDFAAFLEANDTPHITTELALAWAQRPSRAQPSHWATRLGYVRVFARYRAA ADPRTQIPPSGLLPFRPKRARPYLYSKEDIQRLLSAALEMPCRYTRCKLRPWTYYCLFGLLSVSGLRLGEARNLKLADVD FDAAVLTIRGTKFGKSRLVPMHASTCAVLRDYLKRRRQHCAAQAASPYLFTSQLGNRLDVGDIHRTFYALSRQIGLRGAT DSHGPRLHDMRHVFATNTLVRWYEAEQDPERLLPILSTYLGHVHVADTQWYLTGSPELMKEAMRRLERRWEDRT >Mature_313_residues STFRQAVQEYIEMRRGLGFKLRETERGLIDFAAFLEANDTPHITTELALAWAQRPSRAQPSHWATRLGYVRVFARYRAAA DPRTQIPPSGLLPFRPKRARPYLYSKEDIQRLLSAALEMPCRYTRCKLRPWTYYCLFGLLSVSGLRLGEARNLKLADVDF DAAVLTIRGTKFGKSRLVPMHASTCAVLRDYLKRRRQHCAAQAASPYLFTSQLGNRLDVGDIHRTFYALSRQIGLRGATD SHGPRLHDMRHVFATNTLVRWYEAEQDPERLLPILSTYLGHVHVADTQWYLTGSPELMKEAMRRLERRWEDRT
Specific function: Site-Specific Tyrosine Recombinase, Which Acts By Catalyzing The Cutting And Rejoining Of The Recombining DNA Molecules. Binds Cooperatively To Specific DNA Consensus Sequences That Are Separated From Xerc Binding Sites By A Short Central Region, Forming
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the 'phage' integrase family
Homologues:
None
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): Y4RB_RHISN (P55635)
Other databases:
- EMBL: U00090 - RefSeq: NP_444040.1 - ProteinModelPortal: P55635 - GeneID: 962532 - GenomeReviews: U00090_GR - KEGG: rhi:NGR_a01850 - HOGENOM: HBG373304 - ProtClustDB: CLSK809014 - InterPro: IPR011010 - InterPro: IPR013762 - InterPro: IPR002104 - Gene3D: G3DSA:1.10.443.10
Pfam domain/function: PF00589 Phage_integrase; SSF56349 DNA_brk_join_enz
EC number: NA
Molecular weight: Translated: 36311; Mature: 36180
Theoretical pI: Translated: 10.39; Mature: 10.39
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.6 %Cys (Translated Protein) 2.2 %Met (Translated Protein) 3.8 %Cys+Met (Translated Protein) 1.6 %Cys (Mature Protein) 1.9 %Met (Mature Protein) 3.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSTFRQAVQEYIEMRRGLGFKLRETERGLIDFAAFLEANDTPHITTELALAWAQRPSRAQ CCHHHHHHHHHHHHHHCCCCEEEHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCC PSHWATRLGYVRVFARYRAAADPRTQIPPSGLLPFRPKRARPYLYSKEDIQRLLSAALEM CHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHC PCRYTRCKLRPWTYYCLFGLLSVSGLRLGEARNLKLADVDFDAAVLTIRGTKFGKSRLVP CCCHHHCCCCCHHHHHHHHHHHHCCCCCCCCCCCEEEECCCCEEEEEEECCCCCCCCCCC MHASTCAVLRDYLKRRRQHCAAQAASPYLFTSQLGNRLDVGDIHRTFYALSRQIGLRGAT CCHHHHHHHHHHHHHHHHHHHHHHCCCEEEHHHCCCCCCHHHHHHHHHHHHHHHCCCCCC DSHGPRLHDMRHVFATNTLVRWYEAEQDPERLLPILSTYLGHVHVADTQWYLTGSPELMK CCCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCEEEECCEEEECCCHHHHH EAMRRLERRWEDRT HHHHHHHHHHCCCC >Mature Secondary Structure STFRQAVQEYIEMRRGLGFKLRETERGLIDFAAFLEANDTPHITTELALAWAQRPSRAQ CHHHHHHHHHHHHHHCCCCEEEHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCC PSHWATRLGYVRVFARYRAAADPRTQIPPSGLLPFRPKRARPYLYSKEDIQRLLSAALEM CHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHC PCRYTRCKLRPWTYYCLFGLLSVSGLRLGEARNLKLADVDFDAAVLTIRGTKFGKSRLVP CCCHHHCCCCCHHHHHHHHHHHHCCCCCCCCCCCEEEECCCCEEEEEEECCCCCCCCCCC MHASTCAVLRDYLKRRRQHCAAQAASPYLFTSQLGNRLDVGDIHRTFYALSRQIGLRGAT CCHHHHHHHHHHHHHHHHHHHHHHCCCEEEHHHCCCCCCHHHHHHHHHHHHHHHCCCCCC DSHGPRLHDMRHVFATNTLVRWYEAEQDPERLLPILSTYLGHVHVADTQWYLTGSPELMK CCCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCEEEECCEEEECCCHHHHH EAMRRLERRWEDRT HHHHHHHHHHCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 9163424