Definition | Sinorhizobium fredii NGR234 plasmid pNGR234a, complete sequence. |
---|---|
Accession | NC_000914 |
Length | 536,165 |
Click here to switch to the map view.
The map label for this gene is Not Available
Identifier: 16519921
GI number: 16519921
Start: 233129
End: 234127
Strand: Reverse
Name: Not Available
Synonym: NGR_a01840
Alternate gene names: 16519921
Gene position: 234127-233129 (Counterclockwise)
Preceding gene: 16519920
Following gene: 16519928
Centisome position: 43.67
GC content: 60.26
Gene sequence:
>999_bases ATGACCAAGCACGCCAGCCTCGCGCCACTCTTGGAGAGCTTTTTCCTCCAACGCCTGATGCAACAGCGGCAGGCAAGCCC CCATACGATCAGTTCCTATCGCGATACATTTCGGCAACTGCTGAAGTTCGCAGAACGAAGATTGCGCAAGCCGCCCTCTC GCCTGAACTTCGAGGAGATCGACGCGCCGCTGATCGTTGCCTTCCTTGATGACCTGGAGAACCGCCAGGGTATCAGCGTC CGCAGCCGCAACCTGCGCCTCACGGCGATTCATTCCTTCTTTCGCTATGCGGCCTTCGAGATACCCGAGCATTCCGCCCA AATCCAACGCGTGCTTGCGATTCCCAGCAAGCGCTTCACCCGAACCCTCGTTAATTTCCTGACCCGTCCGGAGGTCGATG CCTTGCTGGCCGCACCGGATCGATCGACCTGGTCCGGCCGCCGCGACCACGCGTTCCTCCTGGTCGCGGTGCAGACCGGA TTGCGCCTATCGGAGATCACCGGTCTCAAGCGGGATGATCTGTTCTTCGGCACGGGGGCCCACCTGCGCGTCATTGGTAA AGGGCGCAAGGAACGCTGCACCCCGTTCGCCAAGTCTACGACCGCCGTCTTGAGAAACTGGCTGAAAGAGCCGCAGCGCG GAGACCAAGGTATCCTGTTTCCCAGCGCCAGAGGTGAGCGGCTGAGCGTTCATGGCGTTCAGTATATGCTGAACAAACAC CGTCAGATCGCTTCTGCCATGAGCCCGTCGCTGGAGGGAAAGCGCGTCACTGTTCATCGTCTGAGGCACACGATGGCCAT GGACCTCCTACAGGCCGGCGTCGATCGCGCCGTCATCGCCCTGTGGCTCGGCCATGAATCGGTCGAGACGACACAAATCT ATCTCGAGGCGACGTTGGCGATGAAGGAGGCGGCGTTGGCAAAGACATCCCCATATTCCGGGAAGTCATCCCGATTCCGG CCCGACGACAATCTGCTGGCGTTCTTGAACAGCCTGTAG
Upstream 100 bases:
>100_bases GTCATGTCCACGTGGCCGATACCCAATGGTACCTCACCGGTTCACCCGAGCTGATGAAAGAAGCAATGCGCCGCCTTGAA CGTCGCTGGGAGGATCGGAC
Downstream 100 bases:
>100_bases ATCCCGCTGACTATGTCGTGTGGCTCGGCCGATCCTGTAGGAAATCTCGTTGTTCCTCAGACGCTTGCGTTTTCGGTACC CGACATAGTCGGGTGGGGTA
Product: DNA integration/recombination/inversion protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 332; Mature: 331
Protein sequence:
>332_residues MTKHASLAPLLESFFLQRLMQQRQASPHTISSYRDTFRQLLKFAERRLRKPPSRLNFEEIDAPLIVAFLDDLENRQGISV RSRNLRLTAIHSFFRYAAFEIPEHSAQIQRVLAIPSKRFTRTLVNFLTRPEVDALLAAPDRSTWSGRRDHAFLLVAVQTG LRLSEITGLKRDDLFFGTGAHLRVIGKGRKERCTPFAKSTTAVLRNWLKEPQRGDQGILFPSARGERLSVHGVQYMLNKH RQIASAMSPSLEGKRVTVHRLRHTMAMDLLQAGVDRAVIALWLGHESVETTQIYLEATLAMKEAALAKTSPYSGKSSRFR PDDNLLAFLNSL
Sequences:
>Translated_332_residues MTKHASLAPLLESFFLQRLMQQRQASPHTISSYRDTFRQLLKFAERRLRKPPSRLNFEEIDAPLIVAFLDDLENRQGISV RSRNLRLTAIHSFFRYAAFEIPEHSAQIQRVLAIPSKRFTRTLVNFLTRPEVDALLAAPDRSTWSGRRDHAFLLVAVQTG LRLSEITGLKRDDLFFGTGAHLRVIGKGRKERCTPFAKSTTAVLRNWLKEPQRGDQGILFPSARGERLSVHGVQYMLNKH RQIASAMSPSLEGKRVTVHRLRHTMAMDLLQAGVDRAVIALWLGHESVETTQIYLEATLAMKEAALAKTSPYSGKSSRFR PDDNLLAFLNSL >Mature_331_residues TKHASLAPLLESFFLQRLMQQRQASPHTISSYRDTFRQLLKFAERRLRKPPSRLNFEEIDAPLIVAFLDDLENRQGISVR SRNLRLTAIHSFFRYAAFEIPEHSAQIQRVLAIPSKRFTRTLVNFLTRPEVDALLAAPDRSTWSGRRDHAFLLVAVQTGL RLSEITGLKRDDLFFGTGAHLRVIGKGRKERCTPFAKSTTAVLRNWLKEPQRGDQGILFPSARGERLSVHGVQYMLNKHR QIASAMSPSLEGKRVTVHRLRHTMAMDLLQAGVDRAVIALWLGHESVETTQIYLEATLAMKEAALAKTSPYSGKSSRFRP DDNLLAFLNSL
Specific function: Site-Specific Tyrosine Recombinase, Which Acts By Catalyzing The Cutting And Rejoining Of The Recombining DNA Molecules. Binds Cooperatively To Specific DNA Consensus Sequences That Are Separated From Xerc Binding Sites By A Short Central Region, Forming
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the 'phage' integrase family
Homologues:
Organism=Escherichia coli, GI1789261, Length=296, Percent_Identity=28.7162162162162, Blast_Score=89, Evalue=4e-19, Organism=Escherichia coli, GI1790244, Length=283, Percent_Identity=28.2685512367491, Blast_Score=87, Evalue=2e-18,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): Y4RC_RHISN (P55636)
Other databases:
- EMBL: U00090 - RefSeq: NP_444041.1 - ProteinModelPortal: P55636 - GeneID: 962416 - GenomeReviews: U00090_GR - KEGG: rhi:NGR_a01840 - HOGENOM: HBG727654 - ProtClustDB: CLSK893868 - InterPro: IPR011010 - InterPro: IPR013762 - InterPro: IPR002104 - InterPro: IPR010998 - InterPro: IPR023109 - InterPro: IPR004107 - Gene3D: G3DSA:1.10.150.130 - Gene3D: G3DSA:1.10.443.10
Pfam domain/function: PF02899 Phage_integr_N; PF00589 Phage_integrase; SSF56349 DNA_brk_join_enz; SSF47823 L_intgrse_like_N
EC number: NA
Molecular weight: Translated: 37710; Mature: 37579
Theoretical pI: Translated: 11.26; Mature: 11.26
Prosite motif: NA
Important sites: ACT_SITE 294-294
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.3 %Cys (Translated Protein) 2.1 %Met (Translated Protein) 2.4 %Cys+Met (Translated Protein) 0.3 %Cys (Mature Protein) 1.8 %Met (Mature Protein) 2.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTKHASLAPLLESFFLQRLMQQRQASPHTISSYRDTFRQLLKFAERRLRKPPSRLNFEEI CCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHC DAPLIVAFLDDLENRQGISVRSRNLRLTAIHSFFRYAAFEIPEHSAQIQRVLAIPSKRFT CCHHHHHHHHHHCCCCCCCEECCCEEHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCHHHH RTLVNFLTRPEVDALLAAPDRSTWSGRRDHAFLLVAVQTGLRLSEITGLKRDDLFFGTGA HHHHHHHCCCCHHHEEECCCCCCCCCCCCCEEEEEEECCCCCHHHHCCCCCCCEEEECCC HLRVIGKGRKERCTPFAKSTTAVLRNWLKEPQRGDQGILFPSARGERLSVHGVQYMLNKH EEEEEECCCHHHCCCHHHHHHHHHHHHHHCCCCCCCCEECCCCCCCEEEHHHHHHHHHHH RQIASAMSPSLEGKRVTVHRLRHTMAMDLLQAGVDRAVIALWLGHESVETTQIYLEATLA HHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCHHHHHHHHHHCCCCCCHHHHHHHHHHH MKEAALAKTSPYSGKSSRFRPDDNLLAFLNSL HHHHHHHCCCCCCCCCCCCCCCHHHHHHHHCC >Mature Secondary Structure TKHASLAPLLESFFLQRLMQQRQASPHTISSYRDTFRQLLKFAERRLRKPPSRLNFEEI CCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHC DAPLIVAFLDDLENRQGISVRSRNLRLTAIHSFFRYAAFEIPEHSAQIQRVLAIPSKRFT CCHHHHHHHHHHCCCCCCCEECCCEEHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCHHHH RTLVNFLTRPEVDALLAAPDRSTWSGRRDHAFLLVAVQTGLRLSEITGLKRDDLFFGTGA HHHHHHHCCCCHHHEEECCCCCCCCCCCCCEEEEEEECCCCCHHHHCCCCCCCEEEECCC HLRVIGKGRKERCTPFAKSTTAVLRNWLKEPQRGDQGILFPSARGERLSVHGVQYMLNKH EEEEEECCCHHHCCCHHHHHHHHHHHHHHCCCCCCCCEECCCCCCCEEEHHHHHHHHHHH RQIASAMSPSLEGKRVTVHRLRHTMAMDLLQAGVDRAVIALWLGHESVETTQIYLEATLA HHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCHHHHHHHHHHCCCCCCHHHHHHHHHHH MKEAALAKTSPYSGKSSRFRPDDNLLAFLNSL HHHHHHHCCCCCCCCCCCCCCCHHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 9163424