Definition | Sinorhizobium fredii NGR234 plasmid pNGR234a, complete sequence. |
---|---|
Accession | NC_000914 |
Length | 536,165 |
Click here to switch to the map view.
The map label for this gene is Not Available
Identifier: 16519679
GI number: 16519679
Start: 6654
End: 7571
Strand: Reverse
Name: Not Available
Synonym: NGR_a00070
Alternate gene names: 16519679
Gene position: 7571-6654 (Counterclockwise)
Preceding gene: 16519672
Following gene: NA
Centisome position: 1.41
GC content: 65.9
Gene sequence:
>918_bases ATGGCAACGGCTGCATACCTCAATTCTCCTTCCCTGCAGCAAAGGCTGATCGGATACGCCCGTGTTTCGACCGAGGATCA GCTCAACGACGCTCAGGTCGACGAATTGCGGGCGGCTGGTTGCCACCGCATCCACCAGGAGCACGGATCCGGCGCATCAC GCGCGCGGCCGGTGCTTGCGAAGCTTCTTAAAGACCTTGCCATGGGCGATGTCCTCGTCGTCGTTCGCCTCGACCGTCTG GCCCGATCGGTCAGCCACCTGCTCGACGTCATCGAAGACCTCGAGAAGCGCGGCGTCCATTTCCGCTCGCTGCGTGATCC GATCGATACCTCGACGCCGCACGGAATGTTTTCCCTGCAGGTGCTCGGCGCCGTCGCCCAGCTCGAGCGCGCGCTGATCG CGGAGCGGACCAAGTCCGGTATGCAGGCCGCCAAGGCGCGCGGCCGGCTTGCCGGCAATCCCGGGCTTCGAGAACGCCGG CCAGAAGCCATCCGTGCGGTCTCAGCGGCGCGCGAGCGGGCCTACCTCGATGAACTGATTGTGTCAGCGCAGACCTGGCT GCCGACAGTCCGGCGACTGCGCCCGCGACACAGTTGGGACAATGTCGTGCGGATCCTCAATCGCAGGGGGCACGACTGGA CCGTCGAACGGTTGCGGCGGGCGGTCCACCGGCTAGTGCGCGAAAAGCTCGCGGAACCGGAACTGCTTGCCCGATCGCTG CGCCGGCCGCCCGAGGATCATCTGATGCGGCTGGTTGCCGGGATCGCCATCGCCGATCCCAATCTGTCGCTGCGCGACAT TGCCGCCCAGTTGGACCAGATGCAGGAGCGACCGCCACGTGGCGGCCGCAAATGGCAACCGTCTTCCGTCCGAGCACTAC TGGACGAGGCGAGTCGCATCGGGCTGGTTCGCGCTTGA
Upstream 100 bases:
>100_bases GTACGGCCTTCCATTTTGAACATAAAACACGCTTGCAAAGGAGCCTTTTCTCACACATGTTTCATTTGTACAAACCGCCC CGAAACGACCGAAAAATTCG
Downstream 100 bases:
>100_bases GAGGCTTGGTTCATGAATTCAAGAGTGTTCACGCGGCGTCGAAAGACGGGTGCGGCACTGTTTCCGGTCGCCGCATGCTT CCAAATGGAAAAAGACCGAG
Product: DNA integration/recombination/inversion protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 305; Mature: 304
Protein sequence:
>305_residues MATAAYLNSPSLQQRLIGYARVSTEDQLNDAQVDELRAAGCHRIHQEHGSGASRARPVLAKLLKDLAMGDVLVVVRLDRL ARSVSHLLDVIEDLEKRGVHFRSLRDPIDTSTPHGMFSLQVLGAVAQLERALIAERTKSGMQAAKARGRLAGNPGLRERR PEAIRAVSAARERAYLDELIVSAQTWLPTVRRLRPRHSWDNVVRILNRRGHDWTVERLRRAVHRLVREKLAEPELLARSL RRPPEDHLMRLVAGIAIADPNLSLRDIAAQLDQMQERPPRGGRKWQPSSVRALLDEASRIGLVRA
Sequences:
>Translated_305_residues MATAAYLNSPSLQQRLIGYARVSTEDQLNDAQVDELRAAGCHRIHQEHGSGASRARPVLAKLLKDLAMGDVLVVVRLDRL ARSVSHLLDVIEDLEKRGVHFRSLRDPIDTSTPHGMFSLQVLGAVAQLERALIAERTKSGMQAAKARGRLAGNPGLRERR PEAIRAVSAARERAYLDELIVSAQTWLPTVRRLRPRHSWDNVVRILNRRGHDWTVERLRRAVHRLVREKLAEPELLARSL RRPPEDHLMRLVAGIAIADPNLSLRDIAAQLDQMQERPPRGGRKWQPSSVRALLDEASRIGLVRA >Mature_304_residues ATAAYLNSPSLQQRLIGYARVSTEDQLNDAQVDELRAAGCHRIHQEHGSGASRARPVLAKLLKDLAMGDVLVVVRLDRLA RSVSHLLDVIEDLEKRGVHFRSLRDPIDTSTPHGMFSLQVLGAVAQLERALIAERTKSGMQAAKARGRLAGNPGLRERRP EAIRAVSAARERAYLDELIVSAQTWLPTVRRLRPRHSWDNVVRILNRRGHDWTVERLRRAVHRLVREKLAEPELLARSLR RPPEDHLMRLVAGIAIADPNLSLRDIAAQLDQMQERPPRGGRKWQPSSVRALLDEASRIGLVRA
Specific function: This Protein Catalyzes The Inversion Of An 1800-Bp E.Coli DNA Fragment, The P Region, Which Can Exist In Either Orientation. The Function Of The Inversion Is Not Yet Clear. [C]
COG id: COG1961
COG function: function code L; Site-specific recombinases, DNA invertase Pin homologs
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the site-specific recombinase resolvase family
Homologues:
Organism=Escherichia coli, GI1787404, Length=147, Percent_Identity=52.3809523809524, Blast_Score=152, Evalue=3e-38, Organism=Escherichia coli, GI1787638, Length=183, Percent_Identity=35.5191256830601, Blast_Score=91, Evalue=9e-20, Organism=Escherichia coli, GI1787827, Length=183, Percent_Identity=35.5191256830601, Blast_Score=91, Evalue=1e-19,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): Y4CG_RHISN (P55389)
Other databases:
- EMBL: U00090 - RefSeq: NP_443799.1 - ProteinModelPortal: P55389 - SMR: P55389 - GeneID: 962300 - GenomeReviews: U00090_GR - KEGG: rhi:NGR_a00070 - HOGENOM: HBG294912 - ProtClustDB: CLSK861953 - InterPro: IPR006118 - InterPro: IPR006119 - Gene3D: G3DSA:3.40.50.1390 - SMART: SM00857
Pfam domain/function: PF00239 Resolvase; SSF53041 Resolv_N
EC number: NA
Molecular weight: Translated: 34278; Mature: 34147
Theoretical pI: Translated: 11.60; Mature: 11.60
Prosite motif: PS00397 RECOMBINASES_1; PS00398 RECOMBINASES_2
Important sites: ACT_SITE 23-23
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.3 %Cys (Translated Protein) 2.0 %Met (Translated Protein) 2.3 %Cys+Met (Translated Protein) 0.3 %Cys (Mature Protein) 1.6 %Met (Mature Protein) 2.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MATAAYLNSPSLQQRLIGYARVSTEDQLNDAQVDELRAAGCHRIHQEHGSGASRARPVLA CCCCCCCCCHHHHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHH KLLKDLAMGDVLVVVRLDRLARSVSHLLDVIEDLEKRGVHFRSLRDPIDTSTPHGMFSLQ HHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHCCCCCCCCCCCHHHHH VLGAVAQLERALIAERTKSGMQAAKARGRLAGNPGLRERRPEAIRAVSAARERAYLDELI HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHH VSAQTWLPTVRRLRPRHSWDNVVRILNRRGHDWTVERLRRAVHRLVREKLAEPELLARSL HHHHHHHHHHHHHCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHH RRPPEDHLMRLVAGIAIADPNLSLRDIAAQLDQMQERPPRGGRKWQPSSVRALLDEASRI CCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHC GLVRA CCCCC >Mature Secondary Structure ATAAYLNSPSLQQRLIGYARVSTEDQLNDAQVDELRAAGCHRIHQEHGSGASRARPVLA CCCCCCCCHHHHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHH KLLKDLAMGDVLVVVRLDRLARSVSHLLDVIEDLEKRGVHFRSLRDPIDTSTPHGMFSLQ HHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHCCCCCCCCCCCHHHHH VLGAVAQLERALIAERTKSGMQAAKARGRLAGNPGLRERRPEAIRAVSAARERAYLDELI HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHH VSAQTWLPTVRRLRPRHSWDNVVRILNRRGHDWTVERLRRAVHRLVREKLAEPELLARSL HHHHHHHHHHHHHCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHH RRPPEDHLMRLVAGIAIADPNLSLRDIAAQLDQMQERPPRGGRKWQPSSVRALLDEASRI CCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHC GLVRA CCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 9163424