Definition Sinorhizobium fredii NGR234 plasmid pNGR234a, complete sequence.
Accession NC_000914
Length 536,165

Click here to switch to the map view.

The map label for this gene is Not Available

Identifier: 16519923

GI number: 16519923

Start: 231298

End: 232230

Strand: Direct

Name: Not Available

Synonym: NGR_a01820

Alternate gene names: 16519923

Gene position: 231298-232230 (Clockwise)

Preceding gene: 16519924

Following gene: 16519922

Centisome position: 43.14

GC content: 62.27

Gene sequence:

>933_bases
ATGACCGCCTTTGCCCGATTCCTCGGCGAGAAGGTCGAGCGTTACATCGACCTGCGTCACTCGCTCGGCTATGCCTTCAG
CAAACAAGCCGGCACGTTGCGGGCTTTCGTCCGCTACGTTGAACGCGCTCAATTCGATGCGCCCGCCACCCGGACTATGG
CGCTGGACTTCGTCCTGTCGTTCGGCGGCGCCGCCAACAGCCGCGCCACTCGTCACGGCGTGCTCCGCCGATTCTACGAG
TATCTCGCTGTCTATGACGCCCAAACCGAAGCCCTGAAGCGCAGAGCCTTTCCTAGATCCAGGGCGATTCCGCCTCCACG
TATCCTCAGCGAGGCAGAGTTGGCGTCGCTCATCGACGCATGCGCCCGCATTTCGCCAGGCATCCCTCTCAGGGGGCTGA
CGATGGCGACGCTGATCGGACTGTTAGCAAGCTCGGGACTGCGGTCGGGCGAAGTGGTCAGGCTTGATCGTTCCGATGTC
GATCTGACCAACGGGGTTCTTCTGGTTCGGAAGACGAAGTTCCGCAAGGATCGTCTCGTTCCGGTTCACGCGACGACCCA
AACAGCGCTTTGTCGCTATGCCCGTGAGCGCGATGCCGCTTTTCCCAGTCCCAAGGACCAGGCCTTCTTCCTCAGCTCTC
GTGGCAACCGCCTCTCGGCGACTGGCTTGCAATGCGGATTTGCTCAGGTCCGCAAGTTCGCCGGCCTTGATGACGGCAAG
ACGTTGCGGCCGCACGATCTGCGGCACCGGTTTGCCGTGACCCGCATGAGCCTTTGGCATCAACAACGCGCCAACGTCCA
GGCGCTACTCCCGGTGCTCGCCACCTATCTCGGCCACGCCAATTACAGTGACACAGCCTACTACCTCACTGGCTCGGTGG
ATCTTCTCGCCATGGCGGCGGAGCGCGCCTTCCTCGATGGAGGCGCAGCATGA

Upstream 100 bases:

>100_bases
ATCTTCTTGGTCACCGGAGCATCAACACAACGGCGCTGTACGTAAAGGTTGCGGCCTCGCAACTCGCCGAGGTCGCACTC
CCCTTTCCGGGAGGCGCTGC

Downstream 100 bases:

>100_bases
GCGAACACTTACTCCTGGCGCCGCTCCTGGAGTCCTACTTTCGCCGGCGCTTGACCAAACAACGCAACGCGACTCCCGCG
ACCATGGCCAGCTATCGCGA

Product: DNA integration/recombination/inversion protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 310; Mature: 309

Protein sequence:

>310_residues
MTAFARFLGEKVERYIDLRHSLGYAFSKQAGTLRAFVRYVERAQFDAPATRTMALDFVLSFGGAANSRATRHGVLRRFYE
YLAVYDAQTEALKRRAFPRSRAIPPPRILSEAELASLIDACARISPGIPLRGLTMATLIGLLASSGLRSGEVVRLDRSDV
DLTNGVLLVRKTKFRKDRLVPVHATTQTALCRYARERDAAFPSPKDQAFFLSSRGNRLSATGLQCGFAQVRKFAGLDDGK
TLRPHDLRHRFAVTRMSLWHQQRANVQALLPVLATYLGHANYSDTAYYLTGSVDLLAMAAERAFLDGGAA

Sequences:

>Translated_310_residues
MTAFARFLGEKVERYIDLRHSLGYAFSKQAGTLRAFVRYVERAQFDAPATRTMALDFVLSFGGAANSRATRHGVLRRFYE
YLAVYDAQTEALKRRAFPRSRAIPPPRILSEAELASLIDACARISPGIPLRGLTMATLIGLLASSGLRSGEVVRLDRSDV
DLTNGVLLVRKTKFRKDRLVPVHATTQTALCRYARERDAAFPSPKDQAFFLSSRGNRLSATGLQCGFAQVRKFAGLDDGK
TLRPHDLRHRFAVTRMSLWHQQRANVQALLPVLATYLGHANYSDTAYYLTGSVDLLAMAAERAFLDGGAA
>Mature_309_residues
TAFARFLGEKVERYIDLRHSLGYAFSKQAGTLRAFVRYVERAQFDAPATRTMALDFVLSFGGAANSRATRHGVLRRFYEY
LAVYDAQTEALKRRAFPRSRAIPPPRILSEAELASLIDACARISPGIPLRGLTMATLIGLLASSGLRSGEVVRLDRSDVD
LTNGVLLVRKTKFRKDRLVPVHATTQTALCRYARERDAAFPSPKDQAFFLSSRGNRLSATGLQCGFAQVRKFAGLDDGKT
LRPHDLRHRFAVTRMSLWHQQRANVQALLPVLATYLGHANYSDTAYYLTGSVDLLAMAAERAFLDGGAA

Specific function: Site-Specific Tyrosine Recombinase, Which Acts By Catalyzing The Cutting And Rejoining Of The Recombining DNA Molecules. Binds Cooperatively To Specific DNA Consensus Sequences That Are Separated From Xerd Binding Sites By A Short Central Region, Forming

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the 'phage' integrase family

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): Y4RE_RHISN (P55638)

Other databases:

- EMBL:   U00090
- RefSeq:   NP_444043.1
- ProteinModelPortal:   P55638
- GeneID:   962419
- GenomeReviews:   U00090_GR
- KEGG:   rhi:NGR_a01820
- HOGENOM:   HBG373304
- ProtClustDB:   CLSK809015
- InterPro:   IPR011010
- InterPro:   IPR013762
- InterPro:   IPR002104
- Gene3D:   G3DSA:1.10.443.10

Pfam domain/function: PF00589 Phage_integrase; SSF56349 DNA_brk_join_enz

EC number: NA

Molecular weight: Translated: 34359; Mature: 34228

Theoretical pI: Translated: 10.77; Mature: 10.77

Prosite motif: NA

Important sites: ACT_SITE 282-282

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.0 %Cys     (Translated Protein)
1.6 %Met     (Translated Protein)
2.6 %Cys+Met (Translated Protein)
1.0 %Cys     (Mature Protein)
1.3 %Met     (Mature Protein)
2.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTAFARFLGEKVERYIDLRHSLGYAFSKQAGTLRAFVRYVERAQFDAPATRTMALDFVLS
CCHHHHHHHHHHHHHHHHHHHHCHHHHHCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH
FGGAANSRATRHGVLRRFYEYLAVYDAQTEALKRRAFPRSRAIPPPRILSEAELASLIDA
HCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHH
CARISPGIPLRGLTMATLIGLLASSGLRSGEVVRLDRSDVDLTNGVLLVRKTKFRKDRLV
HHHCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEECCCCCCCCCCEEEEEECCCCCCCEE
PVHATTQTALCRYARERDAAFPSPKDQAFFLSSRGNRLSATGLQCGFAQVRKFAGLDDGK
EEECHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCEEECCCHHHHHHHHHHHCCCCCCC
TLRPHDLRHRFAVTRMSLWHQQRANVQALLPVLATYLGHANYSDTAYYLTGSVDLLAMAA
CCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCEEEEECCHHHHHHHH
ERAFLDGGAA
HHHHHCCCCC
>Mature Secondary Structure 
TAFARFLGEKVERYIDLRHSLGYAFSKQAGTLRAFVRYVERAQFDAPATRTMALDFVLS
CHHHHHHHHHHHHHHHHHHHHCHHHHHCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH
FGGAANSRATRHGVLRRFYEYLAVYDAQTEALKRRAFPRSRAIPPPRILSEAELASLIDA
HCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHH
CARISPGIPLRGLTMATLIGLLASSGLRSGEVVRLDRSDVDLTNGVLLVRKTKFRKDRLV
HHHCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEECCCCCCCCCCEEEEEECCCCCCCEE
PVHATTQTALCRYARERDAAFPSPKDQAFFLSSRGNRLSATGLQCGFAQVRKFAGLDDGK
EEECHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCEEECCCHHHHHHHHHHHCCCCCCC
TLRPHDLRHRFAVTRMSLWHQQRANVQALLPVLATYLGHANYSDTAYYLTGSVDLLAMAA
CCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCEEEEECCHHHHHHHH
ERAFLDGGAA
HHHHHCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 9163424