Definition Sinorhizobium fredii NGR234 plasmid pNGR234a, complete sequence.
Accession NC_000914
Length 536,165

Click here to switch to the map view.

The map label for this gene is Not Available

Identifier: 16520013

GI number: 16520013

Start: 111961

End: 113952

Strand: Direct

Name: Not Available

Synonym: NGR_a00920

Alternate gene names: 16520013

Gene position: 111961-113952 (Clockwise)

Preceding gene: 16520014

Following gene: 16520012

Centisome position: 20.88

GC content: 59.39

Gene sequence:

>1992_bases
ATGCTGGACATCGGCGTCATTGGAAGACTGAAGTTTGCGACCGCGTTCATGGCCATGTCCTTACTACTGGTGCCGGCTGC
GGAGGCTCAAGAGCAGCCGGTCTGGCACTACGGACTGTCGCTCGTCGATGATCTCAAATATCCGCCCGGCTTCAAAAAGT
TCGACTATGTGAATCCCGAGGCACCTAAGGGCGGAGATCTCAGGCTTTCACAGACAGGCACCTTCGACACGTTCAATCCC
CTGCTGGTGAAGGGCGAAACGGCCGTCGGCCTGGATTTCGTCTTCGACACGCTCATGAAACCGTCAGAGGACGAGATTTC
GACGGCCTACGGGCTTCTGGCCGAAGGCGTTTCCTTCCCCGACGACATCTCCTCGGCCACGTTCCGGCTGAGGCAGGAAG
CGAAATGGGCCGACGGCAAGCCGGTGACGCCGGAAGACGTCGTCTTCAGCTTCGACAAGGCGAAGGAACTGAACCCGCTC
TATCAGAGCTACTATCGGCATGTCGTGAAGGCGGAAAAGACAGGGGATCGGGACGTCACCTTTCACTTCGACGACAAAAA
CAATCACGAACTTCCTCATATCCTCGGGCAGATCCGGATTGTGCCGAAGCACTGGTGGGAGGGCACTGGACCGGACGGCA
AGCCGCGCGACATTTCGCGAACGACGCTTGAACCGGTGATGGGGTCAGGTCCCTACCGGATCGCTTCGTTCGCCCCCGGC
GGGACCATTCGTTATGAGCGCCGGCCCGATTACTGGGGCGTCGCGCTCAACGTCAATGTCGGGCAGAACAATTTCGATTC
GATCACCTATTCCTTTTTTGGCGATCGCGACGTCGAGTTCGAGGCCTTCCGCTCCGGCAACACCGATTATTGGCGGGAGA
ACCAGGCGATGCGCTGGGCTACGGCCTTCGATTTTCCGGCGGTGAAGGATGGCCGCGTTAAACGCGAGGAAATTCCCAAC
CCCTTCCGGGCAACGGCCGTGATGCAGGCGATGGTGCCGAACATGCGCCGCAAGCCCTTCGACGACGAGAGGGTGCGCCA
GGCATTGAACTATGCGCTCGACTTCGAAGAACTGAACCGGACCATCTTCTACAATCAGTATCAGCGCGTGAACAGCTTCT
TCTTCGCCACAGAACTCGCCTCCTCCGGTCTGCCGGAAGGCAAGGAACTGAAGAACCTCAACGAGGTCAAGGACCTCGTG
CCGCCCGAAGTGTTCACCACCCCTTATAGCAACCCGGTCGGCGGCACGCCGCAAAAGGCGCGCGAAAACCTGCGCAAGGC
GATCGAGCTCCTGAACAAGGCGGGGTTCGAGCTCAACGGTAACCGGATGGTGAATACTGAGACGGGCAAGCCGTTCTCCT
TCGAGATCATGTTGAGCAGCCCGTCATTCGAGCGCGTCGCCTTGCCCTATGCGCAGAACCTTAAGCGGATCGGTATCGAA
GCGCGCGTGCGCACGGTGGACCCATCGCAATATACCAATCGCAAGCGTGCCTTCGACTACGATGTGACCTGGGAAGTTTG
GGGTCAGTCCTTGAGCCCCGGAAACGAACAGGCGGACTACTGGGGATCGGCAGCCGCCACACGCCAGGGCTCCAGGAATT
ATGCCGGCATCTCCGACCCTGGCGTCGACGCCTTGATTGAGCGCGTGATCTTCGCAAAGGATCGCGAAACGCTGGTTGCC
GCAACGAAGGCACTCGATCGCGTCCTGCTCGCCCATAATTACGTCATTCCGCTCTATTACAAGCTGGCCGCCCAAATCGC
CTATTGGGACGCGCTGGCCCGGCCGAAAGAGCTGCCGAAATACGGACTGGGCTTCCCCGAGGTGTGGTGGTCGAAGAGCG
CTGCCTGTCATTTGCCCGCGGGCGTTCGCTGTAGTTGCATTTGCGGGAGCTCTGGCGAAGCCGGGCGTCACCAGTTCACG
CCCGACCGTCGCCTCATCCGAAATGCGCAACTTCTGGACCAAATCTGCGCCGCCAGTGAATTTAAAGGATAG

Upstream 100 bases:

>100_bases
TAGGGACGATCGGCCACTGACATGCCGATCCTGCAACGACTTGGAGCCGCCCCACAAACACCGTGGCGGCGAATGGATAA
AAACAAGGAGAAAGGACGAG

Downstream 100 bases:

>100_bases
CCTGCTTTCGACGTCCTTTGCCAGCCCCGGGCGCTGAGCGTCATTCAGTGTCGCTCGGTCGCCGCGGTGCACCGTTCGTT
GGGGACCCAACACTGCAGCA

Product: ABC transporter substrate-binding protein

Products: ADP; phosphate; oligopeptides [Cytoplasm] [C]

Alternate protein names: NA

Number of amino acids: Translated: 663; Mature: 663

Protein sequence:

>663_residues
MLDIGVIGRLKFATAFMAMSLLLVPAAEAQEQPVWHYGLSLVDDLKYPPGFKKFDYVNPEAPKGGDLRLSQTGTFDTFNP
LLVKGETAVGLDFVFDTLMKPSEDEISTAYGLLAEGVSFPDDISSATFRLRQEAKWADGKPVTPEDVVFSFDKAKELNPL
YQSYYRHVVKAEKTGDRDVTFHFDDKNNHELPHILGQIRIVPKHWWEGTGPDGKPRDISRTTLEPVMGSGPYRIASFAPG
GTIRYERRPDYWGVALNVNVGQNNFDSITYSFFGDRDVEFEAFRSGNTDYWRENQAMRWATAFDFPAVKDGRVKREEIPN
PFRATAVMQAMVPNMRRKPFDDERVRQALNYALDFEELNRTIFYNQYQRVNSFFFATELASSGLPEGKELKNLNEVKDLV
PPEVFTTPYSNPVGGTPQKARENLRKAIELLNKAGFELNGNRMVNTETGKPFSFEIMLSSPSFERVALPYAQNLKRIGIE
ARVRTVDPSQYTNRKRAFDYDVTWEVWGQSLSPGNEQADYWGSAAATRQGSRNYAGISDPGVDALIERVIFAKDRETLVA
ATKALDRVLLAHNYVIPLYYKLAAQIAYWDALARPKELPKYGLGFPEVWWSKSAACHLPAGVRCSCICGSSGEAGRHQFT
PDRRLIRNAQLLDQICAASEFKG

Sequences:

>Translated_663_residues
MLDIGVIGRLKFATAFMAMSLLLVPAAEAQEQPVWHYGLSLVDDLKYPPGFKKFDYVNPEAPKGGDLRLSQTGTFDTFNP
LLVKGETAVGLDFVFDTLMKPSEDEISTAYGLLAEGVSFPDDISSATFRLRQEAKWADGKPVTPEDVVFSFDKAKELNPL
YQSYYRHVVKAEKTGDRDVTFHFDDKNNHELPHILGQIRIVPKHWWEGTGPDGKPRDISRTTLEPVMGSGPYRIASFAPG
GTIRYERRPDYWGVALNVNVGQNNFDSITYSFFGDRDVEFEAFRSGNTDYWRENQAMRWATAFDFPAVKDGRVKREEIPN
PFRATAVMQAMVPNMRRKPFDDERVRQALNYALDFEELNRTIFYNQYQRVNSFFFATELASSGLPEGKELKNLNEVKDLV
PPEVFTTPYSNPVGGTPQKARENLRKAIELLNKAGFELNGNRMVNTETGKPFSFEIMLSSPSFERVALPYAQNLKRIGIE
ARVRTVDPSQYTNRKRAFDYDVTWEVWGQSLSPGNEQADYWGSAAATRQGSRNYAGISDPGVDALIERVIFAKDRETLVA
ATKALDRVLLAHNYVIPLYYKLAAQIAYWDALARPKELPKYGLGFPEVWWSKSAACHLPAGVRCSCICGSSGEAGRHQFT
PDRRLIRNAQLLDQICAASEFKG
>Mature_663_residues
MLDIGVIGRLKFATAFMAMSLLLVPAAEAQEQPVWHYGLSLVDDLKYPPGFKKFDYVNPEAPKGGDLRLSQTGTFDTFNP
LLVKGETAVGLDFVFDTLMKPSEDEISTAYGLLAEGVSFPDDISSATFRLRQEAKWADGKPVTPEDVVFSFDKAKELNPL
YQSYYRHVVKAEKTGDRDVTFHFDDKNNHELPHILGQIRIVPKHWWEGTGPDGKPRDISRTTLEPVMGSGPYRIASFAPG
GTIRYERRPDYWGVALNVNVGQNNFDSITYSFFGDRDVEFEAFRSGNTDYWRENQAMRWATAFDFPAVKDGRVKREEIPN
PFRATAVMQAMVPNMRRKPFDDERVRQALNYALDFEELNRTIFYNQYQRVNSFFFATELASSGLPEGKELKNLNEVKDLV
PPEVFTTPYSNPVGGTPQKARENLRKAIELLNKAGFELNGNRMVNTETGKPFSFEIMLSSPSFERVALPYAQNLKRIGIE
ARVRTVDPSQYTNRKRAFDYDVTWEVWGQSLSPGNEQADYWGSAAATRQGSRNYAGISDPGVDALIERVIFAKDRETLVA
ATKALDRVLLAHNYVIPLYYKLAAQIAYWDALARPKELPKYGLGFPEVWWSKSAACHLPAGVRCSCICGSSGEAGRHQFT
PDRRLIRNAQLLDQICAASEFKG

Specific function: Possible binding-protein with either a transport or enzymatic activity

COG id: COG4166

COG function: function code E; ABC-type oligopeptide transport system, periplasmic component

Gene ontology:

Cell location: Periplasm (Potential)

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the bacterial solute-binding protein 5 family

Homologues:

Organism=Escherichia coli, GI87082063, Length=611, Percent_Identity=34.860883797054, Blast_Score=350, Evalue=2e-97,
Organism=Escherichia coli, GI1789966, Length=454, Percent_Identity=23.7885462555066, Blast_Score=82, Evalue=1e-16,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): Y4WM_RHISN (P55691)

Other databases:

- EMBL:   U00090
- RefSeq:   NP_444133.1
- ProteinModelPortal:   P55691
- GeneID:   962314
- GenomeReviews:   U00090_GR
- KEGG:   rhi:NGR_a00920
- HOGENOM:   HBG289570
- ProtClustDB:   CLSK800052
- InterPro:   IPR000914

Pfam domain/function: PF00496 SBP_bac_5

EC number: NA

Molecular weight: Translated: 74809; Mature: 74809

Theoretical pI: Translated: 6.62; Mature: 6.62

Prosite motif: PS01040 SBP_BACTERIAL_5

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.8 %Cys     (Translated Protein)
1.7 %Met     (Translated Protein)
2.4 %Cys+Met (Translated Protein)
0.8 %Cys     (Mature Protein)
1.7 %Met     (Mature Protein)
2.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MLDIGVIGRLKFATAFMAMSLLLVPAAEAQEQPVWHYGLSLVDDLKYPPGFKKFDYVNPE
CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCC
APKGGDLRLSQTGTFDTFNPLLVKGETAVGLDFVFDTLMKPSEDEISTAYGLLAEGVSFP
CCCCCCEEEECCCCCCCCCCEEEECCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCC
DDISSATFRLRQEAKWADGKPVTPEDVVFSFDKAKELNPLYQSYYRHVVKAEKTGDRDVT
CHHHHHHHHHHHHCCCCCCCCCCHHHHEEECHHHHHCCHHHHHHHHHHHHHCCCCCCEEE
FHFDDKNNHELPHILGQIRIVPKHWWEGTGPDGKPRDISRTTLEPVMGSGPYRIASFAPG
EEECCCCCCCHHHHHHHEEEEEHHCCCCCCCCCCCCCCCHHHHHHHHCCCCEEEEEECCC
GTIRYERRPDYWGVALNVNVGQNNFDSITYSFFGDRDVEFEAFRSGNTDYWRENQAMRWA
CEEEECCCCCEEEEEEEEECCCCCCCEEEEEECCCCCCCCHHHCCCCCCHHHCCCCEEEE
TAFDFPAVKDGRVKREEIPNPFRATAVMQAMVPNMRRKPFDDERVRQALNYALDFEELNR
EECCCCCCCCCCCCHHHCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCHHHHCH
TIFYNQYQRVNSFFFATELASSGLPEGKELKNLNEVKDLVPPEVFTTPYSNPVGGTPQKA
HHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCEEECCCCCCCCCCHHHH
RENLRKAIELLNKAGFELNGNRMVNTETGKPFSFEIMLSSPSFERVALPYAQNLKRIGIE
HHHHHHHHHHHHHCCCEECCCEEEECCCCCCEEEEEEECCCCCCEEECCHHHHHHHCCCE
ARVRTVDPSQYTNRKRAFDYDVTWEVWGQSLSPGNEQADYWGSAAATRQGSRNYAGISDP
EEEEECCCHHHCCCCCCCCCCCCHHHCCCCCCCCCCCHHHCCCHHHHCCCCCCCCCCCCC
GVDALIERVIFAKDRETLVAATKALDRVLLAHNYVIPLYYKLAAQIAYWDALARPKELPK
CHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCHHCCH
YGLGFPEVWWSKSAACHLPAGVRCSCICGSSGEAGRHQFTPDRRLIRNAQLLDQICAASE
HCCCCCHHHHCCCCCEECCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHC
FKG
CCC
>Mature Secondary Structure
MLDIGVIGRLKFATAFMAMSLLLVPAAEAQEQPVWHYGLSLVDDLKYPPGFKKFDYVNPE
CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCC
APKGGDLRLSQTGTFDTFNPLLVKGETAVGLDFVFDTLMKPSEDEISTAYGLLAEGVSFP
CCCCCCEEEECCCCCCCCCCEEEECCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCC
DDISSATFRLRQEAKWADGKPVTPEDVVFSFDKAKELNPLYQSYYRHVVKAEKTGDRDVT
CHHHHHHHHHHHHCCCCCCCCCCHHHHEEECHHHHHCCHHHHHHHHHHHHHCCCCCCEEE
FHFDDKNNHELPHILGQIRIVPKHWWEGTGPDGKPRDISRTTLEPVMGSGPYRIASFAPG
EEECCCCCCCHHHHHHHEEEEEHHCCCCCCCCCCCCCCCHHHHHHHHCCCCEEEEEECCC
GTIRYERRPDYWGVALNVNVGQNNFDSITYSFFGDRDVEFEAFRSGNTDYWRENQAMRWA
CEEEECCCCCEEEEEEEEECCCCCCCEEEEEECCCCCCCCHHHCCCCCCHHHCCCCEEEE
TAFDFPAVKDGRVKREEIPNPFRATAVMQAMVPNMRRKPFDDERVRQALNYALDFEELNR
EECCCCCCCCCCCCHHHCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCHHHHCH
TIFYNQYQRVNSFFFATELASSGLPEGKELKNLNEVKDLVPPEVFTTPYSNPVGGTPQKA
HHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCEEECCCCCCCCCCHHHH
RENLRKAIELLNKAGFELNGNRMVNTETGKPFSFEIMLSSPSFERVALPYAQNLKRIGIE
HHHHHHHHHHHHHCCCEECCCEEEECCCCCCEEEEEEECCCCCCEEECCHHHHHHHCCCE
ARVRTVDPSQYTNRKRAFDYDVTWEVWGQSLSPGNEQADYWGSAAATRQGSRNYAGISDP
EEEEECCCHHHCCCCCCCCCCCCHHHCCCCCCCCCCCHHHCCCHHHHCCCCCCCCCCCCC
GVDALIERVIFAKDRETLVAATKALDRVLLAHNYVIPLYYKLAAQIAYWDALARPKELPK
CHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCHHCCH
YGLGFPEVWWSKSAACHLPAGVRCSCICGSSGEAGRHQFTPDRRLIRNAQLLDQICAASE
HCCCCCHHHHCCCCCEECCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHC
FKG
CCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: ATP; oligopeptides [Periplasm]; H2O [C]

Specific reaction: ATP + oligopeptides [Periplasm] + H2O = ADP + phosphate + oligopeptides [Cytoplasm] [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9163424