| Definition | Rhodopseudomonas palustris HaA2, complete genome. |
|---|---|
| Accession | NC_007778 |
| Length | 5,331,656 |
Click here to switch to the map view.
The map label for this gene is yliB [C]
Identifier: 86747259
GI number: 86747259
Start: 142384
End: 143982
Strand: Reverse
Name: yliB [C]
Synonym: RPB_0132
Alternate gene names: 86747259
Gene position: 143982-142384 (Counterclockwise)
Preceding gene: 86747260
Following gene: 86747258
Centisome position: 2.7
GC content: 64.85
Gene sequence:
>1599_bases ATGAAGTTGACGAAGCGGTCCTTTGTTATCGGCTCCCTCGGAGGCCTGGCGATGCTCGGCCTGCCGGCCGATCTGCGCGC GCAGCAGGCCGGCGGCGGCACGTTGGTGATCGGCTCGACGCAGGTGCCGCGGCACTTCAACGGCGCGGTGCAGTCGGGCA TCGCCACCGCGCTGCCGAGCACGCAGATTTTCGCCAGCCCGCTGCGCTTCGACGAGAACTGGAATCCGCAGCCGTATCTC GCCAAATCCTGGGAGGTCGCGCCCGACGGTCTGTCGATTACCCTGAAGCTGGTCGACGACGCGGTGTTTCACGACGGCAA GCCGGTGACCTCCGAGGACGTCGCGTTCTCGATCATGACGATCAAGGCCAACCACCCGTTCAAGACCATGCTGGCCGCGG TCGACAAGGTGGAGACGCCCGATCCGAAGACCGCGGTGATCAAGCTGGCGCATCCGCATCCGGCGCTGCTGCTGGCGATG TCGCCGGCGCTGATGCCGATCCTGCCGAAGCACGTCTACGGCGACGGCCAGGACGTCAAGGCGCATCCGGCCAACCTCAA GCCGATCGGTTCCGGCCCGTACAAGCTCGCCGAATACAAGCAGGGCGAGTACTACACGCTGGAGAAGTTCGACAAATTCT TCATCCCGGGCCGTCCGAAGCTCGACAAGATCGTGGTGCGGCTGATCTCGGATCCGAACGCGCTGATGGTCTCGGCCGAG CGCGGCGAGGTCCACGCCGTGCCGTTCGTCACCGGCGTGCGCGACATCGACCGGCTGGAGAAGTCGAAGAACCTCAAGGT CGTCGACAAGGGCTTCGCCGGTCTCGGCGCGCTGAACTGGCTCGCCTTCAACACCAAGAAGAAGCCGCTCGACGACGTCC GCGTCCGCCAGGCGATCGCCTACGCGGCCAATCGCGACTTCATCGTCAACAAGCTGATGGGCGGCAAGGCGATGCCGTCG ACTGGGCCGATCGCGCCGGGCTCGCCGTTCGAGGAGAAGAACGTCCAGCTCTACAAGTTCGACGTCGCCAAGGCCAAAAA GCTGCTCGACGAGGCCGGCCTCAAGCCGGACGGCAACGGCGTCCGCGCCACGCTGACGATCGACTACCTCCCGGGCAGCG ACGAGCAGCAGCGCAACGTCGCCGAATACATGCGCTCGGCGCTGAAGCGTGTCGGCCTGAACCTCGAAGTCCGCGCCGCG CCCGACTTCCCGACCTGGGCGCAGCGGGTCTCGAACTTCGACTTCGATCTGACCATGGACTCGGTCTACAATTGGGCCGA TCCGGTGATCGGCGTCGACCGGACCTATCTGACCTCGAATATCCGCAAGGGCATCATCTGGTCGAACACGCAGCAATATT CCAACCCGAAGGTCGACGAGATCCTCGGCAAGGCCGCTGTGGAGACCTCGGCGGAGAAGCGCAAGGCGCTTTATTCGGAG TTCCAGAAGATCGTCGTCGACGAGGTGCCGGTGTTCTTCATCAACGCCGTGCCGTTCCACAACGCCTTCGCCAACGGCCT CGGCGGGCTGCCGACCACGATCTGGGGCGTCGTCTCGCCGCTCGACGAAGTGCACTGGGTCACGCCGCCGAAGACCTGA
Upstream 100 bases:
>100_bases CGGAGCCTCGTCATGAGCGCCGGTCTCGTCGTTTTGCCGATACGCACGCCGTGCGCATCGATCATCCTCGCCCTCGAACC TTCACAGCGGAGTTGAATCC
Downstream 100 bases:
>100_bases ACCCAACCGGGTAGACAGACAGCACCATGACGCTCCTCACCCATCTGTTCGGAAAACTCGCCAACGCCGCGGCGCTGCTG CTCGCGGTGCTGGTGCTGAA
Product: extracellular solute-binding protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 532; Mature: 532
Protein sequence:
>532_residues MKLTKRSFVIGSLGGLAMLGLPADLRAQQAGGGTLVIGSTQVPRHFNGAVQSGIATALPSTQIFASPLRFDENWNPQPYL AKSWEVAPDGLSITLKLVDDAVFHDGKPVTSEDVAFSIMTIKANHPFKTMLAAVDKVETPDPKTAVIKLAHPHPALLLAM SPALMPILPKHVYGDGQDVKAHPANLKPIGSGPYKLAEYKQGEYYTLEKFDKFFIPGRPKLDKIVVRLISDPNALMVSAE RGEVHAVPFVTGVRDIDRLEKSKNLKVVDKGFAGLGALNWLAFNTKKKPLDDVRVRQAIAYAANRDFIVNKLMGGKAMPS TGPIAPGSPFEEKNVQLYKFDVAKAKKLLDEAGLKPDGNGVRATLTIDYLPGSDEQQRNVAEYMRSALKRVGLNLEVRAA PDFPTWAQRVSNFDFDLTMDSVYNWADPVIGVDRTYLTSNIRKGIIWSNTQQYSNPKVDEILGKAAVETSAEKRKALYSE FQKIVVDEVPVFFINAVPFHNAFANGLGGLPTTIWGVVSPLDEVHWVTPPKT
Sequences:
>Translated_532_residues MKLTKRSFVIGSLGGLAMLGLPADLRAQQAGGGTLVIGSTQVPRHFNGAVQSGIATALPSTQIFASPLRFDENWNPQPYL AKSWEVAPDGLSITLKLVDDAVFHDGKPVTSEDVAFSIMTIKANHPFKTMLAAVDKVETPDPKTAVIKLAHPHPALLLAM SPALMPILPKHVYGDGQDVKAHPANLKPIGSGPYKLAEYKQGEYYTLEKFDKFFIPGRPKLDKIVVRLISDPNALMVSAE RGEVHAVPFVTGVRDIDRLEKSKNLKVVDKGFAGLGALNWLAFNTKKKPLDDVRVRQAIAYAANRDFIVNKLMGGKAMPS TGPIAPGSPFEEKNVQLYKFDVAKAKKLLDEAGLKPDGNGVRATLTIDYLPGSDEQQRNVAEYMRSALKRVGLNLEVRAA PDFPTWAQRVSNFDFDLTMDSVYNWADPVIGVDRTYLTSNIRKGIIWSNTQQYSNPKVDEILGKAAVETSAEKRKALYSE FQKIVVDEVPVFFINAVPFHNAFANGLGGLPTTIWGVVSPLDEVHWVTPPKT >Mature_532_residues MKLTKRSFVIGSLGGLAMLGLPADLRAQQAGGGTLVIGSTQVPRHFNGAVQSGIATALPSTQIFASPLRFDENWNPQPYL AKSWEVAPDGLSITLKLVDDAVFHDGKPVTSEDVAFSIMTIKANHPFKTMLAAVDKVETPDPKTAVIKLAHPHPALLLAM SPALMPILPKHVYGDGQDVKAHPANLKPIGSGPYKLAEYKQGEYYTLEKFDKFFIPGRPKLDKIVVRLISDPNALMVSAE RGEVHAVPFVTGVRDIDRLEKSKNLKVVDKGFAGLGALNWLAFNTKKKPLDDVRVRQAIAYAANRDFIVNKLMGGKAMPS TGPIAPGSPFEEKNVQLYKFDVAKAKKLLDEAGLKPDGNGVRATLTIDYLPGSDEQQRNVAEYMRSALKRVGLNLEVRAA PDFPTWAQRVSNFDFDLTMDSVYNWADPVIGVDRTYLTSNIRKGIIWSNTQQYSNPKVDEILGKAAVETSAEKRKALYSE FQKIVVDEVPVFFINAVPFHNAFANGLGGLPTTIWGVVSPLDEVHWVTPPKT
Specific function: Probably part of an ABC transporter complex that could be involved in peptide import [H]
COG id: NA
COG function: NA
Gene ontology:
Cell location: Periplasm (Probable) [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the bacterial solute-binding protein 5 family [H]
Homologues:
Organism=Escherichia coli, GI1787052, Length=523, Percent_Identity=25.8126195028681, Blast_Score=133, Evalue=3e-32, Organism=Escherichia coli, GI1789966, Length=512, Percent_Identity=25.1953125, Blast_Score=129, Evalue=7e-31, Organism=Escherichia coli, GI1787762, Length=429, Percent_Identity=25.4079254079254, Blast_Score=112, Evalue=8e-26, Organism=Escherichia coli, GI1789887, Length=470, Percent_Identity=23.8297872340426, Blast_Score=100, Evalue=2e-22, Organism=Escherichia coli, GI1787551, Length=540, Percent_Identity=22.4074074074074, Blast_Score=96, Evalue=8e-21, Organism=Escherichia coli, GI1787495, Length=536, Percent_Identity=24.8134328358209, Blast_Score=92, Evalue=1e-19, Organism=Escherichia coli, GI87081878, Length=512, Percent_Identity=24.4140625, Blast_Score=86, Evalue=6e-18, Organism=Escherichia coli, GI87082063, Length=399, Percent_Identity=23.8095238095238, Blast_Score=74, Evalue=3e-14,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000914 [H]
Pfam domain/function: PF00496 SBP_bac_5 [H]
EC number: NA
Molecular weight: Translated: 58264; Mature: 58264
Theoretical pI: Translated: 9.62; Mature: 9.62
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.0 %Cys (Translated Protein) 2.1 %Met (Translated Protein) 2.1 %Cys+Met (Translated Protein) 0.0 %Cys (Mature Protein) 2.1 %Met (Mature Protein) 2.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKLTKRSFVIGSLGGLAMLGLPADLRAQQAGGGTLVIGSTQVPRHFNGAVQSGIATALPS CCCCCCEEEEECCCCCEEECCCHHHHHHCCCCCEEEECCCCCCCHHCHHHHHCHHHHCCC TQIFASPLRFDENWNPQPYLAKSWEVAPDGLSITLKLVDDAVFHDGKPVTSEDVAFSIMT HHHHHCCCCCCCCCCCCCCCCCCCEECCCCCEEEEEEEHHHHHCCCCCCCCCCEEEEEEE IKANHPFKTMLAAVDKVETPDPKTAVIKLAHPHPALLLAMSPALMPILPKHVYGDGQDVK EECCCCHHHHHHHHHHCCCCCCCEEEEEEECCCCEEEEEECCHHHHHCHHHHCCCCCCCC AHPANLKPIGSGPYKLAEYKQGEYYTLEKFDKFFIPGRPKLDKIVVRLISDPNALMVSAE CCCCCCCCCCCCCCCHHHCCCCCEEEHHHHHHCCCCCCCCHHHHHHHHHCCCCEEEEEEC RGEVHAVPFVTGVRDIDRLEKSKNLKVVDKGFAGLGALNWLAFNTKKKPLDDVRVRQAIA CCCEEEEEHHHHHHHHHHHHHCCCCEEEECCCCCCCHHHEEEECCCCCCHHHHHHHHHHH YAANRDFIVNKLMGGKAMPSTGPIAPGSPFEEKNVQLYKFDVAKAKKLLDEAGLKPDGNG HHCCCCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEEEECHHHHHHHHHHCCCCCCCCC VRATLTIDYLPGSDEQQRNVAEYMRSALKRVGLNLEVRAAPDFPTWAQRVSNFDFDLTMD EEEEEEEEECCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHCCCCCEEEHH SVYNWADPVIGVDRTYLTSNIRKGIIWSNTQQYSNPKVDEILGKAAVETSAEKRKALYSE HHHCCCCCCCCCCHHHHHHHHHCCEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH FQKIVVDEVPVFFINAVPFHNAFANGLGGLPTTIWGVVSPLDEVHWVTPPKT HHHHHHHCCCEEEEECCCCHHHHHHCCCCCHHHHHHHHCCHHHCCCCCCCCC >Mature Secondary Structure MKLTKRSFVIGSLGGLAMLGLPADLRAQQAGGGTLVIGSTQVPRHFNGAVQSGIATALPS CCCCCCEEEEECCCCCEEECCCHHHHHHCCCCCEEEECCCCCCCHHCHHHHHCHHHHCCC TQIFASPLRFDENWNPQPYLAKSWEVAPDGLSITLKLVDDAVFHDGKPVTSEDVAFSIMT HHHHHCCCCCCCCCCCCCCCCCCCEECCCCCEEEEEEEHHHHHCCCCCCCCCCEEEEEEE IKANHPFKTMLAAVDKVETPDPKTAVIKLAHPHPALLLAMSPALMPILPKHVYGDGQDVK EECCCCHHHHHHHHHHCCCCCCCEEEEEEECCCCEEEEEECCHHHHHCHHHHCCCCCCCC AHPANLKPIGSGPYKLAEYKQGEYYTLEKFDKFFIPGRPKLDKIVVRLISDPNALMVSAE CCCCCCCCCCCCCCCHHHCCCCCEEEHHHHHHCCCCCCCCHHHHHHHHHCCCCEEEEEEC RGEVHAVPFVTGVRDIDRLEKSKNLKVVDKGFAGLGALNWLAFNTKKKPLDDVRVRQAIA CCCEEEEEHHHHHHHHHHHHHCCCCEEEECCCCCCCHHHEEEECCCCCCHHHHHHHHHHH YAANRDFIVNKLMGGKAMPSTGPIAPGSPFEEKNVQLYKFDVAKAKKLLDEAGLKPDGNG HHCCCCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEEEECHHHHHHHHHHCCCCCCCCC VRATLTIDYLPGSDEQQRNVAEYMRSALKRVGLNLEVRAAPDFPTWAQRVSNFDFDLTMD EEEEEEEEECCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHCCCCCEEEHH SVYNWADPVIGVDRTYLTSNIRKGIIWSNTQQYSNPKVDEILGKAAVETSAEKRKALYSE HHHCCCCCCCCCCHHHHHHHHHCCEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH FQKIVVDEVPVFFINAVPFHNAFANGLGGLPTTIWGVVSPLDEVHWVTPPKT HHHHHHHCCCEEEEECCCCHHHHHHCCCCCHHHHHHHHCCHHHCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA