Definition | Pelobacter propionicus DSM 2379 chromosome, complete genome. |
---|---|
Accession | NC_008609 |
Length | 4,008,000 |
Click here to switch to the map view.
The map label for this gene is 118578955
Identifier: 118578955
GI number: 118578955
Start: 542371
End: 544392
Strand: Direct
Name: 118578955
Synonym: Ppro_0516
Alternate gene names: NA
Gene position: 542371-544392 (Clockwise)
Preceding gene: 118578950
Following gene: 118578956
Centisome position: 13.53
GC content: 59.5
Gene sequence:
>2022_bases ATGCGTATGATTCGAACATATGGTGTACTGATGTCCGCGGCGCTGTTGCTATGCGGTTTTGACTGGGGTTTTTCCGCTGA CAAGTGCAAGGAGGCCCTGAATCTGGTGGACTCCCTGGAGTCTAGCCGTGATGAAGGTGCGATGCGGCAGACCGAGGCCA GGATACTCGGCCTCTGCCCCGACGGCGCCCCGGGGCACTATGTCAGCGCCCTGATGCTGGAGAGGATCGGCAATGTGGAT GGGGCCATCAAGGAGTATCGCCAGGCCCTGCGGCAGAATCCGCAGTTTACCAGGGCCAGCGGCAACCTGGGCCTGCTCTA TGCACAGACCGGCAGGAACAGCGAAGCCTCGGTGGAGTTAAGCCGTGGTCTGGCGGCAACGTCCGATCCCCGCTACCACA AGGCGCTGGGGCATGTGCTGGCGGAGATGAAGGTGTACCCGCTGGCGATCCACCACCTGAGCGAGGCGGGAAACACTCTC ACCAGTGATGCCGAGGTGTTCAATGATCTGGCCGGGGTGTATCTGGCCATGGGAGATCAGGGCAAGGCCCTGGATGAATA CGGCCGGGCGCTGAACGCTGATCCGGGCAACGAGAAGGCCCATACCGGCATCGCTTCCATCCATCTGGAGCGCAAAGACC TGGATAAGGCATTGGATGAGCTGAAGAAGGGTGAGGCCACCAATCCCCAGAACCGCACGATCCACCTGATGATGGCCGAG ATCTACGAAAAGAAGGGTGATACCCGGCAGGCCAACTACCAGTACCTGTTGGGCGGCAAGGGCAAGGGACTGGCTCAGGT TGCCGATGGTGTGCCGGCTGCCGCCAAGTCCTCCCCCGCCGCGCCGCTCTTCGTGCCTGATTTTCAGAAGAGCGAGGAGT CGCTCAAGGAAATCATCGCTGAGTCTCCCGACAAGGCGGTGGACGCCTACGGAAAACTGGGCGACCTCTACCGTTCCGCG GGCAGGGACAGGGAGGCCATGGCGGCCTACCGGGAGGCGGTGCACCGCAACAGCGCTAACAGCGACGTCTACCTGAATCT GGGTATCCTGCATGAAAAAATGAACAACCTGGACGAGGCGGTGGTGGCCTACAAACAGGCCATACGGGTCAAGCCGGACA ATGCCGATGCCCGCCTGCGTCTCGCCGATATACGCTATGAGCGCGGCTTCTATCAGGAGGCGGTGGAACAGTACAGCGAG TTCCTCAAGCTGAAACCCGACAGCCCGGACATTCAACTCAAGCTGGCCCGCATTCTCGCCAAAAAGAAGGAGACCAGCCT GGCCATCGATGCCTACGATGCCGTTCTGAAGAGCGCTCCCGACAATCCCGAGGCAAACCGGGAAATCGCCGCGCTGTACA AGGCCAAGGGGATGAACGACCGGGCAGTGGCGCATTACCGTAAGGCCTTGGAGTTACGGAAGGATGACGCGGATACCCGC AGCGCGTTGGTGTCGCTATATGTCAAAAACAGGCAGTACGATGAGATAACCGAACTGCTCAAGGGGGCGGTGGAACTGTT CCCGGAAGATGCCAACAACCACTACAAGCTGGGGCTGATCCACGAATTCAAGAAGGAGTATGGCAGCGCCATCGCCTGCT ACCAGAAGGCGGCCGAGCTGAGGCCGGACCATGCCCGCGCCCTCAATGCGCTGGGCCGCATGTACATGAAGACCGACCGC ATCAGCGAAGCCAGGGAGGCGCTGGAGGCCGCCAGGAAGGCGGACCCGACCCTTGAGGAGACCGCGGTTCTGCTGAACAA CATCCGCGACGAATTCAATCCCGAGCCGCGCAGGATCAGCAGGAGCGTGAAGAAATCTTCCTCACGGACCTCGCGCAGGT CGGCGTCCGCATCCGGGAAATCCAAGTCCAAATCATCAGCCAAGGCCAAAAAAACGGACAAGCGCAAGAAGAAGGGCACA TCTTCCGCCAAAACAAAGTCCAAAAGCAAGTCATCGGCCAAGAGCAAGACAACATCCAAAAGCAAGTCAACGGCTAAGAG CAAGACAAAGAAAAAGAAGTAA
Upstream 100 bases:
>100_bases TTGGCATATAATCAAGATTTTAGCCTAAATAATTGAATTTTTAGCGGATCGTCTGTATAATCCGCGGCTTCACAAGATTT TTTTCAAGGAAACCTGCCCT
Downstream 100 bases:
>100_bases AACCCGCGATCGTCGTACCGTTTCCCCCTCTTTGCATCAGGGATTTCACGGGGGCAGGTCAGCTGCGTGTCCTGCCCCCT CTTTCCCTGCCTCACCCTGG
Product: hypothetical protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 673; Mature: 673
Protein sequence:
>673_residues MRMIRTYGVLMSAALLLCGFDWGFSADKCKEALNLVDSLESSRDEGAMRQTEARILGLCPDGAPGHYVSALMLERIGNVD GAIKEYRQALRQNPQFTRASGNLGLLYAQTGRNSEASVELSRGLAATSDPRYHKALGHVLAEMKVYPLAIHHLSEAGNTL TSDAEVFNDLAGVYLAMGDQGKALDEYGRALNADPGNEKAHTGIASIHLERKDLDKALDELKKGEATNPQNRTIHLMMAE IYEKKGDTRQANYQYLLGGKGKGLAQVADGVPAAAKSSPAAPLFVPDFQKSEESLKEIIAESPDKAVDAYGKLGDLYRSA GRDREAMAAYREAVHRNSANSDVYLNLGILHEKMNNLDEAVVAYKQAIRVKPDNADARLRLADIRYERGFYQEAVEQYSE FLKLKPDSPDIQLKLARILAKKKETSLAIDAYDAVLKSAPDNPEANREIAALYKAKGMNDRAVAHYRKALELRKDDADTR SALVSLYVKNRQYDEITELLKGAVELFPEDANNHYKLGLIHEFKKEYGSAIACYQKAAELRPDHARALNALGRMYMKTDR ISEAREALEAARKADPTLEETAVLLNNIRDEFNPEPRRISRSVKKSSSRTSRRSASASGKSKSKSSAKAKKTDKRKKKGT SSAKTKSKSKSSAKSKTTSKSKSTAKSKTKKKK
Sequences:
>Translated_673_residues MRMIRTYGVLMSAALLLCGFDWGFSADKCKEALNLVDSLESSRDEGAMRQTEARILGLCPDGAPGHYVSALMLERIGNVD GAIKEYRQALRQNPQFTRASGNLGLLYAQTGRNSEASVELSRGLAATSDPRYHKALGHVLAEMKVYPLAIHHLSEAGNTL TSDAEVFNDLAGVYLAMGDQGKALDEYGRALNADPGNEKAHTGIASIHLERKDLDKALDELKKGEATNPQNRTIHLMMAE IYEKKGDTRQANYQYLLGGKGKGLAQVADGVPAAAKSSPAAPLFVPDFQKSEESLKEIIAESPDKAVDAYGKLGDLYRSA GRDREAMAAYREAVHRNSANSDVYLNLGILHEKMNNLDEAVVAYKQAIRVKPDNADARLRLADIRYERGFYQEAVEQYSE FLKLKPDSPDIQLKLARILAKKKETSLAIDAYDAVLKSAPDNPEANREIAALYKAKGMNDRAVAHYRKALELRKDDADTR SALVSLYVKNRQYDEITELLKGAVELFPEDANNHYKLGLIHEFKKEYGSAIACYQKAAELRPDHARALNALGRMYMKTDR ISEAREALEAARKADPTLEETAVLLNNIRDEFNPEPRRISRSVKKSSSRTSRRSASASGKSKSKSSAKAKKTDKRKKKGT SSAKTKSKSKSSAKSKTTSKSKSTAKSKTKKKK >Mature_673_residues MRMIRTYGVLMSAALLLCGFDWGFSADKCKEALNLVDSLESSRDEGAMRQTEARILGLCPDGAPGHYVSALMLERIGNVD GAIKEYRQALRQNPQFTRASGNLGLLYAQTGRNSEASVELSRGLAATSDPRYHKALGHVLAEMKVYPLAIHHLSEAGNTL TSDAEVFNDLAGVYLAMGDQGKALDEYGRALNADPGNEKAHTGIASIHLERKDLDKALDELKKGEATNPQNRTIHLMMAE IYEKKGDTRQANYQYLLGGKGKGLAQVADGVPAAAKSSPAAPLFVPDFQKSEESLKEIIAESPDKAVDAYGKLGDLYRSA GRDREAMAAYREAVHRNSANSDVYLNLGILHEKMNNLDEAVVAYKQAIRVKPDNADARLRLADIRYERGFYQEAVEQYSE FLKLKPDSPDIQLKLARILAKKKETSLAIDAYDAVLKSAPDNPEANREIAALYKAKGMNDRAVAHYRKALELRKDDADTR SALVSLYVKNRQYDEITELLKGAVELFPEDANNHYKLGLIHEFKKEYGSAIACYQKAAELRPDHARALNALGRMYMKTDR ISEAREALEAARKADPTLEETAVLLNNIRDEFNPEPRRISRSVKKSSSRTSRRSASASGKSKSKSSAKAKKTDKRKKKGT SSAKTKSKSKSSAKSKTTSKSKSTAKSKTKKKK
Specific function: Unknown
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: Contains 9 TPR repeats [H]
Homologues:
Organism=Homo sapiens, GI32307148, Length=417, Percent_Identity=24.220623501199, Blast_Score=112, Evalue=1e-24, Organism=Homo sapiens, GI32307150, Length=417, Percent_Identity=24.220623501199, Blast_Score=112, Evalue=2e-24, Organism=Homo sapiens, GI301336134, Length=285, Percent_Identity=27.719298245614, Blast_Score=92, Evalue=2e-18, Organism=Homo sapiens, GI83415184, Length=285, Percent_Identity=27.719298245614, Blast_Score=91, Evalue=3e-18, Organism=Homo sapiens, GI224809432, Length=588, Percent_Identity=21.7687074829932, Blast_Score=74, Evalue=5e-13, Organism=Caenorhabditis elegans, GI115532692, Length=349, Percent_Identity=27.5071633237822, Blast_Score=103, Evalue=2e-22, Organism=Caenorhabditis elegans, GI115532690, Length=349, Percent_Identity=27.5071633237822, Blast_Score=103, Evalue=3e-22, Organism=Saccharomyces cerevisiae, GI6319589, Length=301, Percent_Identity=25.9136212624585, Blast_Score=64, Evalue=1e-10, Organism=Drosophila melanogaster, GI17647755, Length=413, Percent_Identity=24.455205811138, Blast_Score=115, Evalue=7e-26, Organism=Drosophila melanogaster, GI24585827, Length=413, Percent_Identity=24.455205811138, Blast_Score=115, Evalue=7e-26, Organism=Drosophila melanogaster, GI24585829, Length=413, Percent_Identity=24.455205811138, Blast_Score=115, Evalue=7e-26, Organism=Drosophila melanogaster, GI161076610, Length=278, Percent_Identity=27.6978417266187, Blast_Score=74, Evalue=4e-13, Organism=Drosophila melanogaster, GI19920486, Length=278, Percent_Identity=27.6978417266187, Blast_Score=74, Evalue=5e-13,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001440 - InterPro: IPR013026 - InterPro: IPR011990 - InterPro: IPR013105 - InterPro: IPR019734 [H]
Pfam domain/function: PF00515 TPR_1; PF07719 TPR_2 [H]
EC number: NA
Molecular weight: Translated: 74243; Mature: 74243
Theoretical pI: Translated: 9.92; Mature: 9.92
Prosite motif: PS50005 TPR L=RR ; PS50293 TPR_REGION
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.6 %Cys (Translated Protein) 2.1 %Met (Translated Protein) 2.7 %Cys+Met (Translated Protein) 0.6 %Cys (Mature Protein) 2.1 %Met (Mature Protein) 2.7 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MRMIRTYGVLMSAALLLCGFDWGFSADKCKEALNLVDSLESSRDEGAMRQTEARILGLCP CCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHCCEEEECC DGAPGHYVSALMLERIGNVDGAIKEYRQALRQNPQFTRASGNLGLLYAQTGRNSEASVEL CCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCEEECCCCEEEEEEECCCCCCCCHHH SRGLAATSDPRYHKALGHVLAEMKVYPLAIHHLSEAGNTLTSDAEVFNDLAGVYLAMGDQ HCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHEEEECCC GKALDEYGRALNADPGNEKAHTGIASIHLERKDLDKALDELKKGEATNPQNRTIHLMMAE CCHHHHHCCCCCCCCCCCHHHCCHHHEEECHHHHHHHHHHHHCCCCCCCCCCEEEHHHHH IYEKKGDTRQANYQYLLGGKGKGLAQVADGVPAAAKSSPAAPLFVPDFQKSEESLKEIIA HHHHCCCCCCCCCEEEECCCCCCHHHHHCCCCCCCCCCCCCCEECCCCHHHHHHHHHHHH ESPDKAVDAYGKLGDLYRSAGRDREAMAAYREAVHRNSANSDVYLNLGILHEKMNNLDEA CCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCCEEEEHHHHHHHHHHHHHH VVAYKQAIRVKPDNADARLRLADIRYERGFYQEAVEQYSEFLKLKPDSPDIQLKLARILA HHHHHHHHCCCCCCCCCEEEEEHHHHHCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHH KKKETSLAIDAYDAVLKSAPDNPEANREIAALYKAKGMNDRAVAHYRKALELRKDDADTR HHHHCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHH SALVSLYVKNRQYDEITELLKGAVELFPEDANNHYKLGLIHEFKKEYGSAIACYQKAAEL HHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCCEEEHHHHHHHHHHCCHHHHHHHHHHC RPDHARALNALGRMYMKTDRISEAREALEAARKADPTLEETAVLLNNIRDEFNPEPRRIS CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHH RSVKKSSSRTSRRSASASGKSKSKSSAKAKKTDKRKKKGTSSAKTKSKSKSSAKSKTTSK HHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHH SKSTAKSKTKKKK HHHHHHHHHCCCC >Mature Secondary Structure MRMIRTYGVLMSAALLLCGFDWGFSADKCKEALNLVDSLESSRDEGAMRQTEARILGLCP CCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHCCEEEECC DGAPGHYVSALMLERIGNVDGAIKEYRQALRQNPQFTRASGNLGLLYAQTGRNSEASVEL CCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCEEECCCCEEEEEEECCCCCCCCHHH SRGLAATSDPRYHKALGHVLAEMKVYPLAIHHLSEAGNTLTSDAEVFNDLAGVYLAMGDQ HCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHEEEECCC GKALDEYGRALNADPGNEKAHTGIASIHLERKDLDKALDELKKGEATNPQNRTIHLMMAE CCHHHHHCCCCCCCCCCCHHHCCHHHEEECHHHHHHHHHHHHCCCCCCCCCCEEEHHHHH IYEKKGDTRQANYQYLLGGKGKGLAQVADGVPAAAKSSPAAPLFVPDFQKSEESLKEIIA HHHHCCCCCCCCCEEEECCCCCCHHHHHCCCCCCCCCCCCCCEECCCCHHHHHHHHHHHH ESPDKAVDAYGKLGDLYRSAGRDREAMAAYREAVHRNSANSDVYLNLGILHEKMNNLDEA CCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCCEEEEHHHHHHHHHHHHHH VVAYKQAIRVKPDNADARLRLADIRYERGFYQEAVEQYSEFLKLKPDSPDIQLKLARILA HHHHHHHHCCCCCCCCCEEEEEHHHHHCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHH KKKETSLAIDAYDAVLKSAPDNPEANREIAALYKAKGMNDRAVAHYRKALELRKDDADTR HHHHCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHH SALVSLYVKNRQYDEITELLKGAVELFPEDANNHYKLGLIHEFKKEYGSAIACYQKAAEL HHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCCEEEHHHHHHHHHHCCHHHHHHHHHHC RPDHARALNALGRMYMKTDRISEAREALEAARKADPTLEETAVLLNNIRDEFNPEPRRIS CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHH RSVKKSSSRTSRRSASASGKSKSKSSAKAKKTDKRKKKGTSSAKTKSKSKSSAKSKTTSK HHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHH SKSTAKSKTKKKK HHHHHHHHHCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 2105307 [H]