Definition | Candidatus Blochmannia pennsylvanicus str. BPEN, complete genome. |
---|---|
Accession | NC_007292 |
Length | 791,654 |
Click here to switch to the map view.
The map label for this gene is yoaE [H]
Identifier: 71892220
GI number: 71892220
Start: 544268
End: 545824
Strand: Reverse
Name: yoaE [H]
Synonym: BPEN_458
Alternate gene names: 71892220
Gene position: 545824-544268 (Counterclockwise)
Preceding gene: 71892224
Following gene: 71892218
Centisome position: 68.95
GC content: 32.88
Gene sequence:
>1557_bases ATGGAATTTTTAACAGACATATCAATGTGGATAGGATTCTTAACGTTAATAATTTTAGAAATTGTACTTGGCGTCGATAA TTTGGTTTTTATTGCTATTTTAACAGATAAACTTCCAAAAAAACAACGTGAACGTGCATGTATTATTGGGTTAACGTTAG CATTAATAATGCGTATCGCATTATTATCTTTGATTTCATGGTTTGTTACTCTTACAAAACCATTGTGCAAAATTGCTACT TTTTCATTTTCTGGTAGAGATTTAATTTTATTATTTGGTGGCATGTTTCTTTTGTTTAAAGCAACTACTGAATTACATCA ACAATTAGAACATAAAGTACATAATCACACTAGTCGTGGATATGCTAGTTTTTGGGTAGTAGTGATACAAATTGTAGTTT TTGATGCTATTTTTTCTCTCGATGCAGTAATAACGGCTGTTGGTACAGTAGAGAATTTGACAATAATGATAATAGCAGTT GTTATAGCAGTACTAATAATGTCGCTATCATCTCGTTTATTAACTAATTTCATTAATAGTCATCAGACTGTAGTAGTTTT GTGTTTAAGTTTTTTATTAATTATTGGTTTGAGTTTAATTGCAGAGGGAATTGGATTTTATATACCAAAAGGTTATTTAT ATGTTGCTATTGGATTTTCGGTATTAATTGAATTATTTAATCAAATTGCTCACTGTAATTCCATGAAGGGTCAATCAACT AAATCAATGCGTGAACGCACAGCAGAGGCGATTATGAGGTTAATGGGTGGTAATACAGCTCAATGGGATTTTGATACAGA GAAAAATTCTTCACTTCTTTTATCAAAAACACATTTTGCTGAAGAGGAGCGCCATATGATTACTGGAGTATTATCGTTAG CTTCACGTACCTTACGAAGCATAATGACTCCTCGAAATGAAATTTCGTGGTTAGATTCTCAAAAACCCGTGCAAGAGTTA TATTCTACGTTAATGAATACTCCTCATAATATGTTTCCAGTATGTAACGGTGAATTGGATCAATTAATAGGCATTGTTCG TGCTAAAGATTTAATGGCGGCTATTGCTAATGGAGAGCAATTAGAAACACATGCTTCAGAAAATTTGCCCATTGTAGTTC CAGAAACTTTGGATGTCTTAAATTTATTAAAAGAATTGCGGCGTGCAAAAGGAAGTATGGTTGTTGTATCTAATGAATTT GGTATTATTCAAGGATTAATAACCCCTTTGGATGTATTAGAAGCCATAGCTGGTGAATTTCCTGATGAAGATGAAACACC AGAAATTGAGATAATTAATAATGGGACAGGTTGGTTAGCAAAAGGTAGTATGGATTTGCATGCATTGCAACAGGCGTTGC AGGCTCATGATTTGGTGCATGTTTGTGATCATGTAGCTTCTTTAGCTGGCCTGTTATTATCCCGTTGCGATCGTATACCA AAAGAGGGCGATGTGTTAACAATTAATAGATGGCGTTTTATAATTCGAAAAATGATAGAGTATCGTATTGAATTAGTAGA AATAGAATGCCTTTTATTTTTTAATGATACGCATTAA
Upstream 100 bases:
>100_bases TTTATTGCTGACAACTTCATTATGTAATCAGATATTTTAATATGTTTAATATAAATATGCCATGAGACGATTTGTTATCT AATGTTCAATGGAGATTATC
Downstream 100 bases:
>100_bases TAAATAGTATTGTTACATATACACATATGTATGTGTGATACTATGAATTGTTGATTTAAAGCAGTTATTTTAAATAAAAT TTTTTTAGTAGTGGTAGCAA
Product: putative transmembrane protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 518; Mature: 518
Protein sequence:
>518_residues MEFLTDISMWIGFLTLIILEIVLGVDNLVFIAILTDKLPKKQRERACIIGLTLALIMRIALLSLISWFVTLTKPLCKIAT FSFSGRDLILLFGGMFLLFKATTELHQQLEHKVHNHTSRGYASFWVVVIQIVVFDAIFSLDAVITAVGTVENLTIMIIAV VIAVLIMSLSSRLLTNFINSHQTVVVLCLSFLLIIGLSLIAEGIGFYIPKGYLYVAIGFSVLIELFNQIAHCNSMKGQST KSMRERTAEAIMRLMGGNTAQWDFDTEKNSSLLLSKTHFAEEERHMITGVLSLASRTLRSIMTPRNEISWLDSQKPVQEL YSTLMNTPHNMFPVCNGELDQLIGIVRAKDLMAAIANGEQLETHASENLPIVVPETLDVLNLLKELRRAKGSMVVVSNEF GIIQGLITPLDVLEAIAGEFPDEDETPEIEIINNGTGWLAKGSMDLHALQQALQAHDLVHVCDHVASLAGLLLSRCDRIP KEGDVLTINRWRFIIRKMIEYRIELVEIECLLFFNDTH
Sequences:
>Translated_518_residues MEFLTDISMWIGFLTLIILEIVLGVDNLVFIAILTDKLPKKQRERACIIGLTLALIMRIALLSLISWFVTLTKPLCKIAT FSFSGRDLILLFGGMFLLFKATTELHQQLEHKVHNHTSRGYASFWVVVIQIVVFDAIFSLDAVITAVGTVENLTIMIIAV VIAVLIMSLSSRLLTNFINSHQTVVVLCLSFLLIIGLSLIAEGIGFYIPKGYLYVAIGFSVLIELFNQIAHCNSMKGQST KSMRERTAEAIMRLMGGNTAQWDFDTEKNSSLLLSKTHFAEEERHMITGVLSLASRTLRSIMTPRNEISWLDSQKPVQEL YSTLMNTPHNMFPVCNGELDQLIGIVRAKDLMAAIANGEQLETHASENLPIVVPETLDVLNLLKELRRAKGSMVVVSNEF GIIQGLITPLDVLEAIAGEFPDEDETPEIEIINNGTGWLAKGSMDLHALQQALQAHDLVHVCDHVASLAGLLLSRCDRIP KEGDVLTINRWRFIIRKMIEYRIELVEIECLLFFNDTH >Mature_518_residues MEFLTDISMWIGFLTLIILEIVLGVDNLVFIAILTDKLPKKQRERACIIGLTLALIMRIALLSLISWFVTLTKPLCKIAT FSFSGRDLILLFGGMFLLFKATTELHQQLEHKVHNHTSRGYASFWVVVIQIVVFDAIFSLDAVITAVGTVENLTIMIIAV VIAVLIMSLSSRLLTNFINSHQTVVVLCLSFLLIIGLSLIAEGIGFYIPKGYLYVAIGFSVLIELFNQIAHCNSMKGQST KSMRERTAEAIMRLMGGNTAQWDFDTEKNSSLLLSKTHFAEEERHMITGVLSLASRTLRSIMTPRNEISWLDSQKPVQEL YSTLMNTPHNMFPVCNGELDQLIGIVRAKDLMAAIANGEQLETHASENLPIVVPETLDVLNLLKELRRAKGSMVVVSNEF GIIQGLITPLDVLEAIAGEFPDEDETPEIEIINNGTGWLAKGSMDLHALQQALQAHDLVHVCDHVASLAGLLLSRCDRIP KEGDVLTINRWRFIIRKMIEYRIELVEIECLLFFNDTH
Specific function: Unknown
COG id: COG0861
COG function: function code P; Membrane protein TerC, possibly involved in tellurium resistance
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein [H]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 2 CBS domains [H]
Homologues:
Organism=Homo sapiens, GI310128564, Length=193, Percent_Identity=24.8704663212435, Blast_Score=68, Evalue=2e-11, Organism=Escherichia coli, GI1788119, Length=510, Percent_Identity=61.3725490196078, Blast_Score=619, Evalue=1e-178, Organism=Escherichia coli, GI87082033, Length=516, Percent_Identity=46.5116279069767, Blast_Score=411, Evalue=1e-116, Organism=Escherichia coli, GI1789197, Length=227, Percent_Identity=44.9339207048458, Blast_Score=175, Evalue=6e-45, Organism=Escherichia coli, GI1790664, Length=226, Percent_Identity=26.1061946902655, Blast_Score=103, Evalue=3e-23, Organism=Escherichia coli, GI1786879, Length=235, Percent_Identity=26.3829787234043, Blast_Score=89, Evalue=7e-19, Organism=Escherichia coli, GI145693175, Length=246, Percent_Identity=25.2032520325203, Blast_Score=72, Evalue=1e-13,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR016169 - InterPro: IPR000644 - InterPro: IPR005496 - InterPro: IPR005170 [H]
Pfam domain/function: PF00571 CBS; PF03471 CorC_HlyC; PF03741 TerC [H]
EC number: NA
Molecular weight: Translated: 57944; Mature: 57944
Theoretical pI: Translated: 5.72; Mature: 5.72
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.5 %Cys (Translated Protein) 3.5 %Met (Translated Protein) 5.0 %Cys+Met (Translated Protein) 1.5 %Cys (Mature Protein) 3.5 %Met (Mature Protein) 5.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MEFLTDISMWIGFLTLIILEIVLGVDNLVFIAILTDKLPKKQRERACIIGLTLALIMRIA CCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHH LLSLISWFVTLTKPLCKIATFSFSGRDLILLFGGMFLLFKATTELHQQLEHKVHNHTSRG HHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH YASFWVVVIQIVVFDAIFSLDAVITAVGTVENLTIMIIAVVIAVLIMSLSSRLLTNFINS HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC HQTVVVLCLSFLLIIGLSLIAEGIGFYIPKGYLYVAIGFSVLIELFNQIAHCNSMKGQST CHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCH KSMRERTAEAIMRLMGGNTAQWDFDTEKNSSLLLSKTHFAEEERHMITGVLSLASRTLRS HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHH IMTPRNEISWLDSQKPVQELYSTLMNTPHNMFPVCNGELDQLIGIVRAKDLMAAIANGEQ HHCCCHHHHHCCCCCHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCH LETHASENLPIVVPETLDVLNLLKELRRAKGSMVVVSNEFGIIQGLITPLDVLEAIAGEF HHHHCCCCCCEECCCHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHHHHCCC PDEDETPEIEIINNGTGWLAKGSMDLHALQQALQAHDLVHVCDHVASLAGLLLSRCDRIP CCCCCCCCEEEEECCCCEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC KEGDVLTINRWRFIIRKMIEYRIELVEIECLLFFNDTH CCCCEEEEHHHHHHHHHHHHHHHHHEEEEEEEEECCCC >Mature Secondary Structure MEFLTDISMWIGFLTLIILEIVLGVDNLVFIAILTDKLPKKQRERACIIGLTLALIMRIA CCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHH LLSLISWFVTLTKPLCKIATFSFSGRDLILLFGGMFLLFKATTELHQQLEHKVHNHTSRG HHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH YASFWVVVIQIVVFDAIFSLDAVITAVGTVENLTIMIIAVVIAVLIMSLSSRLLTNFINS HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC HQTVVVLCLSFLLIIGLSLIAEGIGFYIPKGYLYVAIGFSVLIELFNQIAHCNSMKGQST CHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCH KSMRERTAEAIMRLMGGNTAQWDFDTEKNSSLLLSKTHFAEEERHMITGVLSLASRTLRS HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHH IMTPRNEISWLDSQKPVQELYSTLMNTPHNMFPVCNGELDQLIGIVRAKDLMAAIANGEQ HHCCCHHHHHCCCCCHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCH LETHASENLPIVVPETLDVLNLLKELRRAKGSMVVVSNEFGIIQGLITPLDVLEAIAGEF HHHHCCCCCCEECCCHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHHHHCCC PDEDETPEIEIINNGTGWLAKGSMDLHALQQALQAHDLVHVCDHVASLAGLLLSRCDRIP CCCCCCCCEEEEECCCCEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC KEGDVLTINRWRFIIRKMIEYRIELVEIECLLFFNDTH CCCCEEEEHHHHHHHHHHHHHHHHHEEEEEEEEECCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]