Definition | Candidatus Blochmannia pennsylvanicus str. BPEN, complete genome. |
---|---|
Accession | NC_007292 |
Length | 791,654 |
Click here to switch to the map view.
The map label for this gene is tolA [H]
Identifier: 71892115
GI number: 71892115
Start: 410688
End: 411872
Strand: Direct
Name: tolA [H]
Synonym: BPEN_347
Alternate gene names: 71892115
Gene position: 410688-411872 (Clockwise)
Preceding gene: 71892114
Following gene: 71892116
Centisome position: 51.88
GC content: 33.25
Gene sequence:
>1185_bases TTGTTTAATAGATATTACAAAAAATTTAGATCTGCTGTTATGATATCAATCATTGTACATGTGATCGTCTCTTTATTGCT ATATAAACATATAATAAAAAAACAACAACAATACACTGCTAAAATAGCTAATAACAAAAATTCAATTAATACCTTCATTC ACAACTCTAACCCGAAAAAAGAAAAAAACAACAACATAAAAGATCAATCAAATCGATTCAAAATAACAGAAGCACAGCCT CCATTGGCAGGACCAACAGCGCCCAACACCCCTAAAAAGAAAAGTCAAGATCAATCAAATCGATTCAAAATAACAGAAGC ACAGCCTCCATTGGCAGGACCAACAGCGCCCAACACCCCTAAAAAGAAAAGTCAAGATCAATCAAATCGATTCAAAATAA CAGAAGCACAGCCTCCATTGGCAGGACCAACAGCGCCCAACACCCCTAAAAAGAAAAGTCAAGATCAATCAAATCGATTC AAAATAACAGAAGCACAGCCTCCATTGGCAGGACCAACAGCGCCCAACACCCCTAAAAAGAAAAGTCAAGATCAATCAAA TCGATTCAAAATAACAGAAGCACAGCCTCCATTGGCAGGACCAACAGCGCCCAACACCCCTAAAAAGAAAAGTCAAGATC AATCAAATCGATTCAAAATAACAGAAGCACAGCCTCCATTGGCAGGACCAACAGCGCCCAACACCCCTAAAAAGAAAAGT CAGAATACTATAGAAAAAAATATACTAATACATGATAAATTAAATTATCAATCCCATGACTCTAACGAACTCAATAATTT ACTGAATACATTAATTAACAAAAAAAAATCTATAGAAAATAACAAAAAATCTAAATTAGACTTGACAACAAAAAAAAATG ATAATGAAAATAATATTTCAAACGAAATGATTAAAAGTAATGAAATTAATATATATAAGCGTATGATCAGCGAATCAATA CAACAAAAATTTTATAACTTTTCGTATTATGTTGGTAAACAATGCAATCTACGCATCAAATTAGCACCTGACGGCACGTT ATTATCAGTAACCGCTATATCTGGAGATTACGATTTATGCCAAGCAGCAATTATCGCCGCCAAATTAGCAAAAATTCCTA AACCACCAAATTCAGATATTTACGAAATATTTAAAAACACAATATTAAATTTTTCTCCTCAATAA
Upstream 100 bases:
>100_bases GTTAAATAAGATCGGTATAAATTCAATTGGAATGATAACAAATCCTTCCGTTTGAATACGGAACGATATTCTTACACAAC TTGGATGATAATTATTAAAT
Downstream 100 bases:
>100_bases ACAAAATATTTACACATTTATAAAATAATAATAATATATTGGTCCATTAAATAATCTATTTTCATTATAAAATTTTCATG TATTTAATAATTAAAAAAAC
Product: hypothetical protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 394; Mature: 394
Protein sequence:
>394_residues MFNRYYKKFRSAVMISIIVHVIVSLLLYKHIIKKQQQYTAKIANNKNSINTFIHNSNPKKEKNNNIKDQSNRFKITEAQP PLAGPTAPNTPKKKSQDQSNRFKITEAQPPLAGPTAPNTPKKKSQDQSNRFKITEAQPPLAGPTAPNTPKKKSQDQSNRF KITEAQPPLAGPTAPNTPKKKSQDQSNRFKITEAQPPLAGPTAPNTPKKKSQDQSNRFKITEAQPPLAGPTAPNTPKKKS QNTIEKNILIHDKLNYQSHDSNELNNLLNTLINKKKSIENNKKSKLDLTTKKNDNENNISNEMIKSNEINIYKRMISESI QQKFYNFSYYVGKQCNLRIKLAPDGTLLSVTAISGDYDLCQAAIIAAKLAKIPKPPNSDIYEIFKNTILNFSPQ
Sequences:
>Translated_394_residues MFNRYYKKFRSAVMISIIVHVIVSLLLYKHIIKKQQQYTAKIANNKNSINTFIHNSNPKKEKNNNIKDQSNRFKITEAQP PLAGPTAPNTPKKKSQDQSNRFKITEAQPPLAGPTAPNTPKKKSQDQSNRFKITEAQPPLAGPTAPNTPKKKSQDQSNRF KITEAQPPLAGPTAPNTPKKKSQDQSNRFKITEAQPPLAGPTAPNTPKKKSQDQSNRFKITEAQPPLAGPTAPNTPKKKS QNTIEKNILIHDKLNYQSHDSNELNNLLNTLINKKKSIENNKKSKLDLTTKKNDNENNISNEMIKSNEINIYKRMISESI QQKFYNFSYYVGKQCNLRIKLAPDGTLLSVTAISGDYDLCQAAIIAAKLAKIPKPPNSDIYEIFKNTILNFSPQ >Mature_394_residues MFNRYYKKFRSAVMISIIVHVIVSLLLYKHIIKKQQQYTAKIANNKNSINTFIHNSNPKKEKNNNIKDQSNRFKITEAQP PLAGPTAPNTPKKKSQDQSNRFKITEAQPPLAGPTAPNTPKKKSQDQSNRFKITEAQPPLAGPTAPNTPKKKSQDQSNRF KITEAQPPLAGPTAPNTPKKKSQDQSNRFKITEAQPPLAGPTAPNTPKKKSQDQSNRFKITEAQPPLAGPTAPNTPKKKS QNTIEKNILIHDKLNYQSHDSNELNNLLNTLINKKKSIENNKKSKLDLTTKKNDNENNISNEMIKSNEINIYKRMISESI QQKFYNFSYYVGKQCNLRIKLAPDGTLLSVTAISGDYDLCQAAIIAAKLAKIPKPPNSDIYEIFKNTILNFSPQ
Specific function: Involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3, and K). Necessary for the colicins to reach their respective targets after initial binding to the bacteria. Also involved in the translocation of bacteriophage DNA [H]
COG id: COG3064
COG function: function code M; Membrane protein involved in colicin uptake
Gene ontology:
Cell location: Cell inner membrane; Single-pass type II membrane protein [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
Organism=Escherichia coli, GI1786960, Length=97, Percent_Identity=53.6082474226804, Blast_Score=91, Evalue=1e-19,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR014161 [H]
Pfam domain/function: PF06519 TolA [H]
EC number: NA
Molecular weight: Translated: 44268; Mature: 44268
Theoretical pI: Translated: 10.68; Mature: 10.68
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.5 %Cys (Translated Protein) 1.0 %Met (Translated Protein) 1.5 %Cys+Met (Translated Protein) 0.5 %Cys (Mature Protein) 1.0 %Met (Mature Protein) 1.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MFNRYYKKFRSAVMISIIVHVIVSLLLYKHIIKKQQQYTAKIANNKNSINTFIHNSNPKK CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCH EKNNNIKDQSNRFKITEAQPPLAGPTAPNTPKKKSQDQSNRFKITEAQPPLAGPTAPNTP HCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCCCCC KKKSQDQSNRFKITEAQPPLAGPTAPNTPKKKSQDQSNRFKITEAQPPLAGPTAPNTPKK CCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCC KSQDQSNRFKITEAQPPLAGPTAPNTPKKKSQDQSNRFKITEAQPPLAGPTAPNTPKKKS CCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCHHH QNTIEKNILIHDKLNYQSHDSNELNNLLNTLINKKKSIENNKKSKLDLTTKKNDNENNIS HHHHHHCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCHHH NEMIKSNEINIYKRMISESIQQKFYNFSYYVGKQCNLRIKLAPDGTLLSVTAISGDYDLC HHHHHCCCHHHHHHHHHHHHHHHHHCCCEEECCCCCEEEEECCCCCEEEEEEECCCHHHH QAAIIAAKLAKIPKPPNSDIYEIFKNTILNFSPQ HHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCC >Mature Secondary Structure MFNRYYKKFRSAVMISIIVHVIVSLLLYKHIIKKQQQYTAKIANNKNSINTFIHNSNPKK CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCH EKNNNIKDQSNRFKITEAQPPLAGPTAPNTPKKKSQDQSNRFKITEAQPPLAGPTAPNTP HCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCCCCC KKKSQDQSNRFKITEAQPPLAGPTAPNTPKKKSQDQSNRFKITEAQPPLAGPTAPNTPKK CCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCC KSQDQSNRFKITEAQPPLAGPTAPNTPKKKSQDQSNRFKITEAQPPLAGPTAPNTPKKKS CCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCHHH QNTIEKNILIHDKLNYQSHDSNELNNLLNTLINKKKSIENNKKSKLDLTTKKNDNENNIS HHHHHHCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCHHH NEMIKSNEINIYKRMISESIQQKFYNFSYYVGKQCNLRIKLAPDGTLLSVTAISGDYDLC HHHHHCCCHHHHHHHHHHHHHHHHHCCCEEECCCCCEEEEECCCCCEEEEEEECCCHHHH QAAIIAAKLAKIPKPPNSDIYEIFKNTILNFSPQ HHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 2687247; 8905232; 9278503; 2068069; 8978668; 10404600 [H]