| Definition | Prochlorococcus marinus str. MIT 9312, complete genome. |
|---|---|
| Accession | NC_007577 |
| Length | 1,709,204 |
Click here to switch to the map view.
The map label for this gene is epsE [H]
Identifier: 78779572
GI number: 78779572
Start: 1108511
End: 1110295
Strand: Reverse
Name: epsE [H]
Synonym: PMT9312_1189
Alternate gene names: 78779572
Gene position: 1110295-1108511 (Counterclockwise)
Preceding gene: 78779578
Following gene: 78779571
Centisome position: 64.96
GC content: 31.88
Gene sequence:
>1785_bases ATGGTTAATAAAGAAATTATAGTCAGTGATATTCAAAAGAGTTTGAATAATAGAGTTTGTGAAGATGCTGGAATAATTCC TGTGAGTATTGAGTTTGGGGTAATTGAAATAGGAGCAATGAATCCAGATTTTATTAGAGTTAAAGAAGTAATAACTGATA TTAAGAGACGTTTTAATGTTGAAGTTGTATTAAAACAAATAACTGCATCTGAATGGGAAGAATGGTTTGAAAATAATTCA TCAATATCATTGACTGATGAAAATCAATTAAAAAATGGAATAGAAGATACTAGTGATGAAATAAATATCCAATCAAAATC AACTGAAGATCTTAAGAGTGATAATCTAAATGATGAAATAGATAATTTTGATATTGTAGAAAATGAGGAAAACAATAATT CAATAGATATTAATTATTCAAATGAAACTAATAACTTTCATTATGAATCCAAAGAAATTAAAGAAAAGCCATTTGAAGAA GATGTTGAAAACGATAATGCTTTAATTAAACTCGCACAAACTAATTTAGGTGAAGATTTTGAAAATGATGTAGATAGTTT TTTTGGAGATGAAATTACAAAATCTAAGGATCCAGTGATATCTGGTGTTGCATCATTGTTAAGTAAGTGTTTCTCTCTTA ATGCTTCAGATGTACACATAGAACCCCTTGAAGACAGATTAAGAATACGCTATAGAATTGATGGAATGCTCGAAGAGGTA TTCGCTTTCCCAAGATCTCATATCAGTCCAATTGTAAGCAGACTTAAGATAATGAGTAATTTAGACATAGCTGAAAAAAG AATACCGCAAGATGGAAGAATAAGATGTTTGCTAAGAGGAAGAAAGTCTGATTTTAGAGTTAGCACACTACCAGGTAAAT GGGGTGAAAAAGTTGTTCTCAGAGTTTTAGAAAGCGATAGTTCAGTACTTAATCTTTCAAAGTTAATAACTGAAAAAAGT GAATTAGATTTAATAAGAATAATGTCTAAAACTCCCTATGGAATTGTCGTTGTAGTAGGTCCAACTGGAAGTGGAAAATC CACTACTTTATACTCTATGTTGAGTGAATTAAACTCTCCTGGAGTCAATATTAGTACTGTTGAGGATCCAGTTGAATACA CTTTGGATGGTATTCATCAAGTGCAAGTAATTAGAGAGAAAGGTCTTGATTTCTCAAGAGCTTTGAGATCGTTGATGAGG CAAGATCCAGATATAATTTTAGTAGGTGAGACTAGAGATAAAGAAACTGCCCAAGCCGCCATGGAAGCAGCATTAACTGG ACATATGGTTTTTACTACTCTTCATGCAAATGATACTGCGACCGCAATTACCCGTCTCTCTGAAATGAATATTCCTCCAT ATTTGATTGGAGCATCAATCATAGGAGTTGTTGCTCAAAGATTAGTTAGAAAAGTTTGTTTGTCATGTAGTTCTGTTAAA TCTTTAAGTAAAGGGAAAGATGACAGAGCTATAAAATATGGATTAAATAAAGCAAGAATAATAAATGAAACTAGCAATGA CTCAAAGAGTAGATCATGTCCAGTTTGTGGTGGGAGCGGATATAAAGGTAGAGTTGGGATTTATGAAGTTATGAAAATAA ATGAAAATTTGCGAGAATTAATTATGAAAGAAAGTACCGCAGATGTAATAAGGTCAAAAGCTTTTTTAGTAGAAGGAAGG AGTTTATTAGATTATGGAATGGAACTTGTAAAAAAAGAATTAACTACTATTGAAGAAGTAGAAAGAGTATGCCTGCTTGA GGAGCCATTACCTGAGGAATCATGA
Upstream 100 bases:
>100_bases ATGATTATTATAGCTGTATAAATTCTATAAGCCAGAAAATATTTTTTTAATATAATTTATAGTATGGCTTGAAAATTTTT AATGATACTTTTCACTAATT
Downstream 100 bases:
>100_bases AATTAACACAATTAATGTCTGATTTAGTTAAAAGAAATGGGTCAGATTTACATCTGACTGGCAATTCTGTTCCCTTTTTT AGGGTGCAAGGTCAAATAGT
Product: Type II secretory pathway ATPase PulE/Tfp pilus assembly pathway ATPase PilB-like
Products: NA
Alternate protein names: Cholera toxin secretion protein epsE; Type II traffic warden ATPase [H]
Number of amino acids: Translated: 594; Mature: 594
Protein sequence:
>594_residues MVNKEIIVSDIQKSLNNRVCEDAGIIPVSIEFGVIEIGAMNPDFIRVKEVITDIKRRFNVEVVLKQITASEWEEWFENNS SISLTDENQLKNGIEDTSDEINIQSKSTEDLKSDNLNDEIDNFDIVENEENNNSIDINYSNETNNFHYESKEIKEKPFEE DVENDNALIKLAQTNLGEDFENDVDSFFGDEITKSKDPVISGVASLLSKCFSLNASDVHIEPLEDRLRIRYRIDGMLEEV FAFPRSHISPIVSRLKIMSNLDIAEKRIPQDGRIRCLLRGRKSDFRVSTLPGKWGEKVVLRVLESDSSVLNLSKLITEKS ELDLIRIMSKTPYGIVVVVGPTGSGKSTTLYSMLSELNSPGVNISTVEDPVEYTLDGIHQVQVIREKGLDFSRALRSLMR QDPDIILVGETRDKETAQAAMEAALTGHMVFTTLHANDTATAITRLSEMNIPPYLIGASIIGVVAQRLVRKVCLSCSSVK SLSKGKDDRAIKYGLNKARIINETSNDSKSRSCPVCGGSGYKGRVGIYEVMKINENLRELIMKESTADVIRSKAFLVEGR SLLDYGMELVKKELTTIEEVERVCLLEEPLPEES
Sequences:
>Translated_594_residues MVNKEIIVSDIQKSLNNRVCEDAGIIPVSIEFGVIEIGAMNPDFIRVKEVITDIKRRFNVEVVLKQITASEWEEWFENNS SISLTDENQLKNGIEDTSDEINIQSKSTEDLKSDNLNDEIDNFDIVENEENNNSIDINYSNETNNFHYESKEIKEKPFEE DVENDNALIKLAQTNLGEDFENDVDSFFGDEITKSKDPVISGVASLLSKCFSLNASDVHIEPLEDRLRIRYRIDGMLEEV FAFPRSHISPIVSRLKIMSNLDIAEKRIPQDGRIRCLLRGRKSDFRVSTLPGKWGEKVVLRVLESDSSVLNLSKLITEKS ELDLIRIMSKTPYGIVVVVGPTGSGKSTTLYSMLSELNSPGVNISTVEDPVEYTLDGIHQVQVIREKGLDFSRALRSLMR QDPDIILVGETRDKETAQAAMEAALTGHMVFTTLHANDTATAITRLSEMNIPPYLIGASIIGVVAQRLVRKVCLSCSSVK SLSKGKDDRAIKYGLNKARIINETSNDSKSRSCPVCGGSGYKGRVGIYEVMKINENLRELIMKESTADVIRSKAFLVEGR SLLDYGMELVKKELTTIEEVERVCLLEEPLPEES >Mature_594_residues MVNKEIIVSDIQKSLNNRVCEDAGIIPVSIEFGVIEIGAMNPDFIRVKEVITDIKRRFNVEVVLKQITASEWEEWFENNS SISLTDENQLKNGIEDTSDEINIQSKSTEDLKSDNLNDEIDNFDIVENEENNNSIDINYSNETNNFHYESKEIKEKPFEE DVENDNALIKLAQTNLGEDFENDVDSFFGDEITKSKDPVISGVASLLSKCFSLNASDVHIEPLEDRLRIRYRIDGMLEEV FAFPRSHISPIVSRLKIMSNLDIAEKRIPQDGRIRCLLRGRKSDFRVSTLPGKWGEKVVLRVLESDSSVLNLSKLITEKS ELDLIRIMSKTPYGIVVVVGPTGSGKSTTLYSMLSELNSPGVNISTVEDPVEYTLDGIHQVQVIREKGLDFSRALRSLMR QDPDIILVGETRDKETAQAAMEAALTGHMVFTTLHANDTATAITRLSEMNIPPYLIGASIIGVVAQRLVRKVCLSCSSVK SLSKGKDDRAIKYGLNKARIINETSNDSKSRSCPVCGGSGYKGRVGIYEVMKINENLRELIMKESTADVIRSKAFLVEGR SLLDYGMELVKKELTTIEEVERVCLLEEPLPEES
Specific function: Required for secretion of cholera toxin through the outer membrane [H]
COG id: COG2804
COG function: function code NU; Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB
Gene ontology:
Cell location: Cytoplasm (Probable) [H]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the GSP E family [H]
Homologues:
Organism=Escherichia coli, GI1789723, Length=441, Percent_Identity=40.3628117913832, Blast_Score=317, Evalue=2e-87, Organism=Escherichia coli, GI1786296, Length=382, Percent_Identity=39.0052356020942, Blast_Score=273, Evalue=3e-74, Organism=Escherichia coli, GI87082188, Length=166, Percent_Identity=40.9638554216867, Blast_Score=105, Evalue=7e-24,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR003593 - InterPro: IPR013369 - InterPro: IPR001482 [H]
Pfam domain/function: PF00437 GSPII_E [H]
EC number: NA
Molecular weight: Translated: 66570; Mature: 66570
Theoretical pI: Translated: 4.60; Mature: 4.60
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.3 %Cys (Translated Protein) 2.2 %Met (Translated Protein) 3.5 %Cys+Met (Translated Protein) 1.3 %Cys (Mature Protein) 2.2 %Met (Mature Protein) 3.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MVNKEIIVSDIQKSLNNRVCEDAGIIPVSIEFGVIEIGAMNPDFIRVKEVITDIKRRFNV CCCHHHHHHHHHHHHHHHHHHCCCEEEEEEEECEEEECCCCCCHHHHHHHHHHHHHHCCH EVVLKQITASEWEEWFENNSSISLTDENQLKNGIEDTSDEINIQSKSTEDLKSDNLNDEI HHHHHHHHHHHHHHHHCCCCEEEECCHHHHHCCCCCCCCCEEECCCCCHHHHCCCCCCCC DNFDIVENEENNNSIDINYSNETNNFHYESKEIKEKPFEEDVENDNALIKLAQTNLGEDF CCCCEEECCCCCCEEEEEECCCCCCCEECCHHHHCCCCHHHCCCCCEEEEEECCCCCCHH ENDVDSFFGDEITKSKDPVISGVASLLSKCFSLNASDVHIEPLEDRLRIRYRIDGMLEEV HHHHHHHHCCHHCCCCCHHHHHHHHHHHHHHCCCCCCEEECCCCCCCEEEHHHHHHHHHH FAFPRSHISPIVSRLKIMSNLDIAEKRIPQDGRIRCLLRGRKSDFRVSTLPGKWGEKVVL HHCCHHHHHHHHHHHHHHHCCCHHHHHCCCCCCEEEEEECCCCCCEEEECCCCCHHHHHH RVLESDSSVLNLSKLITEKSELDLIRIMSKTPYGIVVVVGPTGSGKSTTLYSMLSELNSP HHHHCCCHHHHHHHHHHCHHHHHHHHHHCCCCCEEEEEECCCCCCCCHHHHHHHHHHCCC GVNISTVEDPVEYTLDGIHQVQVIREKGLDFSRALRSLMRQDPDIILVGETRDKETAQAA CCEEEECCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCCHHHHHHH MEAALTGHMVFTTLHANDTATAITRLSEMNIPPYLIGASIIGVVAQRLVRKVCLSCSSVK HHHHHHCCEEEEEEECCCHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH SLSKGKDDRAIKYGLNKARIINETSNDSKSRSCPVCGGSGYKGRVGIYEVMKINENLREL HHHCCCCCCHHHHCCCHHEEEECCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH IMKESTADVIRSKAFLVEGRSLLDYGMELVKKELTTIEEVERVCLLEEPLPEES HHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC >Mature Secondary Structure MVNKEIIVSDIQKSLNNRVCEDAGIIPVSIEFGVIEIGAMNPDFIRVKEVITDIKRRFNV CCCHHHHHHHHHHHHHHHHHHCCCEEEEEEEECEEEECCCCCCHHHHHHHHHHHHHHCCH EVVLKQITASEWEEWFENNSSISLTDENQLKNGIEDTSDEINIQSKSTEDLKSDNLNDEI HHHHHHHHHHHHHHHHCCCCEEEECCHHHHHCCCCCCCCCEEECCCCCHHHHCCCCCCCC DNFDIVENEENNNSIDINYSNETNNFHYESKEIKEKPFEEDVENDNALIKLAQTNLGEDF CCCCEEECCCCCCEEEEEECCCCCCCEECCHHHHCCCCHHHCCCCCEEEEEECCCCCCHH ENDVDSFFGDEITKSKDPVISGVASLLSKCFSLNASDVHIEPLEDRLRIRYRIDGMLEEV HHHHHHHHCCHHCCCCCHHHHHHHHHHHHHHCCCCCCEEECCCCCCCEEEHHHHHHHHHH FAFPRSHISPIVSRLKIMSNLDIAEKRIPQDGRIRCLLRGRKSDFRVSTLPGKWGEKVVL HHCCHHHHHHHHHHHHHHHCCCHHHHHCCCCCCEEEEEECCCCCCEEEECCCCCHHHHHH RVLESDSSVLNLSKLITEKSELDLIRIMSKTPYGIVVVVGPTGSGKSTTLYSMLSELNSP HHHHCCCHHHHHHHHHHCHHHHHHHHHHCCCCCEEEEEECCCCCCCCHHHHHHHHHHCCC GVNISTVEDPVEYTLDGIHQVQVIREKGLDFSRALRSLMRQDPDIILVGETRDKETAQAA CCEEEECCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCCHHHHHHH MEAALTGHMVFTTLHANDTATAITRLSEMNIPPYLIGASIIGVVAQRLVRKVCLSCSSVK HHHHHHCCEEEEEEECCCHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH SLSKGKDDRAIKYGLNKARIINETSNDSKSRSCPVCGGSGYKGRVGIYEVMKINENLREL HHHCCCCCCHHHHCCCHHEEEECCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH IMKESTADVIRSKAFLVEGRSLLDYGMELVKKELTTIEEVERVCLLEEPLPEES HHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 8423007; 10952301 [H]