Definition Prochlorococcus marinus str. AS9601, complete genome.
Accession NC_008816
Length 1,669,886

Click here to switch to the map view.

The map label for this gene is 123968759

Identifier: 123968759

GI number: 123968759

Start: 1043975

End: 1045819

Strand: Reverse

Name: 123968759

Synonym: A9601_12261

Alternate gene names: NA

Gene position: 1045819-1043975 (Counterclockwise)

Preceding gene: 123968760

Following gene: 123968758

Centisome position: 62.63

GC content: 24.93

Gene sequence:

>1845_bases
TTGAAAAAAATTGACTTATTAAAAAAAATTGAAGAAGCAAAAATAAAACATAAAGCAGGGAACTCTTTAGAGGCAAATCA
GATATTTCAAGAGTTATTAAAATCAAATAATGATTCTTTTGATTTACTTTACGCTTATGGGTTGTTTTGTAGAGATTTAA
AAAATTTTAATTTAGCAAAAAGAGTATTTCTCAATCTAATTAATAAATTCTCATCATCAATTAATTCTTATATTTTATTA
GCTGAAATATTAAAAATTGAGAACAAATTCAATGATGCAGAAAGGGTACTGCAAAAGGCAATAAAAATTAATCCTAATCA
TGGAGATTTACTTTATAATCTTTCTCTTTTGTACTTTACATTGAGAAACTTTGATTATGCACTAGATTATATAGATAAAG
CTATTAAAGTATCGATAAATAATGATATTTATAAACTTTTAAAGTCTGAGATTTATATCAATAAATTCAATATTGATGAA
GCATTGTATATCTTGGAAAATCTAAATAATAAAAATAGAATTAAAAAAGATAAAAATAAAGAAATAAGAATAAATATTCT
TCTAGCCAATGCATTCCTAAAAAAAAGGAAGTACGAAGAAGCAGAAACAATTCTTTTAAAATTGACCAAAAAATATCAAG
GATTGGAATTGGCTTATTTAAATTTAAGTATTCTGTATAAGGATAAGAATCAATTAAGTAAAAGTATACAAATACTAAAA
AAGGGAATAAACCTATCTCCCAATTTCATGCCTTTTTATAAAAATTTAGCAAGTTTCTATAGAAATTCAGGACAGCTTAA
ACTTGCTATTGAGACTAACTTATTTATTATTTCTAGAAATAAATTTGACTTCAATAGTTTTTATGAATTATCTGGGATTT
ATGATTTTAAGAATCATAAAAATGAATTAGATTTTTTATTAAATACTAAACTTGAGAATCTTAATCCAAACTCAAAGATA
TACGCAGCTTTTGCAATCTCAAATTTGCTGCACAAACAAGGAAAATTTAAAGAAAGTGCAAAATATCTAAAAATCGCCAA
TGACGAAGGCATGAAGTATAAAAAATCTGACTCAAGTTTGAAGATTAAACATACTGAATCTTATAGATCACTAAAAATCA
AAAAATCAAAAAATAAATATTTGAAGAATTCTTCTAATTATGTCTTTATTGTTGGCATGCCAAGATCAGGAAGTACTTTA
CTGGAAAACATATTAAGTTTAAATTCTGAAGTAACTGATATGGGCGAGGTTAGCTTTTTAGAGGAATCCATCAAGGAAGC
TAAAGATTTTGAAGAAATATATGATTTATATGAAAAAAAAGTTATTAATCAATTTAAATCCGCTACCTTTTACACCGATA
AAAGTTTATTTAATTATATGTATATTGCCATTATTTCTAATTTTTTTCCTAAAGCAAAAATAATAAATTGCATAAGAAAC
CCTCTCGATAATATTTTATCCATTTATAGAGCAAACTTTTTAAATCAGTCATTCTCTTTCTCTTTATCTGATATTTCTTG
TTTATATAAACACTATTTTGAAACTATGGAGGAATATAAAATTAAATATGGTGTAAATATTTATGATTATTACTATGAAG
ACTTAATTGAAAATCCCAATAATGTAATACCTAGGATAATAAATTGGCTTGGTTGGGATTGGGACGAAAAATATCTTTCT
CCCCATCAAAACAAAAGAAATGTACACACCGCAAGTAGCGCTCAAATAAGAAAGAAATTTTATTCTTCTTCTATAGGAAT
TTGGAAAGAATATAAGGAACTTTTGGAACCTGCAATAGAAATTATTAAAACAAATAAACTTCTTGCAGAAAAGATTTCTA
GGTGA

Upstream 100 bases:

>100_bases
AAAGATCTGCTATAAAGCAGATCTTTTTTTTTTAATTTCTAAAATTAATATTTTTTTTTAATACTTTCATTAACTATAAT
GTATAAAAAATTTAAATATA

Downstream 100 bases:

>100_bases
AGAGAGTGTTTTAGTAATAACCGGCTCGGATTTTGTCTACTGGAGGTTGTAAATTCTTGGCAAATAGATTACCTTTAAAA
TACCTTAAATTGTGTGAGGA

Product: TPR repeat-containing sulfotransferase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 614; Mature: 614

Protein sequence:

>614_residues
MKKIDLLKKIEEAKIKHKAGNSLEANQIFQELLKSNNDSFDLLYAYGLFCRDLKNFNLAKRVFLNLINKFSSSINSYILL
AEILKIENKFNDAERVLQKAIKINPNHGDLLYNLSLLYFTLRNFDYALDYIDKAIKVSINNDIYKLLKSEIYINKFNIDE
ALYILENLNNKNRIKKDKNKEIRINILLANAFLKKRKYEEAETILLKLTKKYQGLELAYLNLSILYKDKNQLSKSIQILK
KGINLSPNFMPFYKNLASFYRNSGQLKLAIETNLFIISRNKFDFNSFYELSGIYDFKNHKNELDFLLNTKLENLNPNSKI
YAAFAISNLLHKQGKFKESAKYLKIANDEGMKYKKSDSSLKIKHTESYRSLKIKKSKNKYLKNSSNYVFIVGMPRSGSTL
LENILSLNSEVTDMGEVSFLEESIKEAKDFEEIYDLYEKKVINQFKSATFYTDKSLFNYMYIAIISNFFPKAKIINCIRN
PLDNILSIYRANFLNQSFSFSLSDISCLYKHYFETMEEYKIKYGVNIYDYYYEDLIENPNNVIPRIINWLGWDWDEKYLS
PHQNKRNVHTASSAQIRKKFYSSSIGIWKEYKELLEPAIEIIKTNKLLAEKISR

Sequences:

>Translated_614_residues
MKKIDLLKKIEEAKIKHKAGNSLEANQIFQELLKSNNDSFDLLYAYGLFCRDLKNFNLAKRVFLNLINKFSSSINSYILL
AEILKIENKFNDAERVLQKAIKINPNHGDLLYNLSLLYFTLRNFDYALDYIDKAIKVSINNDIYKLLKSEIYINKFNIDE
ALYILENLNNKNRIKKDKNKEIRINILLANAFLKKRKYEEAETILLKLTKKYQGLELAYLNLSILYKDKNQLSKSIQILK
KGINLSPNFMPFYKNLASFYRNSGQLKLAIETNLFIISRNKFDFNSFYELSGIYDFKNHKNELDFLLNTKLENLNPNSKI
YAAFAISNLLHKQGKFKESAKYLKIANDEGMKYKKSDSSLKIKHTESYRSLKIKKSKNKYLKNSSNYVFIVGMPRSGSTL
LENILSLNSEVTDMGEVSFLEESIKEAKDFEEIYDLYEKKVINQFKSATFYTDKSLFNYMYIAIISNFFPKAKIINCIRN
PLDNILSIYRANFLNQSFSFSLSDISCLYKHYFETMEEYKIKYGVNIYDYYYEDLIENPNNVIPRIINWLGWDWDEKYLS
PHQNKRNVHTASSAQIRKKFYSSSIGIWKEYKELLEPAIEIIKTNKLLAEKISR
>Mature_614_residues
MKKIDLLKKIEEAKIKHKAGNSLEANQIFQELLKSNNDSFDLLYAYGLFCRDLKNFNLAKRVFLNLINKFSSSINSYILL
AEILKIENKFNDAERVLQKAIKINPNHGDLLYNLSLLYFTLRNFDYALDYIDKAIKVSINNDIYKLLKSEIYINKFNIDE
ALYILENLNNKNRIKKDKNKEIRINILLANAFLKKRKYEEAETILLKLTKKYQGLELAYLNLSILYKDKNQLSKSIQILK
KGINLSPNFMPFYKNLASFYRNSGQLKLAIETNLFIISRNKFDFNSFYELSGIYDFKNHKNELDFLLNTKLENLNPNSKI
YAAFAISNLLHKQGKFKESAKYLKIANDEGMKYKKSDSSLKIKHTESYRSLKIKKSKNKYLKNSSNYVFIVGMPRSGSTL
LENILSLNSEVTDMGEVSFLEESIKEAKDFEEIYDLYEKKVINQFKSATFYTDKSLFNYMYIAIISNFFPKAKIINCIRN
PLDNILSIYRANFLNQSFSFSLSDISCLYKHYFETMEEYKIKYGVNIYDYYYEDLIENPNNVIPRIINWLGWDWDEKYLS
PHQNKRNVHTASSAQIRKKFYSSSIGIWKEYKELLEPAIEIIKTNKLLAEKISR

Specific function: Unknown

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Contains 8 TPR repeats [H]

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000863
- InterPro:   IPR001440
- InterPro:   IPR011717
- InterPro:   IPR013026
- InterPro:   IPR011990
- InterPro:   IPR019734 [H]

Pfam domain/function: PF00685 Sulfotransfer_1; PF00515 TPR_1; PF07721 TPR_4 [H]

EC number: NA

Molecular weight: Translated: 72212; Mature: 72212

Theoretical pI: Translated: 9.83; Mature: 9.83

Prosite motif: PS50005 TPR ; PS50293 TPR_REGION

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.5 %Cys     (Translated Protein)
1.1 %Met     (Translated Protein)
1.6 %Cys+Met (Translated Protein)
0.5 %Cys     (Mature Protein)
1.1 %Met     (Mature Protein)
1.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKKIDLLKKIEEAKIKHKAGNSLEANQIFQELLKSNNDSFDLLYAYGLFCRDLKNFNLAK
CCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCCCCEEHHHHHHHHHHHHCCCCHHH
RVFLNLINKFSSSINSYILLAEILKIENKFNDAERVLQKAIKINPNHGDLLYNLSLLYFT
HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCCCCHHHHHHHHHHH
LRNFDYALDYIDKAIKVSINNDIYKLLKSEIYINKFNIDEALYILENLNNKNRIKKDKNK
HHHHHHHHHHHCHHEEEEECHHHHHHHHHHHEEEEECHHHHHHHHHCCCCCCCCCCCCCC
EIRINILLANAFLKKRKYEEAETILLKLTKKYQGLELAYLNLSILYKDKNQLSKSIQILK
EEEEEEEEEHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEEEEECCHHHHHHHHHHHH
KGINLSPNFMPFYKNLASFYRNSGQLKLAIETNLFIISRNKFDFNSFYELSGIYDFKNHK
HCCCCCCCCHHHHHHHHHHHCCCCCEEEEEEEEEEEEECCCCCCCCHHHHCCEEHHHCCC
NELDFLLNTKLENLNPNSKIYAAFAISNLLHKQGKFKESAKYLKIANDEGMKYKKSDSSL
CHHHHHHHCHHCCCCCCCCEEHHHHHHHHHHHCCCHHHCCCEEEEECCCCCEEECCCCCE
KIKHTESYRSLKIKKSKNKYLKNSSNYVFIVGMPRSGSTLLENILSLNSEVTDMGEVSFL
EEEECCCHHHEEEECHHHHHHCCCCCEEEEEEECCCCHHHHHHHHHCCCCCCCCHHHHHH
EESIKEAKDFEEIYDLYEKKVINQFKSATFYTDKSLFNYMYIAIISNFFPKAKIINCIRN
HHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEHHHHHHHHHHHHHHHHCCHHHHHHHHHH
PLDNILSIYRANFLNQSFSFSLSDISCLYKHYFETMEEYKIKYGVNIYDYYYEDLIENPN
HHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCC
NVIPRIINWLGWDWDEKYLSPHQNKRNVHTASSAQIRKKFYSSSIGIWKEYKELLEPAIE
HHHHHHHHHHCCCCCHHHCCCCCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHH
IIKTNKLLAEKISR
HHHHHHHHHHHHCC
>Mature Secondary Structure
MKKIDLLKKIEEAKIKHKAGNSLEANQIFQELLKSNNDSFDLLYAYGLFCRDLKNFNLAK
CCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCCCCEEHHHHHHHHHHHHCCCCHHH
RVFLNLINKFSSSINSYILLAEILKIENKFNDAERVLQKAIKINPNHGDLLYNLSLLYFT
HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCCCCHHHHHHHHHHH
LRNFDYALDYIDKAIKVSINNDIYKLLKSEIYINKFNIDEALYILENLNNKNRIKKDKNK
HHHHHHHHHHHCHHEEEEECHHHHHHHHHHHEEEEECHHHHHHHHHCCCCCCCCCCCCCC
EIRINILLANAFLKKRKYEEAETILLKLTKKYQGLELAYLNLSILYKDKNQLSKSIQILK
EEEEEEEEEHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEEEEECCHHHHHHHHHHHH
KGINLSPNFMPFYKNLASFYRNSGQLKLAIETNLFIISRNKFDFNSFYELSGIYDFKNHK
HCCCCCCCCHHHHHHHHHHHCCCCCEEEEEEEEEEEEECCCCCCCCHHHHCCEEHHHCCC
NELDFLLNTKLENLNPNSKIYAAFAISNLLHKQGKFKESAKYLKIANDEGMKYKKSDSSL
CHHHHHHHCHHCCCCCCCCEEHHHHHHHHHHHCCCHHHCCCEEEEECCCCCEEECCCCCE
KIKHTESYRSLKIKKSKNKYLKNSSNYVFIVGMPRSGSTLLENILSLNSEVTDMGEVSFL
EEEECCCHHHEEEECHHHHHHCCCCCEEEEEEECCCCHHHHHHHHHCCCCCCCCHHHHHH
EESIKEAKDFEEIYDLYEKKVINQFKSATFYTDKSLFNYMYIAIISNFFPKAKIINCIRN
HHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEHHHHHHHHHHHHHHHHCCHHHHHHHHHH
PLDNILSIYRANFLNQSFSFSLSDISCLYKHYFETMEEYKIKYGVNIYDYYYEDLIENPN
HHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCC
NVIPRIINWLGWDWDEKYLSPHQNKRNVHTASSAQIRKKFYSSSIGIWKEYKELLEPAIE
HHHHHHHHHHCCCCCHHHCCCCCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHH
IIKTNKLLAEKISR
HHHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9537320 [H]