The gene/protein map for NC_008819 is currently unavailable.
Definition Prochlorococcus marinus str. NATL1A, complete genome.
Accession NC_008819
Length 1,864,731

Click here to switch to the map view.

The map label for this gene is vacB [H]

Identifier: 124025749

GI number: 124025749

Start: 959193

End: 961502

Strand: Direct

Name: vacB [H]

Synonym: NATL1_10421

Alternate gene names: 124025749

Gene position: 959193-961502 (Clockwise)

Preceding gene: 124025748

Following gene: 124025750

Centisome position: 51.44

GC content: 31.0

Gene sequence:

>2310_bases
ATGACATCAGTTCTAAAAATACTAGAAAGTCTAGATTGTGAGGATGGATTAGAAGTTTCAAAACTTGAGAGATCCTTAAA
AATCACAAAAAAAATCGATAAAGATAATTTAAACGTAGCTATTAAAGCTCTTACAAAACTAGGAATCGTACAAACTATTA
TTGATGATAAACTGACGATAAACAATGATATAGATTTTTTGCGAGGTAGGGTGCGTTGTAGTAGTAAAGGCTATTGTTTT
GTTGTTAGAGAAGATCAAGGTGAAGATATTTACATTAGAGAGAGTAATTTAAATAATGCCTGGCATGGCGATTCTGTATT
TGTATTAATAACTAAGCATGGAGTAAGGAGGAGAGCACCTGAAGGGAAAATACAATGTGTTTTACAAAGATATAATGATA
CTTTGCTGGCAAAAGTTGAACCTGACAAGACCACTGGAGAATTAAAGGCCTACCCTTTAGACGACAGAATACCAGTAACA
ATTGAACTTGAAGATGTGATAGAAACTAATAAGAAATATCCCGATAAAGATTTAATTCATGAAATAAAGATAACTAAATA
TCCAATTGCTCAATTTAATGCTAAGGGTTCAATTGTTAGAGAATTATCGATTAATTCAGGAGTGGAGGGAGATATTGAAT
TATTACTTTCCAAGAATAATATATCTAAGAGTTTGGTAGCACCTAAAGTTGCTCCCAAGAAGATTTCCTCAAAAGGAAGA
TCAGATTTAACTGCTCAACCTTCTTTATTATTTGAAAGTTGGGAATCTACTAATTCACCTTCTCTTCCTGGATTATATGC
GCAACCCTATGAAGGGGGTAATAGAATATGGGTTCATGCACCTACTATCTCTGAAAGAATCAATTTAGGCGGGAAGCTAG
ATAGATATCTTAAAGATAAAGGTGAAGTGATTTGCTTAGGTAACAACTGGCTAGAGTTTCTTAATGAATCACTTAGATCA
GCATCTCAATTTAAAATCAATGAGGAATGCGAAGCTATTTCATTGATGATAGATATTAATGCCGATGGCAATATAACTGA
TTGGAAATTCACACTCAGTATTATTAAGCCTGTGAAAATCATTACTCCAAAACACCTTACAGCTATCAACAACAGAAAAG
CTAAATCGAAGTCAGTTCCTATAGCTCTTAAAGGTATTAAAGACAATCTAGAAGTCCTTTATACTCTTTTACATTCAGCC
AAAATAATTAATAATATGAACAATAATTCTATTAAATTAGATGAGTATATTCCTAAATTAGAGAGATTAAGTGAATTATC
AAAAACTTTTCCTGGCAGGGACTTTCACGGATGGTCTAAAACTTACGATTGTTGCGATCCACAATCTCTAATAGATATTT
ATATAAGACTTTCAAATAATATTCTTTCAAAACATTTAATAGGCTATAGATTACCATTTATTTATAAAGAGCACGAAGAA
ATTGATCAATCATCTATTAACGAATTAACTAAATCAGCATTGGCATTAGATAAAAAAATAACCGTTAATGTTGATGGGAC
AATAACATCTAGTGAATTAATAAAGTCATTTGAATCAAGTAGTGAAAAGAAAATACTTCATAAACTTGTTAAACATATAA
TTCCTGGAGTTCATTTAAAACTATACGAGATAAATACTCATGAAGACATTACGCATTTAAATCATGATGCAGCACAGACT
AATATTGAATCTCCTTGGTGTTGCCCATCCTTAAACTACTGGAATATATTTAATCAGTTTATATTATCTTTACTTTTGTC
TGAAGGTAAAAGTAAATCTTCAAGCCGAAGTAAACAAACTGTCGAATTAGGTAAAAATGATAGCTGGAGAGAAGTTGATT
GGGAAATATTTCCATCTAAGCTTAAAGAAATTATTGGTAATCATTCAAATTCAAGATTAATTAATAATTTAAATGAAATT
AGAAAAAAATCTAAATCATTTAGAAACAACATCATTTCAATTGCTCAGGGTAGAGAGGCTCAGAAAATTATAGGTAAAGA
GGTAACAGCTATTATAACTGGTGTACAAAGCTATGGTTTTTTTGCTGAGATAGATGACTTAACTGCTGAAGGCTTAGTCC
ATGTAAGTACTTTGGGTGACGATTGGTACGAATATAGATCTAGGCAAAACCTTTTAGTAGGTAGAAAAAGCAAAAAAACA
TATCAACTTGCGCAAAAAATCAATGTCAGAGTTTTAAAAGTTGACATTCTAAAAAATCAAATTGACTTGGAACTTGTAAA
AGAAACAGGAACAGAAACAGAAACAGAAACAATAAATACTATTAAAGTTGAATCTGATAATGAAGATTAA

Upstream 100 bases:

>100_bases
TTCTAATGATAGCTTCTAAACTAAACTTATTTTTGAAATATTTAGTCTTTTATTTTTAAAGACATAATCCAATGAAATTA
AATTAATTCCCCCTTTTATT

Downstream 100 bases:

>100_bases
ATGAAGAGTATTGTGATTGCAATTACTGGTGCTTCGGCTATGCAAATAGCAGAGCGTTCAATCCAGGTATTGCTAGAGAA
TGATCAATCAGTAGATTTAA

Product: putative acetazolamide conferring resistance protein Zam

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 769; Mature: 768

Protein sequence:

>769_residues
MTSVLKILESLDCEDGLEVSKLERSLKITKKIDKDNLNVAIKALTKLGIVQTIIDDKLTINNDIDFLRGRVRCSSKGYCF
VVREDQGEDIYIRESNLNNAWHGDSVFVLITKHGVRRRAPEGKIQCVLQRYNDTLLAKVEPDKTTGELKAYPLDDRIPVT
IELEDVIETNKKYPDKDLIHEIKITKYPIAQFNAKGSIVRELSINSGVEGDIELLLSKNNISKSLVAPKVAPKKISSKGR
SDLTAQPSLLFESWESTNSPSLPGLYAQPYEGGNRIWVHAPTISERINLGGKLDRYLKDKGEVICLGNNWLEFLNESLRS
ASQFKINEECEAISLMIDINADGNITDWKFTLSIIKPVKIITPKHLTAINNRKAKSKSVPIALKGIKDNLEVLYTLLHSA
KIINNMNNNSIKLDEYIPKLERLSELSKTFPGRDFHGWSKTYDCCDPQSLIDIYIRLSNNILSKHLIGYRLPFIYKEHEE
IDQSSINELTKSALALDKKITVNVDGTITSSELIKSFESSSEKKILHKLVKHIIPGVHLKLYEINTHEDITHLNHDAAQT
NIESPWCCPSLNYWNIFNQFILSLLLSEGKSKSSSRSKQTVELGKNDSWREVDWEIFPSKLKEIIGNHSNSRLINNLNEI
RKKSKSFRNNIISIAQGREAQKIIGKEVTAIITGVQSYGFFAEIDDLTAEGLVHVSTLGDDWYEYRSRQNLLVGRKSKKT
YQLAQKINVRVLKVDILKNQIDLELVKETGTETETETINTIKVESDNED

Sequences:

>Translated_769_residues
MTSVLKILESLDCEDGLEVSKLERSLKITKKIDKDNLNVAIKALTKLGIVQTIIDDKLTINNDIDFLRGRVRCSSKGYCF
VVREDQGEDIYIRESNLNNAWHGDSVFVLITKHGVRRRAPEGKIQCVLQRYNDTLLAKVEPDKTTGELKAYPLDDRIPVT
IELEDVIETNKKYPDKDLIHEIKITKYPIAQFNAKGSIVRELSINSGVEGDIELLLSKNNISKSLVAPKVAPKKISSKGR
SDLTAQPSLLFESWESTNSPSLPGLYAQPYEGGNRIWVHAPTISERINLGGKLDRYLKDKGEVICLGNNWLEFLNESLRS
ASQFKINEECEAISLMIDINADGNITDWKFTLSIIKPVKIITPKHLTAINNRKAKSKSVPIALKGIKDNLEVLYTLLHSA
KIINNMNNNSIKLDEYIPKLERLSELSKTFPGRDFHGWSKTYDCCDPQSLIDIYIRLSNNILSKHLIGYRLPFIYKEHEE
IDQSSINELTKSALALDKKITVNVDGTITSSELIKSFESSSEKKILHKLVKHIIPGVHLKLYEINTHEDITHLNHDAAQT
NIESPWCCPSLNYWNIFNQFILSLLLSEGKSKSSSRSKQTVELGKNDSWREVDWEIFPSKLKEIIGNHSNSRLINNLNEI
RKKSKSFRNNIISIAQGREAQKIIGKEVTAIITGVQSYGFFAEIDDLTAEGLVHVSTLGDDWYEYRSRQNLLVGRKSKKT
YQLAQKINVRVLKVDILKNQIDLELVKETGTETETETINTIKVESDNED
>Mature_768_residues
TSVLKILESLDCEDGLEVSKLERSLKITKKIDKDNLNVAIKALTKLGIVQTIIDDKLTINNDIDFLRGRVRCSSKGYCFV
VREDQGEDIYIRESNLNNAWHGDSVFVLITKHGVRRRAPEGKIQCVLQRYNDTLLAKVEPDKTTGELKAYPLDDRIPVTI
ELEDVIETNKKYPDKDLIHEIKITKYPIAQFNAKGSIVRELSINSGVEGDIELLLSKNNISKSLVAPKVAPKKISSKGRS
DLTAQPSLLFESWESTNSPSLPGLYAQPYEGGNRIWVHAPTISERINLGGKLDRYLKDKGEVICLGNNWLEFLNESLRSA
SQFKINEECEAISLMIDINADGNITDWKFTLSIIKPVKIITPKHLTAINNRKAKSKSVPIALKGIKDNLEVLYTLLHSAK
IINNMNNNSIKLDEYIPKLERLSELSKTFPGRDFHGWSKTYDCCDPQSLIDIYIRLSNNILSKHLIGYRLPFIYKEHEEI
DQSSINELTKSALALDKKITVNVDGTITSSELIKSFESSSEKKILHKLVKHIIPGVHLKLYEINTHEDITHLNHDAAQTN
IESPWCCPSLNYWNIFNQFILSLLLSEGKSKSSSRSKQTVELGKNDSWREVDWEIFPSKLKEIIGNHSNSRLINNLNEIR
KKSKSFRNNIISIAQGREAQKIIGKEVTAIITGVQSYGFFAEIDDLTAEGLVHVSTLGDDWYEYRSRQNLLVGRKSKKTY
QLAQKINVRVLKVDILKNQIDLELVKETGTETETETINTIKVESDNED

Specific function: Not known; control resistance to the carbonic anhydrase inhibitor acetazolamide [H]

COG id: COG0557

COG function: function code K; Exoribonuclease R

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 S1 motif domain [H]

Homologues:

Organism=Escherichia coli, GI87082383, Length=83, Percent_Identity=37.3493975903614, Blast_Score=70, Evalue=5e-13,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR011129
- InterPro:   IPR012340
- InterPro:   IPR016027
- InterPro:   IPR003029
- InterPro:   IPR022967
- InterPro:   IPR013223
- InterPro:   IPR001900
- InterPro:   IPR022966
- InterPro:   IPR011805 [H]

Pfam domain/function: PF08206 OB_RNB; PF00773 RNB; PF00575 S1 [H]

EC number: 3.1.-.- [C]

Molecular weight: Translated: 87155; Mature: 87023

Theoretical pI: Translated: 7.94; Mature: 7.94

Prosite motif: PS50126 S1

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.3 %Cys     (Translated Protein)
0.4 %Met     (Translated Protein)
1.7 %Cys+Met (Translated Protein)
1.3 %Cys     (Mature Protein)
0.3 %Met     (Mature Protein)
1.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTSVLKILESLDCEDGLEVSKLERSLKITKKIDKDNLNVAIKALTKLGIVQTIIDDKLTI
CCHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCEEE
NNDIDFLRGRVRCSSKGYCFVVREDQGEDIYIRESNLNNAWHGDSVFVLITKHGVRRRAP
CCCHHHHHCEEEECCCCEEEEEEECCCCEEEEEECCCCCCCCCCEEEEEEECCCCHHCCC
EGKIQCVLQRYNDTLLAKVEPDKTTGELKAYPLDDRIPVTIELEDVIETNKKYPDKDLIH
CCHHHHHHHHCCCEEEEEECCCCCCCCEEEECCCCCCEEEEEEHHHHHCCCCCCCHHHHH
EIKITKYPIAQFNAKGSIVRELSINSGVEGDIELLLSKNNISKSLVAPKVAPKKISSKGR
HEEEEECCHHHCCCCCCEEEEEECCCCCCCCEEEEEECCCCCHHHCCCCCCCHHHHCCCC
SDLTAQPSLLFESWESTNSPSLPGLYAQPYEGGNRIWVHAPTISERINLGGKLDRYLKDK
CCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCCEEEEECCCHHHHHCCCCHHHHHHHCC
GEVICLGNNWLEFLNESLRSASQFKINEECEAISLMIDINADGNITDWKFTLSIIKPVKI
CCEEEECCHHHHHHHHHHCCHHCCCCCCCCCEEEEEEEECCCCCEEEEEEEEEECCCEEE
ITPKHLTAINNRKAKSKSVPIALKGIKDNLEVLYTLLHSAKIINNMNNNSIKLDEYIPKL
ECCCHHHHCCCCCCCCCCCCEEEECCHHHHHHHHHHHHHHHHHHCCCCCCEEHHHHHHHH
ERLSELSKTFPGRDFHGWSKTYDCCDPQSLIDIYIRLSNNILSKHLIGYRLPFIYKEHEE
HHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCHHHHHHHHHHCCCEEECCHHH
IDQSSINELTKSALALDKKITVNVDGTITSSELIKSFESSSEKKILHKLVKHIIPGVHLK
HCHHHHHHHHHHHHHHCCEEEEEECCCCCHHHHHHHHHCCHHHHHHHHHHHHHCCCCEEE
LYEINTHEDITHLNHDAAQTNIESPWCCPSLNYWNIFNQFILSLLLSEGKSKSSSRSKQT
EEEECCCCCHHHCCCCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCHH
VELGKNDSWREVDWEIFPSKLKEIIGNHSNSRLINNLNEIRKKSKSFRNNIISIAQGREA
HCCCCCCCCCCCCCEECHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHEEHHCCCHH
QKIIGKEVTAIITGVQSYGFFAEIDDLTAEGLVHVSTLGDDWYEYRSRQNLLVGRKSKKT
HHHHHHHHHHHHHHHHHCCCEEECCCCCCCCEEEEEECCCHHHHHHCCCCEEEECCCHHH
YQLAQKINVRVLKVDILKNQIDLELVKETGTETETETINTIKVESDNED
HHHHHHCCEEEEEEEECCCHHHHHHHHHCCCCCCCCCEEEEEECCCCCC
>Mature Secondary Structure 
TSVLKILESLDCEDGLEVSKLERSLKITKKIDKDNLNVAIKALTKLGIVQTIIDDKLTI
CHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCEEE
NNDIDFLRGRVRCSSKGYCFVVREDQGEDIYIRESNLNNAWHGDSVFVLITKHGVRRRAP
CCCHHHHHCEEEECCCCEEEEEEECCCCEEEEEECCCCCCCCCCEEEEEEECCCCHHCCC
EGKIQCVLQRYNDTLLAKVEPDKTTGELKAYPLDDRIPVTIELEDVIETNKKYPDKDLIH
CCHHHHHHHHCCCEEEEEECCCCCCCCEEEECCCCCCEEEEEEHHHHHCCCCCCCHHHHH
EIKITKYPIAQFNAKGSIVRELSINSGVEGDIELLLSKNNISKSLVAPKVAPKKISSKGR
HEEEEECCHHHCCCCCCEEEEEECCCCCCCCEEEEEECCCCCHHHCCCCCCCHHHHCCCC
SDLTAQPSLLFESWESTNSPSLPGLYAQPYEGGNRIWVHAPTISERINLGGKLDRYLKDK
CCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCCEEEEECCCHHHHHCCCCHHHHHHHCC
GEVICLGNNWLEFLNESLRSASQFKINEECEAISLMIDINADGNITDWKFTLSIIKPVKI
CCEEEECCHHHHHHHHHHCCHHCCCCCCCCCEEEEEEEECCCCCEEEEEEEEEECCCEEE
ITPKHLTAINNRKAKSKSVPIALKGIKDNLEVLYTLLHSAKIINNMNNNSIKLDEYIPKL
ECCCHHHHCCCCCCCCCCCCEEEECCHHHHHHHHHHHHHHHHHHCCCCCCEEHHHHHHHH
ERLSELSKTFPGRDFHGWSKTYDCCDPQSLIDIYIRLSNNILSKHLIGYRLPFIYKEHEE
HHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCHHHHHHHHHHCCCEEECCHHH
IDQSSINELTKSALALDKKITVNVDGTITSSELIKSFESSSEKKILHKLVKHIIPGVHLK
HCHHHHHHHHHHHHHHCCEEEEEECCCCCHHHHHHHHHCCHHHHHHHHHHHHHCCCCEEE
LYEINTHEDITHLNHDAAQTNIESPWCCPSLNYWNIFNQFILSLLLSEGKSKSSSRSKQT
EEEECCCCCHHHCCCCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCHH
VELGKNDSWREVDWEIFPSKLKEIIGNHSNSRLINNLNEIRKKSKSFRNNIISIAQGREA
HCCCCCCCCCCCCCEECHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHEEHHCCCHH
QKIIGKEVTAIITGVQSYGFFAEIDDLTAEGLVHVSTLGDDWYEYRSRQNLLVGRKSKKT
HHHHHHHHHHHHHHHHHCCCEEECCCCCCCCEEEEEECCCHHHHHHCCCCEEEECCCHHH
YQLAQKINVRVLKVDILKNQIDLELVKETGTETETETINTIKVESDNED
HHHHHHCCEEEEEEEECCCHHHHHHHHHCCCCCCCCCEEEEEECCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 7727754; 8905231 [H]