| Definition | Clostridium botulinum A str. ATCC 19397, complete genome. |
|---|---|
| Accession | NC_009697 |
| Length | 3,863,450 |
Click here to switch to the map view.
The map label for this gene is ntpI [H]
Identifier: 153933624
GI number: 153933624
Start: 2691808
End: 2693769
Strand: Reverse
Name: ntpI [H]
Synonym: CLB_2572
Alternate gene names: 153933624
Gene position: 2693769-2691808 (Counterclockwise)
Preceding gene: 153930885
Following gene: 153932798
Centisome position: 69.72
GC content: 29.92
Gene sequence:
>1962_bases ATGGGAATAGCTAAGATGAAAAGATTTACTTTATTAGCCCTTAAATCACATAAGGAATCTCTTTTTGAAGCTATGCAAAA GTTTCAAGAAGTACAATTTGTTAACCTACAAGAAGAAAAGTCAGAAAAGCTAGAGTTTATGCAAAATGATTGTCAATCAG AATTGATATCAGACTTGGAAGGTAAACAAGCAAGACTTAAATTTTGTTTAGATATTTTAGAAAGATATGTAGAAAAAGAA AAAGGACTTAAGGCTCTTATGCAAGGTAAGAAGTCCATGAACTATAATGAGCTCAAAAATTTAGGCGAGAAGATAGAGTG GATAAGTATTTATAATGCTCTAAAAGAAAAGGATACAAAGCTTAGTGCCTTAAAAAATGAAATATCTAAACCAAAAGGTG AAATAAAAGCATTGGAGCCTTGGACTAATTTTGATGAAAAAATTAGTAAGGCTAAGTTTAATACTAGTACTTCCTATTTA GGGGTATTACCTTTAAATTACAAAGATGAGTTTAGGGAATCTTTTGATTCAGAGATACCAGTTTCTTATGTAGAAACTAT AGGGGAAAATAAAGATGGAGTTTATTTGTTTATAGTATTTCACAATAATTACTTTAAAGAAGCTTCAGAATTATTAAAAA GATATGGTTTTAGTAAAATAGCTTTTAACTATGATGATTCTCCTAAAGAGACTATAAAAGCTTTAGAAGAAGAAATAAAA TCTATTAAAAAAGAAGAAATAAAAACTATACAAGAAATTAAAGCCTTTGTTGATAAGGCAGAAGATCTTCAAATAGCTTA TGAGTATATTAGTTTACAAGTTGATAGGGCAAAAGCATCTATAAATATTTTAAAGACTAATAAAGTAGTGGCTATGGAAG GTTGGGTACCAGAGGACTCTATGAAAGAATTAGAAGGTTTAATAAGACAATCCGAAGGTGAGTTATATTATATAGAATTT AATGATCCTTTAGATGAAGAAGAAGAAAAGGTTCCAATAATGCTAAAAAATAATAAGATAGTTGAGCCTTTTGAATCTAT TACAGCTATGTATAGCCTTCCGAAGTATAAGGAAATAGATCCTACACCAGCCTTAGTACCTTTTTATCTAATATTCTTTG GAATGATGTTATCAGATGCAGGTTATGGTTTAGTTATGCTGATAGCTACTTTGCTAGCTCTTAAAACTATACCAATGGAA AAAGCAACTAAACAAATGATTAAGTTATTATATTACTTAAGTTATCCAACCATAGTTTGGGGCGCGCTTTATGGCAGTTA TTTTGGAGGTATTATAAACATACCGGCCATATGGGTAAAACCAGAGGATAGTGTATCATCAATTTTAATTATATCTATAG TTTTTGGTATAATTCACCTATATACAGGACTTGGTGTAAAAGCTTATATGTTAATTAGAAACAAAAGATATAAAGATGCA TTTTACGATGTAGGTTTATGGTATATAACTTTAACTACAGCTATAATTATAGTAGCTGCAAATTTCGGAAATATAACTGC ATTAAATCCATACACAAACCCTTGTAAATATATAATGTATGCAGGTATGGTAGGTTTAGTTTTAACTCAAGGTAGAGAAA ACAAAACAATAGGAGCAAAACTTGGTGCAGGTTTATATGGATTATATGGTATTACAAGTTATGTAGGAGATATAGTTTCT TATTCAAGGCTTATGGCTTTGGGGCTTGCTACAGGATTTATAGGTGGAGCTTTTAACTTAATGATAAGCCTTTTAGGTAA TGGAGTTAAAGCCTGGGTATTTGGAACATTAATATTTGTTATAGGTCACGTATTTAATTTGCTTATAAATGCATTAGGTG CTTATGTTCACACTTGTAGATTACAGTATGTTGAATATTTCGGTAAGTTTTATGAAGGGGGAGGAAAACCATTTACTCCT TTCAAACCAAATAATAAATACATTAACATAATAAAAGATTAG
Upstream 100 bases:
>100_bases CTGAAAGAGGGGGAAAAATCCTTAGAAATTATAAAAAATATTTCTAAGGATAAATTTGAAAAGGCTGCAAATATAGTGAT TGAGAGGATAGTGAAAGTTA
Downstream 100 bases:
>100_bases GAGGAGTCTATTATGGAAAATACAACATTAATGAATTTTTTAACACAAAACGGAGGAGCAATATTTGCTGCCATGGGTGT AGCATTAGCGGCTATAATGC
Product: V-type ATP synthase subunit I
Products: ADP; phosphate; H+
Alternate protein names: Na(+)-translocating ATPase subunit I; V-type sodium pump subunit I [H]
Number of amino acids: Translated: 653; Mature: 652
Protein sequence:
>653_residues MGIAKMKRFTLLALKSHKESLFEAMQKFQEVQFVNLQEEKSEKLEFMQNDCQSELISDLEGKQARLKFCLDILERYVEKE KGLKALMQGKKSMNYNELKNLGEKIEWISIYNALKEKDTKLSALKNEISKPKGEIKALEPWTNFDEKISKAKFNTSTSYL GVLPLNYKDEFRESFDSEIPVSYVETIGENKDGVYLFIVFHNNYFKEASELLKRYGFSKIAFNYDDSPKETIKALEEEIK SIKKEEIKTIQEIKAFVDKAEDLQIAYEYISLQVDRAKASINILKTNKVVAMEGWVPEDSMKELEGLIRQSEGELYYIEF NDPLDEEEEKVPIMLKNNKIVEPFESITAMYSLPKYKEIDPTPALVPFYLIFFGMMLSDAGYGLVMLIATLLALKTIPME KATKQMIKLLYYLSYPTIVWGALYGSYFGGIINIPAIWVKPEDSVSSILIISIVFGIIHLYTGLGVKAYMLIRNKRYKDA FYDVGLWYITLTTAIIIVAANFGNITALNPYTNPCKYIMYAGMVGLVLTQGRENKTIGAKLGAGLYGLYGITSYVGDIVS YSRLMALGLATGFIGGAFNLMISLLGNGVKAWVFGTLIFVIGHVFNLLINALGAYVHTCRLQYVEYFGKFYEGGGKPFTP FKPNNKYINIIKD
Sequences:
>Translated_653_residues MGIAKMKRFTLLALKSHKESLFEAMQKFQEVQFVNLQEEKSEKLEFMQNDCQSELISDLEGKQARLKFCLDILERYVEKE KGLKALMQGKKSMNYNELKNLGEKIEWISIYNALKEKDTKLSALKNEISKPKGEIKALEPWTNFDEKISKAKFNTSTSYL GVLPLNYKDEFRESFDSEIPVSYVETIGENKDGVYLFIVFHNNYFKEASELLKRYGFSKIAFNYDDSPKETIKALEEEIK SIKKEEIKTIQEIKAFVDKAEDLQIAYEYISLQVDRAKASINILKTNKVVAMEGWVPEDSMKELEGLIRQSEGELYYIEF NDPLDEEEEKVPIMLKNNKIVEPFESITAMYSLPKYKEIDPTPALVPFYLIFFGMMLSDAGYGLVMLIATLLALKTIPME KATKQMIKLLYYLSYPTIVWGALYGSYFGGIINIPAIWVKPEDSVSSILIISIVFGIIHLYTGLGVKAYMLIRNKRYKDA FYDVGLWYITLTTAIIIVAANFGNITALNPYTNPCKYIMYAGMVGLVLTQGRENKTIGAKLGAGLYGLYGITSYVGDIVS YSRLMALGLATGFIGGAFNLMISLLGNGVKAWVFGTLIFVIGHVFNLLINALGAYVHTCRLQYVEYFGKFYEGGGKPFTP FKPNNKYINIIKD >Mature_652_residues GIAKMKRFTLLALKSHKESLFEAMQKFQEVQFVNLQEEKSEKLEFMQNDCQSELISDLEGKQARLKFCLDILERYVEKEK GLKALMQGKKSMNYNELKNLGEKIEWISIYNALKEKDTKLSALKNEISKPKGEIKALEPWTNFDEKISKAKFNTSTSYLG VLPLNYKDEFRESFDSEIPVSYVETIGENKDGVYLFIVFHNNYFKEASELLKRYGFSKIAFNYDDSPKETIKALEEEIKS IKKEEIKTIQEIKAFVDKAEDLQIAYEYISLQVDRAKASINILKTNKVVAMEGWVPEDSMKELEGLIRQSEGELYYIEFN DPLDEEEEKVPIMLKNNKIVEPFESITAMYSLPKYKEIDPTPALVPFYLIFFGMMLSDAGYGLVMLIATLLALKTIPMEK ATKQMIKLLYYLSYPTIVWGALYGSYFGGIINIPAIWVKPEDSVSSILIISIVFGIIHLYTGLGVKAYMLIRNKRYKDAF YDVGLWYITLTTAIIIVAANFGNITALNPYTNPCKYIMYAGMVGLVLTQGRENKTIGAKLGAGLYGLYGITSYVGDIVSY SRLMALGLATGFIGGAFNLMISLLGNGVKAWVFGTLIFVIGHVFNLLINALGAYVHTCRLQYVEYFGKFYEGGGKPFTPF KPNNKYINIIKD
Specific function: Involved in ATP-driven sodium extrusion [H]
COG id: COG1269
COG function: function code C; Archaeal/vacuolar-type H+-ATPase subunit I
Gene ontology:
Cell location: Cell membrane; Multi-pass membrane protein [H]
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: Belongs to the V-ATPase 116 kDa subunit family [H]
Homologues:
None
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR002490 [H]
Pfam domain/function: PF01496 V_ATPase_I [H]
EC number: 3.6.3.14
Molecular weight: Translated: 74222; Mature: 74091
Theoretical pI: Translated: 7.04; Mature: 7.04
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.6 %Cys (Translated Protein) 3.1 %Met (Translated Protein) 3.7 %Cys+Met (Translated Protein) 0.6 %Cys (Mature Protein) 2.9 %Met (Mature Protein) 3.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MGIAKMKRFTLLALKSHKESLFEAMQKFQEVQFVNLQEEKSEKLEFMQNDCQSELISDLE CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCC GKQARLKFCLDILERYVEKEKGLKALMQGKKSMNYNELKNLGEKIEWISIYNALKEKDTK CHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH LSALKNEISKPKGEIKALEPWTNFDEKISKAKFNTSTSYLGVLPLNYKDEFRESFDSEIP HHHHHHHHCCCCCCEEEECCCCCHHHHHHHHCCCCCCCEEEEEECCCHHHHHHHCCCCCC VSYVETIGENKDGVYLFIVFHNNYFKEASELLKRYGFSKIAFNYDDSPKETIKALEEEIK HHHHHHHCCCCCCEEEEEEEECCHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHH SIKKEEIKTIQEIKAFVDKAEDLQIAYEYISLQVDRAKASINILKTNKVVAMEGWVPEDS HHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHEEECHHCEEEEEECCEEEEEECCCCCHH MKELEGLIRQSEGELYYIEFNDPLDEEEEKVPIMLKNNKIVEPFESITAMYSLPKYKEID HHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCEEEECCCCCCHHHHHHHHHCCCCCCCCC PTPALVPFYLIFFGMMLSDAGYGLVMLIATLLALKTIPMEKATKQMIKLLYYLSYPTIVW CCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCHHHHH GALYGSYFGGIINIPAIWVKPEDSVSSILIISIVFGIIHLYTGLGVKAYMLIRNKRYKDA HHHHHHHCCCCEECCEEEECCCCCHHHHHHHHHHHHHHHHHHCCCHHEEHHHHCCCHHHH FYDVGLWYITLTTAIIIVAANFGNITALNPYTNPCKYIMYAGMVGLVLTQGRENKTIGAK HHHHHHHHHHHHHHHHHHEECCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCCCHHH LGAGLYGLYGITSYVGDIVSYSRLMALGLATGFIGGAFNLMISLLGNGVKAWVFGTLIFV HCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH IGHVFNLLINALGAYVHTCRLQYVEYFGKFYEGGGKPFTPFKPNNKYINIIKD HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEECC >Mature Secondary Structure GIAKMKRFTLLALKSHKESLFEAMQKFQEVQFVNLQEEKSEKLEFMQNDCQSELISDLE CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCC GKQARLKFCLDILERYVEKEKGLKALMQGKKSMNYNELKNLGEKIEWISIYNALKEKDTK CHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH LSALKNEISKPKGEIKALEPWTNFDEKISKAKFNTSTSYLGVLPLNYKDEFRESFDSEIP HHHHHHHHCCCCCCEEEECCCCCHHHHHHHHCCCCCCCEEEEEECCCHHHHHHHCCCCCC VSYVETIGENKDGVYLFIVFHNNYFKEASELLKRYGFSKIAFNYDDSPKETIKALEEEIK HHHHHHHCCCCCCEEEEEEEECCHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHH SIKKEEIKTIQEIKAFVDKAEDLQIAYEYISLQVDRAKASINILKTNKVVAMEGWVPEDS HHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHEEECHHCEEEEEECCEEEEEECCCCCHH MKELEGLIRQSEGELYYIEFNDPLDEEEEKVPIMLKNNKIVEPFESITAMYSLPKYKEID HHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCEEEECCCCCCHHHHHHHHHCCCCCCCCC PTPALVPFYLIFFGMMLSDAGYGLVMLIATLLALKTIPMEKATKQMIKLLYYLSYPTIVW CCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCHHHHH GALYGSYFGGIINIPAIWVKPEDSVSSILIISIVFGIIHLYTGLGVKAYMLIRNKRYKDA HHHHHHHCCCCEECCEEEECCCCCHHHHHHHHHHHHHHHHHHCCCHHEEHHHHCCCHHHH FYDVGLWYITLTTAIIIVAANFGNITALNPYTNPCKYIMYAGMVGLVLTQGRENKTIGAK HHHHHHHHHHHHHHHHHHEECCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCCCHHH LGAGLYGLYGITSYVGDIVSYSRLMALGLATGFIGGAFNLMISLLGNGVKAWVFGTLIFV HCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH IGHVFNLLINALGAYVHTCRLQYVEYFGKFYEGGGKPFTPFKPNNKYINIIKD HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEECC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: ATP; H2O; H+
Specific reaction: ATP + H2O + H+(in) = ADP + phosphate + H+(out)
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 8157629; 8144530 [H]