Definition | Clostridium botulinum A str. Hall, complete genome. |
---|---|
Accession | NC_009698 |
Length | 3,760,560 |
Click here to switch to the map view.
The map label for this gene is appA [H]
Identifier: 153934868
GI number: 153934868
Start: 1316607
End: 1318205
Strand: Reverse
Name: appA [H]
Synonym: CLC_1268
Alternate gene names: 153934868
Gene position: 1318205-1316607 (Counterclockwise)
Preceding gene: 153935630
Following gene: 153936678
Centisome position: 35.05
GC content: 30.02
Gene sequence:
>1599_bases ATGAAAGTGAAGAAACTATTAGCCATGATATTAGTAGCCTCAACTACCTTAGTAGCTACTGCTTGTGGAAATAGCAATAG TTCTAGCGGATCTACAGCCAAAGAAACTTCCGCCAAAGAAAATATTAAGGATGGAGGAAATTTAGTATTTTCTATCCGTG GTGAACCAGAAATACTTAATCCGATTTATGCTTATGATAGGGATACTATGACTATGGATAATGCTCTATTTGCACCTTTA TTTTATATTAATGGAGATAAAATTGATTACACTCTAGCTGAAGAAGTAAAACATTCAGATGACTTTTTAACTTACACAGT AAAACTAAAAAAAGATTTAAAATGGCATGATGGTAAACCTTTAACTGCTGATGATTTAGTATTTACTATGAAACAAATAA TGGATGAAAAACAAGATTCACCTTTTAGAAGTGCCTTTGTTATTAATGATAAACCTGTAGAGGTAAAAAAGGTAGATGAC TTAACCATAGAGTTTAAACTACCAACGGTTCAAATGCCTTTTATGAATTCTTTAGGACAAGTATCTCCTATTCCTAAACA TGTATTTGAAGGAGAAAAGGACATTAAAAAAAGTATTAAAAATGAAAAACCTATAGGTTCTGGTGCCTTTAGATTTAAAG AATCTAAAAAAGGTGAAAGTATAACGCTAGAAAGATTCGATAACTATGTTGGTGGAAAACCTCACCTTGACACCATAACC TATAGAATAATTGCTGATCCAAATTCATCTAAGGTAGCTCTTGAAAATGGCGAAGTTTCTGCTAATTATATAGATATTAG TGGTATAAGTAAATTTGAAAAAAATGAAAAATTAAAAGTAGTTGCCTATGATGAAGGTATGGTGGATAATTTAGTCTTAA ATTGTAAAACAAAAGGTTTAGATAAAAAAGAAGTTAGACAAGCTATAGCCTATGCTTTAAATAAAGATGATTTAATAAAT GCTGCTTATGAAAGTGAAAAATACGCTCCTAAGGCCTATTCCCCACTACCTAAGAATGCCTTATATTATACAGAGGATGT TACTAAATATGGCTTAAATAAAGATAAAGCAAAGGAATTACTAAAAAAATCCGGTGCTGAAAATTTAAAACTTAAGCTAG TTTATAGAAATGATAAAAAGACTTTAGAAAATCAAGCTTTAGTAGTTAAAGAAAATTTAAAAGATATAGGAATAGATGTT GAATTAAAGGGATTAGAAGCAAATGCCTTTTTTAAGCAAATCGATGATCCTTCTAAAGCTGATTTTGACTTGATATTTAA TGCCTATTTAATGGGTAACGAGCCTGATGCTTATAAAGAAGTATTTATGACAAACGGAGCTTTTAACGCTTCTAGATATA ATAATAAAAAACTAGATGACCTTTGGAATAAAGCTGCCGTAGAAACTGATAAAACAAAGAGAGAAGAAATTTATAAAACT ATTCAAAAGGAACTTATAGAAGATATGCCTGTATACCCAATTTGCTATTCTAATGCCACAATAGCAGTAAATAAAAATGT AGGAGGTATAAAAGAAGCAAAAACAGCTCCTATATATATGTTCCAAGACTTATCAAAACTTTATATCATAGAAGAATAA
Upstream 100 bases:
>100_bases TTATGATTTACTTTTATTATTTAACAATTTTTTATTAAAATGGTATCATTTATTAGTAGAATGAGTAGATTAATTTTTAC TATTAGAAAGGGGTTTTTTT
Downstream 100 bases:
>100_bases AAATTTAATAACTAAAAGGACGGTATTAATTGGATCTACCCTTTTAATACCGTTTATTTTTAAATAAAACTGGCATTTTA AACTTTTATAATAATTATTT
Product: putative oligopeptide ABC transporter, oligopeptide-binding protein
Products: ADP; phosphate; dipeptides [Cytoplasm] [C]
Alternate protein names: NA
Number of amino acids: Translated: 532; Mature: 532
Protein sequence:
>532_residues MKVKKLLAMILVASTTLVATACGNSNSSSGSTAKETSAKENIKDGGNLVFSIRGEPEILNPIYAYDRDTMTMDNALFAPL FYINGDKIDYTLAEEVKHSDDFLTYTVKLKKDLKWHDGKPLTADDLVFTMKQIMDEKQDSPFRSAFVINDKPVEVKKVDD LTIEFKLPTVQMPFMNSLGQVSPIPKHVFEGEKDIKKSIKNEKPIGSGAFRFKESKKGESITLERFDNYVGGKPHLDTIT YRIIADPNSSKVALENGEVSANYIDISGISKFEKNEKLKVVAYDEGMVDNLVLNCKTKGLDKKEVRQAIAYALNKDDLIN AAYESEKYAPKAYSPLPKNALYYTEDVTKYGLNKDKAKELLKKSGAENLKLKLVYRNDKKTLENQALVVKENLKDIGIDV ELKGLEANAFFKQIDDPSKADFDLIFNAYLMGNEPDAYKEVFMTNGAFNASRYNNKKLDDLWNKAAVETDKTKREEIYKT IQKELIEDMPVYPICYSNATIAVNKNVGGIKEAKTAPIYMFQDLSKLYIIEE
Sequences:
>Translated_532_residues MKVKKLLAMILVASTTLVATACGNSNSSSGSTAKETSAKENIKDGGNLVFSIRGEPEILNPIYAYDRDTMTMDNALFAPL FYINGDKIDYTLAEEVKHSDDFLTYTVKLKKDLKWHDGKPLTADDLVFTMKQIMDEKQDSPFRSAFVINDKPVEVKKVDD LTIEFKLPTVQMPFMNSLGQVSPIPKHVFEGEKDIKKSIKNEKPIGSGAFRFKESKKGESITLERFDNYVGGKPHLDTIT YRIIADPNSSKVALENGEVSANYIDISGISKFEKNEKLKVVAYDEGMVDNLVLNCKTKGLDKKEVRQAIAYALNKDDLIN AAYESEKYAPKAYSPLPKNALYYTEDVTKYGLNKDKAKELLKKSGAENLKLKLVYRNDKKTLENQALVVKENLKDIGIDV ELKGLEANAFFKQIDDPSKADFDLIFNAYLMGNEPDAYKEVFMTNGAFNASRYNNKKLDDLWNKAAVETDKTKREEIYKT IQKELIEDMPVYPICYSNATIAVNKNVGGIKEAKTAPIYMFQDLSKLYIIEE >Mature_532_residues MKVKKLLAMILVASTTLVATACGNSNSSSGSTAKETSAKENIKDGGNLVFSIRGEPEILNPIYAYDRDTMTMDNALFAPL FYINGDKIDYTLAEEVKHSDDFLTYTVKLKKDLKWHDGKPLTADDLVFTMKQIMDEKQDSPFRSAFVINDKPVEVKKVDD LTIEFKLPTVQMPFMNSLGQVSPIPKHVFEGEKDIKKSIKNEKPIGSGAFRFKESKKGESITLERFDNYVGGKPHLDTIT YRIIADPNSSKVALENGEVSANYIDISGISKFEKNEKLKVVAYDEGMVDNLVLNCKTKGLDKKEVRQAIAYALNKDDLIN AAYESEKYAPKAYSPLPKNALYYTEDVTKYGLNKDKAKELLKKSGAENLKLKLVYRNDKKTLENQALVVKENLKDIGIDV ELKGLEANAFFKQIDDPSKADFDLIFNAYLMGNEPDAYKEVFMTNGAFNASRYNNKKLDDLWNKAAVETDKTKREEIYKT IQKELIEDMPVYPICYSNATIAVNKNVGGIKEAKTAPIYMFQDLSKLYIIEE
Specific function: This protein is a component of an oligopeptide permease, a binding protein-dependent transport system. This APP system can completely substitute for the OPP system in both sporulation and genetic competence. AppA can bind and transport tetra- and pentapep
COG id: COG0747
COG function: function code E; ABC-type dipeptide transport system, periplasmic component
Gene ontology:
Cell location: Cell membrane; Lipid-anchor (Probable) [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the bacterial solute-binding protein 5 family [H]
Homologues:
Organism=Escherichia coli, GI1789966, Length=501, Percent_Identity=27.5449101796407, Blast_Score=154, Evalue=1e-38, Organism=Escherichia coli, GI1787762, Length=482, Percent_Identity=25.5186721991701, Blast_Score=136, Evalue=3e-33, Organism=Escherichia coli, GI1789887, Length=486, Percent_Identity=24.0740740740741, Blast_Score=131, Evalue=1e-31, Organism=Escherichia coli, GI1787052, Length=479, Percent_Identity=23.1732776617954, Blast_Score=120, Evalue=1e-28, Organism=Escherichia coli, GI1787551, Length=518, Percent_Identity=23.3590733590734, Blast_Score=117, Evalue=3e-27, Organism=Escherichia coli, GI1787495, Length=543, Percent_Identity=25.0460405156538, Blast_Score=108, Evalue=1e-24, Organism=Escherichia coli, GI87081878, Length=532, Percent_Identity=24.812030075188, Blast_Score=93, Evalue=5e-20, Organism=Escherichia coli, GI1789397, Length=463, Percent_Identity=23.7580993520518, Blast_Score=89, Evalue=5e-19, Organism=Escherichia coli, GI87082063, Length=470, Percent_Identity=21.2765957446809, Blast_Score=66, Evalue=4e-12,
Paralogues:
None
Copy number: 660 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). 2980 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). 40 Molecules/Cell In: Stationary Phase, Rich Media (Based on E. coli). [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000914 [H]
Pfam domain/function: PF00496 SBP_bac_5 [H]
EC number: NA
Molecular weight: Translated: 59933; Mature: 59933
Theoretical pI: Translated: 6.25; Mature: 6.25
Prosite motif: PS00013 PROKAR_LIPOPROTEIN ; PS01040 SBP_BACTERIAL_5
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.6 %Cys (Translated Protein) 2.4 %Met (Translated Protein) 3.0 %Cys+Met (Translated Protein) 0.6 %Cys (Mature Protein) 2.4 %Met (Mature Protein) 3.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKVKKLLAMILVASTTLVATACGNSNSSSGSTAKETSAKENIKDGGNLVFSIRGEPEILN CCHHHHHHHHHHHHHHEEEEECCCCCCCCCCCHHHHCHHHCCCCCCCEEEEECCCCCCCC PIYAYDRDTMTMDNALFAPLFYINGDKIDYTLAEEVKHSDDFLTYTVKLKKDLKWHDGKP CHHHCCCCCEEECCCCEEEEEEECCCEEEEEHHHHHCCCCCEEEEEEEEEECCCCCCCCC LTADDLVFTMKQIMDEKQDSPFRSAFVINDKPVEVKKVDDLTIEFKLPTVQMPFMNSLGQ CCHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCEEEEECCEEEEEECCEEECHHHHCCCC VSPIPKHVFEGEKDIKKSIKNEKPIGSGAFRFKESKKGESITLERFDNYVGGKPHLDTIT CCCCCHHHHCCHHHHHHHHCCCCCCCCCCEEECCCCCCCEEEHHHHHCCCCCCCCCCEEE YRIIADPNSSKVALENGEVSANYIDISGISKFEKNEKLKVVAYDEGMVDNLVLNCKTKGL EEEEECCCCCEEEEECCCEEEEEEEECCCCHHCCCCCEEEEEECCCCCCCEEEEEECCCC DKKEVRQAIAYALNKDDLINAAYESEKYAPKAYSPLPKNALYYTEDVTKYGLNKDKAKEL CHHHHHHHHHHHCCCCHHHHHHHHCCCCCCCCCCCCCCCCEEEECHHHHHCCCHHHHHHH LKKSGAENLKLKLVYRNDKKTLENQALVVKENLKDIGIDVELKGLEANAFFKQIDDPSKA HHHCCCCCEEEEEEEECCCHHHHCCEEEEECCHHHCCCEEEEECCCHHHHHHHCCCCCCC DFDLIFNAYLMGNEPDAYKEVFMTNGAFNASRYNNKKLDDLWNKAAVETDKTKREEIYKT CCEEEEEEEEECCCCHHHHHHHEECCCCCCCCCCCCCHHHHHHHHHCCCCHHHHHHHHHH IQKELIEDMPVYPICYSNATIAVNKNVGGIKEAKTAPIYMFQDLSKLYIIEE HHHHHHHHCCCCEEEECCEEEEEECCCCCCCCCCCCCEEEEECCCEEEEECC >Mature Secondary Structure MKVKKLLAMILVASTTLVATACGNSNSSSGSTAKETSAKENIKDGGNLVFSIRGEPEILN CCHHHHHHHHHHHHHHEEEEECCCCCCCCCCCHHHHCHHHCCCCCCCEEEEECCCCCCCC PIYAYDRDTMTMDNALFAPLFYINGDKIDYTLAEEVKHSDDFLTYTVKLKKDLKWHDGKP CHHHCCCCCEEECCCCEEEEEEECCCEEEEEHHHHHCCCCCEEEEEEEEEECCCCCCCCC LTADDLVFTMKQIMDEKQDSPFRSAFVINDKPVEVKKVDDLTIEFKLPTVQMPFMNSLGQ CCHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCEEEEECCEEEEEECCEEECHHHHCCCC VSPIPKHVFEGEKDIKKSIKNEKPIGSGAFRFKESKKGESITLERFDNYVGGKPHLDTIT CCCCCHHHHCCHHHHHHHHCCCCCCCCCCEEECCCCCCCEEEHHHHHCCCCCCCCCCEEE YRIIADPNSSKVALENGEVSANYIDISGISKFEKNEKLKVVAYDEGMVDNLVLNCKTKGL EEEEECCCCCEEEEECCCEEEEEEEECCCCHHCCCCCEEEEEECCCCCCCEEEEEECCCC DKKEVRQAIAYALNKDDLINAAYESEKYAPKAYSPLPKNALYYTEDVTKYGLNKDKAKEL CHHHHHHHHHHHCCCCHHHHHHHHCCCCCCCCCCCCCCCCEEEECHHHHHCCCHHHHHHH LKKSGAENLKLKLVYRNDKKTLENQALVVKENLKDIGIDVELKGLEANAFFKQIDDPSKA HHHCCCCCEEEEEEEECCCHHHHCCEEEEECCHHHCCCEEEEECCCHHHHHHHCCCCCCC DFDLIFNAYLMGNEPDAYKEVFMTNGAFNASRYNNKKLDDLWNKAAVETDKTKREEIYKT CCEEEEEEEEECCCCHHHHHHHEECCCCCCCCCCCCCHHHHHHHHHCCCCHHHHHHHHHH IQKELIEDMPVYPICYSNATIAVNKNVGGIKEAKTAPIYMFQDLSKLYIIEE HHHHHHHHCCCCEEEECCEEEEEECCCCCCCCCCCCCEEEEECCCEEEEECC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: ATP; dipeptides [Periplasm]; H2O [C]
Specific reaction: ATP + dipeptides [Periplasm] + H2O = ADP + phosphate + dipeptides [Cytoplasm] [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 7997159; 9384377 [H]