| Definition | Clostridium botulinum A str. ATCC 3502, complete genome. |
|---|---|
| Accession | NC_009495 |
| Length | 3,886,916 |
Click here to switch to the map view.
The map label for this gene is flhA [H]
Identifier: 148380603
GI number: 148380603
Start: 2797018
End: 2799084
Strand: Reverse
Name: flhA [H]
Synonym: CBO2645
Alternate gene names: 148380603
Gene position: 2799084-2797018 (Counterclockwise)
Preceding gene: 148380604
Following gene: 148380602
Centisome position: 72.01
GC content: 32.46
Gene sequence:
>2067_bases GTGGAAGGTACTAATAATAGATTTAAAATAAAAAACAGTATAGATGTTATTGCAGCTTTTGGAGTTGTAGGGATTGTATT AATGATAATAATACCTCTTCCTTCTGCAATTTTAGATGTACTTTTAGCTTTTAATATAACTCTTTCTGTGGTTATAATTT TAATTACTATGTTTACCACAGAAGTTCTTCAGTTTTCATCTTTTCCTACATTACTATTAATAACTACTCTTTTTAGATTA GGGCTGAATATTTCTTCTACTAGACTTATACTAAGGGATGCCTATGCCGGAAAAATAATAGAAACCTTTGGAAGTCTTGT TACAGGAGGAAATTATGTTATAGGTATAATAATATTTCTTATTATAGTTATAATACAATTTGTAGTTATAACTAGTGGTG CGTCAAGAGTATCAGAGGTTTCTGCTAGATTTACTTTAGATGCTATGCCAGGAAAACAAATGAGTATAGATGCAGATTTA AATGCAGGACTTATAGATGAACAAGGAGCAAAAGAAAAAAGGCAAAATCTTCAAAAAGAAGCGGATTTTTATGGATCTAT GGATGGTGCTTCTAAGTTTGTAAAGGGAGATGCGGTAGCAGGACTTATAATAACTTTTATAAATATAATTGCAGGTATAA TAATAGGAGTTGTTATGCTGAAAATGGATATAGCTACAGCAGCTCAAACTTACATAAGACTTACTATAGGGGATGGACTT GTAGGACAAATACCGGCTCTTTTAATATCTACAGCATCTGGTATATTGGTTACTAGATCTGGAAGTGACGAAAATTTGGG AACAGTTCTTAGTAAACAATTAACAGGATTCCCAAAGGTTTTAGCCATAGCATCTGTAGTACTTTTATTTTTAGCTATGA TTCCAGGGCTTCCTCATTTGGCTTTTTTAATATTAGCTATAGCTAATGCAGTGGCTGCTTATCTTTTATTCAAAGAAGCA AAAGAACAGGTTATTATACAGGAAGAAGCTCAGCAAATGGAAATTACAGAAATAGAAAGTAAAGAACCAGAAAACGTTAT GAATCTAGTATCTGTAGAACCTATGGAAGTAGAAATAGGATATGGACTTATACCTTTAGCAGATGAATCCGCGGGAGGGG ATCTTCTTCAAAGAATAACTTCTGTAAGAAGACAATGTGCTATTGAAATGGGAGTAGTAGTTCAACCTATAAGGATAAGA GATAATCTACAGCTTAAGACTAATGAATATATAATAAAAATTAGAGGAACCACTATAACAAAAGGAGAGCTTATGCCTAA TATGCTTCTTTGTATGGATCCTACAGGAGAAGTAGAAATACCAGGAATAAAGGGGATAGAACCAGCCTTTGGACTTCCAG CTGTATGGATTAATAAAGACCAAAGAGAAGAGGCAGAACTTAAAGGTTTAACAGTGGTAGATCCTACTACAGTTATGGTT ACTCATTTAACTGAGATAATAAAAAATCATTCCTATGAACTTTTAGGAAGACAGGAAGTAAAATTAATATTAGATTCCAT GAAGGAAAAATATAGTGCCGTTACAGAGGAACTTATACCAGATCTTATGACTATAGGAGAAATTCAAAAGGTTCTTCAAA ATCTATTAAAAGAAAGAGTATCCATAAAGGATATGGTTACTATATTAGAATCTTTAGCGGATAATTCTAGAAATACAAAA GACATAGAAGTTTTAACAGAGTACGTAAGATTTTCTTTGGCTAGAAGTATATGCAATCCGTTAATAGATGAAAATGGAGC GTTAACCGTAATAACTCTAGATACGAGTATTGAGCAAACTATAAACAATAATATACAAAAATCTATGCAGGGTTCTTTTC CTGCTTTAGATCCAGATACTACTAGTAATATACTAAATGGATTAAAACAAAAATTAGATGAAGTATATTTTTATAATAAT CAAGCAGTAGTTTTGGTTTCACCAAATATAAGACCAGCTTTTAGAAGGCTTATAGAGATGGTATTTCCAGCGGTGAACGT ACTTTCTTTAAATGAGGTACCTAACGATGTGGAGATAAGAACTGAAGGAGTGGTTACGCTACAATGA
Upstream 100 bases:
>100_bases AGTACCACAGGATATGTATGAAGCAGTAGCGGAAATATTAGCTATAGTTTATACTTTAAAAAAGAAAAAATAAACTAGGT ATACAAAATGAGGTGATTTA
Downstream 100 bases:
>100_bases AAATAAAAAAATATGTAGTCAATGATATGAATGAAGCAATGACTAGAATTCGTTATGAACTTGGAGCAGATGCTATAATA ATAAGTCAAAGAAAAATAAG
Product: flagellar biosynthesis protein FlhA
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 688; Mature: 688
Protein sequence:
>688_residues MEGTNNRFKIKNSIDVIAAFGVVGIVLMIIIPLPSAILDVLLAFNITLSVVIILITMFTTEVLQFSSFPTLLLITTLFRL GLNISSTRLILRDAYAGKIIETFGSLVTGGNYVIGIIIFLIIVIIQFVVITSGASRVSEVSARFTLDAMPGKQMSIDADL NAGLIDEQGAKEKRQNLQKEADFYGSMDGASKFVKGDAVAGLIITFINIIAGIIIGVVMLKMDIATAAQTYIRLTIGDGL VGQIPALLISTASGILVTRSGSDENLGTVLSKQLTGFPKVLAIASVVLLFLAMIPGLPHLAFLILAIANAVAAYLLFKEA KEQVIIQEEAQQMEITEIESKEPENVMNLVSVEPMEVEIGYGLIPLADESAGGDLLQRITSVRRQCAIEMGVVVQPIRIR DNLQLKTNEYIIKIRGTTITKGELMPNMLLCMDPTGEVEIPGIKGIEPAFGLPAVWINKDQREEAELKGLTVVDPTTVMV THLTEIIKNHSYELLGRQEVKLILDSMKEKYSAVTEELIPDLMTIGEIQKVLQNLLKERVSIKDMVTILESLADNSRNTK DIEVLTEYVRFSLARSICNPLIDENGALTVITLDTSIEQTINNNIQKSMQGSFPALDPDTTSNILNGLKQKLDEVYFYNN QAVVLVSPNIRPAFRRLIEMVFPAVNVLSLNEVPNDVEIRTEGVVTLQ
Sequences:
>Translated_688_residues MEGTNNRFKIKNSIDVIAAFGVVGIVLMIIIPLPSAILDVLLAFNITLSVVIILITMFTTEVLQFSSFPTLLLITTLFRL GLNISSTRLILRDAYAGKIIETFGSLVTGGNYVIGIIIFLIIVIIQFVVITSGASRVSEVSARFTLDAMPGKQMSIDADL NAGLIDEQGAKEKRQNLQKEADFYGSMDGASKFVKGDAVAGLIITFINIIAGIIIGVVMLKMDIATAAQTYIRLTIGDGL VGQIPALLISTASGILVTRSGSDENLGTVLSKQLTGFPKVLAIASVVLLFLAMIPGLPHLAFLILAIANAVAAYLLFKEA KEQVIIQEEAQQMEITEIESKEPENVMNLVSVEPMEVEIGYGLIPLADESAGGDLLQRITSVRRQCAIEMGVVVQPIRIR DNLQLKTNEYIIKIRGTTITKGELMPNMLLCMDPTGEVEIPGIKGIEPAFGLPAVWINKDQREEAELKGLTVVDPTTVMV THLTEIIKNHSYELLGRQEVKLILDSMKEKYSAVTEELIPDLMTIGEIQKVLQNLLKERVSIKDMVTILESLADNSRNTK DIEVLTEYVRFSLARSICNPLIDENGALTVITLDTSIEQTINNNIQKSMQGSFPALDPDTTSNILNGLKQKLDEVYFYNN QAVVLVSPNIRPAFRRLIEMVFPAVNVLSLNEVPNDVEIRTEGVVTLQ >Mature_688_residues MEGTNNRFKIKNSIDVIAAFGVVGIVLMIIIPLPSAILDVLLAFNITLSVVIILITMFTTEVLQFSSFPTLLLITTLFRL GLNISSTRLILRDAYAGKIIETFGSLVTGGNYVIGIIIFLIIVIIQFVVITSGASRVSEVSARFTLDAMPGKQMSIDADL NAGLIDEQGAKEKRQNLQKEADFYGSMDGASKFVKGDAVAGLIITFINIIAGIIIGVVMLKMDIATAAQTYIRLTIGDGL VGQIPALLISTASGILVTRSGSDENLGTVLSKQLTGFPKVLAIASVVLLFLAMIPGLPHLAFLILAIANAVAAYLLFKEA KEQVIIQEEAQQMEITEIESKEPENVMNLVSVEPMEVEIGYGLIPLADESAGGDLLQRITSVRRQCAIEMGVVVQPIRIR DNLQLKTNEYIIKIRGTTITKGELMPNMLLCMDPTGEVEIPGIKGIEPAFGLPAVWINKDQREEAELKGLTVVDPTTVMV THLTEIIKNHSYELLGRQEVKLILDSMKEKYSAVTEELIPDLMTIGEIQKVLQNLLKERVSIKDMVTILESLADNSRNTK DIEVLTEYVRFSLARSICNPLIDENGALTVITLDTSIEQTINNNIQKSMQGSFPALDPDTTSNILNGLKQKLDEVYFYNN QAVVLVSPNIRPAFRRLIEMVFPAVNVLSLNEVPNDVEIRTEGVVTLQ
Specific function: Involved in the export of flagellum proteins [H]
COG id: COG1298
COG function: function code NU; Flagellar biosynthesis pathway, component FlhA
Gene ontology:
Cell location: Cell membrane; Multi-pass membrane protein (Potential) [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the FHIPEP (flagella/HR/invasion proteins export pore) family [H]
Homologues:
Organism=Escherichia coli, GI1788187, Length=664, Percent_Identity=42.6204819277108, Blast_Score=512, Evalue=1e-146,
Paralogues:
None
Copy number: 10-20 (rich media) [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR006301 - InterPro: IPR001712 [H]
Pfam domain/function: PF00771 FHIPEP [H]
EC number: NA
Molecular weight: Translated: 75285; Mature: 75285
Theoretical pI: Translated: 4.55; Mature: 4.55
Prosite motif: PS00994 FHIPEP
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.4 %Cys (Translated Protein) 3.2 %Met (Translated Protein) 3.6 %Cys+Met (Translated Protein) 0.4 %Cys (Mature Protein) 3.2 %Met (Mature Protein) 3.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MEGTNNRFKIKNSIDVIAAFGVVGIVLMIIIPLPSAILDVLLAFNITLSVVIILITMFTT CCCCCCCEEECCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHH EVLQFSSFPTLLLITTLFRLGLNISSTRLILRDAYAGKIIETFGSLVTGGNYVIGIIIFL HHHHHCCCCHHHHHHHHHHHCCCCCHHHEEEHHHHHHHHHHHHHHHHCCCCHHHHHHHHH IIVIIQFVVITSGASRVSEVSARFTLDAMPGKQMSIDADLNAGLIDEQGAKEKRQNLQKE HHHHHHHHHHCCCHHHHHHHHHHEEEECCCCCCEEECCCCCCCCCCCCCHHHHHHHHHHH ADFYGSMDGASKFVKGDAVAGLIITFINIIAGIIIGVVMLKMDIATAAQTYIRLTIGDGL HHHCCCCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEEEEECCCC VGQIPALLISTASGILVTRSGSDENLGTVLSKQLTGFPKVLAIASVVLLFLAMIPGLPHL HHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCHHH AFLILAIANAVAAYLLFKEAKEQVIIQEEAQQMEITEIESKEPENVMNLVSVEPMEVEIG HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCEEEEEC YGLIPLADESAGGDLLQRITSVRRQCAIEMGVVVQPIRIRDNLQLKTNEYIIKIRGTTIT CCEEECCCCCCCHHHHHHHHHHHHHHHHHHCCEEECEEECCCCEEECCCEEEEEECCEEE KGELMPNMLLCMDPTGEVEIPGIKGIEPAFGLPAVWINKDQREEAELKGLTVVDPTTVMV CCCCCCCEEEEECCCCCEECCCCCCCCCCCCCCEEEECCCCHHHHHHCCCEEECCHHHHH THLTEIIKNHSYELLGRQEVKLILDSMKEKYSAVTEELIPDLMTIGEIQKVLQNLLKERV HHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH SIKDMVTILESLADNSRNTKDIEVLTEYVRFSLARSICNPLIDENGALTVITLDTSIEQT HHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCHHHHH INNNIQKSMQGSFPALDPDTTSNILNGLKQKLDEVYFYNNQAVVLVSPNIRPAFRRLIEM HHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHEEECCEEEEEECCCCCHHHHHHHHH VFPAVNVLSLNEVPNDVEIRTEGVVTLQ HHHHHCEEECCCCCCCCEEEECCEEEEC >Mature Secondary Structure MEGTNNRFKIKNSIDVIAAFGVVGIVLMIIIPLPSAILDVLLAFNITLSVVIILITMFTT CCCCCCCEEECCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHH EVLQFSSFPTLLLITTLFRLGLNISSTRLILRDAYAGKIIETFGSLVTGGNYVIGIIIFL HHHHHCCCCHHHHHHHHHHHCCCCCHHHEEEHHHHHHHHHHHHHHHHCCCCHHHHHHHHH IIVIIQFVVITSGASRVSEVSARFTLDAMPGKQMSIDADLNAGLIDEQGAKEKRQNLQKE HHHHHHHHHHCCCHHHHHHHHHHEEEECCCCCCEEECCCCCCCCCCCCCHHHHHHHHHHH ADFYGSMDGASKFVKGDAVAGLIITFINIIAGIIIGVVMLKMDIATAAQTYIRLTIGDGL HHHCCCCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEEEEECCCC VGQIPALLISTASGILVTRSGSDENLGTVLSKQLTGFPKVLAIASVVLLFLAMIPGLPHL HHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCHHH AFLILAIANAVAAYLLFKEAKEQVIIQEEAQQMEITEIESKEPENVMNLVSVEPMEVEIG HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCEEEEEC YGLIPLADESAGGDLLQRITSVRRQCAIEMGVVVQPIRIRDNLQLKTNEYIIKIRGTTIT CCEEECCCCCCCHHHHHHHHHHHHHHHHHHCCEEECEEECCCCEEECCCEEEEEECCEEE KGELMPNMLLCMDPTGEVEIPGIKGIEPAFGLPAVWINKDQREEAELKGLTVVDPTTVMV CCCCCCCEEEEECCCCCEECCCCCCCCCCCCCCEEEECCCCHHHHHHCCCEEECCHHHHH THLTEIIKNHSYELLGRQEVKLILDSMKEKYSAVTEELIPDLMTIGEIQKVLQNLLKERV HHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH SIKDMVTILESLADNSRNTKDIEVLTEYVRFSLARSICNPLIDENGALTVITLDTSIEQT HHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCHHHHH INNNIQKSMQGSFPALDPDTTSNILNGLKQKLDEVYFYNNQAVVLVSPNIRPAFRRLIEM HHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHEEECCEEEEEECCCCCHHHHHHHHH VFPAVNVLSLNEVPNDVEIRTEGVVTLQ HHHHHCEEECCCCCCCCEEEECCEEEEC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 8097015; 9384377 [H]