The gene/protein map for NC_009697 is currently unavailable.
Definition Clostridium botulinum A str. ATCC 19397, complete genome.
Accession NC_009697
Length 3,863,450

Click here to switch to the map view.

The map label for this gene is engA

Identifier: 153932865

GI number: 153932865

Start: 2530191

End: 2531510

Strand: Reverse

Name: engA

Synonym: CLB_2396

Alternate gene names: 153932865

Gene position: 2531510-2530191 (Counterclockwise)

Preceding gene: 153931168

Following gene: 153933141

Centisome position: 65.52

GC content: 30.45

Gene sequence:

>1320_bases
ATGGCAAAACCTATAGTTGCAATAGTAGGAAGACCTAATGTAGGGAAATCTACACTATTTAATAAGTTAGCAGGAAAAAG
AATATCTATAGTACAAGATACACCAGGGGTGACAAGAGACAGAATTTATGCAGAGGCAGAATGGCTAAATTACAAATTTA
CAATGATAGATACAGGTGGTATAGAACCAAAAAGTGAAGATATAATAGTATCTCAAATGAGAAGACAGGCGCAAATAGCT
ATAGAAATGGCTAATGTAATAATATTTTTAGTAGATGGAAAAGAAGGATTAGCACCAGCAGACAAAGAAGTAGCACAAAT
GCTTAGAAAAAGTAAGAAACCTGTAGTGCTTGTAGTAAATAAAATTGATAAATTGAAAGATGAGAATAATGCTTATGAGT
TTTATAATCTAGGTATAGGGGATCCTGTAACCATATCTTCATCACAGGCATTAGGCTTAGGTGATATGCTAGATAGAGTT
GTAGAATATTTTAAAGATGATGAATCAGCTGGAGAAGATGATGAGAGAATAAATATAGCTTTTATAGGTAAACCCAATGT
AGGGAAATCATCACTTATAAATAAGTTATTAGGAGAAGAAAGACTTATAGTAAGTGATATACCGGGAACCACCAGGGATT
CCATAGATAGTTATGTAGATACAGATTTTGGTGAGTTTACTTTAATAGATACCGCAGGATTAAGACGTAAGAGTAAAGTA
AAAGAAGAAATAGAAAGATATTCTGTAATAAGGACTTATGCATCTATAGAAAGAGCAGATGTATGTATACTTATGATAGA
TGCTACAGAGGGTATATCTGAGCAGGATCAAAAAATTATAGGATATGCTCATGATATAAATAAAGCTATTTTAGTTATAG
TTAATAAGTGGGATCTAGTTGAAAAAGACGATAAGACTATGGATAAATTTAAAAAAGAGTTAAAAGTTAATTTATCCTTC
ATGCCTTATGCAAAATACTTATTTATATCTGCTAAAACAGGCCAAAGGGTTGTAAAAGTTTTACAAACCGCTAAAGAATG
TTATGATAATTATAATAAGAGAGTAAAAACAGGAGTTTTAAATGATGTAATAAGTCAAGCTATTATGATGAAAGAGCCTC
CTATAGTAGGAACAAAAAGATTAAAAATTTACTATGTAACTCAAATAGGAACAAAACCACCAACTTTTATTTTCTTTGTA
AATGATCCAGCTTGCATACATTTCTCTTATCAAAGATATTTAGAAAATCAATTAAGAGAAAATTTTGATTTCCAAGGTAC
AGGAATAAAATCAGAGTTTAGAGAAAGAAAAGAAAAATAA

Upstream 100 bases:

>100_bases
AAGAATTAAGCAATAGTTTAAAAAGAGAAATTTTAGTATGCGATTATACAGGAGAAGACTTAATAGATATAATAAATAAA
CACAGCAAGGAGGAATAAAA

Downstream 100 bases:

>100_bases
ATTTAAAGATATCCTATAGCTACTGCCACTTAAACTTATTATATAGAGAGTTTTAAGTGGCAGTGAACGCATAGGAATTT
TTAATATAAACTTATTTCTT

Product: GTP-binding protein EngA

Products: NA

Alternate protein names: GTP-binding protein EngA

Number of amino acids: Translated: 439; Mature: 438

Protein sequence:

>439_residues
MAKPIVAIVGRPNVGKSTLFNKLAGKRISIVQDTPGVTRDRIYAEAEWLNYKFTMIDTGGIEPKSEDIIVSQMRRQAQIA
IEMANVIIFLVDGKEGLAPADKEVAQMLRKSKKPVVLVVNKIDKLKDENNAYEFYNLGIGDPVTISSSQALGLGDMLDRV
VEYFKDDESAGEDDERINIAFIGKPNVGKSSLINKLLGEERLIVSDIPGTTRDSIDSYVDTDFGEFTLIDTAGLRRKSKV
KEEIERYSVIRTYASIERADVCILMIDATEGISEQDQKIIGYAHDINKAILVIVNKWDLVEKDDKTMDKFKKELKVNLSF
MPYAKYLFISAKTGQRVVKVLQTAKECYDNYNKRVKTGVLNDVISQAIMMKEPPIVGTKRLKIYYVTQIGTKPPTFIFFV
NDPACIHFSYQRYLENQLRENFDFQGTGIKSEFRERKEK

Sequences:

>Translated_439_residues
MAKPIVAIVGRPNVGKSTLFNKLAGKRISIVQDTPGVTRDRIYAEAEWLNYKFTMIDTGGIEPKSEDIIVSQMRRQAQIA
IEMANVIIFLVDGKEGLAPADKEVAQMLRKSKKPVVLVVNKIDKLKDENNAYEFYNLGIGDPVTISSSQALGLGDMLDRV
VEYFKDDESAGEDDERINIAFIGKPNVGKSSLINKLLGEERLIVSDIPGTTRDSIDSYVDTDFGEFTLIDTAGLRRKSKV
KEEIERYSVIRTYASIERADVCILMIDATEGISEQDQKIIGYAHDINKAILVIVNKWDLVEKDDKTMDKFKKELKVNLSF
MPYAKYLFISAKTGQRVVKVLQTAKECYDNYNKRVKTGVLNDVISQAIMMKEPPIVGTKRLKIYYVTQIGTKPPTFIFFV
NDPACIHFSYQRYLENQLRENFDFQGTGIKSEFRERKEK
>Mature_438_residues
AKPIVAIVGRPNVGKSTLFNKLAGKRISIVQDTPGVTRDRIYAEAEWLNYKFTMIDTGGIEPKSEDIIVSQMRRQAQIAI
EMANVIIFLVDGKEGLAPADKEVAQMLRKSKKPVVLVVNKIDKLKDENNAYEFYNLGIGDPVTISSSQALGLGDMLDRVV
EYFKDDESAGEDDERINIAFIGKPNVGKSSLINKLLGEERLIVSDIPGTTRDSIDSYVDTDFGEFTLIDTAGLRRKSKVK
EEIERYSVIRTYASIERADVCILMIDATEGISEQDQKIIGYAHDINKAILVIVNKWDLVEKDDKTMDKFKKELKVNLSFM
PYAKYLFISAKTGQRVVKVLQTAKECYDNYNKRVKTGVLNDVISQAIMMKEPPIVGTKRLKIYYVTQIGTKPPTFIFFVN
DPACIHFSYQRYLENQLRENFDFQGTGIKSEFRERKEK

Specific function: GTPase that plays an essential role in the late steps of ribosome biogenesis

COG id: COG1160

COG function: function code R; Predicted GTPases

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 KH-like domain

Homologues:

Organism=Escherichia coli, GI87082120, Length=460, Percent_Identity=39.5652173913043, Blast_Score=345, Evalue=4e-96,
Organism=Escherichia coli, GI1788919, Length=123, Percent_Identity=39.8373983739837, Blast_Score=85, Evalue=7e-18,
Organism=Escherichia coli, GI2367268, Length=95, Percent_Identity=44.2105263157895, Blast_Score=84, Evalue=2e-17,
Organism=Caenorhabditis elegans, GI17507259, Length=213, Percent_Identity=29.5774647887324, Blast_Score=69, Evalue=4e-12,
Organism=Saccharomyces cerevisiae, GI6323665, Length=224, Percent_Identity=31.6964285714286, Blast_Score=97, Evalue=3e-21,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): DER_CLOB1 (A7FW89)

Other databases:

- EMBL:   CP000726
- RefSeq:   YP_001384700.1
- ProteinModelPortal:   A7FW89
- SMR:   A7FW89
- STRING:   A7FW89
- GeneID:   5394713
- GenomeReviews:   CP000726_GR
- KEGG:   cba:CLB_2396
- eggNOG:   COG1160
- HOGENOM:   HBG592135
- OMA:   KCEKAFD
- ProtClustDB:   PRK00093
- BioCyc:   CBOT441770:CLB_2396-MONOMER
- GO:   GO:0005622
- HAMAP:   MF_00195
- InterPro:   IPR003593
- InterPro:   IPR016484
- InterPro:   IPR006073
- InterPro:   IPR015946
- InterPro:   IPR002917
- InterPro:   IPR005225
- Gene3D:   G3DSA:3.30.300.20
- PIRSF:   PIRSF006485
- PRINTS:   PR00326
- SMART:   SM00382
- TIGRFAMs:   TIGR03594
- TIGRFAMs:   TIGR00231

Pfam domain/function: PF01926 MMR_HSR1

EC number: NA

Molecular weight: Translated: 49752; Mature: 49621

Theoretical pI: Translated: 8.34; Mature: 8.34

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.7 %Cys     (Translated Protein)
2.5 %Met     (Translated Protein)
3.2 %Cys+Met (Translated Protein)
0.7 %Cys     (Mature Protein)
2.3 %Met     (Mature Protein)
3.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MAKPIVAIVGRPNVGKSTLFNKLAGKRISIVQDTPGVTRDRIYAEAEWLNYKFTMIDTGG
CCCCEEEEECCCCCCHHHHHHHHCCCEEEEEECCCCCCHHHEEEEEEEEEEEEEEEECCC
IEPKSEDIIVSQMRRQAQIAIEMANVIIFLVDGKEGLAPADKEVAQMLRKSKKPVVLVVN
CCCCCCHHHHHHHHHHHHHHEEEEEEEEEEEECCCCCCCCHHHHHHHHHHCCCCEEEEEE
KIDKLKDENNAYEFYNLGIGDPVTISSSQALGLGDMLDRVVEYFKDDESAGEDDERINIA
CHHHHCCCCCCEEEEECCCCCCEEECCCCCCCHHHHHHHHHHHHCCCCCCCCCCCEEEEE
FIGKPNVGKSSLINKLLGEERLIVSDIPGTTRDSIDSYVDTDFGEFTLIDTAGLRRKSKV
EEECCCCCHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHCCCCCCEEEEECCCCCHHHHH
KEEIERYSVIRTYASIERADVCILMIDATEGISEQDQKIIGYAHDINKAILVIVNKWDLV
HHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCHHHEEEHCCCCCEEEEEEECCCCC
EKDDKTMDKFKKELKVNLSFMPYAKYLFISAKTGQRVVKVLQTAKECYDNYNKRVKTGVL
CCCHHHHHHHHHHHEEEEEEECCEEEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
NDVISQAIMMKEPPIVGTKRLKIYYVTQIGTKPPTFIFFVNDPACIHFSYQRYLENQLRE
HHHHHHHHHHCCCCCCCCEEEEEEEEEECCCCCCEEEEEECCCEEEEHHHHHHHHHHHHH
NFDFQGTGIKSEFRERKEK
CCCCCCCCCHHHHHHHCCC
>Mature Secondary Structure 
AKPIVAIVGRPNVGKSTLFNKLAGKRISIVQDTPGVTRDRIYAEAEWLNYKFTMIDTGG
CCCEEEEECCCCCCHHHHHHHHCCCEEEEEECCCCCCHHHEEEEEEEEEEEEEEEECCC
IEPKSEDIIVSQMRRQAQIAIEMANVIIFLVDGKEGLAPADKEVAQMLRKSKKPVVLVVN
CCCCCCHHHHHHHHHHHHHHEEEEEEEEEEEECCCCCCCCHHHHHHHHHHCCCCEEEEEE
KIDKLKDENNAYEFYNLGIGDPVTISSSQALGLGDMLDRVVEYFKDDESAGEDDERINIA
CHHHHCCCCCCEEEEECCCCCCEEECCCCCCCHHHHHHHHHHHHCCCCCCCCCCCEEEEE
FIGKPNVGKSSLINKLLGEERLIVSDIPGTTRDSIDSYVDTDFGEFTLIDTAGLRRKSKV
EEECCCCCHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHCCCCCCEEEEECCCCCHHHHH
KEEIERYSVIRTYASIERADVCILMIDATEGISEQDQKIIGYAHDINKAILVIVNKWDLV
HHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCHHHEEEHCCCCCEEEEEEECCCCC
EKDDKTMDKFKKELKVNLSFMPYAKYLFISAKTGQRVVKVLQTAKECYDNYNKRVKTGVL
CCCHHHHHHHHHHHEEEEEEECCEEEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
NDVISQAIMMKEPPIVGTKRLKIYYVTQIGTKPPTFIFFVNDPACIHFSYQRYLENQLRE
HHHHHHHHHHCCCCCCCCEEEEEEEEEECCCCCCEEEEEECCCEEEEHHHHHHHHHHHHH
NFDFQGTGIKSEFRERKEK
CCCCCCCCCHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA