The gene/protein map for NC_009495 is currently unavailable.
Definition Clostridium botulinum A str. ATCC 3502, complete genome.
Accession NC_009495
Length 3,886,916

Click here to switch to the map view.

The map label for this gene is 148379248

Identifier: 148379248

GI number: 148379248

Start: 1380104

End: 1380715

Strand: Direct

Name: 148379248

Synonym: CBO1265

Alternate gene names: NA

Gene position: 1380104-1380715 (Clockwise)

Preceding gene: 148379247

Following gene: 148379249

Centisome position: 35.51

GC content: 29.41

Gene sequence:

>612_bases
TTGAGATTAACAAGAGATTTTTATGCTAAGGATGCTAGAGTATTAGCAAAAGAATTATTAGGGAAAGTATTAGTTAGAGA
AGTAGATGGTATTAAATTGAAGGGGAAAATAGTAGAGACAGAAGCTTATATTGGGGCTATAGATAAAGCTTCTCATGCCT
ATGGTGGAAGAAGAACTAAAAGAACAGAACCTCTTTATGGGAAACCGGGTATAGCCTATGTATATTTTATATATGGTAAG
TATTTTTGTTTTAATATCATAAGTAAAACAGAAGGAGAGGCAGAAGGGGTCCTTATAAGAGCTCTAGAACCTTTAGAAAA
TATAAATCTTATATCAAAATTAAGGTTTAATAAGGAATTTGAAGAATTAAATAATTATCAAAGGAAAAATATAACTTCAG
GACCTTCAAAGCTTTGTATGGCTTTTAATATTAACAGAGATAATAACTGGGAAGATTTATGTGAAAGTTCTAGCTTGTAT
GTGGAGGATGTTTTTTATAATGATTTTGAAATTATTGAAACAGTAAGAGTAGGAATAGATTATGCAGAGGAAGCTAGAGA
TTTTTTATGGAGATATTATATAAAAGATAATGCTTTTGTTTCTGTAAAGTAA

Upstream 100 bases:

>100_bases
GTATAAAGTTAATAACCATGATAATATAAAAATATATGATTAAAATATAAATATATGAAGAAAAATATTATTAAAATGTT
GAGTCCTAGGAGGAATAAAA

Downstream 100 bases:

>100_bases
CAAGTACTTTAAAATATAAATATTTAAATAAAATAGCTTTGTATAAGTGATATGGGGGATGAAATATGGGGGGAAGTTAT
AATTTATATAAAATAAAAGA

Product: 3-methyladenine DNA glycosylase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 203; Mature: 203

Protein sequence:

>203_residues
MRLTRDFYAKDARVLAKELLGKVLVREVDGIKLKGKIVETEAYIGAIDKASHAYGGRRTKRTEPLYGKPGIAYVYFIYGK
YFCFNIISKTEGEAEGVLIRALEPLENINLISKLRFNKEFEELNNYQRKNITSGPSKLCMAFNINRDNNWEDLCESSSLY
VEDVFYNDFEIIETVRVGIDYAEEARDFLWRYYIKDNAFVSVK

Sequences:

>Translated_203_residues
MRLTRDFYAKDARVLAKELLGKVLVREVDGIKLKGKIVETEAYIGAIDKASHAYGGRRTKRTEPLYGKPGIAYVYFIYGK
YFCFNIISKTEGEAEGVLIRALEPLENINLISKLRFNKEFEELNNYQRKNITSGPSKLCMAFNINRDNNWEDLCESSSLY
VEDVFYNDFEIIETVRVGIDYAEEARDFLWRYYIKDNAFVSVK
>Mature_203_residues
MRLTRDFYAKDARVLAKELLGKVLVREVDGIKLKGKIVETEAYIGAIDKASHAYGGRRTKRTEPLYGKPGIAYVYFIYGK
YFCFNIISKTEGEAEGVLIRALEPLENINLISKLRFNKEFEELNNYQRKNITSGPSKLCMAFNINRDNNWEDLCESSSLY
VEDVFYNDFEIIETVRVGIDYAEEARDFLWRYYIKDNAFVSVK

Specific function: Unknown

COG id: COG2094

COG function: function code L; 3-methyladenine DNA glycosylase

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the DNA glycosylase MPG family

Homologues:

Organism=Homo sapiens, GI62632769, Length=206, Percent_Identity=39.3203883495146, Blast_Score=141, Evalue=5e-34,
Organism=Homo sapiens, GI62632765, Length=206, Percent_Identity=39.3203883495146, Blast_Score=141, Evalue=5e-34,
Organism=Homo sapiens, GI62632771, Length=206, Percent_Identity=39.3203883495146, Blast_Score=140, Evalue=5e-34,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): 3MGH_CLOB1 (A7FTE3)

Other databases:

- EMBL:   CP000726
- RefSeq:   YP_001383620.1
- ProteinModelPortal:   A7FTE3
- SMR:   A7FTE3
- STRING:   A7FTE3
- GeneID:   5398012
- GenomeReviews:   CP000726_GR
- KEGG:   cba:CLB_1293
- eggNOG:   COG2094
- HOGENOM:   HBG664239
- OMA:   EHISSQY
- ProtClustDB:   PRK00802
- BioCyc:   CBOT441770:CLB_1293-MONOMER
- HAMAP:   MF_00527
- InterPro:   IPR011034
- InterPro:   IPR003180
- Gene3D:   G3DSA:3.10.300.10
- PANTHER:   PTHR10429
- TIGRFAMs:   TIGR00567

Pfam domain/function: PF02245 Pur_DNA_glyco; SSF50486 FMT_C_like

EC number: 3.2.2.-

Molecular weight: Translated: 23581; Mature: 23581

Theoretical pI: Translated: 7.18; Mature: 7.18

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.5 %Cys     (Translated Protein)
1.0 %Met     (Translated Protein)
2.5 %Cys+Met (Translated Protein)
1.5 %Cys     (Mature Protein)
1.0 %Met     (Mature Protein)
2.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MRLTRDFYAKDARVLAKELLGKVLVREVDGIKLKGKIVETEAYIGAIDKASHAYGGRRTK
CCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEEEECHHHHHHHHHHHHHCCCCCCC
RTEPLYGKPGIAYVYFIYGKYFCFNIISKTEGEAEGVLIRALEPLENINLISKLRFNKEF
CCCCCCCCCCCEEEHHHHHHHHHHHHHHCCCCCCCCEEEEECCCHHHHHHHHHHHCCHHH
EELNNYQRKNITSGPSKLCMAFNINRDNNWEDLCESSSLYVEDVFYNDFEIIETVRVGID
HHHHHHHHCCCCCCCCCEEEEEECCCCCCHHHHHCCCCEEEEEECCCCHHHHHHHHHCCC
YAEEARDFLWRYYIKDNAFVSVK
HHHHHHHHHHHHEECCCEEEEEC
>Mature Secondary Structure
MRLTRDFYAKDARVLAKELLGKVLVREVDGIKLKGKIVETEAYIGAIDKASHAYGGRRTK
CCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEEEECHHHHHHHHHHHHHCCCCCCC
RTEPLYGKPGIAYVYFIYGKYFCFNIISKTEGEAEGVLIRALEPLENINLISKLRFNKEF
CCCCCCCCCCCEEEHHHHHHHHHHHHHHCCCCCCCCEEEEECCCHHHHHHHHHHHCCHHH
EELNNYQRKNITSGPSKLCMAFNINRDNNWEDLCESSSLYVEDVFYNDFEIIETVRVGID
HHHHHHHHCCCCCCCCCEEEEEECCCCCCHHHHHCCCCEEEEEECCCCHHHHHHHHHCCC
YAEEARDFLWRYYIKDNAFVSVK
HHHHHHHHHHHHEECCCEEEEEC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: NA