Definition Clostridium botulinum B1 str. Okra, complete genome.
Accession NC_010516
Length 3,958,233

Click here to switch to the map view.

The map label for this gene is 170757841

Identifier: 170757841

GI number: 170757841

Start: 3574709

End: 3575626

Strand: Reverse

Name: 170757841

Synonym: CLD_1220

Alternate gene names: NA

Gene position: 3575626-3574709 (Counterclockwise)

Preceding gene: 170755584

Following gene: 170756783

Centisome position: 90.33

GC content: 28.0

Gene sequence:

>918_bases
ATGGATTTTAATTATATAGAAGATTATACAGATGGTATAGTTATTAAAGATGTAAGAAACTTTGAATTAGCACATATTTT
TGAATGTGGTCAATGTTTCAGATGGTACAAAACAGAAGAGGGTTCTTATATAGGAGTAGCTTATGGAAAGGTTATAGAAG
TAGAGAAAGCAAATAATGATGTAATATTACATAATGCTACAGAAGAGGATTTTAAAAATATTTGGGCAGAATATTTTGAT
TTATATAGAGATTATAGTGAAATAAAGAATATATTAAGTAAGGATGAAATATTAGCTAAGTCTGTTGAATTTGGACATGG
AATAAGGCTTTTAAAACAAGATCCTTTTGAAATAATAGTGTCTTTCATAATTTCTGCTAATAATAGAATACCAATGATTA
AAAAAGCTATAAAAAATATAAGTGAAAGATGGGGTGATCCTATAGAGTATAAAGGTAACATATATTATAGCTTTCCTACA
GTAGAACAACTTAAAGATGCAACAGAGGATGAATTAAAAGCATGTAGTGTTGGTTTTAGAGCTAAATATATAAAAGATAC
TGTAAATAAAATATACCAAAATTCTATAGAAGAATGTGAGCAATATGAGAAAGAGTATGATATGTTATGGATAAAGAATC
AACAAGATGATATATGTCATAAAGTGCTACAAAATTATAGTGGTATAGGTGCTAAAGTTGCAGATTGTGTTATGTTATTT
TCTATGGAAAAGTATTCTGCATTTCCAGTAGATGTTTGGGTTAAGAGGGCTATGCAATATTTCTATCTAGCCCCTGATGT
ATCCCTAAAAAAAATAAGAGATTTTGGAAGAGAGAAATTTGGAGAATTATCAGGATTTGCCCAACAATATTTGTTTTATT
ATGCAAGAGAAAACAAAATTGACGTTAATCAAGAATAA

Upstream 100 bases:

>100_bases
ATAAGCTGTACTTTATATTTAAAGTACAGCTTATTTATTTTCAAATTTTAGTATATACTATTTAAGAAGTATAAAAATTT
ATTTATAAGGAATGATTTAT

Downstream 100 bases:

>100_bases
TAATAATAAATTGAAGTATAGGAAGCAATAGCATATTAAAATAAATTAAAATTTTTGATATAATCTATGTGGTTAGAATT
TTAAGTTTTTTATATATTTT

Product: putative 8-oxoguanine DNA glycosylase

Products: NA

Alternate protein names: 8-oxoguanine DNA glycosylase; DNA-(apurinic or apyrimidinic site) lyase; AP lyase [H]

Number of amino acids: Translated: 305; Mature: 305

Protein sequence:

>305_residues
MDFNYIEDYTDGIVIKDVRNFELAHIFECGQCFRWYKTEEGSYIGVAYGKVIEVEKANNDVILHNATEEDFKNIWAEYFD
LYRDYSEIKNILSKDEILAKSVEFGHGIRLLKQDPFEIIVSFIISANNRIPMIKKAIKNISERWGDPIEYKGNIYYSFPT
VEQLKDATEDELKACSVGFRAKYIKDTVNKIYQNSIEECEQYEKEYDMLWIKNQQDDICHKVLQNYSGIGAKVADCVMLF
SMEKYSAFPVDVWVKRAMQYFYLAPDVSLKKIRDFGREKFGELSGFAQQYLFYYARENKIDVNQE

Sequences:

>Translated_305_residues
MDFNYIEDYTDGIVIKDVRNFELAHIFECGQCFRWYKTEEGSYIGVAYGKVIEVEKANNDVILHNATEEDFKNIWAEYFD
LYRDYSEIKNILSKDEILAKSVEFGHGIRLLKQDPFEIIVSFIISANNRIPMIKKAIKNISERWGDPIEYKGNIYYSFPT
VEQLKDATEDELKACSVGFRAKYIKDTVNKIYQNSIEECEQYEKEYDMLWIKNQQDDICHKVLQNYSGIGAKVADCVMLF
SMEKYSAFPVDVWVKRAMQYFYLAPDVSLKKIRDFGREKFGELSGFAQQYLFYYARENKIDVNQE
>Mature_305_residues
MDFNYIEDYTDGIVIKDVRNFELAHIFECGQCFRWYKTEEGSYIGVAYGKVIEVEKANNDVILHNATEEDFKNIWAEYFD
LYRDYSEIKNILSKDEILAKSVEFGHGIRLLKQDPFEIIVSFIISANNRIPMIKKAIKNISERWGDPIEYKGNIYYSFPT
VEQLKDATEDELKACSVGFRAKYIKDTVNKIYQNSIEECEQYEKEYDMLWIKNQQDDICHKVLQNYSGIGAKVADCVMLF
SMEKYSAFPVDVWVKRAMQYFYLAPDVSLKKIRDFGREKFGELSGFAQQYLFYYARENKIDVNQE

Specific function: DNA repair enzyme that incises DNA at 8-oxoG residues. Excises 7,8-dihydro-8-oxoguanine and 2,6-diamino-4-hydroxy-5-N- methylformamidopyrimidine (FAPY) from damaged DNA. Has a beta- lyase activity that nicks DNA 3' to the lesion [H]

COG id: COG0122

COG function: function code L; 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the type-1 OGG1 family [H]

Homologues:

Organism=Homo sapiens, GI4505495, Length=293, Percent_Identity=24.5733788395904, Blast_Score=102, Evalue=4e-22,
Organism=Homo sapiens, GI8670534, Length=293, Percent_Identity=24.2320819112628, Blast_Score=100, Evalue=2e-21,
Organism=Homo sapiens, GI8670542, Length=289, Percent_Identity=24.2214532871972, Blast_Score=99, Evalue=4e-21,
Organism=Homo sapiens, GI8670540, Length=289, Percent_Identity=24.2214532871972, Blast_Score=99, Evalue=4e-21,
Organism=Homo sapiens, GI8670530, Length=289, Percent_Identity=24.2214532871972, Blast_Score=99, Evalue=5e-21,
Organism=Homo sapiens, GI8670532, Length=289, Percent_Identity=24.2214532871972, Blast_Score=99, Evalue=6e-21,
Organism=Saccharomyces cerevisiae, GI6323580, Length=310, Percent_Identity=30.3225806451613, Blast_Score=108, Evalue=1e-24,
Organism=Drosophila melanogaster, GI24640654, Length=311, Percent_Identity=29.903536977492, Blast_Score=98, Evalue=9e-21,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR011257
- InterPro:   IPR003265
- InterPro:   IPR003583
- InterPro:   IPR023170
- InterPro:   IPR004577
- InterPro:   IPR012904 [H]

Pfam domain/function: PF00730 HhH-GPD; PF07934 OGG_N [H]

EC number: =4.2.99.18 [H]

Molecular weight: Translated: 35923; Mature: 35923

Theoretical pI: Translated: 4.75; Mature: 4.75

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.0 %Cys     (Translated Protein)
2.0 %Met     (Translated Protein)
3.9 %Cys+Met (Translated Protein)
2.0 %Cys     (Mature Protein)
2.0 %Met     (Mature Protein)
3.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MDFNYIEDYTDGIVIKDVRNFELAHIFECGQCFRWYKTEEGSYIGVAYGKVIEVEKANND
CCCCCHHHHCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCCEEEEEECEEEEEEECCCC
VILHNATEEDFKNIWAEYFDLYRDYSEIKNILSKDEILAKSVEFGHGIRLLKQDPFEIIV
EEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHCCCHHHHHH
SFIISANNRIPMIKKAIKNISERWGDPIEYKGNIYYSFPTVEQLKDATEDELKACSVGFR
HHHHHCCCCCHHHHHHHHHHHHHCCCCCEECCEEEECCCCHHHHHCCCHHHHHHHHHHHH
AKYIKDTVNKIYQNSIEECEQYEKEYDMLWIKNQQDDICHKVLQNYSGIGAKVADCVMLF
HHHHHHHHHHHHHHHHHHHHHHHHHCCEEEEECCCHHHHHHHHHCCCCCCHHHHHHHHHH
SMEKYSAFPVDVWVKRAMQYFYLAPDVSLKKIRDFGREKFGELSGFAQQYLFYYARENKI
HHHHHCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC
DVNQE
CCCCC
>Mature Secondary Structure
MDFNYIEDYTDGIVIKDVRNFELAHIFECGQCFRWYKTEEGSYIGVAYGKVIEVEKANND
CCCCCHHHHCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCCEEEEEECEEEEEEECCCC
VILHNATEEDFKNIWAEYFDLYRDYSEIKNILSKDEILAKSVEFGHGIRLLKQDPFEIIV
EEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHCCCHHHHHH
SFIISANNRIPMIKKAIKNISERWGDPIEYKGNIYYSFPTVEQLKDATEDELKACSVGFR
HHHHHCCCCCHHHHHHHHHHHHHCCCCCEECCEEEECCCCHHHHHCCCHHHHHHHHHHHH
AKYIKDTVNKIYQNSIEECEQYEKEYDMLWIKNQQDDICHKVLQNYSGIGAKVADCVMLF
HHHHHHHHHHHHHHHHHHHHHHHHHCCEEEEECCCHHHHHHHHHCCCCCCHHHHHHHHHH
SMEKYSAFPVDVWVKRAMQYFYLAPDVSLKKIRDFGREKFGELSGFAQQYLFYYARENKI
HHHHHCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC
DVNQE
CCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 9371463 [H]