Definition Clostridium botulinum A2 str. Kyoto chromosome, complete genome.
Accession NC_012563
Length 4,155,278

Click here to switch to the map view.

The map label for this gene is atcC [H]

Identifier: 226947882

GI number: 226947882

Start: 792179

End: 793402

Strand: Reverse

Name: atcC [H]

Synonym: CLM_0737

Alternate gene names: 226947882

Gene position: 793402-792179 (Counterclockwise)

Preceding gene: 226947886

Following gene: 226947877

Centisome position: 19.09

GC content: 34.56

Gene sequence:

>1224_bases
ATGTTAATTTGTGATAAAAAAAGATTAGAAGATAAGATTATTACCTTCAGTAAATTCGGCGCTACAGAAAAAGATGGAGT
AACAAGATTATCTTTATCAAAGCCAGCTCTTCAAGCAAGAGCAGAATTTTCAAAAAGAATGAATGAACTAGGGGCAAGTA
TAACAACAGATGATATGGGAAATATGTATGCTACTTTCAAAGGATCTGAAGAAAACCTTCCCCATATTGCAATGGGATCG
CATTGTGATTCAGTGGTACAAGGTGGAAACTATGATGGTATTTTAGGGGTACTTACAGCAATGGAAGTAGCTGAAACTAT
AGTTACAAGAAAAATTCCCCATCGTCATCCAATTACAGTTATGATTTGGACTAATGAAGAAGGCGCTCGTTTTGATCCTG
CTATGATGTCCTCAGGTGTTATTACAGGTAAATTTGATAAAGCAGAAATGTTAGATTCTAAGGATAAAAAGGGTATATCC
TTTGGTGAAGCACTAGATGCAAGCGGATATAAAGGTGAAGAAAAAAACAGAATAAATCCCAAAGATTATATGGCTCTTCT
CGAACTTCATATTGAACAAGGTCCAGTTCTAGAATCCTCAAAAATAGATATAGGTGTTGTAGAAGGCGTTGTTGGAATGG
TAAATTATGAATTTGAATTTATTGGCCAAGCCGGTCATGCTGGAACAGTTCCACAAAAAATGAGAAAGGATGCTCTTTTA
GCAGCTTCTGAAGCCATTCAATATTTGCATAAGGAACTTGATAAACTTCATGATAATTTAGTGTATACTACCGGTAGAAT
AATTTGTTTCCCTAATATACATACAATTATACCGGATAATGTTAAATTCACATTAGATGCAAGACATCAAGATCCAAAAA
TAACACAACAAGTGGTTAAAATTATCGAAAATATCCCTAAAGAATTAGCAAATTGTAAAGTCACTTACAGGAAATTATGG
TCACGTAAAACTGTTAGTTTCAATAACCAATTGATTAATTTCGTGGAGAAAAATGCCAATCTTTACGGTTACTCTAATAT
GAGAATGTACAGCGGTCCCGGCCATGATGCCCAATTTGCAGCAGATATGTTACCTACGACTATGATTTTCGTTCCAAGTA
TTGGCGGGCACAGTCATTGTGAAATAGAAAAGACGTCACTTGATAATTGTTTAAAAGGAGCTAACGTGTTGCTACAAACA
ATTTTAGATATAGATGAAAAATAA

Upstream 100 bases:

>100_bases
CCATAAAATAAAAGCATTAAAACTAGAAGACATGATACAAATATAGACATGACAAGCGTTTTAGTAAGTTATATACTAAA
TATGCAAGGGAGTGTTTTAT

Downstream 100 bases:

>100_bases
ATATTCACTTATATGGAATCTCAGCAATAGCTTTATTATTACTGGGATTCTACATTTATATTAGTTAATTTAAAATTCTA
AAGGAAACTTTCACAAGAAA

Product: allantoate amidohydrolase

Products: NA

Alternate protein names: ORF4 [H]

Number of amino acids: Translated: 407; Mature: 407

Protein sequence:

>407_residues
MLICDKKRLEDKIITFSKFGATEKDGVTRLSLSKPALQARAEFSKRMNELGASITTDDMGNMYATFKGSEENLPHIAMGS
HCDSVVQGGNYDGILGVLTAMEVAETIVTRKIPHRHPITVMIWTNEEGARFDPAMMSSGVITGKFDKAEMLDSKDKKGIS
FGEALDASGYKGEEKNRINPKDYMALLELHIEQGPVLESSKIDIGVVEGVVGMVNYEFEFIGQAGHAGTVPQKMRKDALL
AASEAIQYLHKELDKLHDNLVYTTGRIICFPNIHTIIPDNVKFTLDARHQDPKITQQVVKIIENIPKELANCKVTYRKLW
SRKTVSFNNQLINFVEKNANLYGYSNMRMYSGPGHDAQFAADMLPTTMIFVPSIGGHSHCEIEKTSLDNCLKGANVLLQT
ILDIDEK

Sequences:

>Translated_407_residues
MLICDKKRLEDKIITFSKFGATEKDGVTRLSLSKPALQARAEFSKRMNELGASITTDDMGNMYATFKGSEENLPHIAMGS
HCDSVVQGGNYDGILGVLTAMEVAETIVTRKIPHRHPITVMIWTNEEGARFDPAMMSSGVITGKFDKAEMLDSKDKKGIS
FGEALDASGYKGEEKNRINPKDYMALLELHIEQGPVLESSKIDIGVVEGVVGMVNYEFEFIGQAGHAGTVPQKMRKDALL
AASEAIQYLHKELDKLHDNLVYTTGRIICFPNIHTIIPDNVKFTLDARHQDPKITQQVVKIIENIPKELANCKVTYRKLW
SRKTVSFNNQLINFVEKNANLYGYSNMRMYSGPGHDAQFAADMLPTTMIFVPSIGGHSHCEIEKTSLDNCLKGANVLLQT
ILDIDEK
>Mature_407_residues
MLICDKKRLEDKIITFSKFGATEKDGVTRLSLSKPALQARAEFSKRMNELGASITTDDMGNMYATFKGSEENLPHIAMGS
HCDSVVQGGNYDGILGVLTAMEVAETIVTRKIPHRHPITVMIWTNEEGARFDPAMMSSGVITGKFDKAEMLDSKDKKGIS
FGEALDASGYKGEEKNRINPKDYMALLELHIEQGPVLESSKIDIGVVEGVVGMVNYEFEFIGQAGHAGTVPQKMRKDALL
AASEAIQYLHKELDKLHDNLVYTTGRIICFPNIHTIIPDNVKFTLDARHQDPKITQQVVKIIENIPKELANCKVTYRKLW
SRKTVSFNNQLINFVEKNANLYGYSNMRMYSGPGHDAQFAADMLPTTMIFVPSIGGHSHCEIEKTSLDNCLKGANVLLQT
ILDIDEK

Specific function: Converts N-carbamyl-L-amino acids to L-amino acids [H]

COG id: COG0624

COG function: function code E; Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the peptidase M20 family [H]

Homologues:

Organism=Escherichia coli, GI1786726, Length=395, Percent_Identity=32.1518987341772, Blast_Score=208, Evalue=6e-55,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR010158
- InterPro:   IPR002933
- InterPro:   IPR011650 [H]

Pfam domain/function: PF07687 M20_dimer; PF01546 Peptidase_M20 [H]

EC number: 3.5.3.- [C]

Molecular weight: Translated: 45248; Mature: 45248

Theoretical pI: Translated: 6.57; Mature: 6.57

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.5 %Cys     (Translated Protein)
4.2 %Met     (Translated Protein)
5.7 %Cys+Met (Translated Protein)
1.5 %Cys     (Mature Protein)
4.2 %Met     (Mature Protein)
5.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MLICDKKRLEDKIITFSKFGATEKDGVTRLSLSKPALQARAEFSKRMNELGASITTDDMG
CCCCCCHHHHHHHEEHHHCCCCCCCCCEEEECCCHHHHHHHHHHHHHHHHCCCCCCCCCC
NMYATFKGSEENLPHIAMGSHCDSVVQGGNYDGILGVLTAMEVAETIVTRKIPHRHPITV
CEEEEEECCCCCCCEEEECCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEE
MIWTNEEGARFDPAMMSSGVITGKFDKAEMLDSKDKKGISFGEALDASGYKGEEKNRINP
EEEECCCCCCCCHHHHHCCCEECCCCHHHHHCCCCCCCCCCCHHHCCCCCCCCCCCCCCH
KDYMALLELHIEQGPVLESSKIDIGVVEGVVGMVNYEFEFIGQAGHAGTVPQKMRKDALL
HHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHEEEEEEEECCCCCCCCHHHHHHHHHH
AASEAIQYLHKELDKLHDNLVYTTGRIICFPNIHTIIPDNVKFTLDARHQDPKITQQVVK
HHHHHHHHHHHHHHHHHCCEEEECCCEEECCCCCEECCCCEEEEEECCCCCHHHHHHHHH
IIENIPKELANCKVTYRKLWSRKTVSFNNQLINFVEKNANLYGYSNMRMYSGPGHDAQFA
HHHHHHHHHHCCHHHHHHHHCCCCCCHHHHHHHHHHHCCCEEEECCEEEECCCCCCHHHH
ADMLPTTMIFVPSIGGHSHCEIEKTSLDNCLKGANVLLQTILDIDEK
HHHCCEEEEEEECCCCCCCCEEEHHHHHHHHHHHHHHHHHHHHCCCC
>Mature Secondary Structure
MLICDKKRLEDKIITFSKFGATEKDGVTRLSLSKPALQARAEFSKRMNELGASITTDDMG
CCCCCCHHHHHHHEEHHHCCCCCCCCCEEEECCCHHHHHHHHHHHHHHHHCCCCCCCCCC
NMYATFKGSEENLPHIAMGSHCDSVVQGGNYDGILGVLTAMEVAETIVTRKIPHRHPITV
CEEEEEECCCCCCCEEEECCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEE
MIWTNEEGARFDPAMMSSGVITGKFDKAEMLDSKDKKGISFGEALDASGYKGEEKNRINP
EEEECCCCCCCCHHHHHCCCEECCCCHHHHHCCCCCCCCCCCHHHCCCCCCCCCCCCCCH
KDYMALLELHIEQGPVLESSKIDIGVVEGVVGMVNYEFEFIGQAGHAGTVPQKMRKDALL
HHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHEEEEEEEECCCCCCCCHHHHHHHHHH
AASEAIQYLHKELDKLHDNLVYTTGRIICFPNIHTIIPDNVKFTLDARHQDPKITQQVVK
HHHHHHHHHHHHHHHHHCCEEEECCCEEECCCCCEECCCCEEEEEECCCCCHHHHHHHHH
IIENIPKELANCKVTYRKLWSRKTVSFNNQLINFVEKNANLYGYSNMRMYSGPGHDAQFA
HHHHHHHHHHCCHHHHHHHHCCCCCCHHHHHHHHHHHCCCEEEECCEEEECCCCCCHHHH
ADMLPTTMIFVPSIGGHSHCEIEKTSLDNCLKGANVLLQTILDIDEK
HHHCCEEEEEEECCCCCCCCEEEHHHHHHHHHHHHHHHHHHHHCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 1732229 [H]