The gene/protein map for NC_012581 is currently unavailable.
Definition Bacillus anthracis str. CDC 684, complete genome.
Accession NC_012581
Length 5,230,115

Click here to switch to the map view.

The map label for this gene is thyA

Identifier: 227814947

GI number: 227814947

Start: 2150301

End: 2151257

Strand: Reverse

Name: thyA

Synonym: BAMEG_2358

Alternate gene names: 227814947

Gene position: 2151257-2150301 (Counterclockwise)

Preceding gene: 227814950

Following gene: 227814946

Centisome position: 41.13

GC content: 36.47

Gene sequence:

>957_bases
ATGAAACATGCTGAAAATGAATACTTAAATTTATGCCGCCATGTAATGGAACATGGTACGAAGAAAGAAGATCGTACAGG
GACAGGCACTGTATCTGTATTTGGATATCAAATGCGTTTTGATTTAAGTAAAGGTTTTCCTTTATTAACGACAAAGAGAG
TGCCGTTTCGCCTTGTAGCAAGTGAATTGCTTTGGTTTATGAAAGGTGATACAAATATTCGTTATTTATTGCAGCATAAT
AATAATATTTGGAATGAATGGGCATTTAAGAGCTGGGTAGAAAGTGACGAGTATACTGGTCCTGACATGATTGATTTCGG
TCTTCGCTCACAACAAGATGAAGAATTTAAAGTACAGTACGATGAGCAAATGGAATTGTTTAAAAAGAACGTTTTAGAAG
ATGATGAGTTCTCAAATAAATATGGTTATTTAGGAGACGTATACGGTAAGCAGTGGCGTGCTTGGAAAACGACAGCTGGT
GAGACACTTGATCAATTAAAAGATGTAATTGAAATGATTAAGAAAACGCCAGATTCGCGTCGTCTAATTGTTTCTGCTTG
GAATCCTGAAGATGTACCAAGTATGGCATTACCACCTTGTCATACGTTATTCCAATTTTATGTAGCAGATGGCAAACTTT
CTTGTCAGCTATATCAAAGAAGTGGTGACATATTCCTTGGAATTCCATTTAACATTGCAAGTTACTCACTACTGACACAT
TTAATTGCACATGAATGCGGACTTGAAGTGGGAGAATTTGTTCATACAATTGGAGATGCACACATTTATACGAATCATTT
TGAGCAAGTAGAAAAGCAATTGGCACGTGAACCACGTCCATTCCCGAAACTTACACTAAATCCAGATGTGAAATCTGTGT
TTGATTTTGAAATGGAAGATTTAACAATTGAAGGATATGATCCACATCCAGCAATTAAAGCACCAGTTGCAGTGTAA

Upstream 100 bases:

>100_bases
AGAATTACCGTGAGAAATTTTATTAATTAAGATTATTAAATTTACCATTTTTCATGTTATGATAAATTGCATATAAGATT
GAAAGAAGGTTTTACTACAT

Downstream 100 bases:

>100_bases
TTGTAGAGGAGATGAAAAGATGATAGTTTCATTTATGGTCGCAATGGACGAAAATAGAGTAATTGGTAAAGATAATAATT
TACCTTGGCGTTTACCGAGT

Product: thymidylate synthase

Products: NA

Alternate protein names: TS; TSase

Number of amino acids: Translated: 318; Mature: 318

Protein sequence:

>318_residues
MKHAENEYLNLCRHVMEHGTKKEDRTGTGTVSVFGYQMRFDLSKGFPLLTTKRVPFRLVASELLWFMKGDTNIRYLLQHN
NNIWNEWAFKSWVESDEYTGPDMIDFGLRSQQDEEFKVQYDEQMELFKKNVLEDDEFSNKYGYLGDVYGKQWRAWKTTAG
ETLDQLKDVIEMIKKTPDSRRLIVSAWNPEDVPSMALPPCHTLFQFYVADGKLSCQLYQRSGDIFLGIPFNIASYSLLTH
LIAHECGLEVGEFVHTIGDAHIYTNHFEQVEKQLAREPRPFPKLTLNPDVKSVFDFEMEDLTIEGYDPHPAIKAPVAV

Sequences:

>Translated_318_residues
MKHAENEYLNLCRHVMEHGTKKEDRTGTGTVSVFGYQMRFDLSKGFPLLTTKRVPFRLVASELLWFMKGDTNIRYLLQHN
NNIWNEWAFKSWVESDEYTGPDMIDFGLRSQQDEEFKVQYDEQMELFKKNVLEDDEFSNKYGYLGDVYGKQWRAWKTTAG
ETLDQLKDVIEMIKKTPDSRRLIVSAWNPEDVPSMALPPCHTLFQFYVADGKLSCQLYQRSGDIFLGIPFNIASYSLLTH
LIAHECGLEVGEFVHTIGDAHIYTNHFEQVEKQLAREPRPFPKLTLNPDVKSVFDFEMEDLTIEGYDPHPAIKAPVAV
>Mature_318_residues
MKHAENEYLNLCRHVMEHGTKKEDRTGTGTVSVFGYQMRFDLSKGFPLLTTKRVPFRLVASELLWFMKGDTNIRYLLQHN
NNIWNEWAFKSWVESDEYTGPDMIDFGLRSQQDEEFKVQYDEQMELFKKNVLEDDEFSNKYGYLGDVYGKQWRAWKTTAG
ETLDQLKDVIEMIKKTPDSRRLIVSAWNPEDVPSMALPPCHTLFQFYVADGKLSCQLYQRSGDIFLGIPFNIASYSLLTH
LIAHECGLEVGEFVHTIGDAHIYTNHFEQVEKQLAREPRPFPKLTLNPDVKSVFDFEMEDLTIEGYDPHPAIKAPVAV

Specific function: Provides the sole de novo source of dTMP for DNA biosynthesis

COG id: COG0207

COG function: function code F; Thymidylate synthase

Gene ontology:

Cell location: Cytoplasm

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the thymidylate synthase family. Bacterial- type thyA subfamily

Homologues:

Organism=Homo sapiens, GI4507751, Length=324, Percent_Identity=47.5308641975309, Blast_Score=300, Evalue=2e-81,
Organism=Escherichia coli, GI1789191, Length=312, Percent_Identity=51.6025641025641, Blast_Score=325, Evalue=3e-90,
Organism=Caenorhabditis elegans, GI71993377, Length=327, Percent_Identity=42.5076452599388, Blast_Score=272, Evalue=2e-73,
Organism=Saccharomyces cerevisiae, GI6324648, Length=196, Percent_Identity=51.530612244898, Blast_Score=200, Evalue=3e-52,
Organism=Drosophila melanogaster, GI17137556, Length=324, Percent_Identity=46.2962962962963, Blast_Score=295, Evalue=2e-80,

Paralogues:

None

Copy number: 391 Molecules/Cell In: Growth Phase, Glucose-minimal MOPS Media. [C]

Swissprot (AC and ID): TYSY_BACAH (A0RDL9)

Other databases:

- EMBL:   CP000485
- RefSeq:   YP_894819.1
- ProteinModelPortal:   A0RDL9
- SMR:   A0RDL9
- STRING:   A0RDL9
- EnsemblBacteria:   EBBACT00000067076
- GeneID:   4546269
- GenomeReviews:   CP000485_GR
- KEGG:   btl:BALH_2000
- eggNOG:   COG0207
- GeneTree:   EBGT00050000001560
- HOGENOM:   HBG588098
- OMA:   RMALAPC
- ProtClustDB:   PRK01827
- BioCyc:   BTHU412694:BALH_2000-MONOMER
- GO:   GO:0005737
- HAMAP:   MF_00008
- InterPro:   IPR000398
- InterPro:   IPR020940
- Gene3D:   G3DSA:3.30.572.10
- PANTHER:   PTHR11549:SF2
- PRINTS:   PR00108
- TIGRFAMs:   TIGR03284

Pfam domain/function: PF00303 Thymidylat_synt; SSF55831 Thymidylat_synth_C

EC number: =2.1.1.45

Molecular weight: Translated: 36893; Mature: 36893

Theoretical pI: Translated: 5.13; Mature: 5.13

Prosite motif: PS00091 THYMIDYLATE_SYNTHASE

Important sites: ACT_SITE 200-200

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.3 %Cys     (Translated Protein)
2.8 %Met     (Translated Protein)
4.1 %Cys+Met (Translated Protein)
1.3 %Cys     (Mature Protein)
2.8 %Met     (Mature Protein)
4.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKHAENEYLNLCRHVMEHGTKKEDRTGTGTVSVFGYQMRFDLSKGFPLLTTKRVPFRLVA
CCCCCHHHHHHHHHHHHCCCCCCCCCCCCEEEEEEEEEEEECCCCCCEEEECCCCHHHHH
SELLWFMKGDTNIRYLLQHNNNIWNEWAFKSWVESDEYTGPDMIDFGLRSQQDEEFKVQY
HHHHHHHCCCCCEEEEEECCCCCHHHHHHHHHHCCCCCCCCCHHHHCCCCCCCCCCEEEH
DEQMELFKKNVLEDDEFSNKYGYLGDVYGKQWRAWKTTAGETLDQLKDVIEMIKKTPDSR
HHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCHHHHHCHHHHHHHHHHHHHHHHHCCCCCC
RLIVSAWNPEDVPSMALPPCHTLFQFYVADGKLSCQLYQRSGDIFLGIPFNIASYSLLTH
EEEEEECCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEEECCCCEEEECCHHHHHHHHHHH
LIAHECGLEVGEFVHTIGDAHIYTNHFEQVEKQLAREPRPFPKLTLNPDVKSVFDFEMED
HHHHHHCCHHHHHHHHHCCCCEEHHHHHHHHHHHHHCCCCCCEEEECCCHHHHHCCCHHC
LTIEGYDPHPAIKAPVAV
EEEECCCCCCCCCCCCCC
>Mature Secondary Structure
MKHAENEYLNLCRHVMEHGTKKEDRTGTGTVSVFGYQMRFDLSKGFPLLTTKRVPFRLVA
CCCCCHHHHHHHHHHHHCCCCCCCCCCCCEEEEEEEEEEEECCCCCCEEEECCCCHHHHH
SELLWFMKGDTNIRYLLQHNNNIWNEWAFKSWVESDEYTGPDMIDFGLRSQQDEEFKVQY
HHHHHHHCCCCCEEEEEECCCCCHHHHHHHHHHCCCCCCCCCHHHHCCCCCCCCCCEEEH
DEQMELFKKNVLEDDEFSNKYGYLGDVYGKQWRAWKTTAGETLDQLKDVIEMIKKTPDSR
HHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCHHHHHCHHHHHHHHHHHHHHHHHCCCCCC
RLIVSAWNPEDVPSMALPPCHTLFQFYVADGKLSCQLYQRSGDIFLGIPFNIASYSLLTH
EEEEEECCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEEECCCCEEEECCHHHHHHHHHHH
LIAHECGLEVGEFVHTIGDAHIYTNHFEQVEKQLAREPRPFPKLTLNPDVKSVFDFEMED
HHHHHHCCHHHHHHHHHCCCCEEHHHHHHHHHHHHHCCCCCCEEEECCCHHHHHCCCHHC
LTIEGYDPHPAIKAPVAV
EEEECCCCCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: NA