Definition Yersinia pestis CO92 chromosome, complete genome.
Accession NC_003143
Length 4,653,728

Click here to switch to the map view.

The map label for this gene is speA

Identifier: 218928102

GI number: 218928102

Start: 1020337

End: 1022316

Strand: Reverse

Name: speA

Synonym: YPO0929

Alternate gene names: 218928102

Gene position: 1022316-1020337 (Counterclockwise)

Preceding gene: 218928112

Following gene: 218928101

Centisome position: 21.97

GC content: 50.1

Gene sequence:

>1980_bases
ATGTCTGATGATAACTTGATTAGCCGTCCGTTAACTGCGGGCGCACATGTTTCTTTACGCTCCATGCAGGAGGTAGCCAT
GAATGATCGAAATGCCAGCAAGATGCTGAGCACGTATAACGTCGCCTACTGGGGTGGCAACTATTATGACGTCAATGAAT
TAGGTCATATCAGTGTTTGCCCTGATCCGGATATTCGTGAGGCGCGTGTCGATCTGGCGCAACTGGTTAAAAAGATGCAG
CTCGAACAAGGGCAACGTTTGCCTGCGCTGTTCTGCTTCCCACAAATTTTACAGCATCGTCTGCGTTCAATTAACGCGGC
GTTTAAACGTGCGCGTGAGTCTTTCGGCTATGAAGGCGGCTACTTCCTGGTTTACCCCATTAAAGTAAACCAACACCGTC
GAGTGATTGAGTCTTTGGTCAATTCCGGCGAACCGTTAGGGCTGGAAGCGGGTTCTAAAGCTGAGATGATGGCGGTGCTG
GCCCATGCTGGCATGACTCGTTCGGTTATCGTGTGTAACGGTTATAAAGACCGTGAATACATTCGTTTGGCTTTGATCGG
TGAAAAACTGGGCCATAAGGTGTATCTGGTCATCGAAAAGATGTCAGAAATCAAAATGGTCCTGGAAGAAGCCGAGCGGC
TGAATGTGGTTCCACGTTTGGGGGTTCGCGCCCGCCTGGCATCACAAGGTTCCGGTAAATGGCAGGCCAGTGGGGGTGAG
AAGTCCAAATTTGGTTTGTCTGCAACGCAGGTATTACAACTGGTTGATATGCTGCGTGAGGCGAATAGCCTGGAAAGTTT
GCAATTACTGCATTTCCACTTAGGTTCTCAACTCTCCAACATCCGTGATATCTCCACCGGCGTGCGTGAATCTGCTCGTT
TCTATGTGGAGTTACACAAGCTGGGCGTTAATATCCAGTGCTTCGATGTGGGCGGGGGCTTGGGCGTCGATTATGAAGGG
ACGCGCTCTCAATCTGATTGCTCCGTTAACTATGGCTTGAATGAATATGCCAATAACGTTATCTGGGGTATCGGTGATGC
CTGTAACGAACATGGTTTGCCGCATCCTACCGTTATTACAGAATCTGGTCGTGCGGTTACCGCTCACCATACGGTATTGG
TGTCCAATGTTATCGGTGTAGAGCGCAATGAGTTCTGTGAGCCGCAGCCACCGGAAGCGGGCGCTCCACGTGCTTTGGAA
AGTTTATGGGATACCTGGCAGGAGATGCAGGAACCGGAAAACCGCCGTTCACTGCGCGAATGGTTGCACGATAGCCAAAT
GGATTTACACGACGTCCATACCCAATATGCTCACGGTATGCTGGATCTGACTCATCGCGCTTGGGCAGAGCAACTTTATC
TAAGCATTTGTAATGAGATCCAAAAACAGTTGGATCCAAGCAACCGCGCCCACCGGCCCATCATTGATGAGCTGCAAGAG
CGGATGGCTGACAAGCTGTATGTGAATTTCTCGCTGTTCCAGTCAATGCCGGATGCATGGGGTATCGATCAACTGTTCCC
GGTCTTGCCATTGGAAGGGTTAGATAAGCCGCCAGAGCGCCGTGCGGTATTGCTGGATATTACCTGTGATTCTGATGGCA
CCATCGATCACTATATTGATGGTGATGGGGTGGCGACCACCATGCCAATGCCACCGTATGATCCGGAAAACCCGCCATTG
CTAGGCTTCTTTATGGTCGGTGCTTACCAGGAAATCTTGGGTAATATGCACAACCTGTTTGGTGACACTGCGGCGGTAGA
TGTTTATGTCTTCCCTGATGGCACGGTAGAGGTTGAGCAGACCGATGAAGGCGATACAGTGGCGGATATGTTGGAATACG
TCCAATTGAACCCTGAAAAATTGCTGGAACATTTCCGCGGTCAGGTAAAAGAAACCGATCTGGATACCGAATTGCAGGCG
CAGTTCCTGGAAGAGTTTGAGGCTGGCTTGTACGGCTACACATATCTTGAAGACGAATAA

Upstream 100 bases:

>100_bases
GCTTTCAGCGTAATATCTGGGCGTTCACACGGATTGTCCTTGTAGGTTGTAGCTGCGATAGTGGCTGACGTTATCGTCAG
TATCGCAATAGAGGCGAATT

Downstream 100 bases:

>100_bases
TCACTAACGATTTATTCAGTATCGATTAAAGGGCAAACTCAGGTTTGTCCTTTTTTTCGGCGCGACCCTCTCCTAAGGCG
GTGGTGATTGCCCGAATATC

Product: arginine decarboxylase

Products: NA

Alternate protein names: ADC

Number of amino acids: Translated: 659; Mature: 658

Protein sequence:

>659_residues
MSDDNLISRPLTAGAHVSLRSMQEVAMNDRNASKMLSTYNVAYWGGNYYDVNELGHISVCPDPDIREARVDLAQLVKKMQ
LEQGQRLPALFCFPQILQHRLRSINAAFKRARESFGYEGGYFLVYPIKVNQHRRVIESLVNSGEPLGLEAGSKAEMMAVL
AHAGMTRSVIVCNGYKDREYIRLALIGEKLGHKVYLVIEKMSEIKMVLEEAERLNVVPRLGVRARLASQGSGKWQASGGE
KSKFGLSATQVLQLVDMLREANSLESLQLLHFHLGSQLSNIRDISTGVRESARFYVELHKLGVNIQCFDVGGGLGVDYEG
TRSQSDCSVNYGLNEYANNVIWGIGDACNEHGLPHPTVITESGRAVTAHHTVLVSNVIGVERNEFCEPQPPEAGAPRALE
SLWDTWQEMQEPENRRSLREWLHDSQMDLHDVHTQYAHGMLDLTHRAWAEQLYLSICNEIQKQLDPSNRAHRPIIDELQE
RMADKLYVNFSLFQSMPDAWGIDQLFPVLPLEGLDKPPERRAVLLDITCDSDGTIDHYIDGDGVATTMPMPPYDPENPPL
LGFFMVGAYQEILGNMHNLFGDTAAVDVYVFPDGTVEVEQTDEGDTVADMLEYVQLNPEKLLEHFRGQVKETDLDTELQA
QFLEEFEAGLYGYTYLEDE

Sequences:

>Translated_659_residues
MSDDNLISRPLTAGAHVSLRSMQEVAMNDRNASKMLSTYNVAYWGGNYYDVNELGHISVCPDPDIREARVDLAQLVKKMQ
LEQGQRLPALFCFPQILQHRLRSINAAFKRARESFGYEGGYFLVYPIKVNQHRRVIESLVNSGEPLGLEAGSKAEMMAVL
AHAGMTRSVIVCNGYKDREYIRLALIGEKLGHKVYLVIEKMSEIKMVLEEAERLNVVPRLGVRARLASQGSGKWQASGGE
KSKFGLSATQVLQLVDMLREANSLESLQLLHFHLGSQLSNIRDISTGVRESARFYVELHKLGVNIQCFDVGGGLGVDYEG
TRSQSDCSVNYGLNEYANNVIWGIGDACNEHGLPHPTVITESGRAVTAHHTVLVSNVIGVERNEFCEPQPPEAGAPRALE
SLWDTWQEMQEPENRRSLREWLHDSQMDLHDVHTQYAHGMLDLTHRAWAEQLYLSICNEIQKQLDPSNRAHRPIIDELQE
RMADKLYVNFSLFQSMPDAWGIDQLFPVLPLEGLDKPPERRAVLLDITCDSDGTIDHYIDGDGVATTMPMPPYDPENPPL
LGFFMVGAYQEILGNMHNLFGDTAAVDVYVFPDGTVEVEQTDEGDTVADMLEYVQLNPEKLLEHFRGQVKETDLDTELQA
QFLEEFEAGLYGYTYLEDE
>Mature_658_residues
SDDNLISRPLTAGAHVSLRSMQEVAMNDRNASKMLSTYNVAYWGGNYYDVNELGHISVCPDPDIREARVDLAQLVKKMQL
EQGQRLPALFCFPQILQHRLRSINAAFKRARESFGYEGGYFLVYPIKVNQHRRVIESLVNSGEPLGLEAGSKAEMMAVLA
HAGMTRSVIVCNGYKDREYIRLALIGEKLGHKVYLVIEKMSEIKMVLEEAERLNVVPRLGVRARLASQGSGKWQASGGEK
SKFGLSATQVLQLVDMLREANSLESLQLLHFHLGSQLSNIRDISTGVRESARFYVELHKLGVNIQCFDVGGGLGVDYEGT
RSQSDCSVNYGLNEYANNVIWGIGDACNEHGLPHPTVITESGRAVTAHHTVLVSNVIGVERNEFCEPQPPEAGAPRALES
LWDTWQEMQEPENRRSLREWLHDSQMDLHDVHTQYAHGMLDLTHRAWAEQLYLSICNEIQKQLDPSNRAHRPIIDELQER
MADKLYVNFSLFQSMPDAWGIDQLFPVLPLEGLDKPPERRAVLLDITCDSDGTIDHYIDGDGVATTMPMPPYDPENPPLL
GFFMVGAYQEILGNMHNLFGDTAAVDVYVFPDGTVEVEQTDEGDTVADMLEYVQLNPEKLLEHFRGQVKETDLDTELQAQ
FLEEFEAGLYGYTYLEDE

Specific function: Catalyzes the biosynthesis of agmatine from arginine

COG id: COG1166

COG function: function code E; Arginine decarboxylase (spermidine biosynthesis)

Gene ontology:

Cell location: Periplasmic Protein [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the Orn/Lys/Arg decarboxylase class-II family. SpeA subfamily

Homologues:

Organism=Escherichia coli, GI1789307, Length=659, Percent_Identity=84.8254931714719, Blast_Score=1178, Evalue=0.0,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): SPEA_YERPE (Q8ZHG8)

Other databases:

- EMBL:   AL590842
- EMBL:   AE009952
- EMBL:   AE017042
- PIR:   AB0114
- RefSeq:   NP_670612.1
- RefSeq:   NP_994790.1
- RefSeq:   YP_002345977.1
- ProteinModelPortal:   Q8ZHG8
- IntAct:   Q8ZHG8
- GeneID:   1148260
- GeneID:   1173766
- GeneID:   2764242
- GenomeReviews:   AE009952_GR
- GenomeReviews:   AE017042_GR
- GenomeReviews:   AL590842_GR
- KEGG:   ype:YPO0929
- KEGG:   ypk:y3313
- KEGG:   ypm:YP_3513
- HOGENOM:   HBG321436
- OMA:   TCDSDGE
- ProtClustDB:   PRK05354
- BioCyc:   YPES187410:Y3313-MONOMER
- BRENDA:   4.1.1.19
- HAMAP:   MF_01417
- InterPro:   IPR009006
- InterPro:   IPR002985
- InterPro:   IPR022643
- InterPro:   IPR022657
- InterPro:   IPR022644
- InterPro:   IPR022653
- InterPro:   IPR000183
- Gene3D:   G3DSA:2.40.37.10
- PANTHER:   PTHR11482:SF3
- PIRSF:   PIRSF001336
- PRINTS:   PR01180
- PRINTS:   PR01179
- TIGRFAMs:   TIGR01273

Pfam domain/function: PF02784 Orn_Arg_deC_N; PF00278 Orn_DAP_Arg_deC; SSF50621 Racem_decarbox_C

EC number: =4.1.1.19

Molecular weight: Translated: 74105; Mature: 73973

Theoretical pI: Translated: 4.66; Mature: 4.66

Prosite motif: PS00878 ODR_DC_2_1; PS00879 ODR_DC_2_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.4 %Cys     (Translated Protein)
3.2 %Met     (Translated Protein)
4.6 %Cys+Met (Translated Protein)
1.4 %Cys     (Mature Protein)
3.0 %Met     (Mature Protein)
4.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSDDNLISRPLTAGAHVSLRSMQEVAMNDRNASKMLSTYNVAYWGGNYYDVNELGHISVC
CCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHCEEEEECCCEEECCCCCEEEEC
PDPDIREARVDLAQLVKKMQLEQGQRLPALFCFPQILQHRLRSINAAFKRARESFGYEGG
CCCCHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC
YFLVYPIKVNQHRRVIESLVNSGEPLGLEAGSKAEMMAVLAHAGMTRSVIVCNGYKDREY
EEEEEEEEECHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHCCCCCEEEEECCCCCCHH
IRLALIGEKLGHKVYLVIEKMSEIKMVLEEAERLNVVPRLGVRARLASQGSGKWQASGGE
EEEEEEHHHHCCEEEEEEHHHHHHHHHHHHHHHCCCCCCCCHHHHHHCCCCCCEECCCCC
KSKFGLSATQVLQLVDMLREANSLESLQLLHFHLGSQLSNIRDISTGVRESARFYVELHK
CCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEHHH
LGVNIQCFDVGGGLGVDYEGTRSQSDCSVNYGLNEYANNVIWGIGDACNEHGLPHPTVIT
HCCEEEEEECCCCCCCCCCCCCCCCCCEEECCHHHHHCCEEECCCCHHHCCCCCCCEEEE
ESGRAVTAHHTVLVSNVIGVERNEFCEPQPPEAGAPRALESLWDTWQEMQEPENRRSLRE
CCCCEEEEHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHH
WLHDSQMDLHDVHTQYAHGMLDLTHRAWAEQLYLSICNEIQKQLDPSNRAHRPIIDELQE
HHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHH
RMADKLYVNFSLFQSMPDAWGIDQLFPVLPLEGLDKPPERRAVLLDITCDSDGTIDHYID
HHHHHHEEEHHHHHCCCCCCCHHHHHHCCCCCCCCCCCCCCEEEEEEEECCCCCEEEEEC
GDGVATTMPMPPYDPENPPLLGFFMVGAYQEILGNMHNLFGDTAAVDVYVFPDGTVEVEQ
CCCCEEECCCCCCCCCCCCEEHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCEEEEEC
TDEGDTVADMLEYVQLNPEKLLEHFRGQVKETDLDTELQAQFLEEFEAGLYGYTYLEDE
CCCCCHHHHHHHHHCCCHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEECCC
>Mature Secondary Structure 
SDDNLISRPLTAGAHVSLRSMQEVAMNDRNASKMLSTYNVAYWGGNYYDVNELGHISVC
CCCCCCCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHCEEEEECCCEEECCCCCEEEEC
PDPDIREARVDLAQLVKKMQLEQGQRLPALFCFPQILQHRLRSINAAFKRARESFGYEGG
CCCCHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC
YFLVYPIKVNQHRRVIESLVNSGEPLGLEAGSKAEMMAVLAHAGMTRSVIVCNGYKDREY
EEEEEEEEECHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHCCCCCEEEEECCCCCCHH
IRLALIGEKLGHKVYLVIEKMSEIKMVLEEAERLNVVPRLGVRARLASQGSGKWQASGGE
EEEEEEHHHHCCEEEEEEHHHHHHHHHHHHHHHCCCCCCCCHHHHHHCCCCCCEECCCCC
KSKFGLSATQVLQLVDMLREANSLESLQLLHFHLGSQLSNIRDISTGVRESARFYVELHK
CCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEHHH
LGVNIQCFDVGGGLGVDYEGTRSQSDCSVNYGLNEYANNVIWGIGDACNEHGLPHPTVIT
HCCEEEEEECCCCCCCCCCCCCCCCCCEEECCHHHHHCCEEECCCCHHHCCCCCCCEEEE
ESGRAVTAHHTVLVSNVIGVERNEFCEPQPPEAGAPRALESLWDTWQEMQEPENRRSLRE
CCCCEEEEHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHH
WLHDSQMDLHDVHTQYAHGMLDLTHRAWAEQLYLSICNEIQKQLDPSNRAHRPIIDELQE
HHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHH
RMADKLYVNFSLFQSMPDAWGIDQLFPVLPLEGLDKPPERRAVLLDITCDSDGTIDHYID
HHHHHHEEEHHHHHCCCCCCCHHHHHHCCCCCCCCCCCCCCEEEEEEEECCCCCEEEEEC
GDGVATTMPMPPYDPENPPLLGFFMVGAYQEILGNMHNLFGDTAAVDVYVFPDGTVEVEQ
CCCCEEECCCCCCCCCCCCEEHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCEEEEEC
TDEGDTVADMLEYVQLNPEKLLEHFRGQVKETDLDTELQAQFLEEFEAGLYGYTYLEDE
CCCCCHHHHHHHHHCCCHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEECCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11586360; 12142430