Definition Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 chromosome, complete genome.
Accession NC_011094
Length 4,709,075

Click here to switch to the map view.

The map label for this gene is agp

Identifier: 194736517

GI number: 194736517

Start: 1162374

End: 1163615

Strand: Direct

Name: agp

Synonym: SeSA_A1181

Alternate gene names: 194736517

Gene position: 1162374-1163615 (Clockwise)

Preceding gene: 194738286

Following gene: 194734770

Centisome position: 24.68

GC content: 55.23

Gene sequence:

>1242_bases
ATGAAAAAATCATTACTCGCTGTTGCTGTGGCAGGGGCTGTTTTGTTGTCATCCGCCGTACAGGCGCAGACAACGCCGGA
GGGTTATCAATTACAACAGGTGCTGATGATGAGCCGCCATAATCTGCGGGCACCGCTGGCGAATAATGGCAGCGTACTGG
CGCAGTCGACGCCGAATGCCTGGCCGGCGTGGGACGTTCCCGGCGGGCAACTGACGACGAAAGGCGGCGTGCTGGAAGTC
TATATGGGACACTACACACGTGAATGGCTGGTCGCGCAGGGGCTGATACCGTCGGGAGAATGTCCGGCACCCGACACGGT
ATATGCCTATGCGAATAGTTTGCAGCGCACCGTCGCCACCGCGCAATTTTTCATTACCGGCGCTTTCCCCGGCTGTGATA
TTCCTGTTCATCATCAGGAAAAAATGGGCACTATGGATCCTACCTTCAATCCGGTGATTACCGATGATTCCGCCGCGTTC
CGGCAACAGGCCGTACAGGCGATGGAAAAGGCGCGTAGTCAGCTACATCTTGATGAGAGTTATAAACTGCTTGAGCAGAT
AACGCATTATCAGGACTCGCCGTCCTGCAAAGAGAAGCATCAGTGTTCGCTAATCGACGCGAAAGATACCTTCAGCGCGA
ACTATCAGCAAGAGCCTGGCGTGCAGGGGCCGCTGAAAGTAGGGAACTCGCTGGTGGATGCGTTTACCCTGCAATATTAC
GAAGGGTTTCCGATGGATCAGGTCGCCTGGGGCGGGATCCACACCGATCGGCAGTGGAAAGTGCTGTCAAAACTGAAAAA
CGGCTATCAGGACAGCCTGTTTACCTCACCCACGGTGGCGCGCAATGTCGCTGCGCCGCTGGTGAAATATATCGATAAGG
TGCTGGTTGCCGATCGCGTTAGCGCGCCGAAGGTGACCGTGCTGGTGGGGCATGACTCCAATATCGCGTCGCTGCTGACG
GCGCTGGATTTTAAACCCTATCAGCTCCATGACCAGTATGAGAGAACGCCGATTGGCGGCCAGCTTGTCTTCCAACGCTG
GCATGACGGCAACGCTAACCGGGATTTGATGAAAATCGAGTATGTCTACCAGAGCGCCCGGCAGTTACGCAATGCGGAAG
CGTTAACGCTCAAATCGCCCGCGCAAAGGGTAACCCTGGAACTGAAAGGATGTCCGGTGGATGCGAACGGCTTCTGTCCG
CTGGATAAGTTCGATAACGTCATGAACACTGCCGCAAAATAG

Upstream 100 bases:

>100_bases
TTACTGTAAGAAAAACCCCCGTTTTGCGAAATCGTTCCCGGAAAAATGATCCATTTCTGTCACACTCAGAACGATTTGAT
AACAATAAGAGGTCATAGGG

Downstream 100 bases:

>100_bases
CCGTATGCCCCCGCGCAGGCGGGGGCGTTTGTGTTATACGTTCTTACGTTCGATGATTTGTTCGCCCCAGAAGAGCGAGT
CTTTGTCCGTTTTCTCAAAG

Product: glucose-1-phosphatase/inositol phosphatase

Products: NA

Alternate protein names: G1Pase

Number of amino acids: Translated: 413; Mature: 413

Protein sequence:

>413_residues
MKKSLLAVAVAGAVLLSSAVQAQTTPEGYQLQQVLMMSRHNLRAPLANNGSVLAQSTPNAWPAWDVPGGQLTTKGGVLEV
YMGHYTREWLVAQGLIPSGECPAPDTVYAYANSLQRTVATAQFFITGAFPGCDIPVHHQEKMGTMDPTFNPVITDDSAAF
RQQAVQAMEKARSQLHLDESYKLLEQITHYQDSPSCKEKHQCSLIDAKDTFSANYQQEPGVQGPLKVGNSLVDAFTLQYY
EGFPMDQVAWGGIHTDRQWKVLSKLKNGYQDSLFTSPTVARNVAAPLVKYIDKVLVADRVSAPKVTVLVGHDSNIASLLT
ALDFKPYQLHDQYERTPIGGQLVFQRWHDGNANRDLMKIEYVYQSARQLRNAEALTLKSPAQRVTLELKGCPVDANGFCP
LDKFDNVMNTAAK

Sequences:

>Translated_413_residues
MKKSLLAVAVAGAVLLSSAVQAQTTPEGYQLQQVLMMSRHNLRAPLANNGSVLAQSTPNAWPAWDVPGGQLTTKGGVLEV
YMGHYTREWLVAQGLIPSGECPAPDTVYAYANSLQRTVATAQFFITGAFPGCDIPVHHQEKMGTMDPTFNPVITDDSAAF
RQQAVQAMEKARSQLHLDESYKLLEQITHYQDSPSCKEKHQCSLIDAKDTFSANYQQEPGVQGPLKVGNSLVDAFTLQYY
EGFPMDQVAWGGIHTDRQWKVLSKLKNGYQDSLFTSPTVARNVAAPLVKYIDKVLVADRVSAPKVTVLVGHDSNIASLLT
ALDFKPYQLHDQYERTPIGGQLVFQRWHDGNANRDLMKIEYVYQSARQLRNAEALTLKSPAQRVTLELKGCPVDANGFCP
LDKFDNVMNTAAK
>Mature_413_residues
MKKSLLAVAVAGAVLLSSAVQAQTTPEGYQLQQVLMMSRHNLRAPLANNGSVLAQSTPNAWPAWDVPGGQLTTKGGVLEV
YMGHYTREWLVAQGLIPSGECPAPDTVYAYANSLQRTVATAQFFITGAFPGCDIPVHHQEKMGTMDPTFNPVITDDSAAF
RQQAVQAMEKARSQLHLDESYKLLEQITHYQDSPSCKEKHQCSLIDAKDTFSANYQQEPGVQGPLKVGNSLVDAFTLQYY
EGFPMDQVAWGGIHTDRQWKVLSKLKNGYQDSLFTSPTVARNVAAPLVKYIDKVLVADRVSAPKVTVLVGHDSNIASLLT
ALDFKPYQLHDQYERTPIGGQLVFQRWHDGNANRDLMKIEYVYQSARQLRNAEALTLKSPAQRVTLELKGCPVDANGFCP
LDKFDNVMNTAAK

Specific function: Absolutely Required For The Growth Of E.Coli In A High- Phosphate Medium Containing G-1-P As The Sole Carbon Source. [C]

COG id: NA

COG function: NA

Gene ontology:

Cell location: Periplasm

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the histidine acid phosphatase family

Homologues:

Organism=Escherichia coli, GI1787237, Length=413, Percent_Identity=83.2929782082325, Blast_Score=738, Evalue=0.0,
Organism=Escherichia coli, GI1787215, Length=409, Percent_Identity=33.2518337408313, Blast_Score=204, Evalue=9e-54,

Paralogues:

None

Copy number: 60 Molecules/Cell In: Stationary Phase, Rich Media (Based on E. coli). 380 Molecules/Cell In: Stationary Phase, Rich Media (Based on E. coli). [C]

Swissprot (AC and ID): AGP_SALTY (O33921)

Other databases:

- EMBL:   AE006468
- EMBL:   U75949
- RefSeq:   NP_460090.1
- ProteinModelPortal:   O33921
- SMR:   O33921
- PRIDE:   O33921
- GeneID:   1252635
- GenomeReviews:   AE006468_GR
- KEGG:   stm:STM1117
- HOGENOM:   HBG751992
- OMA:   LLVGHDS
- ProtClustDB:   PRK10173
- BioCyc:   STYP99287:STM1117-MONOMER
- BRENDA:   3.1.3.10
- InterPro:   IPR000560

Pfam domain/function: PF00328 Acid_phosphat_A

EC number: =3.1.3.10

Molecular weight: Translated: 45559; Mature: 45559

Theoretical pI: Translated: 7.09; Mature: 7.09

Prosite motif: PS00616 HIS_ACID_PHOSPHAT_1; PS00778 HIS_ACID_PHOSPHAT_2

Important sites: ACT_SITE 40-40 ACT_SITE 312-312

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.5 %Cys     (Translated Protein)
2.4 %Met     (Translated Protein)
3.9 %Cys+Met (Translated Protein)
1.5 %Cys     (Mature Protein)
2.4 %Met     (Mature Protein)
3.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKKSLLAVAVAGAVLLSSAVQAQTTPEGYQLQQVLMMSRHNLRAPLANNGSVLAQSTPNA
CCCHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCCCCCCCCCEEEECCCCC
WPAWDVPGGQLTTKGGVLEVYMGHYTREWLVAQGLIPSGECPAPDTVYAYANSLQRTVAT
CCCCCCCCCCEECCCCEEEEEHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHH
AQFFITGAFPGCDIPVHHQEKMGTMDPTFNPVITDDSAAFRQQAVQAMEKARSQLHLDES
HEEEEEECCCCCCCCCCCHHHCCCCCCCCCCEECCCHHHHHHHHHHHHHHHHHHCCCCHH
YKLLEQITHYQDSPSCKEKHQCSLIDAKDTFSANYQQEPGVQGPLKVGNSLVDAFTLQYY
HHHHHHHHHCCCCCCCCHHCCCEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHH
EGFPMDQVAWGGIHTDRQWKVLSKLKNGYQDSLFTSPTVARNVAAPLVKYIDKVLVADRV
CCCCCCCEECCCCCCCHHHHHHHHHHCCCHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCC
SAPKVTVLVGHDSNIASLLTALDFKPYQLHDQYERTPIGGQLVFQRWHDGNANRDLMKIE
CCCEEEEEEECCCCHHHHHHHHCCCCCCCCHHHHCCCCCCCEEEEEECCCCCCCCHHHHH
YVYQSARQLRNAEALTLKSPAQRVTLELKGCPVDANGFCPLDKFDNVMNTAAK
HHHHHHHHHCCCCEEEECCCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHCCC
>Mature Secondary Structure
MKKSLLAVAVAGAVLLSSAVQAQTTPEGYQLQQVLMMSRHNLRAPLANNGSVLAQSTPNA
CCCHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCCCCCCCCCEEEECCCCC
WPAWDVPGGQLTTKGGVLEVYMGHYTREWLVAQGLIPSGECPAPDTVYAYANSLQRTVAT
CCCCCCCCCCEECCCCEEEEEHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHH
AQFFITGAFPGCDIPVHHQEKMGTMDPTFNPVITDDSAAFRQQAVQAMEKARSQLHLDES
HEEEEEECCCCCCCCCCCHHHCCCCCCCCCCEECCCHHHHHHHHHHHHHHHHHHCCCCHH
YKLLEQITHYQDSPSCKEKHQCSLIDAKDTFSANYQQEPGVQGPLKVGNSLVDAFTLQYY
HHHHHHHHHCCCCCCCCHHCCCEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHH
EGFPMDQVAWGGIHTDRQWKVLSKLKNGYQDSLFTSPTVARNVAAPLVKYIDKVLVADRV
CCCCCCCEECCCCCCCHHHHHHHHHHCCCHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCC
SAPKVTVLVGHDSNIASLLTALDFKPYQLHDQYERTPIGGQLVFQRWHDGNANRDLMKIE
CCCEEEEEEECCCCHHHHHHHHCCCCCCCCHHHHCCCCCCCEEEEEECCCCCCCCHHHHH
YVYQSARQLRNAEALTLKSPAQRVTLELKGCPVDANGFCPLDKFDNVMNTAAK
HHHHHHHHHCCCCEEEECCCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11677609; 9260936