Definition Streptococcus pyogenes M1 GAS chromosome, complete genome.
Accession NC_002737
Length 1,852,441

Click here to switch to the map view.

The map label for this gene is hasC

Identifier: 15675935

GI number: 15675935

Start: 1835310

End: 1836224

Strand: Direct

Name: hasC

Synonym: SPy_2202

Alternate gene names: 15675935

Gene position: 1835310-1836224 (Clockwise)

Preceding gene: 15675934

Following gene: 15675936

Centisome position: 99.08

GC content: 42.51

Gene sequence:

>915_bases
ATGACCAAAGTCAGAAAAGCCATTATTCCTGCTGCAGGTCTAGGAACACGTTTTTTACCTGCTACCAAAGCTCTTGCCAA
AGAGATGTTGCCCATCGTTGATAAACCAACCATCCAGTTTATCGTCGAAGAAGCGCTAAAATCTGGCATCGAGGAAATCC
TTGTGGTGACCGGAAAAGCTAAACGCTCTATCGAGGACCATTTTGATTCAAACTTTGAATTAGAATACAACCTCCAAGCT
AAGGGGAAAAATGAACTGTTGAAATTAGTGGATGAAACCACTGCCATTAACCTTCATTTTATCCGTCAAAGCCACCCAAG
AGGGCTGGGAGATGCTGTCTTACAAGCCAAAGCCTTTGTGGGCAATGAACCCTTTGTGGTCATGCTTGGAGATGACTTAA
TGGACATTACAAATGCATCCGCTAAACCTCTCACCAAACAACTCATGGAGGACTATGACAAGACGCATGCATCCACTATC
GCTGTGATGAAAGTTCCTCATGAAGATGTGTCTAGCTATGGGGTTATCGCTCCTCAAGGCAAGGCTGTCAAGGGCCTTTA
CAGTGTAGACACCTTTGTTGAAAAACCACAACCAGAAGATGCGCCTAGTGATTTGGCTATTATTGGTCGTTACCTCCTAA
CCCCTGAAATTTTTGGTATTTTGGAAAGACAGACCCCTGGAGCAGGTAACGAAGTGCAACTCACAGATGCTATCGATACC
CTCAATAAAACTCAGCGTGTCTTTGCACGAGAATTTAAAGGCAATCGTTACGATGTTGGGGATAAATTTGGATTCATGAA
AACATCTATCGACTATGCCTTAGAACACCCACAGGTCAAAGAGGACTTGAAAAATTACATTATCAAACTAGGAAAAGCTT
TGGAAAAAAGTAAAGTACCAACACATTCAAAGTAA

Upstream 100 bases:

>100_bases
AGTCGAAAAATCATATGAAAACGTAACATGATGTCAAATGAATTCCGTTCTGAAAACGAATGTCACTAAGCCATAGATTA
TAAAAAGTGAGGAGTTACTG

Downstream 100 bases:

>100_bases
TGTGTTGGTACTTTACTTTTTTCCTTGTTTATAGCAACTTAAGTTTATATCTTGAAAATTGTCAACGTTCCAAAATATAA
AAGTTTTGTGTTATACAACT

Product: UDP-glucose pyrophosphorylase

Products: NA

Alternate protein names: Alpha-D-glucosyl-1-phosphate uridylyltransferase 1; UDP-glucose pyrophosphorylase 1; UDPGP 1; Uridine diphosphoglucose pyrophosphorylase 1

Number of amino acids: Translated: 304; Mature: 303

Protein sequence:

>304_residues
MTKVRKAIIPAAGLGTRFLPATKALAKEMLPIVDKPTIQFIVEEALKSGIEEILVVTGKAKRSIEDHFDSNFELEYNLQA
KGKNELLKLVDETTAINLHFIRQSHPRGLGDAVLQAKAFVGNEPFVVMLGDDLMDITNASAKPLTKQLMEDYDKTHASTI
AVMKVPHEDVSSYGVIAPQGKAVKGLYSVDTFVEKPQPEDAPSDLAIIGRYLLTPEIFGILERQTPGAGNEVQLTDAIDT
LNKTQRVFAREFKGNRYDVGDKFGFMKTSIDYALEHPQVKEDLKNYIIKLGKALEKSKVPTHSK

Sequences:

>Translated_304_residues
MTKVRKAIIPAAGLGTRFLPATKALAKEMLPIVDKPTIQFIVEEALKSGIEEILVVTGKAKRSIEDHFDSNFELEYNLQA
KGKNELLKLVDETTAINLHFIRQSHPRGLGDAVLQAKAFVGNEPFVVMLGDDLMDITNASAKPLTKQLMEDYDKTHASTI
AVMKVPHEDVSSYGVIAPQGKAVKGLYSVDTFVEKPQPEDAPSDLAIIGRYLLTPEIFGILERQTPGAGNEVQLTDAIDT
LNKTQRVFAREFKGNRYDVGDKFGFMKTSIDYALEHPQVKEDLKNYIIKLGKALEKSKVPTHSK
>Mature_303_residues
TKVRKAIIPAAGLGTRFLPATKALAKEMLPIVDKPTIQFIVEEALKSGIEEILVVTGKAKRSIEDHFDSNFELEYNLQAK
GKNELLKLVDETTAINLHFIRQSHPRGLGDAVLQAKAFVGNEPFVVMLGDDLMDITNASAKPLTKQLMEDYDKTHASTIA
VMKVPHEDVSSYGVIAPQGKAVKGLYSVDTFVEKPQPEDAPSDLAIIGRYLLTPEIFGILERQTPGAGNEVQLTDAIDTL
NKTQRVFAREFKGNRYDVGDKFGFMKTSIDYALEHPQVKEDLKNYIIKLGKALEKSKVPTHSK

Specific function: May Play A Role In Stationary Phase Survival. [C]

COG id: COG1210

COG function: function code M; UDP-glucose pyrophosphorylase

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the UDPGP type 2 family

Homologues:

Organism=Escherichia coli, GI1787488, Length=291, Percent_Identity=43.298969072165, Blast_Score=233, Evalue=1e-62,
Organism=Escherichia coli, GI1788355, Length=283, Percent_Identity=42.4028268551237, Blast_Score=209, Evalue=2e-55,
Organism=Escherichia coli, GI1788351, Length=237, Percent_Identity=26.5822784810127, Blast_Score=75, Evalue=7e-15,
Organism=Escherichia coli, GI1790224, Length=252, Percent_Identity=25.7936507936508, Blast_Score=69, Evalue=3e-13,

Paralogues:

None

Copy number: 120 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). 140 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). 260 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). [C]

Swissprot (AC and ID): HASC1_STRP1 (P0C0I9)

Other databases:

- EMBL:   AE004092
- EMBL:   CP000017
- RefSeq:   NP_270109.1
- RefSeq:   YP_283216.1
- ProteinModelPortal:   P0C0I9
- SMR:   P0C0I9
- EnsemblBacteria:   EBSTRT00000000005
- EnsemblBacteria:   EBSTRT00000028411
- GeneID:   3571025
- GeneID:   901878
- GenomeReviews:   AE004092_GR
- GenomeReviews:   CP000017_GR
- KEGG:   spy:SPy_2202
- KEGG:   spz:M5005_Spy_1853
- GeneTree:   EBGT00050000026808
- HOGENOM:   HBG688195
- OMA:   HIGAGGE
- ProtClustDB:   CLSK877010
- BioCyc:   SPYO160490:SPY2202-MONOMER
- BioCyc:   SPYO293653:M5005_SPY1853-MONOMER
- InterPro:   IPR005771
- InterPro:   IPR005835
- TIGRFAMs:   TIGR01099

Pfam domain/function: PF00483 NTP_transferase

EC number: =2.7.7.9

Molecular weight: Translated: 33650; Mature: 33519

Theoretical pI: Translated: 6.91; Mature: 6.91

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.0 %Cys     (Translated Protein)
2.3 %Met     (Translated Protein)
2.3 %Cys+Met (Translated Protein)
0.0 %Cys     (Mature Protein)
2.0 %Met     (Mature Protein)
2.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTKVRKAIIPAAGLGTRFLPATKALAKEMLPIVDKPTIQFIVEEALKSGIEEILVVTGKA
CCCHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHEEEECCCH
KRSIEDHFDSNFELEYNLQAKGKNELLKLVDETTAINLHFIRQSHPRGLGDAVLQAKAFV
HHHHHHHCCCCCEEEEEECCCCHHHHHHHHHHHHEEEEEEEECCCCCCHHHHHHHHHHHC
GNEPFVVMLGDDLMDITNASAKPLTKQLMEDYDKTHASTIAVMKVPHEDVSSYGVIAPQG
CCCCEEEEECCCHHHHCCCCCCHHHHHHHHHHHHHHHHEEEEEECCCHHHHHCCCCCCCC
KAVKGLYSVDTFVEKPQPEDAPSDLAIIGRYLLTPEIFGILERQTPGAGNEVQLTDAIDT
CHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEHHHHHH
LNKTQRVFAREFKGNRYDVGDKFGFMKTSIDYALEHPQVKEDLKNYIIKLGKALEKSKVP
HHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCC
THSK
CCCC
>Mature Secondary Structure 
TKVRKAIIPAAGLGTRFLPATKALAKEMLPIVDKPTIQFIVEEALKSGIEEILVVTGKA
CCHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHEEEECCCH
KRSIEDHFDSNFELEYNLQAKGKNELLKLVDETTAINLHFIRQSHPRGLGDAVLQAKAFV
HHHHHHHCCCCCEEEEEECCCCHHHHHHHHHHHHEEEEEEEECCCCCCHHHHHHHHHHHC
GNEPFVVMLGDDLMDITNASAKPLTKQLMEDYDKTHASTIAVMKVPHEDVSSYGVIAPQG
CCCCEEEEECCCHHHHCCCCCCHHHHHHHHHHHHHHHHEEEEEECCCHHHHHCCCCCCCC
KAVKGLYSVDTFVEKPQPEDAPSDLAIIGRYLLTPEIFGILERQTPGAGNEVQLTDAIDT
CHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEHHHHHH
LNKTQRVFAREFKGNRYDVGDKFGFMKTSIDYALEHPQVKEDLKNYIIKLGKALEKSKVP
HHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCC
THSK
CCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11296296