Definition | Streptococcus pyogenes M1 GAS chromosome, complete genome. |
---|---|
Accession | NC_002737 |
Length | 1,852,441 |
Click here to switch to the map view.
The map label for this gene is ung
Identifier: 15674927
GI number: 15674927
Start: 749465
End: 750118
Strand: Direct
Name: ung
Synonym: SPy_0905
Alternate gene names: 15674927
Gene position: 749465-750118 (Clockwise)
Preceding gene: 15674926
Following gene: 15674928
Centisome position: 40.46
GC content: 42.2
Gene sequence:
>654_bases ATGGCTCATTCAATTTGGCATGAGAAAATCAAATCATTTTTGCCGGAGCATTACTACGGTCGCATCAATCACTTTTTAGA TGAAGCCTATGCTTCCGGTCTTGTCTATCCTCCACGAGAAAATGTCTTTAAAGCCTTACAAGTCACTCCTTTGGAAGAAA CCAAAGTGTTGATTTTAGGACAAGACCCTTATCATGGCCCCAAACAGGCCCAGGGCTTATCATTTTCAGTTCCAGAAGAG ATTTCTGCTCCGCCATCCCTTATTAATATTTTAAAAGAATTAGCAGATGACATTGGTCCTCGTGACCATCATGATTTAAG CACTTGGGCTAGTCAAGGGGTTCTGCTTTTGAATGCTTGCTTAACAGTGCCAGCAGGCCAGGCTAATGGGCATGCAGGCT TAATATGGGAGCCATTTACAGATGCTGTTATTAAAGTGCTGAATGAGAAAGACAGCCCTGTAGTTTTCATTTTGTGGGGA GCTTATGCAAGGAAGAAAAAAGCCTTCATTACTAATCCAAAGCACCACATCATCGAGAGCCCTCATCCAAGCCCTTTGTC ATCTTATCGTGGCTTTTTTGGCAGCAAACCCTTTTCAAGAACCAATGCTATTTTAGAAAAAGAGGGCATGACTGGCGTAG ATTGGCTAAAATAA
Upstream 100 bases:
>100_bases ATATCATATTTTCAGGGTCTTTCTATTTAAAATGTGGTAAACTAGATAGAAAAGGACATCGTATTTAACTTAGAAACTGA CGTGATTAGAAAGGAATCTT
Downstream 100 bases:
>100_bases GCCAGCCATTTCAACTAAATAGCTTAATCTAGACATTTTGATACCTTTAATAAGTGAAATGAGTGCAGTAGATTTTTCCT ATTAAAGAACGTAAGGTCTT
Product: uracil-DNA glycosylase
Products: NA
Alternate protein names: UDG
Number of amino acids: Translated: 217; Mature: 216
Protein sequence:
>217_residues MAHSIWHEKIKSFLPEHYYGRINHFLDEAYASGLVYPPRENVFKALQVTPLEETKVLILGQDPYHGPKQAQGLSFSVPEE ISAPPSLINILKELADDIGPRDHHDLSTWASQGVLLLNACLTVPAGQANGHAGLIWEPFTDAVIKVLNEKDSPVVFILWG AYARKKKAFITNPKHHIIESPHPSPLSSYRGFFGSKPFSRTNAILEKEGMTGVDWLK
Sequences:
>Translated_217_residues MAHSIWHEKIKSFLPEHYYGRINHFLDEAYASGLVYPPRENVFKALQVTPLEETKVLILGQDPYHGPKQAQGLSFSVPEE ISAPPSLINILKELADDIGPRDHHDLSTWASQGVLLLNACLTVPAGQANGHAGLIWEPFTDAVIKVLNEKDSPVVFILWG AYARKKKAFITNPKHHIIESPHPSPLSSYRGFFGSKPFSRTNAILEKEGMTGVDWLK >Mature_216_residues AHSIWHEKIKSFLPEHYYGRINHFLDEAYASGLVYPPRENVFKALQVTPLEETKVLILGQDPYHGPKQAQGLSFSVPEEI SAPPSLINILKELADDIGPRDHHDLSTWASQGVLLLNACLTVPAGQANGHAGLIWEPFTDAVIKVLNEKDSPVVFILWGA YARKKKAFITNPKHHIIESPHPSPLSSYRGFFGSKPFSRTNAILEKEGMTGVDWLK
Specific function: Excises uracil residues from the DNA which can arise as a result of misincorporation of dUMP residues by DNA polymerase or due to deamination of cytosine
COG id: COG0692
COG function: function code L; Uracil DNA glycosylase
Gene ontology:
Cell location: Cytoplasm
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the uracil-DNA glycosylase family
Homologues:
Organism=Homo sapiens, GI6224979, Length=213, Percent_Identity=49.2957746478873, Blast_Score=206, Evalue=1e-53, Organism=Homo sapiens, GI19718751, Length=213, Percent_Identity=49.2957746478873, Blast_Score=206, Evalue=2e-53, Organism=Escherichia coli, GI1788934, Length=215, Percent_Identity=48.3720930232558, Blast_Score=204, Evalue=3e-54, Organism=Caenorhabditis elegans, GI17556304, Length=215, Percent_Identity=42.7906976744186, Blast_Score=178, Evalue=2e-45, Organism=Saccharomyces cerevisiae, GI6323620, Length=217, Percent_Identity=40.5529953917051, Blast_Score=144, Evalue=1e-35,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): UNG_STRP1 (Q9A072)
Other databases:
- EMBL: AE004092 - EMBL: CP000017 - RefSeq: NP_269101.1 - RefSeq: YP_282071.1 - ProteinModelPortal: Q9A072 - SMR: Q9A072 - EnsemblBacteria: EBSTRT00000000468 - EnsemblBacteria: EBSTRT00000027521 - GeneID: 3572220 - GeneID: 901060 - GenomeReviews: AE004092_GR - GenomeReviews: CP000017_GR - KEGG: spy:SPy_0905 - KEGG: spz:M5005_Spy_0708 - GeneTree: EBGT00050000028393 - HOGENOM: HBG605450 - OMA: GAHAQKK - ProtClustDB: PRK05254 - BioCyc: SPYO160490:SPY0905-MONOMER - BioCyc: SPYO293653:M5005_SPY0708-MONOMER - GO: GO:0005737 - HAMAP: MF_00148 - InterPro: IPR002043 - InterPro: IPR018085 - InterPro: IPR005122 - Gene3D: G3DSA:3.40.470.10 - PANTHER: PTHR11264 - TIGRFAMs: TIGR00628
Pfam domain/function: PF03167 UDG; SSF52141 UDNA_glycsylseSF
EC number: =3.2.2.27
Molecular weight: Translated: 24213; Mature: 24082
Theoretical pI: Translated: 7.14; Mature: 7.14
Prosite motif: PS00130 U_DNA_GLYCOSYLASE
Important sites: ACT_SITE 62-62
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.5 %Cys (Translated Protein) 0.9 %Met (Translated Protein) 1.4 %Cys+Met (Translated Protein) 0.5 %Cys (Mature Protein) 0.5 %Met (Mature Protein) 0.9 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MAHSIWHEKIKSFLPEHYYGRINHFLDEAYASGLVYPPRENVFKALQVTPLEETKVLILG CCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCEEEEEE QDPYHGPKQAQGLSFSVPEEISAPPSLINILKELADDIGPRDHHDLSTWASQGVLLLNAC CCCCCCCHHCCCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHCCHHEEHHH LTVPAGQANGHAGLIWEPFTDAVIKVLNEKDSPVVFILWGAYARKKKAFITNPKHHIIES HHCCCCCCCCCCCEEECHHHHHHHHHHCCCCCCEEEEEECCHHHHCCCEECCCHHHCCCC PHPSPLSSYRGFFGSKPFSRTNAILEKEGMTGVDWLK CCCCCHHHHHHHHCCCCCHHHHHHHHHCCCCCCCCCC >Mature Secondary Structure AHSIWHEKIKSFLPEHYYGRINHFLDEAYASGLVYPPRENVFKALQVTPLEETKVLILG CCHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCEEEEEE QDPYHGPKQAQGLSFSVPEEISAPPSLINILKELADDIGPRDHHDLSTWASQGVLLLNAC CCCCCCCHHCCCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHCCHHEEHHH LTVPAGQANGHAGLIWEPFTDAVIKVLNEKDSPVVFILWGAYARKKKAFITNPKHHIIES HHCCCCCCCCCCCEEECHHHHHHHHHHCCCCCCEEEEEECCHHHHCCCEECCCHHHCCCC PHPSPLSSYRGFFGSKPFSRTNAILEKEGMTGVDWLK CCCCCHHHHHHHHCCCCCHHHHHHHHHCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11296296