Definition | Agrobacterium tumefaciens str. C58 chromosome circular, complete sequence. |
---|---|
Accession | NC_003062 |
Length | 2,841,580 |
Click here to switch to the map view.
The map label for this gene is ung
Identifier: 159185223
GI number: 159185223
Start: 2426413
End: 2427126
Strand: Reverse
Name: ung
Synonym: Atu2455
Alternate gene names: 159185223
Gene position: 2427126-2426413 (Counterclockwise)
Preceding gene: 15889727
Following gene: 15889725
Centisome position: 85.41
GC content: 58.54
Gene sequence:
>714_bases ATGGCAGAGGCAGGCGTCAAACTCGAAGACAGCTGGAAGCACGTTCTCTCGGGCGAATTCGCCAGCCCCTACATGCAGAA GCTGAAAGAGTTCCTGCTTGCCGAAAAGACCGCCGGCAAGCGCATCTTTCCGAAGGGCGCGGAATATTTCCGCGCTCTCG ACCTCACGCCTCTCGACGAGGTCAAGGTCGTCATTCTTGGCCAGGATCCCTACCATGGGCTCGGTCAGGCACATGGGCTC TGCTTTTCGGTGCAGCCCGGGGTGCGCATTCCGCCCTCGCTCGTCAATATCTACAAGGAATTGCAGAGCGACCTCGGCAT TCGCCCGGTCAAGCACGGTTTTCTGGAAAGCTGGGCAAAACAGGGCGTGTTGCTGCTCAACAGCGTGCTGACGGTGGAAG AGGCGCGGGCCGCCTCGCATCAGGGGCAGGGCTGGGAAAAATTCACCGATGCGGTCATCCGCGCCGTGAACGACGAATGT GACCATGTGGTCTTTCTGCTCTGGGGTTCCTATGCCCAGAAGAAGGCGGCCTTCGTGGACCAGCGCAAACATCTCGTGCT GCGCTCGCCACACCCGTCACCCCTTTCAGCCCATAACGGGTTTTTCGGCAACGGCCATTTTTCTAAGGCCAACGCTTTCC TCGTTTCGCATGGCCGTGATCCGATCGACTGGCAATTGCCTGACGTGGTGGAAGGCGACAAAAACCTTCTTTAA
Upstream 100 bases:
>100_bases GCCTTTCGCAAAATACTTTTTCATCTGCGATCCGGATGGTTACAAGATCGAAGTGCTGCAGCGCGGCGACCGGTTCAAAT AAGCTCTGATGGAGGCGGGC
Downstream 100 bases:
>100_bases ATACTCATAAATACCTTTGCGCTTTGCCAGTCTCGGGGTTAAAGATAACCGGACTACAAAGTGAAGGATTATGCCCGTGT CTCTCCTGTCCGCAGAAACC
Product: uracil-DNA glycosylase
Products: NA
Alternate protein names: UDG
Number of amino acids: Translated: 237; Mature: 236
Protein sequence:
>237_residues MAEAGVKLEDSWKHVLSGEFASPYMQKLKEFLLAEKTAGKRIFPKGAEYFRALDLTPLDEVKVVILGQDPYHGLGQAHGL CFSVQPGVRIPPSLVNIYKELQSDLGIRPVKHGFLESWAKQGVLLLNSVLTVEEARAASHQGQGWEKFTDAVIRAVNDEC DHVVFLLWGSYAQKKAAFVDQRKHLVLRSPHPSPLSAHNGFFGNGHFSKANAFLVSHGRDPIDWQLPDVVEGDKNLL
Sequences:
>Translated_237_residues MAEAGVKLEDSWKHVLSGEFASPYMQKLKEFLLAEKTAGKRIFPKGAEYFRALDLTPLDEVKVVILGQDPYHGLGQAHGL CFSVQPGVRIPPSLVNIYKELQSDLGIRPVKHGFLESWAKQGVLLLNSVLTVEEARAASHQGQGWEKFTDAVIRAVNDEC DHVVFLLWGSYAQKKAAFVDQRKHLVLRSPHPSPLSAHNGFFGNGHFSKANAFLVSHGRDPIDWQLPDVVEGDKNLL >Mature_236_residues AEAGVKLEDSWKHVLSGEFASPYMQKLKEFLLAEKTAGKRIFPKGAEYFRALDLTPLDEVKVVILGQDPYHGLGQAHGLC FSVQPGVRIPPSLVNIYKELQSDLGIRPVKHGFLESWAKQGVLLLNSVLTVEEARAASHQGQGWEKFTDAVIRAVNDECD HVVFLLWGSYAQKKAAFVDQRKHLVLRSPHPSPLSAHNGFFGNGHFSKANAFLVSHGRDPIDWQLPDVVEGDKNLL
Specific function: Excises uracil residues from the DNA which can arise as a result of misincorporation of dUMP residues by DNA polymerase or due to deamination of cytosine
COG id: COG0692
COG function: function code L; Uracil DNA glycosylase
Gene ontology:
Cell location: Cytoplasm
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the uracil-DNA glycosylase family
Homologues:
Organism=Homo sapiens, GI19718751, Length=222, Percent_Identity=54.0540540540541, Blast_Score=230, Evalue=8e-61, Organism=Homo sapiens, GI6224979, Length=222, Percent_Identity=54.0540540540541, Blast_Score=230, Evalue=1e-60, Organism=Escherichia coli, GI1788934, Length=217, Percent_Identity=55.2995391705069, Blast_Score=244, Evalue=3e-66, Organism=Caenorhabditis elegans, GI17556304, Length=217, Percent_Identity=48.8479262672811, Blast_Score=218, Evalue=2e-57, Organism=Saccharomyces cerevisiae, GI6323620, Length=236, Percent_Identity=44.4915254237288, Blast_Score=178, Evalue=8e-46,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): UNG_AGRT5 (Q8UCM8)
Other databases:
- EMBL: AE007869 - PIR: AE2878 - PIR: G97654 - RefSeq: NP_355407.2 - ProteinModelPortal: Q8UCM8 - SMR: Q8UCM8 - STRING: Q8UCM8 - GeneID: 1134493 - GenomeReviews: AE007869_GR - KEGG: atu:Atu2455 - eggNOG: COG0692 - HOGENOM: HBG605450 - OMA: GAHAQKK - PhylomeDB: Q8UCM8 - ProtClustDB: PRK05254 - BioCyc: ATUM176299-1:ATU2455-MONOMER - GO: GO:0005737 - HAMAP: MF_00148 - InterPro: IPR002043 - InterPro: IPR018085 - InterPro: IPR005122 - Gene3D: G3DSA:3.40.470.10 - PANTHER: PTHR11264 - TIGRFAMs: TIGR00628
Pfam domain/function: PF03167 UDG; SSF52141 UDNA_glycsylseSF
EC number: =3.2.2.27
Molecular weight: Translated: 26330; Mature: 26199
Theoretical pI: Translated: 7.41; Mature: 7.41
Prosite motif: PS00130 U_DNA_GLYCOSYLASE
Important sites: ACT_SITE 73-73
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.8 %Cys (Translated Protein) 0.8 %Met (Translated Protein) 1.7 %Cys+Met (Translated Protein) 0.8 %Cys (Mature Protein) 0.4 %Met (Mature Protein) 1.3 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MAEAGVKLEDSWKHVLSGEFASPYMQKLKEFLLAEKTAGKRIFPKGAEYFRALDLTPLDE CCCCCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHCCCCCCCC VKVVILGQDPYHGLGQAHGLCFSVQPGVRIPPSLVNIYKELQSDLGIRPVKHGFLESWAK EEEEEECCCCCCCCHHHCCEEEECCCCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHH QGVLLLNSVLTVEEARAASHQGQGWEKFTDAVIRAVNDECDHVVFLLWGSYAQKKAAFVD CCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHH QRKHLVLRSPHPSPLSAHNGFFGNGHFSKANAFLVSHGRDPIDWQLPDVVEGDKNLL HHCCEEEECCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCHHHCCCCCCC >Mature Secondary Structure AEAGVKLEDSWKHVLSGEFASPYMQKLKEFLLAEKTAGKRIFPKGAEYFRALDLTPLDE CCCCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHCCCCCCCC VKVVILGQDPYHGLGQAHGLCFSVQPGVRIPPSLVNIYKELQSDLGIRPVKHGFLESWAK EEEEEECCCCCCCCHHHCCEEEECCCCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHH QGVLLLNSVLTVEEARAASHQGQGWEKFTDAVIRAVNDECDHVVFLLWGSYAQKKAAFVD CCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHH QRKHLVLRSPHPSPLSAHNGFFGNGHFSKANAFLVSHGRDPIDWQLPDVVEGDKNLL HHCCEEEECCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCHHHCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11743193; 11743194