The gene/protein map for NC_008819 is currently unavailable.
Definition Prochlorococcus marinus str. NATL1A, complete genome.
Accession NC_008819
Length 1,864,731

Click here to switch to the map view.

The map label for this gene is uvrC

Identifier: 124025708

GI number: 124025708

Start: 921564

End: 923486

Strand: Direct

Name: uvrC

Synonym: NATL1_10011

Alternate gene names: 124025708

Gene position: 921564-923486 (Clockwise)

Preceding gene: 124025707

Following gene: 124025709

Centisome position: 49.42

GC content: 30.94

Gene sequence:

>1923_bases
GTGGAACTAATACCGTTAATAAGGGACAAGTCAAGATTATCGGATTTTTTGAAGGATATACCTAATGATCCTGGATGTTA
TTTGATGAAAGATGGTGAGGATAGATTGCTTTATGTTGGTAAATCTAAAAAGTTAAGGAATAGAGTTAGAAGTTATTTTC
GTTCAGGTAATGAATTAAGTCCTAGAATATCTTTAATGGTGAGACAAGTTGCAGATATTGAATTGATAGTTACTGATAAT
GAAAGTGAAGCATTAACATTAGAATCAAATTTAATTAAATCTCACCAACCATATTTCAATGTCTTACTAAAAGATGATAA
AAAGTATCCCTATGTTTGTATTACTTGGGGTGATAAATATCCAAGAATTTTTTTAACTAGAAAAAGGCGTCAACGACAAT
TAAAAGATAAATATTATGGTCCTTATGTAGATGTTTATTTACTTAGAAAAACTCTATTTAGTATAAAAAAATTGTTTCCA
CTCAGGCAAAGAAGAATTCCGCTTTATAAGGATAGAACATGCCTTAATTATTCAATTGGAAGATGCCCTGGTGTTTGCCA
GGAAGAAATAAGTTCAGAAGATTACAAAAACACTTTAAAAAGAGTTGAAATGATATTTCAAGGAAGAACGGATGAATTAA
GAATATTATTAGAAAAACAAATGATTTCTTTTTCAGAGTCATTGAAATTTGAAGAGGCTGGATCAGTTAGAGATCAGCTT
AAGGGTATAGATAGATTGTATGAATCTCAAAAGATGATCATACCAGATTCATCTGTTTGTAGGGATATAATTGCAATGGC
ATCAGAAGAAAATATAAGCTCAGTACAAATTTTTCAAATGCGATCAGGTAAATTAATTGGTCGTTTAGGATATTTCTCAG
ATAATAGTAATTTTAATTCATCTCAAATACTTCAACAAGTAATAGAAAATCATTATTCAAATGTAGATCCTGTTGAAATC
CCATCAGAAATATTAGTTCAACATCAACTTGTAAATAATATTTTAATTTCAGATTGGCTTAGTGAAATAAAAAAGCAAAA
AGTTAATATAAATGTTCCTAAAAGATCTAGAAAAGCAGAGATTATTAAACTCGTAGAAAAAAATGCTAATTTAGAATTAC
AAAGAATTAAACAATCTCATGATAAGAATTTAGTTGAACTTGATGATCTGACTAATATCCTTGATTTAGAAAATATTCCA
AAGAGAATTGAATGTTATGACATAAGCCATATCCAAGGAAGTGACGCTGTTGCATCACAAGTAGTATTTATTGATGGTAT
TGCGGCAAGGCAACACTATAGAAGATATAAAATTAAAAGCCCAAATATAAAAATTGGTCACAGCGACGATTTCGAATCAA
TGGCTGAAGTGATAACTAGAAGATTTAGAAGATGGGCTCGTTTTAAAGAAGAAGGTGGAGATATTAATGCCCTACTAAGT
AATCAAAGCAGTGTTCTAGATAACCTGAATTTAAATGACTGGCCAGATCTCGTTGTGATAGATGGAGGTAAAGGTCAATT
AAGTTCTGTCGTAGCTGCTCTTGAGGAACTTAAACTTGATCAAAATTTAAATGTTATATCTTTAGCAAAAAAGAAGGAGG
AAGTTTTTATTCCTAATGTTAAACAATCATTAGTTACCGAATCAAATCAACCAGGAATGCTTTTGCTAAGGAGACTGAGA
GATGAAGCTCATAGATTTGCAATTACTTTTCATAGGCAAAAAAGGAGTCAACGGATGAAACGTTCTCAGTTAAATGAAAT
ACCGGGTCTTGGACCTCAAAGAATAAAATTATTGCTTGAGCATTTCAGGTCAATTGAGGCAATACAAATGGCTACTTTTT
CTGAACTTTCATCAACACCCGGCTTAGGCAGATCAACTGCTGTTGTTATTAGAAACTATTTTCATCCCGATAAAAATAAA
TAA

Upstream 100 bases:

>100_bases
CGCTGAAGTAAAGAGTGCAGTATTGAATAAGGATGGATCTGCCTTAAATTTAGCGAGCACAGGTTGGAATTATGGTGGTT
AATTTATCGAGACAAGTGCG

Downstream 100 bases:

>100_bases
TCTATTTTTAAATTATTAACTATTTAGCGACATCTAATCTTTATTAGAGTAATGTATAGAGTGATATGAACGTACCTAAC
ATTTGATTTAATATGAATTT

Product: excinuclease ABC subunit C

Products: NA

Alternate protein names: Protein uvrC; Excinuclease ABC subunit C

Number of amino acids: Translated: 640; Mature: 640

Protein sequence:

>640_residues
MELIPLIRDKSRLSDFLKDIPNDPGCYLMKDGEDRLLYVGKSKKLRNRVRSYFRSGNELSPRISLMVRQVADIELIVTDN
ESEALTLESNLIKSHQPYFNVLLKDDKKYPYVCITWGDKYPRIFLTRKRRQRQLKDKYYGPYVDVYLLRKTLFSIKKLFP
LRQRRIPLYKDRTCLNYSIGRCPGVCQEEISSEDYKNTLKRVEMIFQGRTDELRILLEKQMISFSESLKFEEAGSVRDQL
KGIDRLYESQKMIIPDSSVCRDIIAMASEENISSVQIFQMRSGKLIGRLGYFSDNSNFNSSQILQQVIENHYSNVDPVEI
PSEILVQHQLVNNILISDWLSEIKKQKVNINVPKRSRKAEIIKLVEKNANLELQRIKQSHDKNLVELDDLTNILDLENIP
KRIECYDISHIQGSDAVASQVVFIDGIAARQHYRRYKIKSPNIKIGHSDDFESMAEVITRRFRRWARFKEEGGDINALLS
NQSSVLDNLNLNDWPDLVVIDGGKGQLSSVVAALEELKLDQNLNVISLAKKKEEVFIPNVKQSLVTESNQPGMLLLRRLR
DEAHRFAITFHRQKRSQRMKRSQLNEIPGLGPQRIKLLLEHFRSIEAIQMATFSELSSTPGLGRSTAVVIRNYFHPDKNK

Sequences:

>Translated_640_residues
MELIPLIRDKSRLSDFLKDIPNDPGCYLMKDGEDRLLYVGKSKKLRNRVRSYFRSGNELSPRISLMVRQVADIELIVTDN
ESEALTLESNLIKSHQPYFNVLLKDDKKYPYVCITWGDKYPRIFLTRKRRQRQLKDKYYGPYVDVYLLRKTLFSIKKLFP
LRQRRIPLYKDRTCLNYSIGRCPGVCQEEISSEDYKNTLKRVEMIFQGRTDELRILLEKQMISFSESLKFEEAGSVRDQL
KGIDRLYESQKMIIPDSSVCRDIIAMASEENISSVQIFQMRSGKLIGRLGYFSDNSNFNSSQILQQVIENHYSNVDPVEI
PSEILVQHQLVNNILISDWLSEIKKQKVNINVPKRSRKAEIIKLVEKNANLELQRIKQSHDKNLVELDDLTNILDLENIP
KRIECYDISHIQGSDAVASQVVFIDGIAARQHYRRYKIKSPNIKIGHSDDFESMAEVITRRFRRWARFKEEGGDINALLS
NQSSVLDNLNLNDWPDLVVIDGGKGQLSSVVAALEELKLDQNLNVISLAKKKEEVFIPNVKQSLVTESNQPGMLLLRRLR
DEAHRFAITFHRQKRSQRMKRSQLNEIPGLGPQRIKLLLEHFRSIEAIQMATFSELSSTPGLGRSTAVVIRNYFHPDKNK
>Mature_640_residues
MELIPLIRDKSRLSDFLKDIPNDPGCYLMKDGEDRLLYVGKSKKLRNRVRSYFRSGNELSPRISLMVRQVADIELIVTDN
ESEALTLESNLIKSHQPYFNVLLKDDKKYPYVCITWGDKYPRIFLTRKRRQRQLKDKYYGPYVDVYLLRKTLFSIKKLFP
LRQRRIPLYKDRTCLNYSIGRCPGVCQEEISSEDYKNTLKRVEMIFQGRTDELRILLEKQMISFSESLKFEEAGSVRDQL
KGIDRLYESQKMIIPDSSVCRDIIAMASEENISSVQIFQMRSGKLIGRLGYFSDNSNFNSSQILQQVIENHYSNVDPVEI
PSEILVQHQLVNNILISDWLSEIKKQKVNINVPKRSRKAEIIKLVEKNANLELQRIKQSHDKNLVELDDLTNILDLENIP
KRIECYDISHIQGSDAVASQVVFIDGIAARQHYRRYKIKSPNIKIGHSDDFESMAEVITRRFRRWARFKEEGGDINALLS
NQSSVLDNLNLNDWPDLVVIDGGKGQLSSVVAALEELKLDQNLNVISLAKKKEEVFIPNVKQSLVTESNQPGMLLLRRLR
DEAHRFAITFHRQKRSQRMKRSQLNEIPGLGPQRIKLLLEHFRSIEAIQMATFSELSSTPGLGRSTAVVIRNYFHPDKNK

Specific function: The UvrABC repair system catalyzes the recognition and processing of DNA lesions. UvrC both incises the 5' and 3' sides of the lesion. The N-terminal half is responsible for the 3' incision and the C-terminal half is responsible for the 5' incision

COG id: COG0322

COG function: function code L; Nuclease subunit of the excinuclease complex

Gene ontology:

Cell location: Cytoplasm

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 UVR domain

Homologues:

Organism=Escherichia coli, GI87081999, Length=626, Percent_Identity=33.0670926517572, Blast_Score=341, Evalue=8e-95,
Organism=Escherichia coli, GI1788037, Length=220, Percent_Identity=29.0909090909091, Blast_Score=74, Evalue=2e-14,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): UVRC_PROM1 (A2C249)

Other databases:

- EMBL:   CP000553
- RefSeq:   YP_001014824.1
- ProteinModelPortal:   A2C249
- STRING:   A2C249
- GeneID:   4780129
- GenomeReviews:   CP000553_GR
- KEGG:   pme:NATL1_10011
- eggNOG:   COG0322
- HOGENOM:   HBG566029
- OMA:   DVLYVGK
- ProtClustDB:   PRK00558
- BioCyc:   PMAR167555:NATL1_10011-MONOMER
- GO:   GO:0005737
- HAMAP:   MF_00203
- InterPro:   IPR003583
- InterPro:   IPR010994
- InterPro:   IPR001943
- InterPro:   IPR009055
- InterPro:   IPR004791
- InterPro:   IPR001162
- InterPro:   IPR000305
- SMART:   SM00465
- SMART:   SM00278
- TIGRFAMs:   TIGR00194

Pfam domain/function: PF01541 GIY-YIG; PF02151 UVR; PF08459 UvrC_HhH_N; SSF47781 RuvA_2_like; SSF46600 UvrB_C; SSF82771 UvrC_N

EC number: NA

Molecular weight: Translated: 74222; Mature: 74222

Theoretical pI: Translated: 9.79; Mature: 9.79

Prosite motif: PS50151 UVR; PS50164 UVRC_1; PS50165 UVRC_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.1 %Cys     (Translated Protein)
1.9 %Met     (Translated Protein)
3.0 %Cys+Met (Translated Protein)
1.1 %Cys     (Mature Protein)
1.9 %Met     (Mature Protein)
3.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MELIPLIRDKSRLSDFLKDIPNDPGCYLMKDGEDRLLYVGKSKKLRNRVRSYFRSGNELS
CCCCCCCCCHHHHHHHHHHCCCCCCEEEEECCCCCEEEECCCHHHHHHHHHHHHCCCCCC
PRISLMVRQVADIELIVTDNESEALTLESNLIKSHQPYFNVLLKDDKKYPYVCITWGDKY
HHHHHHHHHHCCEEEEEECCCCCEEEEHHHHHHCCCCCEEEEEECCCCCCEEEEEECCCC
PRIFLTRKRRQRQLKDKYYGPYVDVYLLRKTLFSIKKLFPLRQRRIPLYKDRTCLNYSIG
CEEEEEHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEECCCCC
RCPGVCQEEISSEDYKNTLKRVEMIFQGRTDELRILLEKQMISFSESLKFEEAGSVRDQL
CCCCHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCHHCCCHHHHH
KGIDRLYESQKMIIPDSSVCRDIIAMASEENISSVQIFQMRSGKLIGRLGYFSDNSNFNS
HHHHHHHHCCCEECCCHHHHHHHHHHHCCCCCCCEEEEEECCCCEEEEEEEECCCCCCCH
SQILQQVIENHYSNVDPVEIPSEILVQHQLVNNILISDWLSEIKKQKVNINVPKRSRKAE
HHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCEECCCCCCCHHHH
IIKLVEKNANLELQRIKQSHDKNLVELDDLTNILDLENIPKRIECYDISHIQGSDAVASQ
HHHHHHCCCCCHHHHHHHHHCCCCEEHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHH
VVFIDGIAARQHYRRYKIKSPNIKIGHSDDFESMAEVITRRFRRWARFKEEGGDINALLS
HHEECCHHHHHHHHHCCCCCCCEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEECC
NQSSVLDNLNLNDWPDLVVIDGGKGQLSSVVAALEELKLDQNLNVISLAKKKEEVFIPNV
CCHHHHHCCCCCCCCCEEEEECCCCHHHHHHHHHHHHHHCCCCCEEEEHHCCCCEECCCH
KQSLVTESNQPGMLLLRRLRDEAHRFAITFHRQKRSQRMKRSQLNEIPGLGPQRIKLLLE
HHHHHCCCCCCHHHHHHHHHHHHHHEEHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHH
HFRSIEAIQMATFSELSSTPGLGRSTAVVIRNYFHPDKNK
HHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCC
>Mature Secondary Structure
MELIPLIRDKSRLSDFLKDIPNDPGCYLMKDGEDRLLYVGKSKKLRNRVRSYFRSGNELS
CCCCCCCCCHHHHHHHHHHCCCCCCEEEEECCCCCEEEECCCHHHHHHHHHHHHCCCCCC
PRISLMVRQVADIELIVTDNESEALTLESNLIKSHQPYFNVLLKDDKKYPYVCITWGDKY
HHHHHHHHHHCCEEEEEECCCCCEEEEHHHHHHCCCCCEEEEEECCCCCCEEEEEECCCC
PRIFLTRKRRQRQLKDKYYGPYVDVYLLRKTLFSIKKLFPLRQRRIPLYKDRTCLNYSIG
CEEEEEHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEECCCCC
RCPGVCQEEISSEDYKNTLKRVEMIFQGRTDELRILLEKQMISFSESLKFEEAGSVRDQL
CCCCHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCHHCCCHHHHH
KGIDRLYESQKMIIPDSSVCRDIIAMASEENISSVQIFQMRSGKLIGRLGYFSDNSNFNS
HHHHHHHHCCCEECCCHHHHHHHHHHHCCCCCCCEEEEEECCCCEEEEEEEECCCCCCCH
SQILQQVIENHYSNVDPVEIPSEILVQHQLVNNILISDWLSEIKKQKVNINVPKRSRKAE
HHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCEECCCCCCCHHHH
IIKLVEKNANLELQRIKQSHDKNLVELDDLTNILDLENIPKRIECYDISHIQGSDAVASQ
HHHHHHCCCCCHHHHHHHHHCCCCEEHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHH
VVFIDGIAARQHYRRYKIKSPNIKIGHSDDFESMAEVITRRFRRWARFKEEGGDINALLS
HHEECCHHHHHHHHHCCCCCCCEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEECC
NQSSVLDNLNLNDWPDLVVIDGGKGQLSSVVAALEELKLDQNLNVISLAKKKEEVFIPNV
CCHHHHHCCCCCCCCCEEEEECCCCHHHHHHHHHHHHHHCCCCCEEEEHHCCCCEECCCH
KQSLVTESNQPGMLLLRRLRDEAHRFAITFHRQKRSQRMKRSQLNEIPGLGPQRIKLLLE
HHHHHCCCCCCHHHHHHHHHHHHHHEEHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHH
HFRSIEAIQMATFSELSSTPGLGRSTAVVIRNYFHPDKNK
HHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA