Definition Bacillus anthracis str. Sterne chromosome, complete genome.
Accession NC_005945
Length 5,228,663

Click here to switch to the map view.

The map label for this gene is uvrC

Identifier: 49187411

GI number: 49187411

Start: 4324920

End: 4326704

Strand: Reverse

Name: uvrC

Synonym: BAS4416

Alternate gene names: 49187411

Gene position: 4326704-4324920 (Counterclockwise)

Preceding gene: 49187412

Following gene: 49187409

Centisome position: 82.75

GC content: 35.97

Gene sequence:

>1785_bases
GTGCACGAACATTTAAAAGAAAAATTGGCTATTTTACCAGATCAACCTGGTTGTTATTTAATGAAAGATAGGCAAGGAAC
GGTTATATATGTCGGAAAGGCAAAAGTGCTTAAAAATCGTGTGCGCTCGTACTTTACTGGTTCGCATGACGGGAAAACAC
TTCGGTTAGTAGGAGAAATTGTAGATTTTGAATATATTGTAACCTCCTCAAATCTAGAGGCGCTCATTTTGGAGTTAAAC
TTAATAAAAAAACATGACCCAAAATATAATATTCAATTAAAAGATGATAAAACATATCCTTTTATTAAAATTACAGCTGA
GAAACAACCGCGCTTACTTATTACGCGAAATGTAAAAAAGGATAAAGGAAAGTATTTTGGCCCTTATCCGAATGCACAAT
CAGCTCATGAAACGAAAAAACTGCTGGATCGTATGTATCCGCTTCGTAAGTGCTCAAATATGCCGGATAAAGTTTGTTTA
TATTATCATATGGGTCAATGTTTAGCACCTTGTGTGAAAGAAGTGACGGAAGAACAAAATAAAGAAATTGTAGATGAGAT
TATTAAGTTTTTAAATGGTGGGCATAAAGAAGTTCGTTCAGAATTAGAAACAAAAATGTATGAAGCTTCAGAGAAACTAG
AGTTTGAACGTGCAAAAGAGTTACGTGATCAAATCGCTCATATCGATGCGATTATGGAAAAACAAAAGATGATTATGAGT
GATTTAGTGGACCGTGATGTGTTTGGCTATGCAGTTGATAAAGGGTGGATGTGTGTTCAAGTTTTCTTCGTTCGGAAAGG
AAAGTTAATTGAACGTGATGTTTCTATGTTTCCAATATATGATGAACCAGAAGAGGGATTCTTAACGTTTATCGGTCAAT
TTTATGAAAACAGCAGTCATTTTAAGCCGAAAGAAATAGTTGTTCCAGGAAGTATAGACTCAGAATTAGTAGAACGCTTT
TTAGAAGTGGAAGCGACACAGCCGAAACGCGGTAAGAAAAAAGATCTTGTAGAACTGGCAAATAAAAATGCGAAGATTGC
CCTGGAAGAGAAATTCTATTTAATTGAACGTGATGAAGAGCGAACGATTAAAGCTGTAGAGAATTTAGGGAAGCAGCTCG
GAATTGAAACGCCTTATCGTATTGAAGCATTTGATAACTCAAATATTCAAGGGACAAATCCTGTTTCTGCAATGATTGCT
TTTATTGATGGGAAACCAGCTAAGAAAGAATACAGGAAATATAAAATTAAAACAGTTCAAGGACCAGATGATTATGAGTC
TATGAGAGAAGTTGTGAGACGCCGTTATACAAGGGCGCTGAAAGAGGGTTTACCTTTACCAGATTTAATCATTATTGATG
GCGGAAAAGGTCATCTGGCGGCTGCAAGTGATGTTCTAGAAAATGAGCTCGGTTTATATATTCCGATGGCAGGTCTTGTA
AAAGATGACAAACATAAAACATCTCATTTAATTATTGGAGATCCACCTGAACCTGTGATGCTGGAGAGAAATAGCCAAGA
ATTTTATTTATTGCAGCGTGTTCAAGATGAAGTGCATCGATTTGCAATTACATTTCATCGTCAATTACACGGGAAATCTG
TCATTCAATCAGCACTGGATGATATTCCTGGAATCGGTGATAAACGGAAAAAGGTATTGTTAAAACATTTTGGTTCATTA
AAGAAGATGAAAGAAGCTTCTATAGAGGAATTTGTCGAAGCAGGTATGCCGAAAAATGTCGCAGAGACGATTTATACTTA
TTTAACAGATAAGAAGACGTTGTAG

Upstream 100 bases:

>100_bases
CCTTTCCGTTACAATAAAAGAACGTATGTTGGGTGTTTTGGCATAGTATGTTTTCATGTGAAAATAAAAATGCTAAGCAT
GTAGAAATGGAGGGTAACGA

Downstream 100 bases:

>100_bases
TTTACAATGTCTTCTATTTTTTGGTATAATTTCTGAAGATTTAAAATAAATCCAAATGAATGAAAATACGAAAAAGTCAA
CTAAACTTATATGTCTAGTT

Product: excinuclease ABC subunit C

Products: NA

Alternate protein names: Protein uvrC; Excinuclease ABC subunit C

Number of amino acids: Translated: 594; Mature: 594

Protein sequence:

>594_residues
MHEHLKEKLAILPDQPGCYLMKDRQGTVIYVGKAKVLKNRVRSYFTGSHDGKTLRLVGEIVDFEYIVTSSNLEALILELN
LIKKHDPKYNIQLKDDKTYPFIKITAEKQPRLLITRNVKKDKGKYFGPYPNAQSAHETKKLLDRMYPLRKCSNMPDKVCL
YYHMGQCLAPCVKEVTEEQNKEIVDEIIKFLNGGHKEVRSELETKMYEASEKLEFERAKELRDQIAHIDAIMEKQKMIMS
DLVDRDVFGYAVDKGWMCVQVFFVRKGKLIERDVSMFPIYDEPEEGFLTFIGQFYENSSHFKPKEIVVPGSIDSELVERF
LEVEATQPKRGKKKDLVELANKNAKIALEEKFYLIERDEERTIKAVENLGKQLGIETPYRIEAFDNSNIQGTNPVSAMIA
FIDGKPAKKEYRKYKIKTVQGPDDYESMREVVRRRYTRALKEGLPLPDLIIIDGGKGHLAAASDVLENELGLYIPMAGLV
KDDKHKTSHLIIGDPPEPVMLERNSQEFYLLQRVQDEVHRFAITFHRQLHGKSVIQSALDDIPGIGDKRKKVLLKHFGSL
KKMKEASIEEFVEAGMPKNVAETIYTYLTDKKTL

Sequences:

>Translated_594_residues
MHEHLKEKLAILPDQPGCYLMKDRQGTVIYVGKAKVLKNRVRSYFTGSHDGKTLRLVGEIVDFEYIVTSSNLEALILELN
LIKKHDPKYNIQLKDDKTYPFIKITAEKQPRLLITRNVKKDKGKYFGPYPNAQSAHETKKLLDRMYPLRKCSNMPDKVCL
YYHMGQCLAPCVKEVTEEQNKEIVDEIIKFLNGGHKEVRSELETKMYEASEKLEFERAKELRDQIAHIDAIMEKQKMIMS
DLVDRDVFGYAVDKGWMCVQVFFVRKGKLIERDVSMFPIYDEPEEGFLTFIGQFYENSSHFKPKEIVVPGSIDSELVERF
LEVEATQPKRGKKKDLVELANKNAKIALEEKFYLIERDEERTIKAVENLGKQLGIETPYRIEAFDNSNIQGTNPVSAMIA
FIDGKPAKKEYRKYKIKTVQGPDDYESMREVVRRRYTRALKEGLPLPDLIIIDGGKGHLAAASDVLENELGLYIPMAGLV
KDDKHKTSHLIIGDPPEPVMLERNSQEFYLLQRVQDEVHRFAITFHRQLHGKSVIQSALDDIPGIGDKRKKVLLKHFGSL
KKMKEASIEEFVEAGMPKNVAETIYTYLTDKKTL
>Mature_594_residues
MHEHLKEKLAILPDQPGCYLMKDRQGTVIYVGKAKVLKNRVRSYFTGSHDGKTLRLVGEIVDFEYIVTSSNLEALILELN
LIKKHDPKYNIQLKDDKTYPFIKITAEKQPRLLITRNVKKDKGKYFGPYPNAQSAHETKKLLDRMYPLRKCSNMPDKVCL
YYHMGQCLAPCVKEVTEEQNKEIVDEIIKFLNGGHKEVRSELETKMYEASEKLEFERAKELRDQIAHIDAIMEKQKMIMS
DLVDRDVFGYAVDKGWMCVQVFFVRKGKLIERDVSMFPIYDEPEEGFLTFIGQFYENSSHFKPKEIVVPGSIDSELVERF
LEVEATQPKRGKKKDLVELANKNAKIALEEKFYLIERDEERTIKAVENLGKQLGIETPYRIEAFDNSNIQGTNPVSAMIA
FIDGKPAKKEYRKYKIKTVQGPDDYESMREVVRRRYTRALKEGLPLPDLIIIDGGKGHLAAASDVLENELGLYIPMAGLV
KDDKHKTSHLIIGDPPEPVMLERNSQEFYLLQRVQDEVHRFAITFHRQLHGKSVIQSALDDIPGIGDKRKKVLLKHFGSL
KKMKEASIEEFVEAGMPKNVAETIYTYLTDKKTL

Specific function: The UvrABC repair system catalyzes the recognition and processing of DNA lesions. UvrC both incises the 5' and 3' sides of the lesion. The N-terminal half is responsible for the 3' incision and the C-terminal half is responsible for the 5' incision

COG id: COG0322

COG function: function code L; Nuclease subunit of the excinuclease complex

Gene ontology:

Cell location: Cytoplasm

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 UVR domain

Homologues:

Organism=Escherichia coli, GI87081999, Length=612, Percent_Identity=37.4183006535948, Blast_Score=358, Evalue=1e-100,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): UVRC_BACAA (C3PAA9)

Other databases:

- EMBL:   CP001598
- RefSeq:   YP_002868799.1
- ProteinModelPortal:   C3PAA9
- EnsemblBacteria:   EBBACT00000130561
- GeneID:   7850082
- GenomeReviews:   CP001598_GR
- KEGG:   bai:BAA_4772
- GeneTree:   EBGT00050000001864
- ProtClustDB:   PRK00558
- GO:   GO:0005737
- HAMAP:   MF_00203
- InterPro:   IPR003583
- InterPro:   IPR010994
- InterPro:   IPR001943
- InterPro:   IPR009055
- InterPro:   IPR004791
- InterPro:   IPR001162
- InterPro:   IPR000305
- SMART:   SM00465
- SMART:   SM00278
- TIGRFAMs:   TIGR00194

Pfam domain/function: PF01541 GIY-YIG; PF02151 UVR; PF08459 UvrC_HhH_N; SSF47781 RuvA_2_like; SSF46600 UvrB_C; SSF82771 UvrC_N

EC number: NA

Molecular weight: Translated: 68399; Mature: 68399

Theoretical pI: Translated: 8.46; Mature: 8.46

Prosite motif: PS50151 UVR; PS50164 UVRC_1; PS50165 UVRC_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.0 %Cys     (Translated Protein)
2.9 %Met     (Translated Protein)
3.9 %Cys+Met (Translated Protein)
1.0 %Cys     (Mature Protein)
2.9 %Met     (Mature Protein)
3.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MHEHLKEKLAILPDQPGCYLMKDRQGTVIYVGKAKVLKNRVRSYFTGSHDGKTLRLVGEI
CCHHHHHHHHCCCCCCCEEEEECCCCCEEEECCHHHHHHHHHHHHCCCCCCCEEEEEHHH
VDFEYIVTSSNLEALILELNLIKKHDPKYNIQLKDDKTYPFIKITAEKQPRLLITRNVKK
HCEEEEEECCCCEEEEEEEHHHHCCCCCEEEEEECCCCCCEEEEECCCCCCEEEEECCCC
DKGKYFGPYPNAQSAHETKKLLDRMYPLRKCSNMPDKVCLYYHMGQCLAPCVKEVTEEQN
CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHEEHHHHHHHHHHHHHHHHHHHHH
KEIVDEIIKFLNGGHKEVRSELETKMYEASEKLEFERAKELRDQIAHIDAIMEKQKMIMS
HHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
DLVDRDVFGYAVDKGWMCVQVFFVRKGKLIERDVSMFPIYDEPEEGFLTFIGQFYENSSH
HHHHCCHHHHEECCCCEEEEEEHHHCCCHHHCCCCCCCCCCCCCCHHHHHHHHHHCCCCC
FKPKEIVVPGSIDSELVERFLEVEATQPKRGKKKDLVELANKNAKIALEEKFYLIERDEE
CCCCEEEECCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCCEEEEECEEEEEECCHH
RTIKAVENLGKQLGIETPYRIEAFDNSNIQGTNPVSAMIAFIDGKPAKKEYRKYKIKTVQ
HHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCHHHHHEEHCCCCCCHHHHHHEEEEEEC
GPDDYESMREVVRRRYTRALKEGLPLPDLIIIDGGKGHLAAASDVLENELGLYIPMAGLV
CCCHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCHHHHHHHHHCCCCEEEECHHHC
KDDKHKTSHLIIGDPPEPVMLERNSQEFYLLQRVQDEVHRFAITFHRQLHGKSVIQSALD
CCCCCCCCEEEECCCCCCEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
DIPGIGDKRKKVLLKHFGSLKKMKEASIEEFVEAGMPKNVAETIYTYLTDKKTL
HCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCC
>Mature Secondary Structure
MHEHLKEKLAILPDQPGCYLMKDRQGTVIYVGKAKVLKNRVRSYFTGSHDGKTLRLVGEI
CCHHHHHHHHCCCCCCCEEEEECCCCCEEEECCHHHHHHHHHHHHCCCCCCCEEEEEHHH
VDFEYIVTSSNLEALILELNLIKKHDPKYNIQLKDDKTYPFIKITAEKQPRLLITRNVKK
HCEEEEEECCCCEEEEEEEHHHHCCCCCEEEEEECCCCCCEEEEECCCCCCEEEEECCCC
DKGKYFGPYPNAQSAHETKKLLDRMYPLRKCSNMPDKVCLYYHMGQCLAPCVKEVTEEQN
CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHEEHHHHHHHHHHHHHHHHHHHHH
KEIVDEIIKFLNGGHKEVRSELETKMYEASEKLEFERAKELRDQIAHIDAIMEKQKMIMS
HHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
DLVDRDVFGYAVDKGWMCVQVFFVRKGKLIERDVSMFPIYDEPEEGFLTFIGQFYENSSH
HHHHCCHHHHEECCCCEEEEEEHHHCCCHHHCCCCCCCCCCCCCCHHHHHHHHHHCCCCC
FKPKEIVVPGSIDSELVERFLEVEATQPKRGKKKDLVELANKNAKIALEEKFYLIERDEE
CCCCEEEECCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCCEEEEECEEEEEECCHH
RTIKAVENLGKQLGIETPYRIEAFDNSNIQGTNPVSAMIAFIDGKPAKKEYRKYKIKTVQ
HHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCHHHHHEEHCCCCCCHHHHHHEEEEEEC
GPDDYESMREVVRRRYTRALKEGLPLPDLIIIDGGKGHLAAASDVLENELGLYIPMAGLV
CCCHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCHHHHHHHHHCCCCEEEECHHHC
KDDKHKTSHLIIGDPPEPVMLERNSQEFYLLQRVQDEVHRFAITFHRQLHGKSVIQSALD
CCCCCCCCEEEECCCCCCEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
DIPGIGDKRKKVLLKHFGSLKKMKEASIEEFVEAGMPKNVAETIYTYLTDKKTL
HCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA