Definition Corynebacterium diphtheriae NCTC 13129 chromosome, complete genome.
Accession NC_002935
Length 2,488,635

Click here to switch to the map view.

The map label for this gene is uvrA2 [H]

Identifier: 38234591

GI number: 38234591

Start: 2081863

End: 2084454

Strand: Direct

Name: uvrA2 [H]

Synonym: DIP2031

Alternate gene names: 38234591

Gene position: 2081863-2084454 (Clockwise)

Preceding gene: 38234587

Following gene: 38234592

Centisome position: 83.65

GC content: 51.2

Gene sequence:

>2592_bases
ATGAGCTATTTACATGAACGCATCACAGACAAGGAAACTCATATCCAACCAAATGTTCGAGTAATGGGAGCACGCCAGAA
CAACCTGCAAAACGTTGATCTTTCTGTGCCCCGCGATGCTCTCGTCGTCTTCACTGGTGTTTCAGGATCGGGGAAATCCT
CTCTTGCCTTTGGAACCCTCTATGCAGAATCACAACGGCGTTATCTCGAATCGGTGGCCCCTTACGCCCGCCGTCTTATC
GACCAAGCCGGAGTGCCCGAGGTTGATTCCATCACAGGAATGCCTCCCGCTGTTGCCCTGCAACAACAGCGAGGCGGACA
AAATTCACGATCAACCGTTGGCAGTATCACCACCATCTCAAGTCTGGTTCGTATGTTGTATTCACGCGCCGGTCAGTATC
CTGAGCACCAGAGCATGCTCTACGCCGAGGATTTCTCCGCCAATACACCCCAAGGTGCATGCCCGAAATGCCATGGAATT
GGCAGGGTGTATGAAGTCGAAGAAAACCAGATGGTACCCGATCCCACCAAAACCATCCGTGAACGAGCAATAGCTTCCTG
GCCTACCGCCTGGCACGGTCACCAGCTTCGAGATGTCTTAGTGGCTCTGGGCTACGACGTGGATGTGCCCTGGAAAGATC
TTCCGAAAAAAGACCGCGACTGGATTCTTTACACTGAAGAAACACCGCACGTACCGGTCTACTCTCGAATGACGCTTGCT
GAAGCTCAAGCAGCCAAAAAAGCAGGCGCTGAGCCCTCTTACAACGGCACCTATGTGGGCGCAAAACGCTATGTGCTTGA
TACTTTTGCCAAAACTAAGAGTGCCTCTATGAAACGCAAGGTAGCACAGTTTCTCACATCCATCCCCTGCCCATCCTGTC
ACGGCAAACGCATCAAGGCAGAGGCACTTTCGGTAACTTTTGCAGGAGTAGATATTGCTGATTTCTCTCAGCTACCTTTG
CACGAGTTAGTAAACCTATTGGACGAAGAAGTTCAGCATGCTTCGAAGAAACTCGCCCTCGATGCTGATGGAACTACCCA
TGAAGCCGCACCAGATGGTCACAGAACACCCAATCATTCTGTGGAGAAACTTGCCACTACTGCTCGACTAGGAGCAGGAC
TCGTTGAGCGCTTACGCCCCATCATAGACCTCGGGCTAGGCTACCTTTCCCTCGATCGAATGACCCCCACTCTTTCCGGT
GGTGAACTGCAGCGTTTGCGCCTAGCTACTCAGTTATCATCTGAACTTTTCGGTGTGGTCTATGTATTAGATGAGCCTTC
TGCAGGTCTGCATCCTCAAGATATTCATGCGCTTCTCGGTGTACTTGATGGTCTCAAAGCGCGGGGAAACAGCCTTTTTG
TTGTGGAGCATTCCATTGAAGTAATGCACCATGCTGACTGGATTGTAGACGTGGGACCAGGAGCAGGTGAACAGGGTGGC
AAGGTTCTATACAGTGGCCAAGTAGAGGGCTTGGCTAAAGTCACGGATTCCGTCACCCGCGGCTATCTTTTTGGTGACAG
TGGGCTGCCACAACATACTCCTCGAAAGCCCAGTCAGTGGATGAAACTTTCGGGAGTTACCCGAAATAACCTCCATAATG
TGGATATAGAGATTCCCCTCGGGGTTCTTACCGCGGTCACCGGCGTCTCAGGTTCAGGTAAGTCCAGTTTGGTGAGTCAG
GCCCTGCCACAACTGGTAGGAAACCGTCTGGGTAAGAACATGTCAGGAGATGAAACCGACTTAGAAGCAGATGATCTCTT
AAGAGCCGAGCAGACTCCCGCAGTCAGCGGAAACATTGTGGGAGATTTCAGCAGTATCCACCGAGTTGTTGCAATCGATC
AGAAACCTATTGGGCGAACCCCTCGCTCAAATATTGCAACCTACACCGGGCTTTTCGATCATGTGCGACGTCGTTTTGCG
GAAACCGTAGAAGCCAAGCGTCGCCGCTATAAACCGGGCAGATTCTCGTTCAACGTCGCTGGCGGCAGATGCCCCACGTG
CGAGGGAGAAGGGTCCGTTATGGTTGAGCTGCTTTTCTTACCATCGGTCTATACAAAATGCCCTGATTGCCACGGCACTC
GTTATCAGTCAAGCACTTTGGAAATTCTGTGGCGAGGTAAGAACGTGTCAGAAGTTTTGGATTTGAATGTCAACGAAGCC
TTGGACTTTTTTGAGGGAGAGTTCGATATTATGCGTTCGTTGACTGCACTGAGGGATGTTGGATTGGGTTATCTACGTTT
GGGGCAGCCTGCTACAGAGTTATCAGGTGGTGAGGCTCAACGAGTCAAACTTGCCACCGAATTGCAGAGGGCTCAACGGG
GTGACACTCTCTATGTGCTAGATGAGCCCACTACAGGTTTACATTGCTCAGATGCGGACCGTTTGATATCTCATCTTCAG
ACTTTGGTGGATTCAGGTAACACGGTAGTGATGGTTGAACTTGATATGCGAATTATTGCAGCGGCGGATTATGTGATTGA
TATGGGCCCCGGCGCAGGAGAAGAAGGTGGGCAAATCGTGGCCGCTGGGACCCCAGCGGAGGTTGCGACTTCAGAAGAAA
GTGTTTCTGCACCATTTTTAGCGGCGCTTTGA

Upstream 100 bases:

>100_bases
AATCCTCGTCATATATCGACGAAATAGACAGTAAAGTTGCAAGCGGCAGGATTATGAGGCTTCCTGCACCTTACACAGAA
GCGATTGGTCAGATTTATCT

Downstream 100 bases:

>100_bases
GGTGAGAACCTATTTATGGTTTGGAAGGCTTATTTATAATCCATTCAAAACTGAGTACCGTATTGAATTTAGTGGATTCC
CTTGCCCCGGTTCATCTGGT

Product: excinuclease ABC subunit A

Products: NA

Alternate protein names: UvrA protein; Excinuclease ABC subunit A [H]

Number of amino acids: Translated: 863; Mature: 862

Protein sequence:

>863_residues
MSYLHERITDKETHIQPNVRVMGARQNNLQNVDLSVPRDALVVFTGVSGSGKSSLAFGTLYAESQRRYLESVAPYARRLI
DQAGVPEVDSITGMPPAVALQQQRGGQNSRSTVGSITTISSLVRMLYSRAGQYPEHQSMLYAEDFSANTPQGACPKCHGI
GRVYEVEENQMVPDPTKTIRERAIASWPTAWHGHQLRDVLVALGYDVDVPWKDLPKKDRDWILYTEETPHVPVYSRMTLA
EAQAAKKAGAEPSYNGTYVGAKRYVLDTFAKTKSASMKRKVAQFLTSIPCPSCHGKRIKAEALSVTFAGVDIADFSQLPL
HELVNLLDEEVQHASKKLALDADGTTHEAAPDGHRTPNHSVEKLATTARLGAGLVERLRPIIDLGLGYLSLDRMTPTLSG
GELQRLRLATQLSSELFGVVYVLDEPSAGLHPQDIHALLGVLDGLKARGNSLFVVEHSIEVMHHADWIVDVGPGAGEQGG
KVLYSGQVEGLAKVTDSVTRGYLFGDSGLPQHTPRKPSQWMKLSGVTRNNLHNVDIEIPLGVLTAVTGVSGSGKSSLVSQ
ALPQLVGNRLGKNMSGDETDLEADDLLRAEQTPAVSGNIVGDFSSIHRVVAIDQKPIGRTPRSNIATYTGLFDHVRRRFA
ETVEAKRRRYKPGRFSFNVAGGRCPTCEGEGSVMVELLFLPSVYTKCPDCHGTRYQSSTLEILWRGKNVSEVLDLNVNEA
LDFFEGEFDIMRSLTALRDVGLGYLRLGQPATELSGGEAQRVKLATELQRAQRGDTLYVLDEPTTGLHCSDADRLISHLQ
TLVDSGNTVVMVELDMRIIAAADYVIDMGPGAGEEGGQIVAAGTPAEVATSEESVSAPFLAAL

Sequences:

>Translated_863_residues
MSYLHERITDKETHIQPNVRVMGARQNNLQNVDLSVPRDALVVFTGVSGSGKSSLAFGTLYAESQRRYLESVAPYARRLI
DQAGVPEVDSITGMPPAVALQQQRGGQNSRSTVGSITTISSLVRMLYSRAGQYPEHQSMLYAEDFSANTPQGACPKCHGI
GRVYEVEENQMVPDPTKTIRERAIASWPTAWHGHQLRDVLVALGYDVDVPWKDLPKKDRDWILYTEETPHVPVYSRMTLA
EAQAAKKAGAEPSYNGTYVGAKRYVLDTFAKTKSASMKRKVAQFLTSIPCPSCHGKRIKAEALSVTFAGVDIADFSQLPL
HELVNLLDEEVQHASKKLALDADGTTHEAAPDGHRTPNHSVEKLATTARLGAGLVERLRPIIDLGLGYLSLDRMTPTLSG
GELQRLRLATQLSSELFGVVYVLDEPSAGLHPQDIHALLGVLDGLKARGNSLFVVEHSIEVMHHADWIVDVGPGAGEQGG
KVLYSGQVEGLAKVTDSVTRGYLFGDSGLPQHTPRKPSQWMKLSGVTRNNLHNVDIEIPLGVLTAVTGVSGSGKSSLVSQ
ALPQLVGNRLGKNMSGDETDLEADDLLRAEQTPAVSGNIVGDFSSIHRVVAIDQKPIGRTPRSNIATYTGLFDHVRRRFA
ETVEAKRRRYKPGRFSFNVAGGRCPTCEGEGSVMVELLFLPSVYTKCPDCHGTRYQSSTLEILWRGKNVSEVLDLNVNEA
LDFFEGEFDIMRSLTALRDVGLGYLRLGQPATELSGGEAQRVKLATELQRAQRGDTLYVLDEPTTGLHCSDADRLISHLQ
TLVDSGNTVVMVELDMRIIAAADYVIDMGPGAGEEGGQIVAAGTPAEVATSEESVSAPFLAAL
>Mature_862_residues
SYLHERITDKETHIQPNVRVMGARQNNLQNVDLSVPRDALVVFTGVSGSGKSSLAFGTLYAESQRRYLESVAPYARRLID
QAGVPEVDSITGMPPAVALQQQRGGQNSRSTVGSITTISSLVRMLYSRAGQYPEHQSMLYAEDFSANTPQGACPKCHGIG
RVYEVEENQMVPDPTKTIRERAIASWPTAWHGHQLRDVLVALGYDVDVPWKDLPKKDRDWILYTEETPHVPVYSRMTLAE
AQAAKKAGAEPSYNGTYVGAKRYVLDTFAKTKSASMKRKVAQFLTSIPCPSCHGKRIKAEALSVTFAGVDIADFSQLPLH
ELVNLLDEEVQHASKKLALDADGTTHEAAPDGHRTPNHSVEKLATTARLGAGLVERLRPIIDLGLGYLSLDRMTPTLSGG
ELQRLRLATQLSSELFGVVYVLDEPSAGLHPQDIHALLGVLDGLKARGNSLFVVEHSIEVMHHADWIVDVGPGAGEQGGK
VLYSGQVEGLAKVTDSVTRGYLFGDSGLPQHTPRKPSQWMKLSGVTRNNLHNVDIEIPLGVLTAVTGVSGSGKSSLVSQA
LPQLVGNRLGKNMSGDETDLEADDLLRAEQTPAVSGNIVGDFSSIHRVVAIDQKPIGRTPRSNIATYTGLFDHVRRRFAE
TVEAKRRRYKPGRFSFNVAGGRCPTCEGEGSVMVELLFLPSVYTKCPDCHGTRYQSSTLEILWRGKNVSEVLDLNVNEAL
DFFEGEFDIMRSLTALRDVGLGYLRLGQPATELSGGEAQRVKLATELQRAQRGDTLYVLDEPTTGLHCSDADRLISHLQT
LVDSGNTVVMVELDMRIIAAADYVIDMGPGAGEEGGQIVAAGTPAEVATSEESVSAPFLAAL

Specific function: The UvrABC repair system catalyzes the recognition and processing of DNA lesions. UvrA is an ATPase and a DNA-binding protein. A damage recognition complex composed of 2 UvrA and 2 UvrB subunits scans DNA for abnormalities. When the presence of a lesion h

COG id: COG0178

COG function: function code L; Excinuclease ATPase subunit

Gene ontology:

Cell location: Cytoplasm [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 ABC transporter domain [H]

Homologues:

Organism=Escherichia coli, GI2367343, Length=724, Percent_Identity=37.9834254143646, Blast_Score=474, Evalue=1e-135,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR017871 [H]

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 93715; Mature: 93584

Theoretical pI: Translated: 6.29; Mature: 6.29

Prosite motif: PS00211 ABC_TRANSPORTER_1 ; PS50893 ABC_TRANSPORTER_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.0 %Cys     (Translated Protein)
2.0 %Met     (Translated Protein)
3.0 %Cys+Met (Translated Protein)
1.0 %Cys     (Mature Protein)
1.9 %Met     (Mature Protein)
2.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSYLHERITDKETHIQPNVRVMGARQNNLQNVDLSVPRDALVVFTGVSGSGKSSLAFGTL
CCHHHHHHCCCHHCCCCCEEEEECCCCCCCCCCCCCCCCEEEEEECCCCCCCCCEEEEHH
YAESQRRYLESVAPYARRLIDQAGVPEVDSITGMPPAVALQQQRGGQNSRSTVGSITTIS
HHHHHHHHHHHHHHHHHHHHHHCCCCCCHHCCCCCHHHHHHHHHCCCCCHHHHHHHHHHH
SLVRMLYSRAGQYPEHQSMLYAEDFSANTPQGACPKCHGIGRVYEVEENQMVPDPTKTIR
HHHHHHHHHCCCCCCHHCEEEEECCCCCCCCCCCCCCCCCCCEEEECCCCCCCCHHHHHH
ERAIASWPTAWHGHQLRDVLVALGYDVDVPWKDLPKKDRDWILYTEETPHVPVYSRMTLA
HHHHHCCCCCCCCHHHHHHHHHHCCCCCCCHHHCCCCCCCEEEEECCCCCCCCHHHHHHH
EAQAAKKAGAEPSYNGTYVGAKRYVLDTFAKTKSASMKRKVAQFLTSIPCPSCHGKRIKA
HHHHHHHCCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEE
EALSVTFAGVDIADFSQLPLHELVNLLDEEVQHASKKLALDADGTTHEAAPDGHRTPNHS
EEEEEEEECCCHHCCCCCCHHHHHHHHHHHHHHHHHHEEECCCCCCCCCCCCCCCCCCHH
VEKLATTARLGAGLVERLRPIIDLGLGYLSLDRMTPTLSGGELQRLRLATQLSSELFGVV
HHHHHHHHHHHHHHHHHHHHHHHHCCCCEEHHCCCCCCCCCHHHHHHHHHHHHHHHEEEE
YVLDEPSAGLHPQDIHALLGVLDGLKARGNSLFVVEHSIEVMHHADWIVDVGPGAGEQGG
EEEECCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHCCCEEEECCCCCCCCCC
KVLYSGQVEGLAKVTDSVTRGYLFGDSGLPQHTPRKPSQWMKLSGVTRNNLHNVDIEIPL
EEEECCCCCHHHHHHHHHHCCEEECCCCCCCCCCCCCHHHHEECCCCCCCCEEEEEEECH
GVLTAVTGVSGSGKSSLVSQALPQLVGNRLGKNMSGDETDLEADDLLRAEQTPAVSGNIV
HHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHCCCCCCCCCEE
GDFSSIHRVVAIDQKPIGRTPRSNIATYTGLFDHVRRRFAETVEAKRRRYKPGRFSFNVA
ECHHHCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEC
GGRCPTCEGEGSVMVELLFLPSVYTKCPDCHGTRYQSSTLEILWRGKNVSEVLDLNVNEA
CCCCCCCCCCCCEEEEEEHHHHHHHCCCCCCCCCCCCCEEEEEECCCCHHHHHCCCHHHH
LDFFEGEFDIMRSLTALRDVGLGYLRLGQPATELSGGEAQRVKLATELQRAQRGDTLYVL
HHHHCCHHHHHHHHHHHHHCCCCEEECCCCHHHCCCCCHHHHHHHHHHHHHHCCCEEEEE
DEPTTGLHCSDADRLISHLQTLVDSGNTVVMVELDMRIIAAADYVIDMGPGAGEEGGQIV
ECCCCCCCCCCHHHHHHHHHHHHCCCCEEEEEEECCEEEEEECEEEECCCCCCCCCCEEE
AAGTPAEVATSEESVSAPFLAAL
ECCCCHHHCCCCHHCCCCHHHCC
>Mature Secondary Structure 
SYLHERITDKETHIQPNVRVMGARQNNLQNVDLSVPRDALVVFTGVSGSGKSSLAFGTL
CHHHHHHCCCHHCCCCCEEEEECCCCCCCCCCCCCCCCEEEEEECCCCCCCCCEEEEHH
YAESQRRYLESVAPYARRLIDQAGVPEVDSITGMPPAVALQQQRGGQNSRSTVGSITTIS
HHHHHHHHHHHHHHHHHHHHHHCCCCCCHHCCCCCHHHHHHHHHCCCCCHHHHHHHHHHH
SLVRMLYSRAGQYPEHQSMLYAEDFSANTPQGACPKCHGIGRVYEVEENQMVPDPTKTIR
HHHHHHHHHCCCCCCHHCEEEEECCCCCCCCCCCCCCCCCCCEEEECCCCCCCCHHHHHH
ERAIASWPTAWHGHQLRDVLVALGYDVDVPWKDLPKKDRDWILYTEETPHVPVYSRMTLA
HHHHHCCCCCCCCHHHHHHHHHHCCCCCCCHHHCCCCCCCEEEEECCCCCCCCHHHHHHH
EAQAAKKAGAEPSYNGTYVGAKRYVLDTFAKTKSASMKRKVAQFLTSIPCPSCHGKRIKA
HHHHHHHCCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEE
EALSVTFAGVDIADFSQLPLHELVNLLDEEVQHASKKLALDADGTTHEAAPDGHRTPNHS
EEEEEEEECCCHHCCCCCCHHHHHHHHHHHHHHHHHHEEECCCCCCCCCCCCCCCCCCHH
VEKLATTARLGAGLVERLRPIIDLGLGYLSLDRMTPTLSGGELQRLRLATQLSSELFGVV
HHHHHHHHHHHHHHHHHHHHHHHHCCCCEEHHCCCCCCCCCHHHHHHHHHHHHHHHEEEE
YVLDEPSAGLHPQDIHALLGVLDGLKARGNSLFVVEHSIEVMHHADWIVDVGPGAGEQGG
EEEECCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHCCCEEEECCCCCCCCCC
KVLYSGQVEGLAKVTDSVTRGYLFGDSGLPQHTPRKPSQWMKLSGVTRNNLHNVDIEIPL
EEEECCCCCHHHHHHHHHHCCEEECCCCCCCCCCCCCHHHHEECCCCCCCCEEEEEEECH
GVLTAVTGVSGSGKSSLVSQALPQLVGNRLGKNMSGDETDLEADDLLRAEQTPAVSGNIV
HHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHCCCCCCCCCEE
GDFSSIHRVVAIDQKPIGRTPRSNIATYTGLFDHVRRRFAETVEAKRRRYKPGRFSFNVA
ECHHHCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEC
GGRCPTCEGEGSVMVELLFLPSVYTKCPDCHGTRYQSSTLEILWRGKNVSEVLDLNVNEA
CCCCCCCCCCCCEEEEEEHHHHHHHCCCCCCCCCCCCCEEEEEECCCCHHHHHCCCHHHH
LDFFEGEFDIMRSLTALRDVGLGYLRLGQPATELSGGEAQRVKLATELQRAQRGDTLYVL
HHHHCCHHHHHHHHHHHHHCCCCEEECCCCHHHCCCCCHHHHHHHHHHHHHHCCCEEEEE
DEPTTGLHCSDADRLISHLQTLVDSGNTVVMVELDMRIIAAADYVIDMGPGAGEEGGQIV
ECCCCCCCCCCHHHHHHHHHHHHCCCCEEEEEEECCEEEEEECEEEECCCCCCCCCCEEE
AAGTPAEVATSEESVSAPFLAAL
ECCCCHHHCCCCHHCCCCHHHCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: Hydrolase; Acting on ester bonds [C]

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8269961 [H]