Definition Treponema pallidum subsp. pallidum SS14, complete genome.
Accession NC_010741
Length 1,139,457

Click here to switch to the map view.

The map label for this gene is uvrD [H]

Identifier: 189026251

GI number: 189026251

Start: 1122474

End: 1124486

Strand: Reverse

Name: uvrD [H]

Synonym: TPASS_1028

Alternate gene names: 189026251

Gene position: 1124486-1122474 (Counterclockwise)

Preceding gene: 189026252

Following gene: 189026248

Centisome position: 98.69

GC content: 55.39

Gene sequence:

>2013_bases
GTGGAATCCTACCTGAGCGCGCTCAATGAGGCGCAGCGTCAGGCCGTTTGCCATTATGGCAGCCCGCTTCTTATCCTTGC
CGGCGCAGGCTCAGGAAAGACGCGCGTTATCACCACCAAAATCGCCCATCTTATCCGTTCCCGGCAGGTTCGCCCCGAGC
AGATTCTGGCAGTAACCTTTACCAATAAAGCGGCGCGTGAGATGCGCACGCGTGCCTGCGCGCTTGAGTCTGCCGCGCAG
GGGGCAACTATCTGTACCTTCCACGCACTGGGGGTGTGGATCCTGCGTCGCTATGCAGTCCGTCTGGGATTGAACCCCCA
TTTTAGTATTTATGACGACCATGACGTCCGTGCACTCTTGCCAAAAATCCTGCCTCATTGCGATCACAGTCGGGCAGGCA
TGCTCGCGCGTGGAATTTCTCAGGCAAAAGACTATGGGCTCGACTGCGCCTCGTTTGAGTCAGTGCACGCGCGTGTCTCC
GCTCCTGCATGCGCTGCCAGAGCCGTTCTGGGTGACAGGCAGTTTGCGCACGCATATGCGTGCTACCATCGGCGTATGCG
CGAAATGGGAACGGTAGACTTTGGGGATCTGATTATGCTTCCGGTGCAGCTCTTGCGTGAGCACCAGGACGTCGCCGAAC
AGCTGCATGCACGGTGGCAGGTGGTCATGGTAGATGAGTATCAAGACTCAAACGTGGCGCAGTTTCATTTCTTGCAGGTG
CTCACCGGTGCGCACACCTATCTTTGTGTGGTAGGGGACGACGATCAGTCCATCTATCGCTTTCGCGGAGCAGAGGTAAA
AAATATCTTGACCTTCCCTGAGTTCTTTCAAAATACCCAGATTATCCGCCTGGAGTACAACTACCGGTCCACAGACGCAA
TTCTGCGTGTTGCTGATTCGGTAGTGAAAAAAAACCAAGACCGCTTAGGAAAGGCGCTGATTGCCCAGCGCACGGGAGGT
ACTAAGCCGCGCCTGTTCTTGCTGAATAATCAAGATGAAGAAGCTGCGCTGTGCGTGCACCTCATTCAAGAAGCGCGTGC
GCGCGGCATCCCATACGCGGATTGGGCGATTTTATATCGGGTAAATGCACAGTCGCTGAGTTTTGAACAGTGTTTTTTGC
GGAATCGCATTCCGTATCGCATTGTCGGCACGCTCAAATTCTACAGTCGCGCAGAGGTAAAAGACGTGCTGGCGTTTCTC
CAGCTCATAGTCAATGGCTCAGATGAACTGGCCCTCCGGCGGGTGCTGAATAAGCCGCCTCGGGGCATTGGAGAAAAGAC
ACAAGACGCATTGTTTGTCTGTGCACAGCAGGCAGCCATAACTGATTTTACCACACTCCAGTCCACCCACCTGACCGCGC
TTGGCACGCGTGCGCGGCAAAAGGTCAGTAGCTTTCTGTCGCTGTTACGTGCGCTGCGTGCACGCATGCCACAGGCCCCC
GCAGCGGGAGAAGAAGCTCGCACCAGTGCGCCGCCTGAGGAGCCGGTAGAGGAGCGCACCCACGATGCAGAAGGACTTGC
GCGCTTTGTTTCTGTGGTAATGGAACACACGGGGCTGGAAGAATGGTATCGACAGAAGGATGAGGAAGAAGGGACGCAGT
GCGCGGTCAACGTGCAGGAGTTAATGAACGCAGCGTCACTGTATGCATGTTCGCATGAGGGGTTAGTGAGTTTCCTAGAA
CACATCCAATTGGACCAAAATATGGCCGACGAGGGAGGAGCGGCTGACGCAGTGCACTTGATCACCATTCACAATACAAA
AGGGCTGGAGTTTCGACGGGTGATTCTGACCGGACTAGAGAACGGGGTGTTTCCGCGTGATGACGAAGCAGATATACAGG
AAGAGCGGCGTTTGATGTACGTTGCCTGCACGCGGGCTATGGATAGCTTATACCTTACCGCGTGTGCGTACCGCAGAATG
TGGGGACGACACACGGCGATGAAGCCCAGCCGCTTTTTGACCGAGCTTGACTCCGCGCTGTTAGAAATAACCGATCCGCG
ACACTTTGCCTAA

Upstream 100 bases:

>100_bases
CCCGCCCTGGAAGAATAAGGAACATGCCGTCAGTGCAGGGCGCCGCCCGGGCGCGGTGGCACTCGGTGCGGTGCGGCAGT
TTTCGGTTGCAGGGCTGCGC

Downstream 100 bases:

>100_bases
AGCACTGCAGTGCGCCTAGCGCGGGGAGTCTTGGGTGATGAGTAATTTGACCGCTTCGACTGCAACCCGTATGGCGTTTT
CGGTGTCGTGAACTTGAATG

Product: DNA helicase II

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 670; Mature: 670

Protein sequence:

>670_residues
MESYLSALNEAQRQAVCHYGSPLLILAGAGSGKTRVITTKIAHLIRSRQVRPEQILAVTFTNKAAREMRTRACALESAAQ
GATICTFHALGVWILRRYAVRLGLNPHFSIYDDHDVRALLPKILPHCDHSRAGMLARGISQAKDYGLDCASFESVHARVS
APACAARAVLGDRQFAHAYACYHRRMREMGTVDFGDLIMLPVQLLREHQDVAEQLHARWQVVMVDEYQDSNVAQFHFLQV
LTGAHTYLCVVGDDDQSIYRFRGAEVKNILTFPEFFQNTQIIRLEYNYRSTDAILRVADSVVKKNQDRLGKALIAQRTGG
TKPRLFLLNNQDEEAALCVHLIQEARARGIPYADWAILYRVNAQSLSFEQCFLRNRIPYRIVGTLKFYSRAEVKDVLAFL
QLIVNGSDELALRRVLNKPPRGIGEKTQDALFVCAQQAAITDFTTLQSTHLTALGTRARQKVSSFLSLLRALRARMPQAP
AAGEEARTSAPPEEPVEERTHDAEGLARFVSVVMEHTGLEEWYRQKDEEEGTQCAVNVQELMNAASLYACSHEGLVSFLE
HIQLDQNMADEGGAADAVHLITIHNTKGLEFRRVILTGLENGVFPRDDEADIQEERRLMYVACTRAMDSLYLTACAYRRM
WGRHTAMKPSRFLTELDSALLEITDPRHFA

Sequences:

>Translated_670_residues
MESYLSALNEAQRQAVCHYGSPLLILAGAGSGKTRVITTKIAHLIRSRQVRPEQILAVTFTNKAAREMRTRACALESAAQ
GATICTFHALGVWILRRYAVRLGLNPHFSIYDDHDVRALLPKILPHCDHSRAGMLARGISQAKDYGLDCASFESVHARVS
APACAARAVLGDRQFAHAYACYHRRMREMGTVDFGDLIMLPVQLLREHQDVAEQLHARWQVVMVDEYQDSNVAQFHFLQV
LTGAHTYLCVVGDDDQSIYRFRGAEVKNILTFPEFFQNTQIIRLEYNYRSTDAILRVADSVVKKNQDRLGKALIAQRTGG
TKPRLFLLNNQDEEAALCVHLIQEARARGIPYADWAILYRVNAQSLSFEQCFLRNRIPYRIVGTLKFYSRAEVKDVLAFL
QLIVNGSDELALRRVLNKPPRGIGEKTQDALFVCAQQAAITDFTTLQSTHLTALGTRARQKVSSFLSLLRALRARMPQAP
AAGEEARTSAPPEEPVEERTHDAEGLARFVSVVMEHTGLEEWYRQKDEEEGTQCAVNVQELMNAASLYACSHEGLVSFLE
HIQLDQNMADEGGAADAVHLITIHNTKGLEFRRVILTGLENGVFPRDDEADIQEERRLMYVACTRAMDSLYLTACAYRRM
WGRHTAMKPSRFLTELDSALLEITDPRHFA
>Mature_670_residues
MESYLSALNEAQRQAVCHYGSPLLILAGAGSGKTRVITTKIAHLIRSRQVRPEQILAVTFTNKAAREMRTRACALESAAQ
GATICTFHALGVWILRRYAVRLGLNPHFSIYDDHDVRALLPKILPHCDHSRAGMLARGISQAKDYGLDCASFESVHARVS
APACAARAVLGDRQFAHAYACYHRRMREMGTVDFGDLIMLPVQLLREHQDVAEQLHARWQVVMVDEYQDSNVAQFHFLQV
LTGAHTYLCVVGDDDQSIYRFRGAEVKNILTFPEFFQNTQIIRLEYNYRSTDAILRVADSVVKKNQDRLGKALIAQRTGG
TKPRLFLLNNQDEEAALCVHLIQEARARGIPYADWAILYRVNAQSLSFEQCFLRNRIPYRIVGTLKFYSRAEVKDVLAFL
QLIVNGSDELALRRVLNKPPRGIGEKTQDALFVCAQQAAITDFTTLQSTHLTALGTRARQKVSSFLSLLRALRARMPQAP
AAGEEARTSAPPEEPVEERTHDAEGLARFVSVVMEHTGLEEWYRQKDEEEGTQCAVNVQELMNAASLYACSHEGLVSFLE
HIQLDQNMADEGGAADAVHLITIHNTKGLEFRRVILTGLENGVFPRDDEADIQEERRLMYVACTRAMDSLYLTACAYRRM
WGRHTAMKPSRFLTELDSALLEITDPRHFA

Specific function: Essential helicase. May act as a helicase in plasmid pT181 replication [H]

COG id: COG0210

COG function: function code L; Superfamily I DNA and RNA helicases

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 uvrD-like helicase C-terminal domain [H]

Homologues:

Organism=Escherichia coli, GI2367296, Length=673, Percent_Identity=34.6210995542348, Blast_Score=362, Evalue=1e-101,
Organism=Escherichia coli, GI48994965, Length=672, Percent_Identity=34.9702380952381, Blast_Score=350, Evalue=2e-97,
Organism=Escherichia coli, GI1787196, Length=368, Percent_Identity=27.445652173913, Blast_Score=97, Evalue=4e-21,
Organism=Saccharomyces cerevisiae, GI6322369, Length=759, Percent_Identity=24.7694334650856, Blast_Score=160, Evalue=5e-40,
Organism=Saccharomyces cerevisiae, GI6324477, Length=679, Percent_Identity=22.2385861561119, Blast_Score=115, Evalue=3e-26,

Paralogues:

None

Copy number: 3000 [C]

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR005751
- InterPro:   IPR013986
- InterPro:   IPR014017
- InterPro:   IPR000212
- InterPro:   IPR014016 [H]

Pfam domain/function: PF00580 UvrD-helicase [H]

EC number: =3.6.4.12 [H]

Molecular weight: Translated: 75401; Mature: 75401

Theoretical pI: Translated: 7.46; Mature: 7.46

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.2 %Cys     (Translated Protein)
2.2 %Met     (Translated Protein)
4.5 %Cys+Met (Translated Protein)
2.2 %Cys     (Mature Protein)
2.2 %Met     (Mature Protein)
4.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MESYLSALNEAQRQAVCHYGSPLLILAGAGSGKTRVITTKIAHLIRSRQVRPEQILAVTF
CCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCEEHHHHHHHHHHHCCCCCHHEEEEEE
TNKAAREMRTRACALESAAQGATICTFHALGVWILRRYAVRLGLNPHFSIYDDHDVRALL
CCHHHHHHHHHHHHHHHHCCCCEEHHHHHHHHHHHHHHHHHHCCCCCEEEECCCHHHHHH
PKILPHCDHSRAGMLARGISQAKDYGLDCASFESVHARVSAPACAARAVLGDRQFAHAYA
HHHCCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCHHHHHHHHCCHHHHHHHH
CYHRRMREMGTVDFGDLIMLPVQLLREHQDVAEQLHARWQVVMVDEYQDSNVAQFHFLQV
HHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCEEEEEEECCCCCCHHHHHHHHH
LTGAHTYLCVVGDDDQSIYRFRGAEVKNILTFPEFFQNTQIIRLEYNYRSTDAILRVADS
HHCCCEEEEEEECCCCHHHHHCCCCHHHHHHCHHHHCCCEEEEEEECCCCHHHHHHHHHH
VVKKNQDRLGKALIAQRTGGTKPRLFLLNNQDEEAALCVHLIQEARARGIPYADWAILYR
HHHCCHHHHHHHHHHHCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCCCEEEEEE
VNAQSLSFEQCFLRNRIPYRIVGTLKFYSRAEVKDVLAFLQLIVNGSDELALRRVLNKPP
ECCCCCCHHHHHHHCCCCEEEEEEHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCC
RGIGEKTQDALFVCAQQAAITDFTTLQSTHLTALGTRARQKVSSFLSLLRALRARMPQAP
CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC
AAGEEARTSAPPEEPVEERTHDAEGLARFVSVVMEHTGLEEWYRQKDEEEGTQCAVNVQE
CCCCCCCCCCCCCCCHHHHCCCHHHHHHHHHHHHHHCCHHHHHHCCCCCCCHHEEEEHHH
LMNAASLYACSHEGLVSFLEHIQLDQNMADEGGAADAVHLITIHNTKGLEFRRVILTGLE
HHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEEEECCCCCHHHHHHHHHHC
NGVFPRDDEADIQEERRLMYVACTRAMDSLYLTACAYRRMWGRHTAMKPSRFLTELDSAL
CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH
LEITDPRHFA
HEECCCCCCC
>Mature Secondary Structure
MESYLSALNEAQRQAVCHYGSPLLILAGAGSGKTRVITTKIAHLIRSRQVRPEQILAVTF
CCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCEEHHHHHHHHHHHCCCCCHHEEEEEE
TNKAAREMRTRACALESAAQGATICTFHALGVWILRRYAVRLGLNPHFSIYDDHDVRALL
CCHHHHHHHHHHHHHHHHCCCCEEHHHHHHHHHHHHHHHHHHCCCCCEEEECCCHHHHHH
PKILPHCDHSRAGMLARGISQAKDYGLDCASFESVHARVSAPACAARAVLGDRQFAHAYA
HHHCCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCHHHHHHHHCCHHHHHHHH
CYHRRMREMGTVDFGDLIMLPVQLLREHQDVAEQLHARWQVVMVDEYQDSNVAQFHFLQV
HHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCEEEEEEECCCCCCHHHHHHHHH
LTGAHTYLCVVGDDDQSIYRFRGAEVKNILTFPEFFQNTQIIRLEYNYRSTDAILRVADS
HHCCCEEEEEEECCCCHHHHHCCCCHHHHHHCHHHHCCCEEEEEEECCCCHHHHHHHHHH
VVKKNQDRLGKALIAQRTGGTKPRLFLLNNQDEEAALCVHLIQEARARGIPYADWAILYR
HHHCCHHHHHHHHHHHCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCCCEEEEEE
VNAQSLSFEQCFLRNRIPYRIVGTLKFYSRAEVKDVLAFLQLIVNGSDELALRRVLNKPP
ECCCCCCHHHHHHHCCCCEEEEEEHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCC
RGIGEKTQDALFVCAQQAAITDFTTLQSTHLTALGTRARQKVSSFLSLLRALRARMPQAP
CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC
AAGEEARTSAPPEEPVEERTHDAEGLARFVSVVMEHTGLEEWYRQKDEEEGTQCAVNVQE
CCCCCCCCCCCCCCCHHHHCCCHHHHHHHHHHHHHHCCHHHHHHCCCCCCCHHEEEEHHH
LMNAASLYACSHEGLVSFLEHIQLDQNMADEGGAADAVHLITIHNTKGLEFRRVILTGLE
HHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEEEECCCCCHHHHHHHHHHC
NGVFPRDDEADIQEERRLMYVACTRAMDSLYLTACAYRRMWGRHTAMKPSRFLTELDSAL
CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH
LEITDPRHFA
HEECCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8232203 [H]