Definition | Treponema pallidum subsp. pallidum SS14, complete genome. |
---|---|
Accession | NC_010741 |
Length | 1,139,457 |
Click here to switch to the map view.
The map label for this gene is uvrD [H]
Identifier: 189026251
GI number: 189026251
Start: 1122474
End: 1124486
Strand: Reverse
Name: uvrD [H]
Synonym: TPASS_1028
Alternate gene names: 189026251
Gene position: 1124486-1122474 (Counterclockwise)
Preceding gene: 189026252
Following gene: 189026248
Centisome position: 98.69
GC content: 55.39
Gene sequence:
>2013_bases GTGGAATCCTACCTGAGCGCGCTCAATGAGGCGCAGCGTCAGGCCGTTTGCCATTATGGCAGCCCGCTTCTTATCCTTGC CGGCGCAGGCTCAGGAAAGACGCGCGTTATCACCACCAAAATCGCCCATCTTATCCGTTCCCGGCAGGTTCGCCCCGAGC AGATTCTGGCAGTAACCTTTACCAATAAAGCGGCGCGTGAGATGCGCACGCGTGCCTGCGCGCTTGAGTCTGCCGCGCAG GGGGCAACTATCTGTACCTTCCACGCACTGGGGGTGTGGATCCTGCGTCGCTATGCAGTCCGTCTGGGATTGAACCCCCA TTTTAGTATTTATGACGACCATGACGTCCGTGCACTCTTGCCAAAAATCCTGCCTCATTGCGATCACAGTCGGGCAGGCA TGCTCGCGCGTGGAATTTCTCAGGCAAAAGACTATGGGCTCGACTGCGCCTCGTTTGAGTCAGTGCACGCGCGTGTCTCC GCTCCTGCATGCGCTGCCAGAGCCGTTCTGGGTGACAGGCAGTTTGCGCACGCATATGCGTGCTACCATCGGCGTATGCG CGAAATGGGAACGGTAGACTTTGGGGATCTGATTATGCTTCCGGTGCAGCTCTTGCGTGAGCACCAGGACGTCGCCGAAC AGCTGCATGCACGGTGGCAGGTGGTCATGGTAGATGAGTATCAAGACTCAAACGTGGCGCAGTTTCATTTCTTGCAGGTG CTCACCGGTGCGCACACCTATCTTTGTGTGGTAGGGGACGACGATCAGTCCATCTATCGCTTTCGCGGAGCAGAGGTAAA AAATATCTTGACCTTCCCTGAGTTCTTTCAAAATACCCAGATTATCCGCCTGGAGTACAACTACCGGTCCACAGACGCAA TTCTGCGTGTTGCTGATTCGGTAGTGAAAAAAAACCAAGACCGCTTAGGAAAGGCGCTGATTGCCCAGCGCACGGGAGGT ACTAAGCCGCGCCTGTTCTTGCTGAATAATCAAGATGAAGAAGCTGCGCTGTGCGTGCACCTCATTCAAGAAGCGCGTGC GCGCGGCATCCCATACGCGGATTGGGCGATTTTATATCGGGTAAATGCACAGTCGCTGAGTTTTGAACAGTGTTTTTTGC GGAATCGCATTCCGTATCGCATTGTCGGCACGCTCAAATTCTACAGTCGCGCAGAGGTAAAAGACGTGCTGGCGTTTCTC CAGCTCATAGTCAATGGCTCAGATGAACTGGCCCTCCGGCGGGTGCTGAATAAGCCGCCTCGGGGCATTGGAGAAAAGAC ACAAGACGCATTGTTTGTCTGTGCACAGCAGGCAGCCATAACTGATTTTACCACACTCCAGTCCACCCACCTGACCGCGC TTGGCACGCGTGCGCGGCAAAAGGTCAGTAGCTTTCTGTCGCTGTTACGTGCGCTGCGTGCACGCATGCCACAGGCCCCC GCAGCGGGAGAAGAAGCTCGCACCAGTGCGCCGCCTGAGGAGCCGGTAGAGGAGCGCACCCACGATGCAGAAGGACTTGC GCGCTTTGTTTCTGTGGTAATGGAACACACGGGGCTGGAAGAATGGTATCGACAGAAGGATGAGGAAGAAGGGACGCAGT GCGCGGTCAACGTGCAGGAGTTAATGAACGCAGCGTCACTGTATGCATGTTCGCATGAGGGGTTAGTGAGTTTCCTAGAA CACATCCAATTGGACCAAAATATGGCCGACGAGGGAGGAGCGGCTGACGCAGTGCACTTGATCACCATTCACAATACAAA AGGGCTGGAGTTTCGACGGGTGATTCTGACCGGACTAGAGAACGGGGTGTTTCCGCGTGATGACGAAGCAGATATACAGG AAGAGCGGCGTTTGATGTACGTTGCCTGCACGCGGGCTATGGATAGCTTATACCTTACCGCGTGTGCGTACCGCAGAATG TGGGGACGACACACGGCGATGAAGCCCAGCCGCTTTTTGACCGAGCTTGACTCCGCGCTGTTAGAAATAACCGATCCGCG ACACTTTGCCTAA
Upstream 100 bases:
>100_bases CCCGCCCTGGAAGAATAAGGAACATGCCGTCAGTGCAGGGCGCCGCCCGGGCGCGGTGGCACTCGGTGCGGTGCGGCAGT TTTCGGTTGCAGGGCTGCGC
Downstream 100 bases:
>100_bases AGCACTGCAGTGCGCCTAGCGCGGGGAGTCTTGGGTGATGAGTAATTTGACCGCTTCGACTGCAACCCGTATGGCGTTTT CGGTGTCGTGAACTTGAATG
Product: DNA helicase II
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 670; Mature: 670
Protein sequence:
>670_residues MESYLSALNEAQRQAVCHYGSPLLILAGAGSGKTRVITTKIAHLIRSRQVRPEQILAVTFTNKAAREMRTRACALESAAQ GATICTFHALGVWILRRYAVRLGLNPHFSIYDDHDVRALLPKILPHCDHSRAGMLARGISQAKDYGLDCASFESVHARVS APACAARAVLGDRQFAHAYACYHRRMREMGTVDFGDLIMLPVQLLREHQDVAEQLHARWQVVMVDEYQDSNVAQFHFLQV LTGAHTYLCVVGDDDQSIYRFRGAEVKNILTFPEFFQNTQIIRLEYNYRSTDAILRVADSVVKKNQDRLGKALIAQRTGG TKPRLFLLNNQDEEAALCVHLIQEARARGIPYADWAILYRVNAQSLSFEQCFLRNRIPYRIVGTLKFYSRAEVKDVLAFL QLIVNGSDELALRRVLNKPPRGIGEKTQDALFVCAQQAAITDFTTLQSTHLTALGTRARQKVSSFLSLLRALRARMPQAP AAGEEARTSAPPEEPVEERTHDAEGLARFVSVVMEHTGLEEWYRQKDEEEGTQCAVNVQELMNAASLYACSHEGLVSFLE HIQLDQNMADEGGAADAVHLITIHNTKGLEFRRVILTGLENGVFPRDDEADIQEERRLMYVACTRAMDSLYLTACAYRRM WGRHTAMKPSRFLTELDSALLEITDPRHFA
Sequences:
>Translated_670_residues MESYLSALNEAQRQAVCHYGSPLLILAGAGSGKTRVITTKIAHLIRSRQVRPEQILAVTFTNKAAREMRTRACALESAAQ GATICTFHALGVWILRRYAVRLGLNPHFSIYDDHDVRALLPKILPHCDHSRAGMLARGISQAKDYGLDCASFESVHARVS APACAARAVLGDRQFAHAYACYHRRMREMGTVDFGDLIMLPVQLLREHQDVAEQLHARWQVVMVDEYQDSNVAQFHFLQV LTGAHTYLCVVGDDDQSIYRFRGAEVKNILTFPEFFQNTQIIRLEYNYRSTDAILRVADSVVKKNQDRLGKALIAQRTGG TKPRLFLLNNQDEEAALCVHLIQEARARGIPYADWAILYRVNAQSLSFEQCFLRNRIPYRIVGTLKFYSRAEVKDVLAFL QLIVNGSDELALRRVLNKPPRGIGEKTQDALFVCAQQAAITDFTTLQSTHLTALGTRARQKVSSFLSLLRALRARMPQAP AAGEEARTSAPPEEPVEERTHDAEGLARFVSVVMEHTGLEEWYRQKDEEEGTQCAVNVQELMNAASLYACSHEGLVSFLE HIQLDQNMADEGGAADAVHLITIHNTKGLEFRRVILTGLENGVFPRDDEADIQEERRLMYVACTRAMDSLYLTACAYRRM WGRHTAMKPSRFLTELDSALLEITDPRHFA >Mature_670_residues MESYLSALNEAQRQAVCHYGSPLLILAGAGSGKTRVITTKIAHLIRSRQVRPEQILAVTFTNKAAREMRTRACALESAAQ GATICTFHALGVWILRRYAVRLGLNPHFSIYDDHDVRALLPKILPHCDHSRAGMLARGISQAKDYGLDCASFESVHARVS APACAARAVLGDRQFAHAYACYHRRMREMGTVDFGDLIMLPVQLLREHQDVAEQLHARWQVVMVDEYQDSNVAQFHFLQV LTGAHTYLCVVGDDDQSIYRFRGAEVKNILTFPEFFQNTQIIRLEYNYRSTDAILRVADSVVKKNQDRLGKALIAQRTGG TKPRLFLLNNQDEEAALCVHLIQEARARGIPYADWAILYRVNAQSLSFEQCFLRNRIPYRIVGTLKFYSRAEVKDVLAFL QLIVNGSDELALRRVLNKPPRGIGEKTQDALFVCAQQAAITDFTTLQSTHLTALGTRARQKVSSFLSLLRALRARMPQAP AAGEEARTSAPPEEPVEERTHDAEGLARFVSVVMEHTGLEEWYRQKDEEEGTQCAVNVQELMNAASLYACSHEGLVSFLE HIQLDQNMADEGGAADAVHLITIHNTKGLEFRRVILTGLENGVFPRDDEADIQEERRLMYVACTRAMDSLYLTACAYRRM WGRHTAMKPSRFLTELDSALLEITDPRHFA
Specific function: Essential helicase. May act as a helicase in plasmid pT181 replication [H]
COG id: COG0210
COG function: function code L; Superfamily I DNA and RNA helicases
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 uvrD-like helicase C-terminal domain [H]
Homologues:
Organism=Escherichia coli, GI2367296, Length=673, Percent_Identity=34.6210995542348, Blast_Score=362, Evalue=1e-101, Organism=Escherichia coli, GI48994965, Length=672, Percent_Identity=34.9702380952381, Blast_Score=350, Evalue=2e-97, Organism=Escherichia coli, GI1787196, Length=368, Percent_Identity=27.445652173913, Blast_Score=97, Evalue=4e-21, Organism=Saccharomyces cerevisiae, GI6322369, Length=759, Percent_Identity=24.7694334650856, Blast_Score=160, Evalue=5e-40, Organism=Saccharomyces cerevisiae, GI6324477, Length=679, Percent_Identity=22.2385861561119, Blast_Score=115, Evalue=3e-26,
Paralogues:
None
Copy number: 3000 [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR005751 - InterPro: IPR013986 - InterPro: IPR014017 - InterPro: IPR000212 - InterPro: IPR014016 [H]
Pfam domain/function: PF00580 UvrD-helicase [H]
EC number: =3.6.4.12 [H]
Molecular weight: Translated: 75401; Mature: 75401
Theoretical pI: Translated: 7.46; Mature: 7.46
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.2 %Cys (Translated Protein) 2.2 %Met (Translated Protein) 4.5 %Cys+Met (Translated Protein) 2.2 %Cys (Mature Protein) 2.2 %Met (Mature Protein) 4.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MESYLSALNEAQRQAVCHYGSPLLILAGAGSGKTRVITTKIAHLIRSRQVRPEQILAVTF CCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCEEHHHHHHHHHHHCCCCCHHEEEEEE TNKAAREMRTRACALESAAQGATICTFHALGVWILRRYAVRLGLNPHFSIYDDHDVRALL CCHHHHHHHHHHHHHHHHCCCCEEHHHHHHHHHHHHHHHHHHCCCCCEEEECCCHHHHHH PKILPHCDHSRAGMLARGISQAKDYGLDCASFESVHARVSAPACAARAVLGDRQFAHAYA HHHCCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCHHHHHHHHCCHHHHHHHH CYHRRMREMGTVDFGDLIMLPVQLLREHQDVAEQLHARWQVVMVDEYQDSNVAQFHFLQV HHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCEEEEEEECCCCCCHHHHHHHHH LTGAHTYLCVVGDDDQSIYRFRGAEVKNILTFPEFFQNTQIIRLEYNYRSTDAILRVADS HHCCCEEEEEEECCCCHHHHHCCCCHHHHHHCHHHHCCCEEEEEEECCCCHHHHHHHHHH VVKKNQDRLGKALIAQRTGGTKPRLFLLNNQDEEAALCVHLIQEARARGIPYADWAILYR HHHCCHHHHHHHHHHHCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCCCEEEEEE VNAQSLSFEQCFLRNRIPYRIVGTLKFYSRAEVKDVLAFLQLIVNGSDELALRRVLNKPP ECCCCCCHHHHHHHCCCCEEEEEEHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCC RGIGEKTQDALFVCAQQAAITDFTTLQSTHLTALGTRARQKVSSFLSLLRALRARMPQAP CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC AAGEEARTSAPPEEPVEERTHDAEGLARFVSVVMEHTGLEEWYRQKDEEEGTQCAVNVQE CCCCCCCCCCCCCCCHHHHCCCHHHHHHHHHHHHHHCCHHHHHHCCCCCCCHHEEEEHHH LMNAASLYACSHEGLVSFLEHIQLDQNMADEGGAADAVHLITIHNTKGLEFRRVILTGLE HHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEEEECCCCCHHHHHHHHHHC NGVFPRDDEADIQEERRLMYVACTRAMDSLYLTACAYRRMWGRHTAMKPSRFLTELDSAL CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH LEITDPRHFA HEECCCCCCC >Mature Secondary Structure MESYLSALNEAQRQAVCHYGSPLLILAGAGSGKTRVITTKIAHLIRSRQVRPEQILAVTF CCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCEEHHHHHHHHHHHCCCCCHHEEEEEE TNKAAREMRTRACALESAAQGATICTFHALGVWILRRYAVRLGLNPHFSIYDDHDVRALL CCHHHHHHHHHHHHHHHHCCCCEEHHHHHHHHHHHHHHHHHHCCCCCEEEECCCHHHHHH PKILPHCDHSRAGMLARGISQAKDYGLDCASFESVHARVSAPACAARAVLGDRQFAHAYA HHHCCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCHHHHHHHHCCHHHHHHHH CYHRRMREMGTVDFGDLIMLPVQLLREHQDVAEQLHARWQVVMVDEYQDSNVAQFHFLQV HHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCEEEEEEECCCCCCHHHHHHHHH LTGAHTYLCVVGDDDQSIYRFRGAEVKNILTFPEFFQNTQIIRLEYNYRSTDAILRVADS HHCCCEEEEEEECCCCHHHHHCCCCHHHHHHCHHHHCCCEEEEEEECCCCHHHHHHHHHH VVKKNQDRLGKALIAQRTGGTKPRLFLLNNQDEEAALCVHLIQEARARGIPYADWAILYR HHHCCHHHHHHHHHHHCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCCCEEEEEE VNAQSLSFEQCFLRNRIPYRIVGTLKFYSRAEVKDVLAFLQLIVNGSDELALRRVLNKPP ECCCCCCHHHHHHHCCCCEEEEEEHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCC RGIGEKTQDALFVCAQQAAITDFTTLQSTHLTALGTRARQKVSSFLSLLRALRARMPQAP CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC AAGEEARTSAPPEEPVEERTHDAEGLARFVSVVMEHTGLEEWYRQKDEEEGTQCAVNVQE CCCCCCCCCCCCCCCHHHHCCCHHHHHHHHHHHHHHCCHHHHHHCCCCCCCHHEEEEHHH LMNAASLYACSHEGLVSFLEHIQLDQNMADEGGAADAVHLITIHNTKGLEFRRVILTGLE HHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEEEECCCCCHHHHHHHHHHC NGVFPRDDEADIQEERRLMYVACTRAMDSLYLTACAYRRMWGRHTAMKPSRFLTELDSAL CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH LEITDPRHFA HEECCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 8232203 [H]