Definition Mycobacterium sp. MCS chromosome, complete genome.
Accession NC_008146
Length 5,705,448

Click here to switch to the map view.

The map label for this gene is yqhH [H]

Identifier: 108797789

GI number: 108797789

Start: 888756

End: 892361

Strand: Direct

Name: yqhH [H]

Synonym: Mmcs_0811

Alternate gene names: 108797789

Gene position: 888756-892361 (Clockwise)

Preceding gene: 108797785

Following gene: 108797790

Centisome position: 15.58

GC content: 63.26

Gene sequence:

>3606_bases
ATGCCTGAGTACCGATTTCGGTGGTCGAACCTCCCAGACTGGCTGCTCGACTCCCTCTGCTACGCGCTCCTCGTCAACTA
CGCGAACGTCGACCGCGCAGAAGCGCTACGGTCCGTCTACCGCGCGCGTCCCACGGACGACTTCGTCAAGGAAGCATGGC
CAGTACTTCGAGATGTGTGGTTGTTTCGGGACCGCGACTCGCGAACCCTTGTCGTCAACGCTCTACGCACGCGCCGAGGC
GATGAAGGGCGAATCACCAATCGCCGGGCTCAGATGGAGTATCTTCGCCGACTGCGGAACGCGAAGAACCTCCGAGATGT
CGTCCGCGTGGCATTCATCGCATACGGAGAGACGCGACATGCGGATGCTGATAGTCCTGGTGCGCCCCGCACAAGGTACA
AAGACCGCCGCAGCCCCTCATCAGCCAACGGAGGTTCCGTGACCACCACCACGCAGGAGCCCGTGGAGCACCACGTCGCG
GTTCCCGAAGTCGGCCAGATCGTCACCGTGCGCGGCGCAAATTGGGCAGTTACCGACGTCCAGCAACAGAGCCTGCCTCG
CAGTGCACACGACGATGCGGTCCGGCAGCTCCAGAACGCGGTGACACTCCAGTCGGTCGAGGACGACCGACTCGGCGACG
AGCTACGCGTCGTCTGGGAACTCGAGCCGGGGCGCAGGCTGAAACCCCCCCAGGATCTGCCCACCAAGATCGCTCCCGAT
CGTTTCGATCCGCCCGAGCGGCTGGCGGCTTTCATCGACGCACTGAGGTGGGGCGCCGTCACAAGCGCCGATGTGAAGAC
GGTGCAGGCGCCGTTTCGATCGGGCGCCAAGGTGGAGCCGTACCAGTTGGAGCCGCTCCGACGAGCACTCAGCGCACCGC
GTGCCAACTTGCTTCTCGCCGACGACGTCGGCCTCGGCAAGACGATTGAAGCGGGAATGGTCATCCAGGAACTGTTGCTC
CGCCACCGGGCACGCACGGTCATCATCGTCTGCCCCGCCGGTCTGTCCGTGAAGTGGCAGAACGAGATGCACGACAAGTT
CGGCCTCGACTTCCGGATCGTCAACTCCGAAACGATGAAAGAAGTTCGCCGCACACACGGTGTGCACGCCAATCCGTTCA
CGCTGTTTCCGCGCATCATCGTGTCGATGGCGTGGCTTCCCGGCCCACGCGCGCAGCGCCAGCTCCGCGACGCGCTGATC
ACCAAGTCACGTGGCGCAGGGCCGCGCTTCGCCTTTGACGTCCTCGTCGTCGACGAGGCTCACCACGTGGCTCCGGCAAC
CCCGACAAGAACGGACAAAGTCGGCCTGCGCAAGGGCTATGCCGTCGATTCCCAGCGCACCCGGGCCGTGCGCGACCTCG
CCGAGCGCTCCGAACACCGGCTCTTCCTGTCGGCAACCCCGCACAACGGCTACACCGAGTCATTCACAGCACTACTGGAG
ATGATCGACCCGCAGCGCTTCGTCCGGGGCAAGTTCATCGACGAACAGGCGCTGCAAACCGTCGCCGTACGACGTCTCAA
GAAGGACTTGCCCGGCCAATTCCACGAACGGAAGGTCGAAAAGTTGCTCATCACTCCATCAGCCGACGAAGAAGCAGCAT
ACGACCGGTTGATCGCATTCACGCAGCGGCGCGACACTGCCGCGAAGGGCGACGACCGCGGCGCCAAGGACATGGCTACG
CTGCTCTTGAAGAAGCGGTTCTTCTCGTCGCCCGTGGCTTTCGCCCGAACTGTCGATGTATACAAGGACACCCGCCAGCG
TGGCTTGGTTGTCGACTTCGACGCGGAGTACGACGAAGTCCTCGGTCTCGACGCCGACGAGCTGGAAGAAGGGCGCGTCG
AACAGCCGGAAACTCAAACTCTTCAACAGACCAAGAACGCTCTACCGCCACTTACCGCCGCGGATAAGGACGACCTCGAC
TGGCTGAGCGATTGGGGGCACAGCTACGAAGCACGCCCCGACTCGAAGCTCACAGCCCTCATCGACTACCTCGAAGCCAA
CACGAAAGCCAACGGCGAGTGGCTCAACGAACGAGTCGTCGTCTTCACCGAATACGTCGATACGCTCGAATGGATCCACG
GCATCCTTCGACAACGTGGTTACGAGGCGGACCGCGTCGCGGTGATCGACGGCTCTACGGACGGTGAACAACGCGAAGTC
ATCCGTGCCCGTTTCAACACCGACCCGTCTCGCGAGAAGCTGCGGATTCTGCTGGCGACGGATGCCGCCGGCGAAGGGAT
CGACCTGCAGGATTACTGCCACCGGCTCGTCAACTTCGACATCCCGTTCAATCCCAACCGGCTCGAGCAACGCATCGGCC
GTATCGACCGTTATGGACAGACCCACGACCCGGAGATTCGCCACTTCGCGACAGACGACGAGAAGTCTCAACTCACAAGA
GACGTCGACCTGCTGGCACGAGTCGCCAAGAAAGTCGCGCAGATCATGGCCGATCTCGGTTCTGCCAACGAGATCATCGC
GCCAGACCTGCAACGCCAACTCAGCGGCATCGACGCAGCGCCGCGCAAGGGGAAGGCCGAAAAAAGCCCGATCGGGCAGA
TGCTCGCTGGCGGACGGGACGTCGGCGCAGAGCTCACCAAATTGGCGCAGGACATCACCGAGAGTCGTGAAACGCTCCAC
TTGCAGCCGGCGAACTTGCAGCGGGTCGTCGATGTCGCGTTCGAGCTCGATCGGCTACCTCCGATCGAGGAAGTCGGGTC
GGACCGCACCGACGTTCCCGTCTTTCGCCTGCCCGCGCTCGGCAGCTCCTGGGAGCAGGTGACGCGTGGGCTCACGACGG
CCCTGGACCGTGAAAGTCTTCGACCGATCGCGTTCGATCCGGCCGTGCTCGCGGAAGATCCCGACGTGGTGTACATGCAC
CTCGGAAGTCCCCTCCTTCAGCGCGCCACCCGCCGATTGCGGTCTGCTCTGTGGGGCGGCGAACGGTCGCTGGAAAGGGT
AACCGCAGTCGTCGTACCTGACCTCGAGGAGTCCTTCGCTGCGGCGGTCACCCGCTTGATACTCATCGGGAAGGCAGGGC
TTCGACTTCACGAAGAGGTGTTCCTCGCAGGCACACGGTTGGGCCGTCGCCAAGCAGTCGGTGAACACCGCGCCGAAGAT
CTGCTCGAGAACGCCCTCGACGCGGAGAGCCTCGAGCCCGTCCCGACATCCATCGCCGAACAACTCGCGAAAGCGTGGGA
CGACGACAAGGCCGACGGCCTCCGAGCGCGTGTGGCCAAGGCGGTCGCGGACCGGGTGGAACGACGACAGCAGGCAGTCA
CCGCGCAGCTGGAAGAGCGCCGGACGGCAGATCGTGAACGAGTCATCGCGACGTTCGATCGATTTGGTGCCACGCTGAAA
TCTGCTCTGGCAGAGGCAGAAGCTATCGAATCCGAGCTCACGCTGTTCGATGATGAGCGCCGTCAAAGCGAACGGGACCT
GCGCCATATTCGCGCGCGCATGGACGCGTTGGCCGATGAGCGTGACGAGGAGTTGGTGGCGGTGGACGCACGTTATACCG
ATGTGCAGGCTCGGACGTTCCATGGGGCGGTGTTGTTCGCGTTGTCGCCGAAGGATATCGAGCGGGGGGAGGTGAGCATC
CGATGA

Upstream 100 bases:

>100_bases
CAGTCAATGGATGGTGTGGTTCGCGCGGAACATTGGGAAGTTCGCTGGGTGGTTGTCCCCATCGCGAATCGAATCAGCTG
GGTTGTCGAGTTGACAGTAA

Downstream 100 bases:

>100_bases
GGCGGTCGCGTCGCGGCTTCGGTCCGGATTTCGGTTGGCTCGAGCAGTTCGACGTCGATGGTCCGTTTTTATCGTTGCCG
GTGGTGAAAGAGTTTTGGGC

Product: helicase-like protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 1201; Mature: 1200

Protein sequence:

>1201_residues
MPEYRFRWSNLPDWLLDSLCYALLVNYANVDRAEALRSVYRARPTDDFVKEAWPVLRDVWLFRDRDSRTLVVNALRTRRG
DEGRITNRRAQMEYLRRLRNAKNLRDVVRVAFIAYGETRHADADSPGAPRTRYKDRRSPSSANGGSVTTTTQEPVEHHVA
VPEVGQIVTVRGANWAVTDVQQQSLPRSAHDDAVRQLQNAVTLQSVEDDRLGDELRVVWELEPGRRLKPPQDLPTKIAPD
RFDPPERLAAFIDALRWGAVTSADVKTVQAPFRSGAKVEPYQLEPLRRALSAPRANLLLADDVGLGKTIEAGMVIQELLL
RHRARTVIIVCPAGLSVKWQNEMHDKFGLDFRIVNSETMKEVRRTHGVHANPFTLFPRIIVSMAWLPGPRAQRQLRDALI
TKSRGAGPRFAFDVLVVDEAHHVAPATPTRTDKVGLRKGYAVDSQRTRAVRDLAERSEHRLFLSATPHNGYTESFTALLE
MIDPQRFVRGKFIDEQALQTVAVRRLKKDLPGQFHERKVEKLLITPSADEEAAYDRLIAFTQRRDTAAKGDDRGAKDMAT
LLLKKRFFSSPVAFARTVDVYKDTRQRGLVVDFDAEYDEVLGLDADELEEGRVEQPETQTLQQTKNALPPLTAADKDDLD
WLSDWGHSYEARPDSKLTALIDYLEANTKANGEWLNERVVVFTEYVDTLEWIHGILRQRGYEADRVAVIDGSTDGEQREV
IRARFNTDPSREKLRILLATDAAGEGIDLQDYCHRLVNFDIPFNPNRLEQRIGRIDRYGQTHDPEIRHFATDDEKSQLTR
DVDLLARVAKKVAQIMADLGSANEIIAPDLQRQLSGIDAAPRKGKAEKSPIGQMLAGGRDVGAELTKLAQDITESRETLH
LQPANLQRVVDVAFELDRLPPIEEVGSDRTDVPVFRLPALGSSWEQVTRGLTTALDRESLRPIAFDPAVLAEDPDVVYMH
LGSPLLQRATRRLRSALWGGERSLERVTAVVVPDLEESFAAAVTRLILIGKAGLRLHEEVFLAGTRLGRRQAVGEHRAED
LLENALDAESLEPVPTSIAEQLAKAWDDDKADGLRARVAKAVADRVERRQQAVTAQLEERRTADRERVIATFDRFGATLK
SALAEAEAIESELTLFDDERRQSERDLRHIRARMDALADERDEELVAVDARYTDVQARTFHGAVLFALSPKDIERGEVSI
R

Sequences:

>Translated_1201_residues
MPEYRFRWSNLPDWLLDSLCYALLVNYANVDRAEALRSVYRARPTDDFVKEAWPVLRDVWLFRDRDSRTLVVNALRTRRG
DEGRITNRRAQMEYLRRLRNAKNLRDVVRVAFIAYGETRHADADSPGAPRTRYKDRRSPSSANGGSVTTTTQEPVEHHVA
VPEVGQIVTVRGANWAVTDVQQQSLPRSAHDDAVRQLQNAVTLQSVEDDRLGDELRVVWELEPGRRLKPPQDLPTKIAPD
RFDPPERLAAFIDALRWGAVTSADVKTVQAPFRSGAKVEPYQLEPLRRALSAPRANLLLADDVGLGKTIEAGMVIQELLL
RHRARTVIIVCPAGLSVKWQNEMHDKFGLDFRIVNSETMKEVRRTHGVHANPFTLFPRIIVSMAWLPGPRAQRQLRDALI
TKSRGAGPRFAFDVLVVDEAHHVAPATPTRTDKVGLRKGYAVDSQRTRAVRDLAERSEHRLFLSATPHNGYTESFTALLE
MIDPQRFVRGKFIDEQALQTVAVRRLKKDLPGQFHERKVEKLLITPSADEEAAYDRLIAFTQRRDTAAKGDDRGAKDMAT
LLLKKRFFSSPVAFARTVDVYKDTRQRGLVVDFDAEYDEVLGLDADELEEGRVEQPETQTLQQTKNALPPLTAADKDDLD
WLSDWGHSYEARPDSKLTALIDYLEANTKANGEWLNERVVVFTEYVDTLEWIHGILRQRGYEADRVAVIDGSTDGEQREV
IRARFNTDPSREKLRILLATDAAGEGIDLQDYCHRLVNFDIPFNPNRLEQRIGRIDRYGQTHDPEIRHFATDDEKSQLTR
DVDLLARVAKKVAQIMADLGSANEIIAPDLQRQLSGIDAAPRKGKAEKSPIGQMLAGGRDVGAELTKLAQDITESRETLH
LQPANLQRVVDVAFELDRLPPIEEVGSDRTDVPVFRLPALGSSWEQVTRGLTTALDRESLRPIAFDPAVLAEDPDVVYMH
LGSPLLQRATRRLRSALWGGERSLERVTAVVVPDLEESFAAAVTRLILIGKAGLRLHEEVFLAGTRLGRRQAVGEHRAED
LLENALDAESLEPVPTSIAEQLAKAWDDDKADGLRARVAKAVADRVERRQQAVTAQLEERRTADRERVIATFDRFGATLK
SALAEAEAIESELTLFDDERRQSERDLRHIRARMDALADERDEELVAVDARYTDVQARTFHGAVLFALSPKDIERGEVSI
R
>Mature_1200_residues
PEYRFRWSNLPDWLLDSLCYALLVNYANVDRAEALRSVYRARPTDDFVKEAWPVLRDVWLFRDRDSRTLVVNALRTRRGD
EGRITNRRAQMEYLRRLRNAKNLRDVVRVAFIAYGETRHADADSPGAPRTRYKDRRSPSSANGGSVTTTTQEPVEHHVAV
PEVGQIVTVRGANWAVTDVQQQSLPRSAHDDAVRQLQNAVTLQSVEDDRLGDELRVVWELEPGRRLKPPQDLPTKIAPDR
FDPPERLAAFIDALRWGAVTSADVKTVQAPFRSGAKVEPYQLEPLRRALSAPRANLLLADDVGLGKTIEAGMVIQELLLR
HRARTVIIVCPAGLSVKWQNEMHDKFGLDFRIVNSETMKEVRRTHGVHANPFTLFPRIIVSMAWLPGPRAQRQLRDALIT
KSRGAGPRFAFDVLVVDEAHHVAPATPTRTDKVGLRKGYAVDSQRTRAVRDLAERSEHRLFLSATPHNGYTESFTALLEM
IDPQRFVRGKFIDEQALQTVAVRRLKKDLPGQFHERKVEKLLITPSADEEAAYDRLIAFTQRRDTAAKGDDRGAKDMATL
LLKKRFFSSPVAFARTVDVYKDTRQRGLVVDFDAEYDEVLGLDADELEEGRVEQPETQTLQQTKNALPPLTAADKDDLDW
LSDWGHSYEARPDSKLTALIDYLEANTKANGEWLNERVVVFTEYVDTLEWIHGILRQRGYEADRVAVIDGSTDGEQREVI
RARFNTDPSREKLRILLATDAAGEGIDLQDYCHRLVNFDIPFNPNRLEQRIGRIDRYGQTHDPEIRHFATDDEKSQLTRD
VDLLARVAKKVAQIMADLGSANEIIAPDLQRQLSGIDAAPRKGKAEKSPIGQMLAGGRDVGAELTKLAQDITESRETLHL
QPANLQRVVDVAFELDRLPPIEEVGSDRTDVPVFRLPALGSSWEQVTRGLTTALDRESLRPIAFDPAVLAEDPDVVYMHL
GSPLLQRATRRLRSALWGGERSLERVTAVVVPDLEESFAAAVTRLILIGKAGLRLHEEVFLAGTRLGRRQAVGEHRAEDL
LENALDAESLEPVPTSIAEQLAKAWDDDKADGLRARVAKAVADRVERRQQAVTAQLEERRTADRERVIATFDRFGATLKS
ALAEAEAIESELTLFDDERRQSERDLRHIRARMDALADERDEELVAVDARYTDVQARTFHGAVLFALSPKDIERGEVSIR

Specific function: Transcription Regulator That Activates Transcription By Stimulating RNA Polymerase (Rnap) Recycling In Case Of Stress Conditions Such As Supercoiled DNA Or High Salt Concentrations. Probably Acts By Releasing The Rnap, When It Is Trapped Or Immobilized On

COG id: COG0553

COG function: function code KL; Superfamily II DNA/RNA helicases, SNF2 family

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 helicase C-terminal domain [H]

Homologues:

Organism=Escherichia coli, GI1786245, Length=515, Percent_Identity=27.5728155339806, Blast_Score=129, Evalue=1e-30,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR014001
- InterPro:   IPR001650
- InterPro:   IPR014021
- InterPro:   IPR000330 [H]

Pfam domain/function: PF00271 Helicase_C; PF00176 SNF2_N [H]

EC number: 3.6.1.- [C]

Molecular weight: Translated: 135419; Mature: 135288

Theoretical pI: Translated: 6.21; Mature: 6.21

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.2 %Cys     (Translated Protein)
1.0 %Met     (Translated Protein)
1.2 %Cys+Met (Translated Protein)
0.2 %Cys     (Mature Protein)
0.9 %Met     (Mature Protein)
1.2 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MPEYRFRWSNLPDWLLDSLCYALLVNYANVDRAEALRSVYRARPTDDFVKEAWPVLRDVW
CCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHH
LFRDRDSRTLVVNALRTRRGDEGRITNRRAQMEYLRRLRNAKNLRDVVRVAFIAYGETRH
HHCCCCCCEEEEEHHHHCCCCCCCCCHHHHHHHHHHHHHCHHHHHHHHHHHHHHCCCCCC
ADADSPGAPRTRYKDRRSPSSANGGSVTTTTQEPVEHHVAVPEVGQIVTVRGANWAVTDV
CCCCCCCCCCCCCHHCCCCCCCCCCCEECCCHHHHHHCCCCCCCCCEEEEECCCEEEECC
QQQSLPRSAHDDAVRQLQNAVTLQSVEDDRLGDELRVVWELEPGRRLKPPQDLPTKIAPD
HHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCC
RFDPPERLAAFIDALRWGAVTSADVKTVQAPFRSGAKVEPYQLEPLRRALSAPRANLLLA
CCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCCCEEEE
DDVGLGKTIEAGMVIQELLLRHRARTVIIVCPAGLSVKWQNEMHDKFGLDFRIVNSETMK
CCCCCCCHHHHHHHHHHHHHHHCCCEEEEECCCCCCEEECHHHHHHHCCCEEEECHHHHH
EVRRTHGVHANPFTLFPRIIVSMAWLPGPRAQRQLRDALITKSRGAGPRFAFDVLVVDEA
HHHHHCCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCCEEEEEEEEECC
HHVAPATPTRTDKVGLRKGYAVDSQRTRAVRDLAERSEHRLFLSATPHNGYTESFTALLE
CCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHH
MIDPQRFVRGKFIDEQALQTVAVRRLKKDLPGQFHERKVEKLLITPSADEEAAYDRLIAF
HHCHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHEEECCCCCHHHHHHHHHHH
TQRRDTAAKGDDRGAKDMATLLLKKRFFSSPVAFARTVDVYKDTRQRGLVVDFDAEYDEV
HHHCCCCCCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEECCCHHHH
LGLDADELEEGRVEQPETQTLQQTKNALPPLTAADKDDLDWLSDWGHSYEARPDSKLTAL
HCCCHHHHHCCCCCCCHHHHHHHHHHCCCCCCCCCCCCHHHHHHCCCCCCCCCCHHHHHH
IDYLEANTKANGEWLNERVVVFTEYVDTLEWIHGILRQRGYEADRVAVIDGSTDGEQREV
HHHHHCCCCCCCCHHCCEEEEHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCHHHHH
IRARFNTDPSREKLRILLATDAAGEGIDLQDYCHRLVNFDIPFNPNRLEQRIGRIDRYGQ
HHHHCCCCCCHHHEEEEEEECCCCCCCCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCC
THDPEIRHFATDDEKSQLTRDVDLLARVAKKVAQIMADLGSANEIIAPDLQRQLSGIDAA
CCCCCCHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHCCCCCC
PRKGKAEKSPIGQMLAGGRDVGAELTKLAQDITESRETLHLQPANLQRVVDVAFELDRLP
CCCCCCCCCHHHHHHHCCCCHHHHHHHHHHHHHHHHHEEECCCCHHHHHHHHHHHHHCCC
PIEEVGSDRTDVPVFRLPALGSSWEQVTRGLTTALDRESLRPIAFDPAVLAEDPDVVYMH
CHHHHCCCCCCCCEEECCCCCCCHHHHHHHHHHHHHHHCCCCCEECCHHCCCCCCEEEEE
LGSPLLQRATRRLRSALWGGERSLERVTAVVVPDLEESFAAAVTRLILIGKAGLRLHEEV
CCCHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCHHHHHH
FLAGTRLGRRQAVGEHRAEDLLENALDAESLEPVPTSIAEQLAKAWDDDKADGLRARVAK
HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCCCCCHHHHHHHH
AVADRVERRQQAVTAQLEERRTADRERVIATFDRFGATLKSALAEAEAIESELTLFDDER
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHH
RQSERDLRHIRARMDALADERDEELVAVDARYTDVQARTFHGAVLFALSPKDIERGEVSI
HHHHHHHHHHHHHHHHHHCCCCCCEEEEECEECCCCCEEECCEEEEEECCCCCCCCCCCC
R
C
>Mature Secondary Structure 
PEYRFRWSNLPDWLLDSLCYALLVNYANVDRAEALRSVYRARPTDDFVKEAWPVLRDVW
CCCCCCCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHH
LFRDRDSRTLVVNALRTRRGDEGRITNRRAQMEYLRRLRNAKNLRDVVRVAFIAYGETRH
HHCCCCCCEEEEEHHHHCCCCCCCCCHHHHHHHHHHHHHCHHHHHHHHHHHHHHCCCCCC
ADADSPGAPRTRYKDRRSPSSANGGSVTTTTQEPVEHHVAVPEVGQIVTVRGANWAVTDV
CCCCCCCCCCCCCHHCCCCCCCCCCCEECCCHHHHHHCCCCCCCCCEEEEECCCEEEECC
QQQSLPRSAHDDAVRQLQNAVTLQSVEDDRLGDELRVVWELEPGRRLKPPQDLPTKIAPD
HHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCC
RFDPPERLAAFIDALRWGAVTSADVKTVQAPFRSGAKVEPYQLEPLRRALSAPRANLLLA
CCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCCCEEEE
DDVGLGKTIEAGMVIQELLLRHRARTVIIVCPAGLSVKWQNEMHDKFGLDFRIVNSETMK
CCCCCCCHHHHHHHHHHHHHHHCCCEEEEECCCCCCEEECHHHHHHHCCCEEEECHHHHH
EVRRTHGVHANPFTLFPRIIVSMAWLPGPRAQRQLRDALITKSRGAGPRFAFDVLVVDEA
HHHHHCCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCCEEEEEEEEECC
HHVAPATPTRTDKVGLRKGYAVDSQRTRAVRDLAERSEHRLFLSATPHNGYTESFTALLE
CCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHH
MIDPQRFVRGKFIDEQALQTVAVRRLKKDLPGQFHERKVEKLLITPSADEEAAYDRLIAF
HHCHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHEEECCCCCHHHHHHHHHHH
TQRRDTAAKGDDRGAKDMATLLLKKRFFSSPVAFARTVDVYKDTRQRGLVVDFDAEYDEV
HHHCCCCCCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEECCCHHHH
LGLDADELEEGRVEQPETQTLQQTKNALPPLTAADKDDLDWLSDWGHSYEARPDSKLTAL
HCCCHHHHHCCCCCCCHHHHHHHHHHCCCCCCCCCCCCHHHHHHCCCCCCCCCCHHHHHH
IDYLEANTKANGEWLNERVVVFTEYVDTLEWIHGILRQRGYEADRVAVIDGSTDGEQREV
HHHHHCCCCCCCCHHCCEEEEHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCHHHHH
IRARFNTDPSREKLRILLATDAAGEGIDLQDYCHRLVNFDIPFNPNRLEQRIGRIDRYGQ
HHHHCCCCCCHHHEEEEEEECCCCCCCCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCC
THDPEIRHFATDDEKSQLTRDVDLLARVAKKVAQIMADLGSANEIIAPDLQRQLSGIDAA
CCCCCCHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHCCCCCC
PRKGKAEKSPIGQMLAGGRDVGAELTKLAQDITESRETLHLQPANLQRVVDVAFELDRLP
CCCCCCCCCHHHHHHHCCCCHHHHHHHHHHHHHHHHHEEECCCCHHHHHHHHHHHHHCCC
PIEEVGSDRTDVPVFRLPALGSSWEQVTRGLTTALDRESLRPIAFDPAVLAEDPDVVYMH
CHHHHCCCCCCCCEEECCCCCCCHHHHHHHHHHHHHHHCCCCCEECCHHCCCCCCEEEEE
LGSPLLQRATRRLRSALWGGERSLERVTAVVVPDLEESFAAAVTRLILIGKAGLRLHEEV
CCCHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCHHHHHH
FLAGTRLGRRQAVGEHRAEDLLENALDAESLEPVPTSIAEQLAKAWDDDKADGLRARVAK
HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCCCCCHHHHHHHH
AVADRVERRQQAVTAQLEERRTADRERVIATFDRFGATLKSALAEAEAIESELTLFDDER
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHH
RQSERDLRHIRARMDALADERDEELVAVDARYTDVQARTFHGAVLFALSPKDIERGEVSI
HHHHHHHHHHHHHHHHHHCCCCCCEEEEECEECCCCCEEECCEEEEEECCCCCCCCCCCC
R
C

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8969508; 9384377 [H]