| Definition | Shigella flexneri 2a str. 2457T, complete genome. |
|---|---|
| Accession | NC_004741 |
| Length | 4,599,354 |
Click here to switch to the map view.
The map label for this gene is uvrA
Identifier: 30064647
GI number: 30064647
Start: 3452899
End: 3455721
Strand: Reverse
Name: uvrA
Synonym: S3583
Alternate gene names: 30064647
Gene position: 3455721-3452899 (Counterclockwise)
Preceding gene: 30064649
Following gene: 30064640
Centisome position: 75.13
GC content: 55.72
Gene sequence:
>2823_bases ATGGATAAGATCGAAGTTCGGGGCGCCCGCACCCATAATCTCAAAAACATCAACCTCGTTATCCCCCGCGACAAGCTCAT TGTCGTGACCGGGCTTTCGGGTTCTGGCAAATCCTCGCTCGCTTTCGACACCTTATATGCCGAAGGGCAGCGCCGTTACG TTGAATCCCTTTCCGCCTACGCGCGGCAGTTTCTGTCACTGATGGAAAAGCCGGACGTCGATCATATTGAGGGGCTTTCT CCTGCCATCTCAATTGAGCAGAAATCGACGTCTCATAACCCGCGTTCTACGGTGGGGACAATCACCGAAATCCACGACTA TTTGCGTTTGTTGTTCGCCCGCGTCGGCGAACCGCGCTGTCCGGACCACGACGTCCCGCTGGCGGCGCAAACCGTCAGCC AGATGGTGGATAACGTGCTGTCGCAGCCGGAAGGCAAGCGTCTGATGCTGCTCGCGCCAATCATTAAAGAGCGCAAAGGC GAACACACCAAAACGCTGGAGAACCTGGCAAGCCAGGGTTACATCCGTGCTCGTATTGATGGCGAAGTCTGCGATCTTTC CGATCCGCCGAAACTGGAACTGCAAAAGAAACATACCATTGAAGTGGTGGTTGATCGCTTCAAGGTGCGTGACGATCTTA CCCAACGTCTTGCGGAGTCGTTTGAAACCGCGCTGGAACTTTCCGGTGGTACAGCGGTAGTGGCGGATATGGACGACCCG AAAGCGGAAGAGCTGCTGTTCTCCGCCAACTTCGCCTGCCCAATTTGCGGCTACAGTATGCGTGAACTGGAGCCGCGACT GTTTTCGTTTAACAACCCGGCAGGTGCCTGCCCGACCTGTGACGGCCTTGGCGTACAGCAATATTTCGATCCTGACCGCG TGATCCAAAACCCCGAGCTGTCACTGGCTGGCGGTGCGATCCGTGGCTGGGATCGCCGCAACTTCTATTACTTCCAGATG CTGAAATCGCTGGCAGATCACTATAAGTTCGACGTCGAAGCGCCGTGGGGCAGCCTGAGCGCGAACGTGCATAAAGTGGT GTTGTACGGTTCTGGCAAAGAAAACATTGAATTCAAATACATGAACGATCGTGGCGATACCTCCATCCGTCGTCATCCGT TCGAAGGCGTGCTGCACAATATGGAGCGCCGTTATAAAGAGACAGAATCCAGTGCGGTACGTGAAGAATTAGCCAAGTTT ATCAGCAATCGCCCATGCGCCAGCTGCGAAGGGACGCGTCTGCGTCGGGAAGCGCGCCACGTTTATGTCGAGAATACGCC GCTGCCTGCTATCTCCGACATGAGCATCGGTCATGCGATGGAATTCTTCAACAATCTCAAACTCGCTGGTCAGCGGGCGA AGATTGCGGAAAAAATTCTTAAAGAGATCGGCGATCGCCTGAAATTCCTCGTAAACGTCGGCCTGAATTACCTGACACTT TCCCGCTCGGCAGAAACACTTTCCGGCGGTGAAGCCCAGCGTATCCGTCTGGCGAGCCAGATTGGTGCGGGCCTGGTTGG CGTTATGTACGTGCTGGACGAGCCGTCTATCGGCCTGCACCAGCGCGATAACGAGCGCCTGTTGGGTACGCTTATCCATC TGCGCGATCTCGGTAATACCGTGATTGTGGTGGAGCATGACGAAGACGCAATTCGCGCCGCTGACCATGTGATCGATATC GGTCCTGGTGCGGGTGTTCACGGCGGTGAAGTGGTCGCAGAAGGTCCGCTGGAAGCGATTATGGCGGTGCCTGAATCGTT GACCGGGCAGTACATGAGCGGTAAACGCAAGATTGAAGTGCCGAAGAAACGCGTTCCGGCGAATCCAGAAAAAGTGCTGA AGCTGACAGGCGCACGCGGCAACAACCTGAAGGACGTGACGCTCACGCTGCCAGTCGGTCTGTTTACCTGCATCACAGGG GTTTCAGGTTCCGGTAAATCGACGCTGATTAACGACACACTGTTCCCGATTGCCCAACGCCAGTTGAATGGTGCGACCAT CGCCGAACCGGCACCGTATCGCGATATTCAGGGACTGGAGCATTTCGATAAAGTGATCGATATCGACCAAAGCCCAATTG GTCGTACTCCACGTTCTAACCCGGCGACCTATACCGGCGTGTTTACGCCTGTGCGCGAACTGTTTGCGGGCGTACCGGAA TCCCGTGCGCGTGGTTATACGCCAGGACGTTTCAGCTTTAACGTCCGTGGCGGGCGCTGCGAAGCCTGTCAGGGCGACGG TGTGATCAAAGTGGAGATGCACTTCCTGCCGGATATCTACGTGCCGTGCGATCAGTGCAAAGGTAAACGCTATAACCGTG AAACGCTGGAAATTAAGTACAAAGGCAAAACCATCCACGAAGTGCTGGATATGACCATCGAAGAGGCGCGTGAGTTCTTT GATGCGGTGCCAGCTCTGGCGCGTAAGCTGCAAACGTTGATGGACGTTGGTCTGACGTACATTCGCCTGGGGCAGTCCGC AACCACACTTTCTGGTGGTGAAGCCCAGCGCGTGAAGCTGGCGCGTGAGCTGTCAAAACGCGGCACCGGGCAGACGCTGT ATATTCTCGACGAGCCGACCACCGGTCTGCACTTCGCCGATATTCAGCAACTCCTCGACGTGCTGCATAAACTGCGCGAT CAGGGCAATACCATTGTGGTGATTGAGCACAATCTCGACGTGATCAAAACCGCTGACTGGATTGTCGACCTGGGACCAGA AGGCGGCAGTGGCGGCGGCGAAATCCTCGTCTCCGGTACGCCAGAAACCGTCGCGGAGTGCGAAGCTTCGCATACGGCAC GCTTCCTCAAGCCGATGCTGTAA
Upstream 100 bases:
>100_bases TGTATATTCATTCAGGTCAATTTGTGTCATAATTAACCGTTTGTGATCGCCGGTAGCACCATGCCACCGGGCAAAAAAGC GTTTAATCCGGGAAAGGTGA
Downstream 100 bases:
>100_bases TCGTTAAGGCCGCTTTCTGAGCGGCCTTTTCCTTTCAGAGTTGTACCAGCAATTTACGCTTTTCTTCCGGTAGTAAATTC ACTGCCTGCTGATAAGACGC
Product: excinuclease ABC subunit A
Products: NA
Alternate protein names: UvrA protein; Excinuclease ABC subunit A [H]
Number of amino acids: Translated: 940; Mature: 940
Protein sequence:
>940_residues MDKIEVRGARTHNLKNINLVIPRDKLIVVTGLSGSGKSSLAFDTLYAEGQRRYVESLSAYARQFLSLMEKPDVDHIEGLS PAISIEQKSTSHNPRSTVGTITEIHDYLRLLFARVGEPRCPDHDVPLAAQTVSQMVDNVLSQPEGKRLMLLAPIIKERKG EHTKTLENLASQGYIRARIDGEVCDLSDPPKLELQKKHTIEVVVDRFKVRDDLTQRLAESFETALELSGGTAVVADMDDP KAEELLFSANFACPICGYSMRELEPRLFSFNNPAGACPTCDGLGVQQYFDPDRVIQNPELSLAGGAIRGWDRRNFYYFQM LKSLADHYKFDVEAPWGSLSANVHKVVLYGSGKENIEFKYMNDRGDTSIRRHPFEGVLHNMERRYKETESSAVREELAKF ISNRPCASCEGTRLRREARHVYVENTPLPAISDMSIGHAMEFFNNLKLAGQRAKIAEKILKEIGDRLKFLVNVGLNYLTL SRSAETLSGGEAQRIRLASQIGAGLVGVMYVLDEPSIGLHQRDNERLLGTLIHLRDLGNTVIVVEHDEDAIRAADHVIDI GPGAGVHGGEVVAEGPLEAIMAVPESLTGQYMSGKRKIEVPKKRVPANPEKVLKLTGARGNNLKDVTLTLPVGLFTCITG VSGSGKSTLINDTLFPIAQRQLNGATIAEPAPYRDIQGLEHFDKVIDIDQSPIGRTPRSNPATYTGVFTPVRELFAGVPE SRARGYTPGRFSFNVRGGRCEACQGDGVIKVEMHFLPDIYVPCDQCKGKRYNRETLEIKYKGKTIHEVLDMTIEEAREFF DAVPALARKLQTLMDVGLTYIRLGQSATTLSGGEAQRVKLARELSKRGTGQTLYILDEPTTGLHFADIQQLLDVLHKLRD QGNTIVVIEHNLDVIKTADWIVDLGPEGGSGGGEILVSGTPETVAECEASHTARFLKPML
Sequences:
>Translated_940_residues MDKIEVRGARTHNLKNINLVIPRDKLIVVTGLSGSGKSSLAFDTLYAEGQRRYVESLSAYARQFLSLMEKPDVDHIEGLS PAISIEQKSTSHNPRSTVGTITEIHDYLRLLFARVGEPRCPDHDVPLAAQTVSQMVDNVLSQPEGKRLMLLAPIIKERKG EHTKTLENLASQGYIRARIDGEVCDLSDPPKLELQKKHTIEVVVDRFKVRDDLTQRLAESFETALELSGGTAVVADMDDP KAEELLFSANFACPICGYSMRELEPRLFSFNNPAGACPTCDGLGVQQYFDPDRVIQNPELSLAGGAIRGWDRRNFYYFQM LKSLADHYKFDVEAPWGSLSANVHKVVLYGSGKENIEFKYMNDRGDTSIRRHPFEGVLHNMERRYKETESSAVREELAKF ISNRPCASCEGTRLRREARHVYVENTPLPAISDMSIGHAMEFFNNLKLAGQRAKIAEKILKEIGDRLKFLVNVGLNYLTL SRSAETLSGGEAQRIRLASQIGAGLVGVMYVLDEPSIGLHQRDNERLLGTLIHLRDLGNTVIVVEHDEDAIRAADHVIDI GPGAGVHGGEVVAEGPLEAIMAVPESLTGQYMSGKRKIEVPKKRVPANPEKVLKLTGARGNNLKDVTLTLPVGLFTCITG VSGSGKSTLINDTLFPIAQRQLNGATIAEPAPYRDIQGLEHFDKVIDIDQSPIGRTPRSNPATYTGVFTPVRELFAGVPE SRARGYTPGRFSFNVRGGRCEACQGDGVIKVEMHFLPDIYVPCDQCKGKRYNRETLEIKYKGKTIHEVLDMTIEEAREFF DAVPALARKLQTLMDVGLTYIRLGQSATTLSGGEAQRVKLARELSKRGTGQTLYILDEPTTGLHFADIQQLLDVLHKLRD QGNTIVVIEHNLDVIKTADWIVDLGPEGGSGGGEILVSGTPETVAECEASHTARFLKPML >Mature_940_residues MDKIEVRGARTHNLKNINLVIPRDKLIVVTGLSGSGKSSLAFDTLYAEGQRRYVESLSAYARQFLSLMEKPDVDHIEGLS PAISIEQKSTSHNPRSTVGTITEIHDYLRLLFARVGEPRCPDHDVPLAAQTVSQMVDNVLSQPEGKRLMLLAPIIKERKG EHTKTLENLASQGYIRARIDGEVCDLSDPPKLELQKKHTIEVVVDRFKVRDDLTQRLAESFETALELSGGTAVVADMDDP KAEELLFSANFACPICGYSMRELEPRLFSFNNPAGACPTCDGLGVQQYFDPDRVIQNPELSLAGGAIRGWDRRNFYYFQM LKSLADHYKFDVEAPWGSLSANVHKVVLYGSGKENIEFKYMNDRGDTSIRRHPFEGVLHNMERRYKETESSAVREELAKF ISNRPCASCEGTRLRREARHVYVENTPLPAISDMSIGHAMEFFNNLKLAGQRAKIAEKILKEIGDRLKFLVNVGLNYLTL SRSAETLSGGEAQRIRLASQIGAGLVGVMYVLDEPSIGLHQRDNERLLGTLIHLRDLGNTVIVVEHDEDAIRAADHVIDI GPGAGVHGGEVVAEGPLEAIMAVPESLTGQYMSGKRKIEVPKKRVPANPEKVLKLTGARGNNLKDVTLTLPVGLFTCITG VSGSGKSTLINDTLFPIAQRQLNGATIAEPAPYRDIQGLEHFDKVIDIDQSPIGRTPRSNPATYTGVFTPVRELFAGVPE SRARGYTPGRFSFNVRGGRCEACQGDGVIKVEMHFLPDIYVPCDQCKGKRYNRETLEIKYKGKTIHEVLDMTIEEAREFF DAVPALARKLQTLMDVGLTYIRLGQSATTLSGGEAQRVKLARELSKRGTGQTLYILDEPTTGLHFADIQQLLDVLHKLRD QGNTIVVIEHNLDVIKTADWIVDLGPEGGSGGGEILVSGTPETVAECEASHTARFLKPML
Specific function: The UvrABC repair system catalyzes the recognition and processing of DNA lesions. UvrA is an ATPase and a DNA-binding protein. A damage recognition complex composed of 2 UvrA and 2 UvrB subunits scans DNA for abnormalities. When the presence of a lesion h
COG id: COG0178
COG function: function code L; Excinuclease ATPase subunit
Gene ontology:
Cell location: Cytoplasm [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 2 ABC transporter domains [H]
Homologues:
Organism=Escherichia coli, GI2367343, Length=940, Percent_Identity=100, Blast_Score=1937, Evalue=0.0,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR003439 - InterPro: IPR017871 - InterPro: IPR013815 - InterPro: IPR004602 [H]
Pfam domain/function: PF00005 ABC_tran [H]
EC number: NA
Molecular weight: Translated: 103869; Mature: 103869
Theoretical pI: Translated: 6.61; Mature: 6.61
Prosite motif: PS00211 ABC_TRANSPORTER_1 ; PS50893 ABC_TRANSPORTER_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.5 %Cys (Translated Protein) 1.9 %Met (Translated Protein) 3.4 %Cys+Met (Translated Protein) 1.5 %Cys (Mature Protein) 1.9 %Met (Mature Protein) 3.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MDKIEVRGARTHNLKNINLVIPRDKLIVVTGLSGSGKSSLAFDTLYAEGQRRYVESLSAY CCCEEECCCCCCCCCEEEEEECCCCEEEEECCCCCCCCCEEHHHHHHHHHHHHHHHHHHH ARQFLSLMEKPDVDHIEGLSPAISIEQKSTSHNPRSTVGTITEIHDYLRLLFARVGEPRC HHHHHHHHCCCCCCHHCCCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCC PDHDVPLAAQTVSQMVDNVLSQPEGKRLMLLAPIIKERKGEHTKTLENLASQGYIRARID CCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEEHHHHHCCCCHHHHHHHHHHCCEEEEEEC GEVCDLSDPPKLELQKKHTIEVVVDRFKVRDDLTQRLAESFETALELSGGTAVVADMDDP CCEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECCCEEEEEECCCC KAEELLFSANFACPICGYSMRELEPRLFSFNNPAGACPTCDGLGVQQYFDPDRVIQNPEL CHHHHEEECCCCCCCCCCCHHHCCCHHEECCCCCCCCCCCCCCCCHHHCCHHHHHCCCCC SLAGGAIRGWDRRNFYYFQMLKSLADHYKFDVEAPWGSLSANVHKVVLYGSGKENIEFKY EECCCCCCCCCCCCHHHHHHHHHHHHHCEEECCCCCCCCCCCEEEEEEECCCCCCEEEEE MNDRGDTSIRRHPFEGVLHNMERRYKETESSAVREELAKFISNRPCASCEGTRLRREARH ECCCCCCCHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHCE VYVENTPLPAISDMSIGHAMEFFNNLKLAGQRAKIAEKILKEIGDRLKFLVNVGLNYLTL EEEECCCCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCCEEEE SRSAETLSGGEAQRIRLASQIGAGLVGVMYVLDEPSIGLHQRDNERLLGTLIHLRDLGNT CCCCCCCCCCCHHHHHHHHHHCCCCEEHEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCE VIVVEHDEDAIRAADHVIDIGPGAGVHGGEVVAEGPLEAIMAVPESLTGQYMSGKRKIEV EEEEECCCHHHHHHCCEEECCCCCCCCCCCEEECCHHHHHHHCCHHHCCHHHCCCEEECC PKKRVPANPEKVLKLTGARGNNLKDVTLTLPVGLFTCITGVSGSGKSTLINDTLFPIAQR CHHCCCCCHHHHHEEECCCCCCCEEEEEEEHHHHHHHHHCCCCCCCCEEECCHHHHHHHH QLNGATIAEPAPYRDIQGLEHFDKVIDIDQSPIGRTPRSNPATYTGVFTPVRELFAGVPE HCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCEEEHHHHHHHHHHCCCCC SRARGYTPGRFSFNVRGGRCEACQGDGVIKVEMHFLPDIYVPCDQCKGKRYNRETLEIKY HHHCCCCCCEEEEEECCCEECCCCCCCEEEEEEEECCCCCCCCHHHCCCCCCCCEEEEEE KGKTIHEVLDMTIEEAREFFDAVPALARKLQTLMDVGLTYIRLGQSATTLSGGEAQRVKL CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHEEECCCCCCCCCCCHHHHHH ARELSKRGTGQTLYILDEPTTGLHFADIQQLLDVLHKLRDQGNTIVVIEHNLDVIKTADW HHHHHHCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCCCEEEECCE IVDLGPEGGSGGGEILVSGTPETVAECEASHTARFLKPML EEEECCCCCCCCCEEEEECCCHHHHHHHHHHHHHHHHCCC >Mature Secondary Structure MDKIEVRGARTHNLKNINLVIPRDKLIVVTGLSGSGKSSLAFDTLYAEGQRRYVESLSAY CCCEEECCCCCCCCCEEEEEECCCCEEEEECCCCCCCCCEEHHHHHHHHHHHHHHHHHHH ARQFLSLMEKPDVDHIEGLSPAISIEQKSTSHNPRSTVGTITEIHDYLRLLFARVGEPRC HHHHHHHHCCCCCCHHCCCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCC PDHDVPLAAQTVSQMVDNVLSQPEGKRLMLLAPIIKERKGEHTKTLENLASQGYIRARID CCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEEHHHHHCCCCHHHHHHHHHHCCEEEEEEC GEVCDLSDPPKLELQKKHTIEVVVDRFKVRDDLTQRLAESFETALELSGGTAVVADMDDP CCEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECCCEEEEEECCCC KAEELLFSANFACPICGYSMRELEPRLFSFNNPAGACPTCDGLGVQQYFDPDRVIQNPEL CHHHHEEECCCCCCCCCCCHHHCCCHHEECCCCCCCCCCCCCCCCHHHCCHHHHHCCCCC SLAGGAIRGWDRRNFYYFQMLKSLADHYKFDVEAPWGSLSANVHKVVLYGSGKENIEFKY EECCCCCCCCCCCCHHHHHHHHHHHHHCEEECCCCCCCCCCCEEEEEEECCCCCCEEEEE MNDRGDTSIRRHPFEGVLHNMERRYKETESSAVREELAKFISNRPCASCEGTRLRREARH ECCCCCCCHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHCE VYVENTPLPAISDMSIGHAMEFFNNLKLAGQRAKIAEKILKEIGDRLKFLVNVGLNYLTL EEEECCCCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCCEEEE SRSAETLSGGEAQRIRLASQIGAGLVGVMYVLDEPSIGLHQRDNERLLGTLIHLRDLGNT CCCCCCCCCCCHHHHHHHHHHCCCCEEHEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCE VIVVEHDEDAIRAADHVIDIGPGAGVHGGEVVAEGPLEAIMAVPESLTGQYMSGKRKIEV EEEEECCCHHHHHHCCEEECCCCCCCCCCCEEECCHHHHHHHCCHHHCCHHHCCCEEECC PKKRVPANPEKVLKLTGARGNNLKDVTLTLPVGLFTCITGVSGSGKSTLINDTLFPIAQR CHHCCCCCHHHHHEEECCCCCCCEEEEEEEHHHHHHHHHCCCCCCCCEEECCHHHHHHHH QLNGATIAEPAPYRDIQGLEHFDKVIDIDQSPIGRTPRSNPATYTGVFTPVRELFAGVPE HCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCEEEHHHHHHHHHHCCCCC SRARGYTPGRFSFNVRGGRCEACQGDGVIKVEMHFLPDIYVPCDQCKGKRYNRETLEIKY HHHCCCCCCEEEEEECCCEECCCCCCCEEEEEEEECCCCCCCCHHHCCCCCCCCEEEEEE KGKTIHEVLDMTIEEAREFFDAVPALARKLQTLMDVGLTYIRLGQSATTLSGGEAQRVKL CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHEEECCCCCCCCCCCHHHHHH ARELSKRGTGQTLYILDEPTTGLHFADIQQLLDVLHKLRDQGNTIVVIEHNLDVIKTADW HHHHHHCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCCCEEEECCE IVDLGPEGGSGGGEILVSGTPETVAECEASHTARFLKPML EEEECCCCCCCCCEEEEECCCHHHHHHHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: Hydrolase; Acting on ester bonds [C]
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 11586360; 12142430 [H]