Definition | Akkermansia muciniphila ATCC BAA-835, complete genome. |
---|---|
Accession | NC_010655 |
Length | 2,664,102 |
Click here to switch to the map view.
The map label for this gene is uvrA [H]
Identifier: 187735943
GI number: 187735943
Start: 1743347
End: 1745848
Strand: Reverse
Name: uvrA [H]
Synonym: Amuc_1452
Alternate gene names: 187735943
Gene position: 1745848-1743347 (Counterclockwise)
Preceding gene: 187735944
Following gene: 187735942
Centisome position: 65.53
GC content: 62.35
Gene sequence:
>2502_bases ATGAATCTTCCCATCTCCATCCGCGGCGCGCGCCAGCACAACCTCCGGAACCTGAATCTGGATCTTCCGTCCAACAAGCT CATCGTATTCTGCGGCCCTTCCGGCTCCGGAAAATCTTCCCTGGCGTTTGATACGCTTTTTTCCGAATCCCGCAGGCGTT TTCTGGACTGCCTGTCGGCACGCTCCAGGCAGGGCATGGATCAACCGGAAAAACCGGAAGTGGACAGCATTACCGGGCTG CCCCCGGCCCTGTGCCTGGAGCAATCCGCCAGGCAGCAGAGTTCCCGCACCCTGCTGGGGAGCATCACGGAAATTCTGGA CTACCTGCGCATCCTTTACGCGGCTGCCGGCACGCCCCATGACCCGGAGACAGGAAAGGAACTGGAACGCAAGAGCCCGG ACCGGATTACGGAAGAACTCGTTTCCCTGCCGGAACACACGCGCCTGATTCTGACCGCTCCGGCGGAAAACCTGCTGGCC CAGGATCCCGCAGCGACGCTGGCCGACTTCCAGCGGCAGGGCTTCCTCCGGGTTTACTGGAACGGAGAAATGCGGGATAT TGAAGAAATAAGTTCCCCCGTCCCCCCGCCTCCAGACGCGGCCCTGGTCATTGACCGCCTCATCGTCAGAGGGGAAAATA CGGCCTCGCGCATTGCGGATTCCCTGCAAACGGCTCTCCGTATCAATCCGGACGAGGTGCGGGCCATCATCACCATACCG GGAGAGGAAGCCTCAATCCGGGCCTTCCACACCCGCTACCGCAATCCGGAAACAGGCTTCCTTCTGCCCCAGCTTACGCC CCGCCATTTTTCCTTCAACTCCCCGCTGGGGGCATGCCCCTCCTGCCGGGGAACCGGCCTGAATGAACAGGAAAACGGTC CGTGCCGCGCCTGCGGAGGCCAGCGTCTTTCCCCTCTGGCCCTGGCCGTCACCATGCCTGCGCCGGACCGGGCCTACAAT CTCGCGGAACTGACGGCTCTTCCTCTGGAAGATATGGCAGGAGAACTGGAACGACTGAAAACGCCCCCCTCCCTGGCGGC GGCATTGACCCCGCTCATGGAGGAAATCAACAAACGCGTGCGCTTCCTGAATGAGCTGGGACTCTCCTACCTGTCCCTGG ACCGCCAGGCAAACACCCTCTCCGGAGGCGAACTGCAGAGGGCGCGCCTGGCTTCCCAGCTGGGAGGCGGCCTTTCCGGA GTCCTTTACATCCTGGACGAACCCACGGCCGGACTGCACCCCGCCGATACGGACCGCCTGCTCCGCGCTCTCCGGACGCT CCGGAACCAGGGCAACACAGTACTGGTCGTAGAGCATGATGAACAAATTCTAACCGCGGCGGATCACCTGGTGGACATGG GCCCCGGCTCCGGAACCAACGGAGGCCGTATTCTGGCGCAGGGCTCCCTTGCTGAGATACTGGGAAATTCAGGAAGCCCC ACCGGGGAATGGCTTTCAGGCAAGCGAAACATGCCCGCCTCCGGACGCAAGACGGCTCCTGCGGGGCGTCTGGTACTGAC CGGTGCGGACAAGCACAACCTCAACAACGTCACTCTGAATATCCCGGTCGGCACACTGACCTGCATCTCCGGCCCTTCCG GTTCAGGGAAATCCACCCTTGTCCGGGACTGCCTCATCCCCGCAGTCAGGCAAGATCTCTCCGGGAAAAAGGGTATTCCG CGCCGCGTGCAGGGAACGGAACACTTCAACCGCCTCGTCGTCATCGACCAGTCGCCCATCGGCAAAACGCCGCGCTCCAC ACCGGCCACCGCTACCGGCCTGCTCCAGGTGCTGCGCCCCCTTTACGCACAGCTCCCCCTTTCCAAGCAGAGGGGATATA CGGCGGCGCGCTTTTCCCCCAACATTCGCGGAGGCCGCTGTGAACGGTGCCAGGGAACGGGCATGATTGAAGTGGACATG AACTTTCTGGGAAACGTGGCAATGCCCTGCGACGCCTGCCAGGGGCAGTGCTACAACAGGGAAACGCTGGAAGTCACCTG GAAAGGGAAATCCATTGCCCAGGCGCTGGCCCTGACCGTGGACGAAGCGGCGGAATTCTTTTCCTCCCTGCCCAGAGCCG CCGCCATCCTGAAAAGCATGCAGGACGTAGGGCTGGGATACCTCAATCTCAACCGCAGGGCGGACACCCTTTCCGGCGGA GAATCCCAGCGCATAAAAATAGCTGCGGAACTGGCCAAAGCCCCGGCCTGGAAACTGGAGGAAGACGGGAAACGGGCCCT GTTCATTCTGGACGAACCCACCAGCGGCCTCCACTTCAATGAAGTGGCCCTTCTCCTGGCAGCCCTTTTCCGCCTGAGGG ATGCCGGACACACCATCCTCTGCGTGGAACACCACAAGGACCTGCTCAATGCCGCGGACTACCTGGTGGACATGGGCCCC GGAGCCGGCAGGCACGGCGGCAATATCGTGGCCGAGGGCTCCCCCGCAGATGTAGCGTCCAATCCGGAAGCGCCCACTTC TCCCTGGCTCGTCCCCCGTTAA
Upstream 100 bases:
>100_bases ATCCACCGGCACGCCAAATTCCTGGGAGATATCACGGCGCGCCGCCTCATCATTGAGGAAGGCGGCACGCACCAGGGAGC CTTTACCCGCCTTACGTGAC
Downstream 100 bases:
>100_bases CAACGGCAGCCCCTGAAAATCATTCATCTGTCCTGTCCCCCGGACATCCCGTTCCCGGCGTCTTCGGAAAAAAATCTGCC TCTCCGGGGTTTATTCGCCT
Product: excinuclease ABC, A subunit
Products: NA
Alternate protein names: UvrA protein; Excinuclease ABC subunit A [H]
Number of amino acids: Translated: 833; Mature: 833
Protein sequence:
>833_residues MNLPISIRGARQHNLRNLNLDLPSNKLIVFCGPSGSGKSSLAFDTLFSESRRRFLDCLSARSRQGMDQPEKPEVDSITGL PPALCLEQSARQQSSRTLLGSITEILDYLRILYAAAGTPHDPETGKELERKSPDRITEELVSLPEHTRLILTAPAENLLA QDPAATLADFQRQGFLRVYWNGEMRDIEEISSPVPPPPDAALVIDRLIVRGENTASRIADSLQTALRINPDEVRAIITIP GEEASIRAFHTRYRNPETGFLLPQLTPRHFSFNSPLGACPSCRGTGLNEQENGPCRACGGQRLSPLALAVTMPAPDRAYN LAELTALPLEDMAGELERLKTPPSLAAALTPLMEEINKRVRFLNELGLSYLSLDRQANTLSGGELQRARLASQLGGGLSG VLYILDEPTAGLHPADTDRLLRALRTLRNQGNTVLVVEHDEQILTAADHLVDMGPGSGTNGGRILAQGSLAEILGNSGSP TGEWLSGKRNMPASGRKTAPAGRLVLTGADKHNLNNVTLNIPVGTLTCISGPSGSGKSTLVRDCLIPAVRQDLSGKKGIP RRVQGTEHFNRLVVIDQSPIGKTPRSTPATATGLLQVLRPLYAQLPLSKQRGYTAARFSPNIRGGRCERCQGTGMIEVDM NFLGNVAMPCDACQGQCYNRETLEVTWKGKSIAQALALTVDEAAEFFSSLPRAAAILKSMQDVGLGYLNLNRRADTLSGG ESQRIKIAAELAKAPAWKLEEDGKRALFILDEPTSGLHFNEVALLLAALFRLRDAGHTILCVEHHKDLLNAADYLVDMGP GAGRHGGNIVAEGSPADVASNPEAPTSPWLVPR
Sequences:
>Translated_833_residues MNLPISIRGARQHNLRNLNLDLPSNKLIVFCGPSGSGKSSLAFDTLFSESRRRFLDCLSARSRQGMDQPEKPEVDSITGL PPALCLEQSARQQSSRTLLGSITEILDYLRILYAAAGTPHDPETGKELERKSPDRITEELVSLPEHTRLILTAPAENLLA QDPAATLADFQRQGFLRVYWNGEMRDIEEISSPVPPPPDAALVIDRLIVRGENTASRIADSLQTALRINPDEVRAIITIP GEEASIRAFHTRYRNPETGFLLPQLTPRHFSFNSPLGACPSCRGTGLNEQENGPCRACGGQRLSPLALAVTMPAPDRAYN LAELTALPLEDMAGELERLKTPPSLAAALTPLMEEINKRVRFLNELGLSYLSLDRQANTLSGGELQRARLASQLGGGLSG VLYILDEPTAGLHPADTDRLLRALRTLRNQGNTVLVVEHDEQILTAADHLVDMGPGSGTNGGRILAQGSLAEILGNSGSP TGEWLSGKRNMPASGRKTAPAGRLVLTGADKHNLNNVTLNIPVGTLTCISGPSGSGKSTLVRDCLIPAVRQDLSGKKGIP RRVQGTEHFNRLVVIDQSPIGKTPRSTPATATGLLQVLRPLYAQLPLSKQRGYTAARFSPNIRGGRCERCQGTGMIEVDM NFLGNVAMPCDACQGQCYNRETLEVTWKGKSIAQALALTVDEAAEFFSSLPRAAAILKSMQDVGLGYLNLNRRADTLSGG ESQRIKIAAELAKAPAWKLEEDGKRALFILDEPTSGLHFNEVALLLAALFRLRDAGHTILCVEHHKDLLNAADYLVDMGP GAGRHGGNIVAEGSPADVASNPEAPTSPWLVPR >Mature_833_residues MNLPISIRGARQHNLRNLNLDLPSNKLIVFCGPSGSGKSSLAFDTLFSESRRRFLDCLSARSRQGMDQPEKPEVDSITGL PPALCLEQSARQQSSRTLLGSITEILDYLRILYAAAGTPHDPETGKELERKSPDRITEELVSLPEHTRLILTAPAENLLA QDPAATLADFQRQGFLRVYWNGEMRDIEEISSPVPPPPDAALVIDRLIVRGENTASRIADSLQTALRINPDEVRAIITIP GEEASIRAFHTRYRNPETGFLLPQLTPRHFSFNSPLGACPSCRGTGLNEQENGPCRACGGQRLSPLALAVTMPAPDRAYN LAELTALPLEDMAGELERLKTPPSLAAALTPLMEEINKRVRFLNELGLSYLSLDRQANTLSGGELQRARLASQLGGGLSG VLYILDEPTAGLHPADTDRLLRALRTLRNQGNTVLVVEHDEQILTAADHLVDMGPGSGTNGGRILAQGSLAEILGNSGSP TGEWLSGKRNMPASGRKTAPAGRLVLTGADKHNLNNVTLNIPVGTLTCISGPSGSGKSTLVRDCLIPAVRQDLSGKKGIP RRVQGTEHFNRLVVIDQSPIGKTPRSTPATATGLLQVLRPLYAQLPLSKQRGYTAARFSPNIRGGRCERCQGTGMIEVDM NFLGNVAMPCDACQGQCYNRETLEVTWKGKSIAQALALTVDEAAEFFSSLPRAAAILKSMQDVGLGYLNLNRRADTLSGG ESQRIKIAAELAKAPAWKLEEDGKRALFILDEPTSGLHFNEVALLLAALFRLRDAGHTILCVEHHKDLLNAADYLVDMGP GAGRHGGNIVAEGSPADVASNPEAPTSPWLVPR
Specific function: The UvrABC repair system catalyzes the recognition and processing of DNA lesions. UvrA is an ATPase and a DNA-binding protein. A damage recognition complex composed of 2 UvrA and 2 UvrB subunits scans DNA for abnormalities. When the presence of a lesion h
COG id: COG0178
COG function: function code L; Excinuclease ATPase subunit
Gene ontology:
Cell location: Cytoplasm [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 2 ABC transporter domains [H]
Homologues:
Organism=Escherichia coli, GI2367343, Length=553, Percent_Identity=47.377938517179, Blast_Score=489, Evalue=1e-139,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR003439 - InterPro: IPR017871 - InterPro: IPR013815 - InterPro: IPR003593 - InterPro: IPR004602 [H]
Pfam domain/function: PF00005 ABC_tran [H]
EC number: NA
Molecular weight: Translated: 89851; Mature: 89851
Theoretical pI: Translated: 7.17; Mature: 7.17
Prosite motif: PS00211 ABC_TRANSPORTER_1 ; PS50893 ABC_TRANSPORTER_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.8 %Cys (Translated Protein) 1.6 %Met (Translated Protein) 3.4 %Cys+Met (Translated Protein) 1.8 %Cys (Mature Protein) 1.6 %Met (Mature Protein) 3.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MNLPISIRGARQHNLRNLNLDLPSNKLIVFCGPSGSGKSSLAFDTLFSESRRRFLDCLSA CCCCEEECCCCCCCCEEEEEECCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHH RSRQGMDQPEKPEVDSITGLPPALCLEQSARQQSSRTLLGSITEILDYLRILYAAAGTPH HHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC DPETGKELERKSPDRITEELVSLPEHTRLILTAPAENLLAQDPAATLADFQRQGFLRVYW CCCCCHHHHHCCHHHHHHHHHCCCCCCEEEEECCHHHHHHCCCHHHHHHHHHCCCEEEEE NGEMRDIEEISSPVPPPPDAALVIDRLIVRGENTASRIADSLQTALRINPDEVRAIITIP CCCCHHHHHHCCCCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHEEEEEEEC GEEASIRAFHTRYRNPETGFLLPQLTPRHFSFNSPLGACPSCRGTGLNEQENGPCRACGG CCCCHHHHHHHHCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC QRLSPLALAVTMPAPDRAYNLAELTALPLEDMAGELERLKTPPSLAAALTPLMEEINKRV CCCCCEEEEEECCCCCCCCCHHHHHCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHH RFLNELGLSYLSLDRQANTLSGGELQRARLASQLGGGLSGVLYILDEPTAGLHPADTDRL HHHHHHCHHHEECCCCCCCCCCCHHHHHHHHHHHCCCCEEEEEEEECCCCCCCCCCHHHH LRALRTLRNQGNTVLVVEHDEQILTAADHLVDMGPGSGTNGGRILAQGSLAEILGNSGSP HHHHHHHHCCCCEEEEEECCCHHHHHHHHHHCCCCCCCCCCCEEEECCCHHHHHCCCCCC TGEWLSGKRNMPASGRKTAPAGRLVLTGADKHNLNNVTLNIPVGTLTCISGPSGSGKSTL CCHHHCCCCCCCCCCCCCCCCCEEEEECCCCCCCCEEEEEECCCEEEEECCCCCCCCHHH VRDCLIPAVRQDLSGKKGIPRRVQGTEHFNRLVVIDQSPIGKTPRSTPATATGLLQVLRP HHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHH LYAQLPLSKQRGYTAARFSPNIRGGRCERCQGTGMIEVDMNFLGNVAMPCDACQGQCYNR HHHHCCCCHHCCCEEEECCCCCCCCCCCCCCCCCEEEECHHHHCCCCCCCHHHCCCCCCC ETLEVTWKGKSIAQALALTVDEAAEFFSSLPRAAAILKSMQDVGLGYLNLNRRADTLSGG CEEEEEECCHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHCCCEEEECCCCCCCCCCC ESQRIKIAAELAKAPAWKLEEDGKRALFILDEPTSGLHFNEVALLLAALFRLRDAGHTIL CCCEEEHHHHHHHCCCCCCCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCEEE CVEHHKDLLNAADYLVDMGPGAGRHGGNIVAEGSPADVASNPEAPTSPWLVPR EEHHHHHHHHHHHHHEECCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCC >Mature Secondary Structure MNLPISIRGARQHNLRNLNLDLPSNKLIVFCGPSGSGKSSLAFDTLFSESRRRFLDCLSA CCCCEEECCCCCCCCEEEEEECCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHH RSRQGMDQPEKPEVDSITGLPPALCLEQSARQQSSRTLLGSITEILDYLRILYAAAGTPH HHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC DPETGKELERKSPDRITEELVSLPEHTRLILTAPAENLLAQDPAATLADFQRQGFLRVYW CCCCCHHHHHCCHHHHHHHHHCCCCCCEEEEECCHHHHHHCCCHHHHHHHHHCCCEEEEE NGEMRDIEEISSPVPPPPDAALVIDRLIVRGENTASRIADSLQTALRINPDEVRAIITIP CCCCHHHHHHCCCCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHEEEEEEEC GEEASIRAFHTRYRNPETGFLLPQLTPRHFSFNSPLGACPSCRGTGLNEQENGPCRACGG CCCCHHHHHHHHCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC QRLSPLALAVTMPAPDRAYNLAELTALPLEDMAGELERLKTPPSLAAALTPLMEEINKRV CCCCCEEEEEECCCCCCCCCHHHHHCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHH RFLNELGLSYLSLDRQANTLSGGELQRARLASQLGGGLSGVLYILDEPTAGLHPADTDRL HHHHHHCHHHEECCCCCCCCCCCHHHHHHHHHHHCCCCEEEEEEEECCCCCCCCCCHHHH LRALRTLRNQGNTVLVVEHDEQILTAADHLVDMGPGSGTNGGRILAQGSLAEILGNSGSP HHHHHHHHCCCCEEEEEECCCHHHHHHHHHHCCCCCCCCCCCEEEECCCHHHHHCCCCCC TGEWLSGKRNMPASGRKTAPAGRLVLTGADKHNLNNVTLNIPVGTLTCISGPSGSGKSTL CCHHHCCCCCCCCCCCCCCCCCEEEEECCCCCCCCEEEEEECCCEEEEECCCCCCCCHHH VRDCLIPAVRQDLSGKKGIPRRVQGTEHFNRLVVIDQSPIGKTPRSTPATATGLLQVLRP HHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHH LYAQLPLSKQRGYTAARFSPNIRGGRCERCQGTGMIEVDMNFLGNVAMPCDACQGQCYNR HHHHCCCCHHCCCEEEECCCCCCCCCCCCCCCCCEEEECHHHHCCCCCCCHHHCCCCCCC ETLEVTWKGKSIAQALALTVDEAAEFFSSLPRAAAILKSMQDVGLGYLNLNRRADTLSGG CEEEEEECCHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHCCCEEEECCCCCCCCCCC ESQRIKIAAELAKAPAWKLEEDGKRALFILDEPTSGLHFNEVALLLAALFRLRDAGHTIL CCCEEEHHHHHHHCCCCCCCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCEEE CVEHHKDLLNAADYLVDMGPGAGRHGGNIVAEGSPADVASNPEAPTSPWLVPR EEHHHHHHHHHHHHHEECCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: Hydrolase; Acting on ester bonds [C]
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 8675016 [H]