Definition Escherichia coli O157:H7 str. EC4115, complete genome.
Accession NC_011353
Length 5,572,075

Click here to switch to the map view.

The map label for this gene is rarA

Identifier: 209397045

GI number: 209397045

Start: 1072462

End: 1073805

Strand: Direct

Name: rarA

Synonym: ECH74115_1054

Alternate gene names: 209397045

Gene position: 1072462-1073805 (Clockwise)

Preceding gene: 209397900

Following gene: 209396596

Centisome position: 19.25

GC content: 53.27

Gene sequence:

>1344_bases
GTGAGCAATCTGTCGCTCGATTTTTCGGATAATACTTTTCAACCTCTGGCCGCGCGTATGCGGCCAGAAAATTTAGCACA
GTATATCGGTCAGCAACATTTGCTGGCTGCGGGGAAGCCGTTGCCGCGCGCTATCGAAGCCGGGCATTTACATTCTATGA
TCCTCTGGGGGCCACCGGGTACCGGCAAAACAACCCTCGCTGAAGTGATTGCCCGCTATGCGAACGCTGATGTGGAACGT
ATTTCTGCCGTCACCTCTGGTGTGAAAGAGATTCGCGAGGCGATCGAGCGCGCCCGGCAAAACCGCAATGCAGGTCGCCG
CACTATTCTTTTTGTTGACGAAGTTCACCGTTTCAACAAAAGCCAGCAGGATGCATTTCTGCCACATATTGAAGACGGCA
CCATCACTTTTATTGGCGCAACCACTGAAAACCCGTCGTTTGAGCTTAATTCGGCACTGCTTTCCCGTGCCCGTGTTTAT
CTGTTGAAATCCCTGAGTACAGAGGATATTGAGCAAGTACTAACTCAGGCGATGGAAGACAAAACCCGTGGCTATGGTGG
TCAGGATATTGTTCTGCCAGATGAAACACGACGTGCCATTGCAGAACTGGTGAATGGCGACGCGCGCCGGGCGTTAAATA
CGCTGGAAATGATGGCGGATATGGCCGAAGTCGATGATAGCGGTAAGCGGGTCCTGAAGCCTGAATTACTGACCGAAATC
GCCGGTGAACGTAGCGCCCGCTTTGATAACAAAGGCGATCGCTTTTACGATCTGATTTCCGCACTGCATAAGTCGGTACG
TGGTAGCGCACCCGATGCGGCGCTGTACTGGTATGCGCGAATTATTACCGCTGGTGGCGATCCGTTATATGTCGCGCGTC
GCTGTCTGGCGATTGCGTCTGAAGACGTCGGTAATGCCGATCCACGGGCGATGCAGGTGGCAATTGCGGCCTGGGATTGC
TTTACTCGCGTTGGCCCGGCGGAAGGTGAACGCGCCATTGCTCAGGCGATTGTTTACCTGGCCTGCGCGCCAAAAAGCAA
CGCTGTCTACACTGCGTTTAAAGCCGCGCTGGCCGATGCTCGCGAACGCCCGGATTATGACGTGCCGGTTCATTTGCGTA
ATGCGCCGACGAAATTAATGAAGGAAATGGGCTACGGGCAGGAATATCGTTACGCTCATGATGAAGCAAACGCTTATGCT
GCCGGTGAGGTTTACTTCCCGCCGGAAATAGCACAAACACGCTATTATTTCCCGACAAACAGGGGCCTTGAAGGCAAGAT
TGGCGAAAAGCTCGCCTGGCTGGCTGAACAGGATCAAAATAGCCCCATAAAACGCTACCGTTAA

Upstream 100 bases:

>100_bases
CTGAAATCCCAGCAAAATGGGGCTGTGGATGCAGCGAAATTTACCTTCACCCCGCCGCAAGGCGTCACGGTAGATGATCA
ACGTAAGTAGAGGCACCTGA

Downstream 100 bases:

>100_bases
TGTTATCGTTGCGGTAATGTTGTTACTGTATCCCTGTGGTCGCAGGCTGTGGCCACATCTCCCATTTAATTCGATAAGCA
CAGGATAAGCATGCTCGATC

Product: recombination factor protein RarA

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 447; Mature: 446

Protein sequence:

>447_residues
MSNLSLDFSDNTFQPLAARMRPENLAQYIGQQHLLAAGKPLPRAIEAGHLHSMILWGPPGTGKTTLAEVIARYANADVER
ISAVTSGVKEIREAIERARQNRNAGRRTILFVDEVHRFNKSQQDAFLPHIEDGTITFIGATTENPSFELNSALLSRARVY
LLKSLSTEDIEQVLTQAMEDKTRGYGGQDIVLPDETRRAIAELVNGDARRALNTLEMMADMAEVDDSGKRVLKPELLTEI
AGERSARFDNKGDRFYDLISALHKSVRGSAPDAALYWYARIITAGGDPLYVARRCLAIASEDVGNADPRAMQVAIAAWDC
FTRVGPAEGERAIAQAIVYLACAPKSNAVYTAFKAALADARERPDYDVPVHLRNAPTKLMKEMGYGQEYRYAHDEANAYA
AGEVYFPPEIAQTRYYFPTNRGLEGKIGEKLAWLAEQDQNSPIKRYR

Sequences:

>Translated_447_residues
MSNLSLDFSDNTFQPLAARMRPENLAQYIGQQHLLAAGKPLPRAIEAGHLHSMILWGPPGTGKTTLAEVIARYANADVER
ISAVTSGVKEIREAIERARQNRNAGRRTILFVDEVHRFNKSQQDAFLPHIEDGTITFIGATTENPSFELNSALLSRARVY
LLKSLSTEDIEQVLTQAMEDKTRGYGGQDIVLPDETRRAIAELVNGDARRALNTLEMMADMAEVDDSGKRVLKPELLTEI
AGERSARFDNKGDRFYDLISALHKSVRGSAPDAALYWYARIITAGGDPLYVARRCLAIASEDVGNADPRAMQVAIAAWDC
FTRVGPAEGERAIAQAIVYLACAPKSNAVYTAFKAALADARERPDYDVPVHLRNAPTKLMKEMGYGQEYRYAHDEANAYA
AGEVYFPPEIAQTRYYFPTNRGLEGKIGEKLAWLAEQDQNSPIKRYR
>Mature_446_residues
SNLSLDFSDNTFQPLAARMRPENLAQYIGQQHLLAAGKPLPRAIEAGHLHSMILWGPPGTGKTTLAEVIARYANADVERI
SAVTSGVKEIREAIERARQNRNAGRRTILFVDEVHRFNKSQQDAFLPHIEDGTITFIGATTENPSFELNSALLSRARVYL
LKSLSTEDIEQVLTQAMEDKTRGYGGQDIVLPDETRRAIAELVNGDARRALNTLEMMADMAEVDDSGKRVLKPELLTEIA
GERSARFDNKGDRFYDLISALHKSVRGSAPDAALYWYARIITAGGDPLYVARRCLAIASEDVGNADPRAMQVAIAAWDCF
TRVGPAEGERAIAQAIVYLACAPKSNAVYTAFKAALADARERPDYDVPVHLRNAPTKLMKEMGYGQEYRYAHDEANAYAA
GEVYFPPEIAQTRYYFPTNRGLEGKIGEKLAWLAEQDQNSPIKRYR

Specific function: Unknown

COG id: COG2256

COG function: function code L; ATPase related to the helicase subunit of the Holliday junction resolvase

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the rarA family

Homologues:

Organism=Homo sapiens, GI18426902, Length=438, Percent_Identity=41.0958904109589, Blast_Score=302, Evalue=4e-82,
Organism=Homo sapiens, GI18426904, Length=438, Percent_Identity=36.5296803652968, Blast_Score=243, Evalue=3e-64,
Organism=Escherichia coli, GI1787119, Length=447, Percent_Identity=100, Blast_Score=921, Evalue=0.0,
Organism=Saccharomyces cerevisiae, GI6324111, Length=448, Percent_Identity=40.625, Blast_Score=305, Evalue=8e-84,
Organism=Saccharomyces cerevisiae, GI6324039, Length=237, Percent_Identity=26.5822784810127, Blast_Score=74, Evalue=5e-14,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): RARA_ECO57 (P0AAZ6)

Other databases:

- EMBL:   AE005174
- EMBL:   BA000007
- PIR:   A99751
- RefSeq:   NP_286769.1
- RefSeq:   NP_309004.1
- ProteinModelPortal:   P0AAZ6
- SMR:   P0AAZ6
- MINT:   MINT-1224488
- EnsemblBacteria:   EBESCT00000024643
- EnsemblBacteria:   EBESCT00000055352
- GeneID:   917730
- GeneID:   958851
- GenomeReviews:   AE005174_GR
- GenomeReviews:   BA000007_GR
- KEGG:   ece:Z1238
- KEGG:   ecs:ECs0977
- GeneTree:   EBGT00050000009793
- HOGENOM:   HBG635390
- OMA:   TENTTYY
- ProtClustDB:   PRK13342
- BioCyc:   ECOL83334:ECS0977-MONOMER
- InterPro:   IPR003593
- InterPro:   IPR003959
- InterPro:   IPR021886
- SMART:   SM00382

Pfam domain/function: PF00004 AAA; PF12002 MgsA_C

EC number: NA

Molecular weight: Translated: 49627; Mature: 49495

Theoretical pI: Translated: 6.34; Mature: 6.34

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.7 %Cys     (Translated Protein)
2.2 %Met     (Translated Protein)
2.9 %Cys+Met (Translated Protein)
0.7 %Cys     (Mature Protein)
2.0 %Met     (Mature Protein)
2.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSNLSLDFSDNTFQPLAARMRPENLAQYIGQQHLLAAGKPLPRAIEAGHLHSMILWGPPG
CCCCEECCCCCCHHHHHHHCCHHHHHHHHCHHHHHHCCCCCCHHHHCCCCCEEEEECCCC
TGKTTLAEVIARYANADVERISAVTSGVKEIREAIERARQNRNAGRRTILFVDEVHRFNK
CCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEHHHHHHCCC
SQQDAFLPHIEDGTITFIGATTENPSFELNSALLSRARVYLLKSLSTEDIEQVLTQAMED
CCCCCCCCEECCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH
KTRGYGGQDIVLPDETRRAIAELVNGDARRALNTLEMMADMAEVDDSGKRVLKPELLTEI
HHCCCCCCEEECCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCCCEECCHHHHHHH
AGERSARFDNKGDRFYDLISALHKSVRGSAPDAALYWYARIITAGGDPLYVARRCLAIAS
HCCHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHH
EDVGNADPRAMQVAIAAWDCFTRVGPAEGERAIAQAIVYLACAPKSNAVYTAFKAALADA
CCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHEECCCCCCHHHHHHHHHHHH
RERPDYDVPVHLRNAPTKLMKEMGYGQEYRYAHDEANAYAAGEVYFPPEIAQTRYYFPTN
HHCCCCCCCEEECCCHHHHHHHCCCCCCCCCCCCCCCCEECCEEECCCHHHCCEEECCCC
RGLEGKIGEKLAWLAEQDQNSPIKRYR
CCCCCCHHHHHHHHHHCCCCCCHHCCC
>Mature Secondary Structure 
SNLSLDFSDNTFQPLAARMRPENLAQYIGQQHLLAAGKPLPRAIEAGHLHSMILWGPPG
CCCEECCCCCCHHHHHHHCCHHHHHHHHCHHHHHHCCCCCCHHHHCCCCCEEEEECCCC
TGKTTLAEVIARYANADVERISAVTSGVKEIREAIERARQNRNAGRRTILFVDEVHRFNK
CCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEHHHHHHCCC
SQQDAFLPHIEDGTITFIGATTENPSFELNSALLSRARVYLLKSLSTEDIEQVLTQAMED
CCCCCCCCEECCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH
KTRGYGGQDIVLPDETRRAIAELVNGDARRALNTLEMMADMAEVDDSGKRVLKPELLTEI
HHCCCCCCEEECCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCCCEECCHHHHHHH
AGERSARFDNKGDRFYDLISALHKSVRGSAPDAALYWYARIITAGGDPLYVARRCLAIAS
HCCHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHH
EDVGNADPRAMQVAIAAWDCFTRVGPAEGERAIAQAIVYLACAPKSNAVYTAFKAALADA
CCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHEECCCCCCHHHHHHHHHHHH
RERPDYDVPVHLRNAPTKLMKEMGYGQEYRYAHDEANAYAAGEVYFPPEIAQTRYYFPTN
HHCCCCCCCEEECCCHHHHHHHCCCCCCCCCCCCCCCCEECCEEECCCHHHCCEEECCCC
RGLEGKIGEKLAWLAEQDQNSPIKRYR
CCCCCCHHHHHHHHHHCCCCCCHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796