Definition Streptococcus pneumoniae D39, complete genome.
Accession NC_008533
Length 2,046,115

Click here to switch to the map view.

The map label for this gene is recA

Identifier: 116516275

GI number: 116516275

Start: 1732610

End: 1733776

Strand: Reverse

Name: recA

Synonym: SPD_1739

Alternate gene names: 116516275

Gene position: 1733776-1732610 (Counterclockwise)

Preceding gene: 116515783

Following gene: 116517012

Centisome position: 84.74

GC content: 41.82

Gene sequence:

>1167_bases
ATGGCGAAAAAACCAAAAAAATTAGAAGAAATTTCAAAAAAATTTGGGGCAGAACGTGAAAAGGCCTTGAATGACGCTCT
TAAATTGATTGAGAAAGACTTTGGTAAAGGATCAATCATGCGTTTGGGTGAACGTGCGGAGCAAAAGGTGCAAGTGATGA
GCTCAGGTTCTTTAGCTCTTGACATTGCCCTTGGCTCAGGTGGTTATCCTAAGGGACGTATCATCGAAATCTATGGCCCA
GAGTCATCTGGTAAGACAACGGTTGCCCTTCATGCAGTTGCACAAGCGCAAAAAGAAGGTGGGATTGCTGCCTTTATCGA
TGCGGAACATGCCCTTGATCCAGCTTATGCTGCGGCCCTTGGTGTCAATATTGACGAATTGCTCTTGTCTCAACCAGACT
CAGGAGAGCAAGGTCTTGAGATTGCGGGAAAATTGATTGACTCAGGTGCAGTTGATCTTGTCGTAGTCGACTCAGTTGCT
GCCCTTGTTCCTCGTGCGGAAATTGATGGAGATATCGGAGATAGCCATGTTGGTTTGCAGGCTCGTATGATGAGCCAGGC
CATGCGTAAACTTGGCGCCTCTATCAATAAAACCAAAACAATTGCCATTTTTATCAACCAATTGCGTGAAAAAGTTGGAG
TGATGTTTGGAAATCCAGAAACAACACCGGGCGGACGTGCTTTGAAATTCTATGCTTCAGTCCGCTTGGATGTTCGTGGT
AATACACAAATTAAGGGAACTGGTGATCAAAAAGAAACCAATGTCGGTAAAGAAACTAAGATTAAGGTTGTAAAAAATAA
GGTAGCTCCACCGTTTAAGGAAGCCGTAGTTGAAATTATGTACGGAGAAGGAATTTCTAAGACTGGTGAGCTTTTGAAGA
TTGCAAGCGATTTGGATATTATCAAAAAAGCAGGGGCTTGGTATTCTTACAAAGATGAAAAAATTGGGCAAGGTTCTGAG
AATGCTAAGAAATACTTGGCAGAGCACCCAGAAATCTTTGATGAAATTGATAAGCAAGTCCGTTCTAAATTTGGCTTGAT
TGATGGAGAAGAAGTTTCAGAACAAGATACTGAAAACAAAAAAGATGAGCCAAAGAAAGAAGAAGCAGTGAATGAAGAAG
TTCCGCTTGACTTAGGCGATGAACTTGAAATCGAAATTGAAGAATAA

Upstream 100 bases:

>100_bases
TATGCATGCCTTTAACCTAGTTCGCAAGGCTTTATTAAGTGACTAACTTTTGATATAATAGTAGATAGGTCTGAGGATCA
TTAGAATGTAGGAGAATAGA

Downstream 100 bases:

>100_bases
GCTGTTAAAGCAGTGGAGAAATCCGCTACTTTTTCGATTTTTGATTCAAGTTTTTAGATTATATATAGTAGCTTGAAATA
AGATATGAACAACTTTATTA

Product: recombinase A

Products: NA

Alternate protein names: Recombinase A

Number of amino acids: Translated: 388; Mature: 387

Protein sequence:

>388_residues
MAKKPKKLEEISKKFGAEREKALNDALKLIEKDFGKGSIMRLGERAEQKVQVMSSGSLALDIALGSGGYPKGRIIEIYGP
ESSGKTTVALHAVAQAQKEGGIAAFIDAEHALDPAYAAALGVNIDELLLSQPDSGEQGLEIAGKLIDSGAVDLVVVDSVA
ALVPRAEIDGDIGDSHVGLQARMMSQAMRKLGASINKTKTIAIFINQLREKVGVMFGNPETTPGGRALKFYASVRLDVRG
NTQIKGTGDQKETNVGKETKIKVVKNKVAPPFKEAVVEIMYGEGISKTGELLKIASDLDIIKKAGAWYSYKDEKIGQGSE
NAKKYLAEHPEIFDEIDKQVRSKFGLIDGEEVSEQDTENKKDEPKKEEAVNEEVPLDLGDELEIEIEE

Sequences:

>Translated_388_residues
MAKKPKKLEEISKKFGAEREKALNDALKLIEKDFGKGSIMRLGERAEQKVQVMSSGSLALDIALGSGGYPKGRIIEIYGP
ESSGKTTVALHAVAQAQKEGGIAAFIDAEHALDPAYAAALGVNIDELLLSQPDSGEQGLEIAGKLIDSGAVDLVVVDSVA
ALVPRAEIDGDIGDSHVGLQARMMSQAMRKLGASINKTKTIAIFINQLREKVGVMFGNPETTPGGRALKFYASVRLDVRG
NTQIKGTGDQKETNVGKETKIKVVKNKVAPPFKEAVVEIMYGEGISKTGELLKIASDLDIIKKAGAWYSYKDEKIGQGSE
NAKKYLAEHPEIFDEIDKQVRSKFGLIDGEEVSEQDTENKKDEPKKEEAVNEEVPLDLGDELEIEIEE
>Mature_387_residues
AKKPKKLEEISKKFGAEREKALNDALKLIEKDFGKGSIMRLGERAEQKVQVMSSGSLALDIALGSGGYPKGRIIEIYGPE
SSGKTTVALHAVAQAQKEGGIAAFIDAEHALDPAYAAALGVNIDELLLSQPDSGEQGLEIAGKLIDSGAVDLVVVDSVAA
LVPRAEIDGDIGDSHVGLQARMMSQAMRKLGASINKTKTIAIFINQLREKVGVMFGNPETTPGGRALKFYASVRLDVRGN
TQIKGTGDQKETNVGKETKIKVVKNKVAPPFKEAVVEIMYGEGISKTGELLKIASDLDIIKKAGAWYSYKDEKIGQGSEN
AKKYLAEHPEIFDEIDKQVRSKFGLIDGEEVSEQDTENKKDEPKKEEAVNEEVPLDLGDELEIEIEE

Specific function: Can catalyze the hydrolysis of ATP in the presence of single-stranded DNA, the ATP-dependent uptake of single-stranded DNA by duplex DNA, and the ATP-dependent hybridization of homologous single-stranded DNAs. It interacts with lexA causing its activation

COG id: COG0468

COG function: function code L; RecA/RadA recombinase

Gene ontology:

Cell location: Cytoplasm

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the recA family

Homologues:

Organism=Escherichia coli, GI1789051, Length=323, Percent_Identity=63.4674922600619, Blast_Score=411, Evalue=1e-116,

Paralogues:

None

Copy number: 1548 Molecules/Cell In: Growth Phase, Glucose-minimal MOPS Media. 800-1200 (L-broth) 40,000-60,000 (L-broth + Nalidixate) 4,000 Molecules/Cell In: Glucose minimal media [C]

Swissprot (AC and ID): RECA_STRP2 (Q04IM6)

Other databases:

- EMBL:   CP000410
- RefSeq:   YP_817162.1
- ProteinModelPortal:   Q04IM6
- SMR:   Q04IM6
- STRING:   Q04IM6
- EnsemblBacteria:   EBSTRT00000019736
- GeneID:   4442603
- GenomeReviews:   CP000410_GR
- KEGG:   spd:SPD_1739
- eggNOG:   COG0468
- GeneTree:   EBGT00050000027961
- HOGENOM:   HBG339889
- OMA:   GRDNTIT
- ProtClustDB:   PRK09354
- GO:   GO:0005737
- HAMAP:   MF_00268
- InterPro:   IPR003593
- InterPro:   IPR013765
- InterPro:   IPR020584
- InterPro:   IPR020588
- InterPro:   IPR020587
- PANTHER:   PTHR22942:SF1
- PRINTS:   PR00142
- SMART:   SM00382
- TIGRFAMs:   TIGR02012

Pfam domain/function: PF00154 RecA

EC number: NA

Molecular weight: Translated: 41950; Mature: 41819

Theoretical pI: Translated: 4.84; Mature: 4.84

Prosite motif: PS00321 RECA_1; PS50162 RECA_2; PS50163 RECA_3

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.0 %Cys     (Translated Protein)
2.1 %Met     (Translated Protein)
2.1 %Cys+Met (Translated Protein)
0.0 %Cys     (Mature Protein)
1.8 %Met     (Mature Protein)
1.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MAKKPKKLEEISKKFGAEREKALNDALKLIEKDFGKGSIMRLGERAEQKVQVMSSGSLAL
CCCCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCEEE
DIALGSGGYPKGRIIEIYGPESSGKTTVALHAVAQAQKEGGIAAFIDAEHALDPAYAAAL
EEEECCCCCCCCEEEEEECCCCCCCEEEEHHHHHHHHHCCCEEEEEECHHHCCHHHHHHH
GVNIDELLLSQPDSGEQGLEIAGKLIDSGAVDLVVVDSVAALVPRAEIDGDIGDSHVGLQ
CCCHHHHHHCCCCCCCHHHHHHHHHHCCCCEEEEEHHHHHHHCCCHHCCCCCCCCCCHHH
ARMMSQAMRKLGASINKTKTIAIFINQLREKVGVMFGNPETTPGGRALKFYASVRLDVRG
HHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCEEECCCCCCCCCCEEEEEEEEEEEECC
NTQIKGTGDQKETNVGKETKIKVVKNKVAPPFKEAVVEIMYGEGISKTGELLKIASDLDI
CEEEECCCCCHHCCCCCHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHH
IKKAGAWYSYKDEKIGQGSENAKKYLAEHPEIFDEIDKQVRSKFGLIDGEEVSEQDTENK
HHHCCCCCCCCCCCCCCCCHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCHHCHHHCCCC
KDEPKKEEAVNEEVPLDLGDELEIEIEE
CCCCHHHHHCCCCCCCCCCCCEEEEECC
>Mature Secondary Structure 
AKKPKKLEEISKKFGAEREKALNDALKLIEKDFGKGSIMRLGERAEQKVQVMSSGSLAL
CCCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCEEE
DIALGSGGYPKGRIIEIYGPESSGKTTVALHAVAQAQKEGGIAAFIDAEHALDPAYAAAL
EEEECCCCCCCCEEEEEECCCCCCCEEEEHHHHHHHHHCCCEEEEEECHHHCCHHHHHHH
GVNIDELLLSQPDSGEQGLEIAGKLIDSGAVDLVVVDSVAALVPRAEIDGDIGDSHVGLQ
CCCHHHHHHCCCCCCCHHHHHHHHHHCCCCEEEEEHHHHHHHCCCHHCCCCCCCCCCHHH
ARMMSQAMRKLGASINKTKTIAIFINQLREKVGVMFGNPETTPGGRALKFYASVRLDVRG
HHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCEEECCCCCCCCCCEEEEEEEEEEEECC
NTQIKGTGDQKETNVGKETKIKVVKNKVAPPFKEAVVEIMYGEGISKTGELLKIASDLDI
CEEEECCCCCHHCCCCCHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHH
IKKAGAWYSYKDEKIGQGSENAKKYLAEHPEIFDEIDKQVRSKFGLIDGEEVSEQDTENK
HHHCCCCCCCCCCCCCCCCHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCHHCHHHCCCC
KDEPKKEEAVNEEVPLDLGDELEIEIEE
CCCCHHHHHCCCCCCCCCCCCEEEEECC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA