Definition | Candidatus Protochlamydia amoebophila UWE25, complete genome. |
---|---|
Accession | NC_005861 |
Length | 2,414,465 |
Click here to switch to the map view.
The map label for this gene is rpoA
Identifier: 46446067
GI number: 46446067
Start: 560293
End: 561408
Strand: Direct
Name: rpoA
Synonym: pc0433
Alternate gene names: 46446067
Gene position: 560293-561408 (Clockwise)
Preceding gene: 46446066
Following gene: 46446068
Centisome position: 23.21
GC content: 38.08
Gene sequence:
>1116_bases ATGTCAGTAAAATACGGCAAATTTGAAATGCCTCATAAGATTACAGTAGATCAAGAGTCTCCTGAATCTAATTTTGCCCG TTATGTGGCTGAGCCTTTCGAAAGAGGATTTGGACATACAATTGGAAATGCATTGCGTAGAATGATGTTGTCGTCTTTAG AAGCTCCAGCTATCATTTCTGTTCGCGTAGAGGGCATTCCTCACGAATATATGGCTATTGAAGGAATTTCTGAGGATATG ACTAATATTATCCTCAACTTTAAAGGAGCTTTGCTACGCAAACTTCCTACTGAAGAAACTCCTAGAGATACGCGTATTTT GACAAAAGTCGTCGAAGTGACACAAGACGACTTAGATCGCAATCAGGGGCAATACTGCGTTACATTGCAAGATGTGGTTC AAGAAGGCAATTTTGAAATTGTTAATCCAGAATTACATCTTTTTACTGTGACTAAGCCTATGCGTCGTCAAGTAGATCTT CGAATTGCCTTTGGACGCGGTTATGTCCCGTCTGAGCGTCACGTGGTACGTGATAAAACGTCCGATGAAATTTTAGTTGA TGCAGCTTTTTCACCTGTTCGTTTAATCAATTATTTTATCGAGAATACACGGGTAGGTCAAGATACTGATTTTGATCGAC TCATCATGGAAGTTACCACTGATGGGCGTATTACTCCTGCTGAAGCTTTGAGTTTTGCAGTTCAAATTGGGCTTAAACAT TTTGAAGTGTTTAATCAATTTAATAATTACGCCCTTTCTTTTGACGAAAAAGATGGAGATCGTAACGGTGATCAAGATGA GTTAATGGATAAACTTTCTTTAGGAATTGATGAAATCGAGTTATCAGTTCGTTCAGCTAACTGCTTAACTGGTGCTAATA TCGAGACACTCGCAGAATTGGTTTGCATTCCAGAACGTAGGATGTTAGAATTCAGAAACTTCGGTAAAAAATCCTTAAAT GAAATTAAAGCTAAACTTCATGAGATGTCATTGCACTTAGGCATGGACTTGAGTCGTTTTGGGGTTTCACCTGATAATGT CAAAGATAAAATCAAGCAGTACCGTGAAGAAAAGAAAAAGAAAAAAGAATTAGTTAAACACGAAGATGCTAAGTAG
Upstream 100 bases:
>100_bases GTATCAGCATTAAAAGGCTTATTAATAGACTGCTCATTTGCTTCATACGATCAAGAGCCAAACGTTCATCTCAACTATTA CTATTTTTAGGAGTCACTTC
Downstream 100 bases:
>100_bases GTAAACAGTTATGAGACACCTCAATCAAACATGTAAGCTCAATCGAACCACGTCTCATAGACGTTGCATGTTCGCCAACA TGTTAAAATCTTTAATATCG
Product: DNA-directed RNA polymerase subunit alpha
Products: NA
Alternate protein names: RNAP subunit alpha; RNA polymerase subunit alpha; Transcriptase subunit alpha
Number of amino acids: Translated: 371; Mature: 370
Protein sequence:
>371_residues MSVKYGKFEMPHKITVDQESPESNFARYVAEPFERGFGHTIGNALRRMMLSSLEAPAIISVRVEGIPHEYMAIEGISEDM TNIILNFKGALLRKLPTEETPRDTRILTKVVEVTQDDLDRNQGQYCVTLQDVVQEGNFEIVNPELHLFTVTKPMRRQVDL RIAFGRGYVPSERHVVRDKTSDEILVDAAFSPVRLINYFIENTRVGQDTDFDRLIMEVTTDGRITPAEALSFAVQIGLKH FEVFNQFNNYALSFDEKDGDRNGDQDELMDKLSLGIDEIELSVRSANCLTGANIETLAELVCIPERRMLEFRNFGKKSLN EIKAKLHEMSLHLGMDLSRFGVSPDNVKDKIKQYREEKKKKKELVKHEDAK
Sequences:
>Translated_371_residues MSVKYGKFEMPHKITVDQESPESNFARYVAEPFERGFGHTIGNALRRMMLSSLEAPAIISVRVEGIPHEYMAIEGISEDM TNIILNFKGALLRKLPTEETPRDTRILTKVVEVTQDDLDRNQGQYCVTLQDVVQEGNFEIVNPELHLFTVTKPMRRQVDL RIAFGRGYVPSERHVVRDKTSDEILVDAAFSPVRLINYFIENTRVGQDTDFDRLIMEVTTDGRITPAEALSFAVQIGLKH FEVFNQFNNYALSFDEKDGDRNGDQDELMDKLSLGIDEIELSVRSANCLTGANIETLAELVCIPERRMLEFRNFGKKSLN EIKAKLHEMSLHLGMDLSRFGVSPDNVKDKIKQYREEKKKKKELVKHEDAK >Mature_370_residues SVKYGKFEMPHKITVDQESPESNFARYVAEPFERGFGHTIGNALRRMMLSSLEAPAIISVRVEGIPHEYMAIEGISEDMT NIILNFKGALLRKLPTEETPRDTRILTKVVEVTQDDLDRNQGQYCVTLQDVVQEGNFEIVNPELHLFTVTKPMRRQVDLR IAFGRGYVPSERHVVRDKTSDEILVDAAFSPVRLINYFIENTRVGQDTDFDRLIMEVTTDGRITPAEALSFAVQIGLKHF EVFNQFNNYALSFDEKDGDRNGDQDELMDKLSLGIDEIELSVRSANCLTGANIETLAELVCIPERRMLEFRNFGKKSLNE IKAKLHEMSLHLGMDLSRFGVSPDNVKDKIKQYREEKKKKKELVKHEDAK
Specific function: DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates
COG id: COG0202
COG function: function code K; DNA-directed RNA polymerase, alpha subunit/40 kD subunit
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the RNA polymerase alpha chain family
Homologues:
Organism=Escherichia coli, GI1789690, Length=340, Percent_Identity=37.0588235294118, Blast_Score=189, Evalue=2e-49,
Paralogues:
None
Copy number: 850 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). 560 Molecules/Cell In: Stationary-Phase, Rich-Media (Based on E. coli). 6847 Molecules/Cell In: Growth Phase, Glucose-minimal MOPS Media. 7,000 Molecules/Cell In: Glucose minimal media
Swissprot (AC and ID): RPOA_PARUW (Q6ME42)
Other databases:
- EMBL: BX908798 - RefSeq: YP_007432.1 - ProteinModelPortal: Q6ME42 - STRING: Q6ME42 - GeneID: 2781452 - GenomeReviews: BX908798_GR - KEGG: pcu:pc0433 - NMPDR: fig|264201.1.peg.433 - eggNOG: COG0202 - HOGENOM: HBG430844 - OMA: FGTTLGN - ProtClustDB: PRK05182 - BioCyc: CPRO264201:PC0433-MONOMER - HAMAP: MF_00059 - InterPro: IPR011261 - InterPro: IPR011262 - InterPro: IPR009025 - InterPro: IPR011263 - InterPro: IPR011260 - InterPro: IPR011773 - Gene3D: G3DSA:2.170.120.12 - ProDom: PD001179 - SMART: SM00662 - TIGRFAMs: TIGR02027
Pfam domain/function: PF01000 RNA_pol_A_bac; PF03118 RNA_pol_A_CTD; PF01193 RNA_pol_L; SSF47789 RNAP_alpha_C; SSF56553 RNAP_insert; SSF55257 RNAP_RBP11-like
EC number: =2.7.7.6
Molecular weight: Translated: 42445; Mature: 42314
Theoretical pI: Translated: 5.38; Mature: 5.38
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.8 %Cys (Translated Protein) 3.2 %Met (Translated Protein) 4.0 %Cys+Met (Translated Protein) 0.8 %Cys (Mature Protein) 3.0 %Met (Mature Protein) 3.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSVKYGKFEMPHKITVDQESPESNFARYVAEPFERGFGHTIGNALRRMMLSSLEAPAIIS CCCCCCCCCCCCEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCEEEE VRVEGIPHEYMAIEGISEDMTNIILNFKGALLRKLPTEETPRDTRILTKVVEVTQDDLDR EEECCCCCHHHEECCCHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHC NQGQYCVTLQDVVQEGNFEIVNPELHLFTVTKPMRRQVDLRIAFGRGYVPSERHVVRDKT CCCCEEEEHHHHHHCCCEEEECCCEEEEEECHHHHHCEEEEEEECCCCCCCCCCEECCCC SDEILVDAAFSPVRLINYFIENTRVGQDTDFDRLIMEVTTDGRITPAEALSFAVQIGLKH CCCEEEECCCCHHHHHHHHHHCCCCCCCCCHHHHHEEECCCCCCCHHHHHHHHHHHHHHH FEVFNQFNNYALSFDEKDGDRNGDQDELMDKLSLGIDEIELSVRSANCLTGANIETLAEL HHHHHHHCCEEEEECCCCCCCCCCHHHHHHHHHCCHHHHEEEEECCCCCCCCCHHHHHHH VCIPERRMLEFRNFGKKSLNEIKAKLHEMSLHLGMDLSRFGVSPDNVKDKIKQYREEKKK HHCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCHHHHCCCCCHHHHHHHHHHHHHHH KKELVKHEDAK HHHHHHCCCCC >Mature Secondary Structure SVKYGKFEMPHKITVDQESPESNFARYVAEPFERGFGHTIGNALRRMMLSSLEAPAIIS CCCCCCCCCCCEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCEEEE VRVEGIPHEYMAIEGISEDMTNIILNFKGALLRKLPTEETPRDTRILTKVVEVTQDDLDR EEECCCCCHHHEECCCHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHC NQGQYCVTLQDVVQEGNFEIVNPELHLFTVTKPMRRQVDLRIAFGRGYVPSERHVVRDKT CCCCEEEEHHHHHHCCCEEEECCCEEEEEECHHHHHCEEEEEEECCCCCCCCCCEECCCC SDEILVDAAFSPVRLINYFIENTRVGQDTDFDRLIMEVTTDGRITPAEALSFAVQIGLKH CCCEEEECCCCHHHHHHHHHHCCCCCCCCCHHHHHEEECCCCCCCHHHHHHHHHHHHHHH FEVFNQFNNYALSFDEKDGDRNGDQDELMDKLSLGIDEIELSVRSANCLTGANIETLAEL HHHHHHHCCEEEEECCCCCCCCCCHHHHHHHHHCCHHHHEEEEECCCCCCCCCHHHHHHH VCIPERRMLEFRNFGKKSLNEIKAKLHEMSLHLGMDLSRFGVSPDNVKDKIKQYREEKKK HHCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCHHHHCCCCCHHHHHHHHHHHHHHH KKELVKHEDAK HHHHHHCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA