| Definition | Rhodopseudomonas palustris HaA2, complete genome. |
|---|---|
| Accession | NC_007778 |
| Length | 5,331,656 |
Click here to switch to the map view.
The map label for this gene is rpoN2 [H]
Identifier: 86747781
GI number: 86747781
Start: 739032
End: 740672
Strand: Reverse
Name: rpoN2 [H]
Synonym: RPB_0655
Alternate gene names: 86747781
Gene position: 740672-739032 (Counterclockwise)
Preceding gene: 86747782
Following gene: 86747780
Centisome position: 13.89
GC content: 66.54
Gene sequence:
>1641_bases ATGGCACTAACTCAACGCCTCGAGTTCCGACAGTCACAGTCGCTGGTGATGACGCCGCAGCTGATGCAGGCGATCAAGCT GCTGCAACTGTCGAATCTGGACCTCGCGACCTTCGTCGAGGACGAACTCGAGAAAAACCCCCTGCTGGACCGGGCCAGTG ACAACGCCGAACCGCCGGTCGCCGGCGAGGCAGTGATGGAGCGCGCCGAGGCCAGCGGCGACGATTTCGGCGGGAGCGAG AGCAGCGGAGACGGCTCTGACTTCTCCGACGGCGTCGGCAGCGACTCGTTCGAGCCGGGCGCCGAGGATTGGATGCACCG CGACCTCGGCAGCCGCAGCGAGATCGAGCAGACGCTCGATACCGGCATGGAAAACGTGTTTCCGGAAGAGCCGGCCGAGG CCGCCGCCCGCGCCGCGCAGGACGCGGCTCCAGCCTCCTACACCGAATGGGGCGGCGGCGCCTCCAGCGACGAGGGCTAC AATCTCGAGGCCTTCGTCGCCGCCGAAACCTCCCTGGCCGATCGCCTCGCCGAGCAGCTTGCGGTGGCGCTCACCGCGCC GTCGCAGCGCATGATCGGCCAATATCTGATCGACCTCGTGGATGACGCCGGATATCTGCCGCCCGACCTCGGTGACGCCG CGGAGCGTCTCGGCACCACCCAGGCCGAAGTCGAAGCCGTCGTCGCCGTTCTGCAGACCTTCGATCCGCCGGGGATCTGC GCGCGCTCGCTGGCCGAATGCCTGGCGATCCAGTTGCGCGAACTCGACCGGTTCGACCCGGCGATGCAGGCTTTGGTCGA GAATCTGGATCTCCTCGCCAAGCGCGACATCGCCAGCCTCCGCAAGCTCTGCGGCGTCGACGACGAGGATCTCGCCGATA TGATCGGCGAAATCCGCCATCTCGACCCGAAGCCGGGTCTGAAATTCGCATCGTCGCGGGTGCAGACCGTGGTGCCGGAC GTGTTCGTCCGCCCCGGCCCGGACGGCGGCTGGCTGGTCGAACTCAACAGCGACACGCTGCCGAAGGTGCTGGTCAACCA GTCCTATTACTCCGAACTGTCGAAGACGATCCGCAAGGACGGCGACAAGTCGTACTTCTCCGACTGCCTGCAGAACGCCA CCTGGCTGGTGCGCGCGCTCGACCAGCGTGCCCGCACCATCCTGAAAGTGGCGACCGAGATCGTGCGCCAGCAGGACGGC TTCTTCACCCACGGCGTCGCGCATCTGCGGCCGCTGAATCTGAAGGCGGTGGCCGACGCGATCCAGATGCACGAATCCAC GGTATCGCGCGTGACCGCCAACAAATACATGGCGACCAATCGGGGCACGTTCGAACTCAAGTATTTCTTTACCGCTTCGA TCGCTTCCGCCGACGGCGGCGAGGCGCATTCGGCCGAAGCCGTGCGCCATCACATCCGGCAGTTGATCGACGGCGAAGAG CCGACCGCCATCCTGTCGGACGACACCATCGTCGAACGGCTGCGCGAAGCCGGCATCGAGATTGCGCGCCGCACCGTCGC GAAGTATCGCGAGGCGATGCGGATCCCGTCGTCGGTGCAGCGGCGGCGCGACAAGCAGAGCATGCTCGGCACGGCCCTGG CGGCGCCCGCCGATCGGTCCCGCGACACCGCTCCGGCTTGA
Upstream 100 bases:
>100_bases GGTGGCCCCCTACTCCGAATTTCCAATTCGTCAAGGCGTGTACATCAGCCGCGGAAAGGCGTAAGCAAGAAACGGACCAA CTTGGATCGTTCCTGCCGCC
Downstream 100 bases:
>100_bases TTACGCCGCTGCTCGGACTACTGTCATTACCCCCAAGGACGAAACGAGGCATCCCATGACTTTTCGGGTCTCCGGCAAAA GCATCAGCGTCGGCGAAGCC
Product: RNA polymerase factor sigma-54
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 546; Mature: 545
Protein sequence:
>546_residues MALTQRLEFRQSQSLVMTPQLMQAIKLLQLSNLDLATFVEDELEKNPLLDRASDNAEPPVAGEAVMERAEASGDDFGGSE SSGDGSDFSDGVGSDSFEPGAEDWMHRDLGSRSEIEQTLDTGMENVFPEEPAEAAARAAQDAAPASYTEWGGGASSDEGY NLEAFVAAETSLADRLAEQLAVALTAPSQRMIGQYLIDLVDDAGYLPPDLGDAAERLGTTQAEVEAVVAVLQTFDPPGIC ARSLAECLAIQLRELDRFDPAMQALVENLDLLAKRDIASLRKLCGVDDEDLADMIGEIRHLDPKPGLKFASSRVQTVVPD VFVRPGPDGGWLVELNSDTLPKVLVNQSYYSELSKTIRKDGDKSYFSDCLQNATWLVRALDQRARTILKVATEIVRQQDG FFTHGVAHLRPLNLKAVADAIQMHESTVSRVTANKYMATNRGTFELKYFFTASIASADGGEAHSAEAVRHHIRQLIDGEE PTAILSDDTIVERLREAGIEIARRTVAKYREAMRIPSSVQRRRDKQSMLGTALAAPADRSRDTAPA
Sequences:
>Translated_546_residues MALTQRLEFRQSQSLVMTPQLMQAIKLLQLSNLDLATFVEDELEKNPLLDRASDNAEPPVAGEAVMERAEASGDDFGGSE SSGDGSDFSDGVGSDSFEPGAEDWMHRDLGSRSEIEQTLDTGMENVFPEEPAEAAARAAQDAAPASYTEWGGGASSDEGY NLEAFVAAETSLADRLAEQLAVALTAPSQRMIGQYLIDLVDDAGYLPPDLGDAAERLGTTQAEVEAVVAVLQTFDPPGIC ARSLAECLAIQLRELDRFDPAMQALVENLDLLAKRDIASLRKLCGVDDEDLADMIGEIRHLDPKPGLKFASSRVQTVVPD VFVRPGPDGGWLVELNSDTLPKVLVNQSYYSELSKTIRKDGDKSYFSDCLQNATWLVRALDQRARTILKVATEIVRQQDG FFTHGVAHLRPLNLKAVADAIQMHESTVSRVTANKYMATNRGTFELKYFFTASIASADGGEAHSAEAVRHHIRQLIDGEE PTAILSDDTIVERLREAGIEIARRTVAKYREAMRIPSSVQRRRDKQSMLGTALAAPADRSRDTAPA >Mature_545_residues ALTQRLEFRQSQSLVMTPQLMQAIKLLQLSNLDLATFVEDELEKNPLLDRASDNAEPPVAGEAVMERAEASGDDFGGSES SGDGSDFSDGVGSDSFEPGAEDWMHRDLGSRSEIEQTLDTGMENVFPEEPAEAAARAAQDAAPASYTEWGGGASSDEGYN LEAFVAAETSLADRLAEQLAVALTAPSQRMIGQYLIDLVDDAGYLPPDLGDAAERLGTTQAEVEAVVAVLQTFDPPGICA RSLAECLAIQLRELDRFDPAMQALVENLDLLAKRDIASLRKLCGVDDEDLADMIGEIRHLDPKPGLKFASSRVQTVVPDV FVRPGPDGGWLVELNSDTLPKVLVNQSYYSELSKTIRKDGDKSYFSDCLQNATWLVRALDQRARTILKVATEIVRQQDGF FTHGVAHLRPLNLKAVADAIQMHESTVSRVTANKYMATNRGTFELKYFFTASIASADGGEAHSAEAVRHHIRQLIDGEEP TAILSDDTIVERLREAGIEIARRTVAKYREAMRIPSSVQRRRDKQSMLGTALAAPADRSRDTAPA
Specific function: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor is responsible for the expression of the nitrogen fixation genes [H]
COG id: COG1508
COG function: function code K; DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the sigma-54 factor family [H]
Homologues:
Organism=Escherichia coli, GI1789594, Length=532, Percent_Identity=35.7142857142857, Blast_Score=305, Evalue=3e-84,
Paralogues:
None
Copy number: 70 (log & stationary phase) [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000394 - InterPro: IPR007046 - InterPro: IPR007634 [H]
Pfam domain/function: PF00309 Sigma54_AID; PF04963 Sigma54_CBD; PF04552 Sigma54_DBD [H]
EC number: NA
Molecular weight: Translated: 59499; Mature: 59368
Theoretical pI: Translated: 4.37; Mature: 4.37
Prosite motif: PS00717 SIGMA54_1 ; PS00718 SIGMA54_2 ; PS50044 SIGMA54_3
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.7 %Cys (Translated Protein) 2.4 %Met (Translated Protein) 3.1 %Cys+Met (Translated Protein) 0.7 %Cys (Mature Protein) 2.2 %Met (Mature Protein) 2.9 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MALTQRLEFRQSQSLVMTPQLMQAIKLLQLSNLDLATFVEDELEKNPLLDRASDNAEPPV CCCHHHHHHHHCCCEEECHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCCCCCCCC AGEAVMERAEASGDDFGGSESSGDGSDFSDGVGSDSFEPGAEDWMHRDLGSRSEIEQTLD HHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCHHHHHHHHH TGMENVFPEEPAEAAARAAQDAAPASYTEWGGGASSDEGYNLEAFVAAETSLADRLAEQL HHHHHCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCEEEEEEHHHHHHHHHHHHH AVALTAPSQRMIGQYLIDLVDDAGYLPPDLGDAAERLGTTQAEVEAVVAVLQTFDPPGIC HHHHCCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHCCHHHHHHHHHHHHHHCCCCCHH ARSLAECLAIQLRELDRFDPAMQALVENLDLLAKRDIASLRKLCGVDDEDLADMIGEIRH HHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHC LDPKPGLKFASSRVQTVVPDVFVRPGPDGGWLVELNSDTLPKVLVNQSYYSELSKTIRKD CCCCCCHHHHHHHHHHHCCHHEECCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHCC GDKSYFSDCLQNATWLVRALDQRARTILKVATEIVRQQDGFFTHGVAHLRPLNLKAVADA CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHCHHHHCCCCHHHHHHH IQMHESTVSRVTANKYMATNRGTFELKYFFTASIASADGGEAHSAEAVRHHIRQLIDGEE HHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEEHHCCCCCCCHHHHHHHHHHHHHHCCCC PTAILSDDTIVERLREAGIEIARRTVAKYREAMRIPSSVQRRRDKQSMLGTALAAPADRS CCEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCC RDTAPA CCCCCC >Mature Secondary Structure ALTQRLEFRQSQSLVMTPQLMQAIKLLQLSNLDLATFVEDELEKNPLLDRASDNAEPPV CCHHHHHHHHCCCEEECHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCCCCCCCC AGEAVMERAEASGDDFGGSESSGDGSDFSDGVGSDSFEPGAEDWMHRDLGSRSEIEQTLD HHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCHHHHHHHHH TGMENVFPEEPAEAAARAAQDAAPASYTEWGGGASSDEGYNLEAFVAAETSLADRLAEQL HHHHHCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCEEEEEEHHHHHHHHHHHHH AVALTAPSQRMIGQYLIDLVDDAGYLPPDLGDAAERLGTTQAEVEAVVAVLQTFDPPGIC HHHHCCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHCCHHHHHHHHHHHHHHCCCCCHH ARSLAECLAIQLRELDRFDPAMQALVENLDLLAKRDIASLRKLCGVDDEDLADMIGEIRH HHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHC LDPKPGLKFASSRVQTVVPDVFVRPGPDGGWLVELNSDTLPKVLVNQSYYSELSKTIRKD CCCCCCHHHHHHHHHHHCCHHEECCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHCC GDKSYFSDCLQNATWLVRALDQRARTILKVATEIVRQQDGFFTHGVAHLRPLNLKAVADA CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHCHHHHCCCCHHHHHHH IQMHESTVSRVTANKYMATNRGTFELKYFFTASIASADGGEAHSAEAVRHHIRQLIDGEE HHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEEHHCCCCCCCHHHHHHHHHHHHHHCCCC PTAILSDDTIVERLREAGIEIARRTVAKYREAMRIPSSVQRRRDKQSMLGTALAAPADRS CCEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCC RDTAPA CCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 1991712; 12597275 [H]