Definition | Novosphingobium aromaticivorans DSM 12444 chromosome, complete genome. |
---|---|
Accession | NC_007794 |
Length | 3,561,584 |
Click here to switch to the map view.
The map label for this gene is rpoN2 [H]
Identifier: 87198322
GI number: 87198322
Start: 319545
End: 321041
Strand: Direct
Name: rpoN2 [H]
Synonym: Saro_0297
Alternate gene names: 87198322
Gene position: 319545-321041 (Clockwise)
Preceding gene: 87198321
Following gene: 87198327
Centisome position: 8.97
GC content: 67.94
Gene sequence:
>1497_bases ATGGCGCTCGGGCCGAGGCTCGATATCCGGCAGACCCAGTCGCTGGTGATGACGCCGCAGCTCCAGCAGGCGATCAAGCT GCTGGCGTTGTCGAACCTCGAGGTCGAGGCCTTCATCGGCGAGGCGCTGGAGGCCAATCCGCTGCTCGAAATCGGTGAGA CGGCACCCGCCGAGGCGGTCGACGCCGGCCCCGAAGACCTGCGCCGCACGCATCTCGAATCCTCGCCGGTAGACCAGCTC GTTGCCGAGGGGCGGGTGGAGGAGGATCGCCCGCTCGACATCGATGTCACTGCGCTCGACCGCGATCGGGACACCGGGGA TGGCGATTTCGGGGGCGGCACGCTCGAGTTGTCCTCGACCCGCGAAAGTGGCGGCGGCGAGGGGCCGGACATCGACGAAC GGGGGCGCATCGAGGAAACGCTTGCCGAACATCTCCACGCCCAGATCGGCGCGACGACCTCGGACGCGCAGCTCCTCTTC GTCGCCCGCTGGCTGATCGACCAGCTCGACGAGGCAGGATATCTCGCCATGCCGATTGGCGAAGTGGCCGAGGCGCTGGG CCTTTCGCCGCTGGTGGTCGAGCGGGCGCTGGCTCTCGTCCAGTCGCTCGACCCGACCGGGGTCGGCGCACGCAACCTGG CGGAATGCATCGCGCTCCAGGCCCGGGAGGCGGACCGCTATGATCCGTGCATGGCGCGGCTGATCGACAATCTGGAACTT GTTGCGCGCGGCGAGATCGCGCGGCTGAAACGCCTGTGCCAGGTCGACGACGAGGACTTTGCCGACATGCTGGCCGAGCT GCGCGGCTACGACCCGCGCCCCGGCCTGCGCTTCGGCGGGGGCGCGGCGGAGCCGGTCGTGCCCGATATCCTGGTGCGCG CGGCAAAGGGCGGCTGGGACATCGCGCTCAATCAGGCGACCCTGCCGCGCCTCGTCGTCAATCGCAGCTACTACGTGGAG ATGCGCGGGGCCTGTGTCGGCAAGGAGGCCAAGGCCTGGCTGGGGGAGAAGCTGGCCGACGCGAACTGGCTGCTGAAGGC GCTCGACCAGCGGCAGAAGACCATCCTCAAGGTCGCGGCCGAGATCGTGAAGCAGCAGGACGGCTTCTTCCGGCACGGCG TCGCGCACTTGCGCCCGTTGACGCTGAAGACCGTGGCCGAAGCGATATCGATGCATGAATCGACCGTCAGCCGCGTGACT TCGAACAAGTACCTCCATTGCGACCGGGGTACCTTCGAGCTGAAGTATTTCTTCACTTCGGGCGTCGGCTCTTCCGACGG TGAGGGCGCTTCGGCCGCGGCGGTGAAGGCTGCGATCCGCCAGCTCATCGATGCCGAGGACCCCAAGGCAATCCTTTCGG ACGATGCCCTGGTCGATCTGCTCAAGGCGCGGGGCTTCGACCTTGCCCGGCGCACGGTCGCCAAGTACCGCGAGGCGATC GGGCTCGGAAGTTCGGTCCAGCGCCGCCGCCAGAAAACACTCGCCGGGGTGCGCTGA
Upstream 100 bases:
>100_bases GCGCTCGTGGCCGATGCCAACGTGCGGCGGCTCTATCTGGGTGAAGGCTTCACCCTGTGACGATGGCCGGCTGATGCGGA TCTCGGGGGGGATCTAGGCC
Downstream 100 bases:
>100_bases CGCCCTGTTGCGACGGATTGCTGGCCTTCACGCCCGACGTCCTTGTACGTGACATTCCCGCCCGGTCGGGCAAGCTCGCA ATCGCGAAGTAGCGCCTATT
Product: RNA polymerase factor sigma-54
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 498; Mature: 497
Protein sequence:
>498_residues MALGPRLDIRQTQSLVMTPQLQQAIKLLALSNLEVEAFIGEALEANPLLEIGETAPAEAVDAGPEDLRRTHLESSPVDQL VAEGRVEEDRPLDIDVTALDRDRDTGDGDFGGGTLELSSTRESGGGEGPDIDERGRIEETLAEHLHAQIGATTSDAQLLF VARWLIDQLDEAGYLAMPIGEVAEALGLSPLVVERALALVQSLDPTGVGARNLAECIALQAREADRYDPCMARLIDNLEL VARGEIARLKRLCQVDDEDFADMLAELRGYDPRPGLRFGGGAAEPVVPDILVRAAKGGWDIALNQATLPRLVVNRSYYVE MRGACVGKEAKAWLGEKLADANWLLKALDQRQKTILKVAAEIVKQQDGFFRHGVAHLRPLTLKTVAEAISMHESTVSRVT SNKYLHCDRGTFELKYFFTSGVGSSDGEGASAAAVKAAIRQLIDAEDPKAILSDDALVDLLKARGFDLARRTVAKYREAI GLGSSVQRRRQKTLAGVR
Sequences:
>Translated_498_residues MALGPRLDIRQTQSLVMTPQLQQAIKLLALSNLEVEAFIGEALEANPLLEIGETAPAEAVDAGPEDLRRTHLESSPVDQL VAEGRVEEDRPLDIDVTALDRDRDTGDGDFGGGTLELSSTRESGGGEGPDIDERGRIEETLAEHLHAQIGATTSDAQLLF VARWLIDQLDEAGYLAMPIGEVAEALGLSPLVVERALALVQSLDPTGVGARNLAECIALQAREADRYDPCMARLIDNLEL VARGEIARLKRLCQVDDEDFADMLAELRGYDPRPGLRFGGGAAEPVVPDILVRAAKGGWDIALNQATLPRLVVNRSYYVE MRGACVGKEAKAWLGEKLADANWLLKALDQRQKTILKVAAEIVKQQDGFFRHGVAHLRPLTLKTVAEAISMHESTVSRVT SNKYLHCDRGTFELKYFFTSGVGSSDGEGASAAAVKAAIRQLIDAEDPKAILSDDALVDLLKARGFDLARRTVAKYREAI GLGSSVQRRRQKTLAGVR >Mature_497_residues ALGPRLDIRQTQSLVMTPQLQQAIKLLALSNLEVEAFIGEALEANPLLEIGETAPAEAVDAGPEDLRRTHLESSPVDQLV AEGRVEEDRPLDIDVTALDRDRDTGDGDFGGGTLELSSTRESGGGEGPDIDERGRIEETLAEHLHAQIGATTSDAQLLFV ARWLIDQLDEAGYLAMPIGEVAEALGLSPLVVERALALVQSLDPTGVGARNLAECIALQAREADRYDPCMARLIDNLELV ARGEIARLKRLCQVDDEDFADMLAELRGYDPRPGLRFGGGAAEPVVPDILVRAAKGGWDIALNQATLPRLVVNRSYYVEM RGACVGKEAKAWLGEKLADANWLLKALDQRQKTILKVAAEIVKQQDGFFRHGVAHLRPLTLKTVAEAISMHESTVSRVTS NKYLHCDRGTFELKYFFTSGVGSSDGEGASAAAVKAAIRQLIDAEDPKAILSDDALVDLLKARGFDLARRTVAKYREAIG LGSSVQRRRQKTLAGVR
Specific function: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor is responsible for the expression of the nitrogen fixation genes [H]
COG id: COG1508
COG function: function code K; DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the sigma-54 factor family [H]
Homologues:
Organism=Escherichia coli, GI1789594, Length=499, Percent_Identity=37.0741482965932, Blast_Score=310, Evalue=1e-85,
Paralogues:
None
Copy number: 70 (log & stationary phase) [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000394 - InterPro: IPR007046 - InterPro: IPR007634 [H]
Pfam domain/function: PF00309 Sigma54_AID; PF04963 Sigma54_CBD; PF04552 Sigma54_DBD [H]
EC number: NA
Molecular weight: Translated: 54011; Mature: 53880
Theoretical pI: Translated: 4.71; Mature: 4.71
Prosite motif: PS00717 SIGMA54_1 ; PS00718 SIGMA54_2 ; PS50044 SIGMA54_3
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.0 %Cys (Translated Protein) 1.4 %Met (Translated Protein) 2.4 %Cys+Met (Translated Protein) 1.0 %Cys (Mature Protein) 1.2 %Met (Mature Protein) 2.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MALGPRLDIRQTQSLVMTPQLQQAIKLLALSNLEVEAFIGEALEANPLLEIGETAPAEAV CCCCCCCCHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCCCCCCCCCCHHH DAGPEDLRRTHLESSPVDQLVAEGRVEEDRPLDIDVTALDRDRDTGDGDFGGGTLELSST CCCHHHHHHHHCCCCCHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCEEEECCC RESGGGEGPDIDERGRIEETLAEHLHAQIGATTSDAQLLFVARWLIDQLDEAGYLAMPIG CCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCEEECCHH EVAEALGLSPLVVERALALVQSLDPTGVGARNLAECIALQAREADRYDPCMARLIDNLEL HHHHHHCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHCHHH VARGEIARLKRLCQVDDEDFADMLAELRGYDPRPGLRFGGGAAEPVVPDILVRAAKGGWD HHCCHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCE IALNQATLPRLVVNRSYYVEMRGACVGKEAKAWLGEKLADANWLLKALDQRQKTILKVAA EEECCCHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHH EIVKQQDGFFRHGVAHLRPLTLKTVAEAISMHESTVSRVTSNKYLHCDRGTFELKYFFTS HHHHHCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCEEEEEEEEC GVGSSDGEGASAAAVKAAIRQLIDAEDPKAILSDDALVDLLKARGFDLARRTVAKYREAI CCCCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH GLGSSVQRRRQKTLAGVR CCCHHHHHHHHHHHCCCC >Mature Secondary Structure ALGPRLDIRQTQSLVMTPQLQQAIKLLALSNLEVEAFIGEALEANPLLEIGETAPAEAV CCCCCCCHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCCCCCCCCCCHHH DAGPEDLRRTHLESSPVDQLVAEGRVEEDRPLDIDVTALDRDRDTGDGDFGGGTLELSST CCCHHHHHHHHCCCCCHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCEEEECCC RESGGGEGPDIDERGRIEETLAEHLHAQIGATTSDAQLLFVARWLIDQLDEAGYLAMPIG CCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCEEECCHH EVAEALGLSPLVVERALALVQSLDPTGVGARNLAECIALQAREADRYDPCMARLIDNLEL HHHHHHCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHCHHH VARGEIARLKRLCQVDDEDFADMLAELRGYDPRPGLRFGGGAAEPVVPDILVRAAKGGWD HHCCHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCE IALNQATLPRLVVNRSYYVEMRGACVGKEAKAWLGEKLADANWLLKALDQRQKTILKVAA EEECCCHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHH EIVKQQDGFFRHGVAHLRPLTLKTVAEAISMHESTVSRVTSNKYLHCDRGTFELKYFFTS HHHHHCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCEEEEEEEEC GVGSSDGEGASAAAVKAAIRQLIDAEDPKAILSDDALVDLLKARGFDLARRTVAKYREAI CCCCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH GLGSSVQRRRQKTLAGVR CCCHHHHHHHHHHHCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 1991712; 12597275 [H]