Definition Agrobacterium tumefaciens str. C58 chromosome circular, complete sequence.
Accession NC_003062
Length 2,841,580

Click here to switch to the map view.

The map label for this gene is rpoN [H]

Identifier: 15887683

GI number: 15887683

Start: 325172

End: 326716

Strand: Reverse

Name: rpoN [H]

Synonym: Atu0332

Alternate gene names: 15887683

Gene position: 326716-325172 (Counterclockwise)

Preceding gene: 15887684

Following gene: 159184262

Centisome position: 11.5

GC content: 58.77

Gene sequence:

>1545_bases
ATGGCATTGTCTGCCAGCCTTTTGCTGCGTCAAAACCAGTCTCTCGTGATGACACCGCAACTGATGCAGTCCATCCAGCT
GCTTCAGATGACGCATTTCGAGCTGACGCAGTTTATCGCGCAGGAGGTGGAGCGCAATCCATTGCTGGAAATTGCAGCAA
ACGATGGCGATTTGGGCAGCGATACGCCTGTCGACGTCGATAATTTCGGCGACGCTGAAGCTTCGGATGCGGCCGTCAGG
GTGACGCCCGACAGCGATGACTGGTACGGGGAGCGCGCCGCAAATCTTGGCGAGCAGCTGGACACGAGCTTCGAAAACGT
CTTTCCCGACGATGGCGAACCGAGAAAAGCCGATGCGCCCGAACTCGCAAGCCAATGGAAATCCATGCCCGGGCAGGAGT
CCGGAGAAAGCTACGACCTCGACGACTTCGTGGCCGCCCGACAAAGCCTCAGCGACCACCTCAATCAGCAACTGCCGCTC
GCCATCTCCGCCGCCGAGGACAGGATGATCGCCGATGCGCTGATCGGCCAGCTCGATGAGACCGGTTATATCGCCGCGGA
TGCCGTCGACGACGTTGCAGAACGGCTTGGTGCCACACCATCGGCGGTCGAGTACGTTCTGAAGACGCTTCAGGGTTTTG
ATCCGCCGGGCATCTTTTCCCGTTCCTTGAGCGAATGTCTGGCAACCCAGCTCGCACAGAAGGACCGGCTCGACCCGGCC
ATGCGAGCTTTTGTCGATAATCTCGAGCTTCTGGCGAAACGCGATTTTGCCTCCTTAAAAAAACTCTGCGGCGTCGATGA
AGAAGACCTTCTCGACATGCTGGCGGAAATCCGCACGCTCAATCCGCGCCCCGGCGCGGGTTACGATTCGATGGTTTCGG
AAACAATCGTTCCTGATATCATCGTCCGCCCCTCCTCCACCGGTGGCTGGCTGGTGGAGATCAATCCCGACACCCTGCCG
CGTGTGCTCATCAACCAGAGCTACTTTGCGGAGGTTTCAAAACATAAGGCGCGCGCAGGAGAGGATCAGGATTTCCTGTC
CGAATGCATGCAGACAGCCCATTGGCTGACCCGCAGCCTCGATCAGCGCGCCCGGACGATCATGAAGGTGGCGAGCGAAA
TCGTACGCCAGCAGGATGCCTTCCTCATCAACGGCGTCGACCAGCTGCGCCCGCTGAACCTCAAGACGGTGGCCGACGCC
ATCAAGATGCATGAATCCACCGTCAGCCGCGTGACGTCGAAGAAATACATGCTGACGCCGCGCGGACTTTTCGAGCTCAA
ATATTTCTTCAGCGTGTCGATCAGCGCCGTGGAGGGCGGTGATAGCCATTCGGCGGAAGCGGTACGTCACCGCATCAAGG
CGATGATCGCGCAGGAAGCCGCTGAGGCCGTCCTTTCCGACGACGATATCGTCGACAATCTGAAAAAGACCGGCATCGAT
ATCGCACGCCGCACCGTCGCCAAATATCGCGAGGCGATGAATATTCCCTCCTCCGTGCAGAGGCGCCGGGAGAAGAAAGC
GATGGCGAAGCTTTCCGCCTTCTGA

Upstream 100 bases:

>100_bases
CAAGATAAAAAAGCAATTTTTGGGCCAAGTTCTGTCTCCTCCGCAAAATTTATGAAAACTGTGGCTATGGAGAAGCTGTA
AGGGGAGTTTCGCGTCCGCC

Downstream 100 bases:

>100_bases
GCAACGCGGCGATCACGTTTGCGGCGGTTTTTTGCCGCACCATTGACTCTTGTCACGGCAGGGGCTAGAAGCCCGCCGCA
CACGTCAGCAAGGTCATTTG

Product: RNA polymerase factor sigma-54

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 514; Mature: 513

Protein sequence:

>514_residues
MALSASLLLRQNQSLVMTPQLMQSIQLLQMTHFELTQFIAQEVERNPLLEIAANDGDLGSDTPVDVDNFGDAEASDAAVR
VTPDSDDWYGERAANLGEQLDTSFENVFPDDGEPRKADAPELASQWKSMPGQESGESYDLDDFVAARQSLSDHLNQQLPL
AISAAEDRMIADALIGQLDETGYIAADAVDDVAERLGATPSAVEYVLKTLQGFDPPGIFSRSLSECLATQLAQKDRLDPA
MRAFVDNLELLAKRDFASLKKLCGVDEEDLLDMLAEIRTLNPRPGAGYDSMVSETIVPDIIVRPSSTGGWLVEINPDTLP
RVLINQSYFAEVSKHKARAGEDQDFLSECMQTAHWLTRSLDQRARTIMKVASEIVRQQDAFLINGVDQLRPLNLKTVADA
IKMHESTVSRVTSKKYMLTPRGLFELKYFFSVSISAVEGGDSHSAEAVRHRIKAMIAQEAAEAVLSDDDIVDNLKKTGID
IARRTVAKYREAMNIPSSVQRRREKKAMAKLSAF

Sequences:

>Translated_514_residues
MALSASLLLRQNQSLVMTPQLMQSIQLLQMTHFELTQFIAQEVERNPLLEIAANDGDLGSDTPVDVDNFGDAEASDAAVR
VTPDSDDWYGERAANLGEQLDTSFENVFPDDGEPRKADAPELASQWKSMPGQESGESYDLDDFVAARQSLSDHLNQQLPL
AISAAEDRMIADALIGQLDETGYIAADAVDDVAERLGATPSAVEYVLKTLQGFDPPGIFSRSLSECLATQLAQKDRLDPA
MRAFVDNLELLAKRDFASLKKLCGVDEEDLLDMLAEIRTLNPRPGAGYDSMVSETIVPDIIVRPSSTGGWLVEINPDTLP
RVLINQSYFAEVSKHKARAGEDQDFLSECMQTAHWLTRSLDQRARTIMKVASEIVRQQDAFLINGVDQLRPLNLKTVADA
IKMHESTVSRVTSKKYMLTPRGLFELKYFFSVSISAVEGGDSHSAEAVRHRIKAMIAQEAAEAVLSDDDIVDNLKKTGID
IARRTVAKYREAMNIPSSVQRRREKKAMAKLSAF
>Mature_513_residues
ALSASLLLRQNQSLVMTPQLMQSIQLLQMTHFELTQFIAQEVERNPLLEIAANDGDLGSDTPVDVDNFGDAEASDAAVRV
TPDSDDWYGERAANLGEQLDTSFENVFPDDGEPRKADAPELASQWKSMPGQESGESYDLDDFVAARQSLSDHLNQQLPLA
ISAAEDRMIADALIGQLDETGYIAADAVDDVAERLGATPSAVEYVLKTLQGFDPPGIFSRSLSECLATQLAQKDRLDPAM
RAFVDNLELLAKRDFASLKKLCGVDEEDLLDMLAEIRTLNPRPGAGYDSMVSETIVPDIIVRPSSTGGWLVEINPDTLPR
VLINQSYFAEVSKHKARAGEDQDFLSECMQTAHWLTRSLDQRARTIMKVASEIVRQQDAFLINGVDQLRPLNLKTVADAI
KMHESTVSRVTSKKYMLTPRGLFELKYFFSVSISAVEGGDSHSAEAVRHRIKAMIAQEAAEAVLSDDDIVDNLKKTGIDI
ARRTVAKYREAMNIPSSVQRRREKKAMAKLSAF

Specific function: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor is responsible for the expression of the nitrogen fixation genes (nif operon), glnA and dctA for dicar

COG id: COG1508

COG function: function code K; DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the sigma-54 factor family [H]

Homologues:

Organism=Escherichia coli, GI1789594, Length=514, Percent_Identity=36.3813229571984, Blast_Score=298, Evalue=6e-82,

Paralogues:

None

Copy number: 70 (log & stationary phase) [C]

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000394
- InterPro:   IPR007046
- InterPro:   IPR007634 [H]

Pfam domain/function: PF00309 Sigma54_AID; PF04963 Sigma54_CBD; PF04552 Sigma54_DBD [H]

EC number: NA

Molecular weight: Translated: 56755; Mature: 56623

Theoretical pI: Translated: 4.46; Mature: 4.46

Prosite motif: PS00717 SIGMA54_1 ; PS00718 SIGMA54_2 ; PS50044 SIGMA54_3

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
3.1 %Met     (Translated Protein)
3.7 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
2.9 %Met     (Mature Protein)
3.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MALSASLLLRQNQSLVMTPQLMQSIQLLQMTHFELTQFIAQEVERNPLLEIAANDGDLGS
CCCCHHHHHHCCCCEEECHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCCC
DTPVDVDNFGDAEASDAAVRVTPDSDDWYGERAANLGEQLDTSFENVFPDDGEPRKADAP
CCCCCCCCCCCCCCCCCEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCH
ELASQWKSMPGQESGESYDLDDFVAARQSLSDHLNQQLPLAISAAEDRMIADALIGQLDE
HHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEHHHHHHHHHHHHHHCCC
TGYIAADAVDDVAERLGATPSAVEYVLKTLQGFDPPGIFSRSLSECLATQLAQKDRLDPA
CCCEEHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHCCCHH
MRAFVDNLELLAKRDFASLKKLCGVDEEDLLDMLAEIRTLNPRPGAGYDSMVSETIVPDI
HHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCE
IVRPSSTGGWLVEINPDTLPRVLINQSYFAEVSKHKARAGEDQDFLSECMQTAHWLTRSL
EECCCCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHH
DQRARTIMKVASEIVRQQDAFLINGVDQLRPLNLKTVADAIKMHESTVSRVTSKKYMLTP
HHHHHHHHHHHHHHHHHHCCCEECCHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCEEECC
RGLFELKYFFSVSISAVEGGDSHSAEAVRHRIKAMIAQEAAEAVLSDDDIVDNLKKTGID
CHHHHHHHHHHEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH
IARRTVAKYREAMNIPSSVQRRREKKAMAKLSAF
HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCC
>Mature Secondary Structure 
ALSASLLLRQNQSLVMTPQLMQSIQLLQMTHFELTQFIAQEVERNPLLEIAANDGDLGS
CCCHHHHHHCCCCEEECHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCCC
DTPVDVDNFGDAEASDAAVRVTPDSDDWYGERAANLGEQLDTSFENVFPDDGEPRKADAP
CCCCCCCCCCCCCCCCCEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCH
ELASQWKSMPGQESGESYDLDDFVAARQSLSDHLNQQLPLAISAAEDRMIADALIGQLDE
HHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEHHHHHHHHHHHHHHCCC
TGYIAADAVDDVAERLGATPSAVEYVLKTLQGFDPPGIFSRSLSECLATQLAQKDRLDPA
CCCEEHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHCCCHH
MRAFVDNLELLAKRDFASLKKLCGVDEEDLLDMLAEIRTLNPRPGAGYDSMVSETIVPDI
HHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCE
IVRPSSTGGWLVEINPDTLPRVLINQSYFAEVSKHKARAGEDQDFLSECMQTAHWLTRSL
EECCCCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHH
DQRARTIMKVASEIVRQQDAFLINGVDQLRPLNLKTVADAIKMHESTVSRVTSKKYMLTP
HHHHHHHHHHHHHHHHHHCCCEECCHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCEEECC
RGLFELKYFFSVSISAVEGGDSHSAEAVRHRIKAMIAQEAAEAVLSDDDIVDNLKKTGID
CHHHHHHHHHHEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH
IARRTVAKYREAMNIPSSVQRRREKKAMAKLSAF
HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9537369 [H]