Definition | Agrobacterium tumefaciens str. C58 chromosome circular, complete sequence. |
---|---|
Accession | NC_003062 |
Length | 2,841,580 |
Click here to switch to the map view.
The map label for this gene is rpoN [H]
Identifier: 15887683
GI number: 15887683
Start: 325172
End: 326716
Strand: Reverse
Name: rpoN [H]
Synonym: Atu0332
Alternate gene names: 15887683
Gene position: 326716-325172 (Counterclockwise)
Preceding gene: 15887684
Following gene: 159184262
Centisome position: 11.5
GC content: 58.77
Gene sequence:
>1545_bases ATGGCATTGTCTGCCAGCCTTTTGCTGCGTCAAAACCAGTCTCTCGTGATGACACCGCAACTGATGCAGTCCATCCAGCT GCTTCAGATGACGCATTTCGAGCTGACGCAGTTTATCGCGCAGGAGGTGGAGCGCAATCCATTGCTGGAAATTGCAGCAA ACGATGGCGATTTGGGCAGCGATACGCCTGTCGACGTCGATAATTTCGGCGACGCTGAAGCTTCGGATGCGGCCGTCAGG GTGACGCCCGACAGCGATGACTGGTACGGGGAGCGCGCCGCAAATCTTGGCGAGCAGCTGGACACGAGCTTCGAAAACGT CTTTCCCGACGATGGCGAACCGAGAAAAGCCGATGCGCCCGAACTCGCAAGCCAATGGAAATCCATGCCCGGGCAGGAGT CCGGAGAAAGCTACGACCTCGACGACTTCGTGGCCGCCCGACAAAGCCTCAGCGACCACCTCAATCAGCAACTGCCGCTC GCCATCTCCGCCGCCGAGGACAGGATGATCGCCGATGCGCTGATCGGCCAGCTCGATGAGACCGGTTATATCGCCGCGGA TGCCGTCGACGACGTTGCAGAACGGCTTGGTGCCACACCATCGGCGGTCGAGTACGTTCTGAAGACGCTTCAGGGTTTTG ATCCGCCGGGCATCTTTTCCCGTTCCTTGAGCGAATGTCTGGCAACCCAGCTCGCACAGAAGGACCGGCTCGACCCGGCC ATGCGAGCTTTTGTCGATAATCTCGAGCTTCTGGCGAAACGCGATTTTGCCTCCTTAAAAAAACTCTGCGGCGTCGATGA AGAAGACCTTCTCGACATGCTGGCGGAAATCCGCACGCTCAATCCGCGCCCCGGCGCGGGTTACGATTCGATGGTTTCGG AAACAATCGTTCCTGATATCATCGTCCGCCCCTCCTCCACCGGTGGCTGGCTGGTGGAGATCAATCCCGACACCCTGCCG CGTGTGCTCATCAACCAGAGCTACTTTGCGGAGGTTTCAAAACATAAGGCGCGCGCAGGAGAGGATCAGGATTTCCTGTC CGAATGCATGCAGACAGCCCATTGGCTGACCCGCAGCCTCGATCAGCGCGCCCGGACGATCATGAAGGTGGCGAGCGAAA TCGTACGCCAGCAGGATGCCTTCCTCATCAACGGCGTCGACCAGCTGCGCCCGCTGAACCTCAAGACGGTGGCCGACGCC ATCAAGATGCATGAATCCACCGTCAGCCGCGTGACGTCGAAGAAATACATGCTGACGCCGCGCGGACTTTTCGAGCTCAA ATATTTCTTCAGCGTGTCGATCAGCGCCGTGGAGGGCGGTGATAGCCATTCGGCGGAAGCGGTACGTCACCGCATCAAGG CGATGATCGCGCAGGAAGCCGCTGAGGCCGTCCTTTCCGACGACGATATCGTCGACAATCTGAAAAAGACCGGCATCGAT ATCGCACGCCGCACCGTCGCCAAATATCGCGAGGCGATGAATATTCCCTCCTCCGTGCAGAGGCGCCGGGAGAAGAAAGC GATGGCGAAGCTTTCCGCCTTCTGA
Upstream 100 bases:
>100_bases CAAGATAAAAAAGCAATTTTTGGGCCAAGTTCTGTCTCCTCCGCAAAATTTATGAAAACTGTGGCTATGGAGAAGCTGTA AGGGGAGTTTCGCGTCCGCC
Downstream 100 bases:
>100_bases GCAACGCGGCGATCACGTTTGCGGCGGTTTTTTGCCGCACCATTGACTCTTGTCACGGCAGGGGCTAGAAGCCCGCCGCA CACGTCAGCAAGGTCATTTG
Product: RNA polymerase factor sigma-54
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 514; Mature: 513
Protein sequence:
>514_residues MALSASLLLRQNQSLVMTPQLMQSIQLLQMTHFELTQFIAQEVERNPLLEIAANDGDLGSDTPVDVDNFGDAEASDAAVR VTPDSDDWYGERAANLGEQLDTSFENVFPDDGEPRKADAPELASQWKSMPGQESGESYDLDDFVAARQSLSDHLNQQLPL AISAAEDRMIADALIGQLDETGYIAADAVDDVAERLGATPSAVEYVLKTLQGFDPPGIFSRSLSECLATQLAQKDRLDPA MRAFVDNLELLAKRDFASLKKLCGVDEEDLLDMLAEIRTLNPRPGAGYDSMVSETIVPDIIVRPSSTGGWLVEINPDTLP RVLINQSYFAEVSKHKARAGEDQDFLSECMQTAHWLTRSLDQRARTIMKVASEIVRQQDAFLINGVDQLRPLNLKTVADA IKMHESTVSRVTSKKYMLTPRGLFELKYFFSVSISAVEGGDSHSAEAVRHRIKAMIAQEAAEAVLSDDDIVDNLKKTGID IARRTVAKYREAMNIPSSVQRRREKKAMAKLSAF
Sequences:
>Translated_514_residues MALSASLLLRQNQSLVMTPQLMQSIQLLQMTHFELTQFIAQEVERNPLLEIAANDGDLGSDTPVDVDNFGDAEASDAAVR VTPDSDDWYGERAANLGEQLDTSFENVFPDDGEPRKADAPELASQWKSMPGQESGESYDLDDFVAARQSLSDHLNQQLPL AISAAEDRMIADALIGQLDETGYIAADAVDDVAERLGATPSAVEYVLKTLQGFDPPGIFSRSLSECLATQLAQKDRLDPA MRAFVDNLELLAKRDFASLKKLCGVDEEDLLDMLAEIRTLNPRPGAGYDSMVSETIVPDIIVRPSSTGGWLVEINPDTLP RVLINQSYFAEVSKHKARAGEDQDFLSECMQTAHWLTRSLDQRARTIMKVASEIVRQQDAFLINGVDQLRPLNLKTVADA IKMHESTVSRVTSKKYMLTPRGLFELKYFFSVSISAVEGGDSHSAEAVRHRIKAMIAQEAAEAVLSDDDIVDNLKKTGID IARRTVAKYREAMNIPSSVQRRREKKAMAKLSAF >Mature_513_residues ALSASLLLRQNQSLVMTPQLMQSIQLLQMTHFELTQFIAQEVERNPLLEIAANDGDLGSDTPVDVDNFGDAEASDAAVRV TPDSDDWYGERAANLGEQLDTSFENVFPDDGEPRKADAPELASQWKSMPGQESGESYDLDDFVAARQSLSDHLNQQLPLA ISAAEDRMIADALIGQLDETGYIAADAVDDVAERLGATPSAVEYVLKTLQGFDPPGIFSRSLSECLATQLAQKDRLDPAM RAFVDNLELLAKRDFASLKKLCGVDEEDLLDMLAEIRTLNPRPGAGYDSMVSETIVPDIIVRPSSTGGWLVEINPDTLPR VLINQSYFAEVSKHKARAGEDQDFLSECMQTAHWLTRSLDQRARTIMKVASEIVRQQDAFLINGVDQLRPLNLKTVADAI KMHESTVSRVTSKKYMLTPRGLFELKYFFSVSISAVEGGDSHSAEAVRHRIKAMIAQEAAEAVLSDDDIVDNLKKTGIDI ARRTVAKYREAMNIPSSVQRRREKKAMAKLSAF
Specific function: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor is responsible for the expression of the nitrogen fixation genes (nif operon), glnA and dctA for dicar
COG id: COG1508
COG function: function code K; DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the sigma-54 factor family [H]
Homologues:
Organism=Escherichia coli, GI1789594, Length=514, Percent_Identity=36.3813229571984, Blast_Score=298, Evalue=6e-82,
Paralogues:
None
Copy number: 70 (log & stationary phase) [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000394 - InterPro: IPR007046 - InterPro: IPR007634 [H]
Pfam domain/function: PF00309 Sigma54_AID; PF04963 Sigma54_CBD; PF04552 Sigma54_DBD [H]
EC number: NA
Molecular weight: Translated: 56755; Mature: 56623
Theoretical pI: Translated: 4.46; Mature: 4.46
Prosite motif: PS00717 SIGMA54_1 ; PS00718 SIGMA54_2 ; PS50044 SIGMA54_3
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.6 %Cys (Translated Protein) 3.1 %Met (Translated Protein) 3.7 %Cys+Met (Translated Protein) 0.6 %Cys (Mature Protein) 2.9 %Met (Mature Protein) 3.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MALSASLLLRQNQSLVMTPQLMQSIQLLQMTHFELTQFIAQEVERNPLLEIAANDGDLGS CCCCHHHHHHCCCCEEECHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCCC DTPVDVDNFGDAEASDAAVRVTPDSDDWYGERAANLGEQLDTSFENVFPDDGEPRKADAP CCCCCCCCCCCCCCCCCEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCH ELASQWKSMPGQESGESYDLDDFVAARQSLSDHLNQQLPLAISAAEDRMIADALIGQLDE HHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEHHHHHHHHHHHHHHCCC TGYIAADAVDDVAERLGATPSAVEYVLKTLQGFDPPGIFSRSLSECLATQLAQKDRLDPA CCCEEHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHCCCHH MRAFVDNLELLAKRDFASLKKLCGVDEEDLLDMLAEIRTLNPRPGAGYDSMVSETIVPDI HHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCE IVRPSSTGGWLVEINPDTLPRVLINQSYFAEVSKHKARAGEDQDFLSECMQTAHWLTRSL EECCCCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHH DQRARTIMKVASEIVRQQDAFLINGVDQLRPLNLKTVADAIKMHESTVSRVTSKKYMLTP HHHHHHHHHHHHHHHHHHCCCEECCHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCEEECC RGLFELKYFFSVSISAVEGGDSHSAEAVRHRIKAMIAQEAAEAVLSDDDIVDNLKKTGID CHHHHHHHHHHEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH IARRTVAKYREAMNIPSSVQRRREKKAMAKLSAF HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCC >Mature Secondary Structure ALSASLLLRQNQSLVMTPQLMQSIQLLQMTHFELTQFIAQEVERNPLLEIAANDGDLGS CCCHHHHHHCCCCEEECHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCCC DTPVDVDNFGDAEASDAAVRVTPDSDDWYGERAANLGEQLDTSFENVFPDDGEPRKADAP CCCCCCCCCCCCCCCCCEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCH ELASQWKSMPGQESGESYDLDDFVAARQSLSDHLNQQLPLAISAAEDRMIADALIGQLDE HHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEHHHHHHHHHHHHHHCCC TGYIAADAVDDVAERLGATPSAVEYVLKTLQGFDPPGIFSRSLSECLATQLAQKDRLDPA CCCEEHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHCCCHH MRAFVDNLELLAKRDFASLKKLCGVDEEDLLDMLAEIRTLNPRPGAGYDSMVSETIVPDI HHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCE IVRPSSTGGWLVEINPDTLPRVLINQSYFAEVSKHKARAGEDQDFLSECMQTAHWLTRSL EECCCCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHH DQRARTIMKVASEIVRQQDAFLINGVDQLRPLNLKTVADAIKMHESTVSRVTSKKYMLTP HHHHHHHHHHHHHHHHHHCCCEECCHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCEEECC RGLFELKYFFSVSISAVEGGDSHSAEAVRHRIKAMIAQEAAEAVLSDDDIVDNLKKTGID CHHHHHHHHHHEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH IARRTVAKYREAMNIPSSVQRRREKKAMAKLSAF HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9537369 [H]