Definition Hyphomonas neptunium ATCC 15444 chromosome, complete genome.
Accession NC_008358
Length 3,705,021

Click here to switch to the map view.

The map label for this gene is rpoN [H]

Identifier: 114800351

GI number: 114800351

Start: 189908

End: 191407

Strand: Direct

Name: rpoN [H]

Synonym: HNE_0206

Alternate gene names: 114800351

Gene position: 189908-191407 (Clockwise)

Preceding gene: 114797827

Following gene: 114798703

Centisome position: 5.13

GC content: 63.53

Gene sequence:

>1500_bases
ATGGCCATGAAACAATCACTCGACATGCGCCAGGGCCAGTCCCTGGTAATGACGCCGCAGCTGCAGCAGGCGATCAAGCT
TCTGCAGCTGTCGAACCAGGAACTGTCCGAATTCATCGAGGCTGAGCTTGAGCGCAATCCGCTGCTCGTGAAGGTCGAGG
ATGAGGGCGGGGCGTCGGGCACGGCCGAGGACGCCGACCCCAAGAAGCGCGAGCAGCTGACCCTCGACGACAATTCCGGC
CTTGGCGAAGCGCGCGACCAGCTTGACGCGCCCGGTGAGGACGTCTTCGAGCCCGGCACAGGCTCTGATACCGGATCGGA
CGAGGCGCGCGGCCCCTCCGCCACCACTGACTGGTCAGGCGCGTCCGGGGGCGGTTCGTCCTCGGAGGAGTTTGACTATG
CCGCCAATGTTTCCGGCGACATCACGCTGCACGAACATCTCCATACCCAGCTGAACTTTGCCGGCCTTTCGGCGGCTGAC
CGGTTGATCGCCGCCCGCCTGATCGACGAGATGGACGAGACCGGCTATCTGCGCGCGCCGCTGGACGATGTAGCCAAGGC
GCTGGGCGCCGACATTGCCAATGTCGAGGCAGTCCTCGCCGTCTGCCAGGGCTTTGATCCGACCGGCGTGATGGCCCGCT
CCGTGCCCGAATGCCTCGCGCTCCAGCTGAAGGATCGCGGCCGGCTTGATCCGGCCATGCAGGCGATGCTGGACAATCTG
CATCTTGTGGCCCGCCACGACATGAAGGCGCTGATGGACGTGTGCGGTGTCGACAAGGCCGACATCCAGGACATGCTGCT
GGAGCTGCGCCAGCTTTCGCCCAAGCCGGGGGCGGGTTTTTCCAGCGATACCACGGTTGCCGTGGCGCCGGATGTTTTCG
TCCGTGAACTGCCCAATGGCATGTTTGCTGTGGAGCTCAACTCCGAGAACCTTCCGCGCGTGCTGATGGACAAGGCATAT
TATGCCGAGGTGACCGCTCTGCCGATGCGCGAGAAGGAGAAGGAATTCATCTCCGAATGCGCGGCGTCTGCCAGCTGGCT
GGTCAAGTCTCTGGATCAGCGGGCGCGCACCATATTGAAGGTGGCGAGCGAGATTGTTCGCCAGCAGGACGGGTTTTACG
CCCATGGCGTGGCGCACCTGCGCCCGCTGAACCTCAAACAGGTGGCTGATGCGATCGAGATGCATGAGTCCACCGTCAGC
CGTGTGACGACCAACAAGTACATGGCCACGCCGCGCGGCCTGTTTGAGTTCAAATACTTCTTTTCGGCCTCCATTCCGGC
CACCGGCGGCGGCGAGGCCCACTCGGCAGAAGCCGTCCGTCACCGGATCAAGCAGCTGATCGAGGACGAGGCACCTGAAG
ACGTGCTGTCGGACGACCAGATCGTGGACATCCTCACCGGTTTCGGCATCGAGATTGCCCGCCGCACGGTGGCAAAGTAT
CGCGAGAGCCTGAATATTCCCTCCAGCGTGCAACGCCGCCGCATGGGCCGGGCGGGGTAG

Upstream 100 bases:

>100_bases
TCTGTTTGCAGGCACGCCGGAAGACGTGCTGAAGAATGAGGACGTCCGCAGGGTCTATCTGGGTGAGCAGTTCGCCGCTT
GATCTCAGGGGGCGCGGCTT

Downstream 100 bases:

>100_bases
GGTTTTTGTTTTTTGCTCGGCGGTGTTGCTGAGACCTTCGGTCTCAGCCCGCCTGCGCCTTCGCTTTGCTCGTGGGGGCA
TCAAGCCTTGAGTGTAGTAG

Product: RNA polymerase factor sigma-54

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 499; Mature: 498

Protein sequence:

>499_residues
MAMKQSLDMRQGQSLVMTPQLQQAIKLLQLSNQELSEFIEAELERNPLLVKVEDEGGASGTAEDADPKKREQLTLDDNSG
LGEARDQLDAPGEDVFEPGTGSDTGSDEARGPSATTDWSGASGGGSSSEEFDYAANVSGDITLHEHLHTQLNFAGLSAAD
RLIAARLIDEMDETGYLRAPLDDVAKALGADIANVEAVLAVCQGFDPTGVMARSVPECLALQLKDRGRLDPAMQAMLDNL
HLVARHDMKALMDVCGVDKADIQDMLLELRQLSPKPGAGFSSDTTVAVAPDVFVRELPNGMFAVELNSENLPRVLMDKAY
YAEVTALPMREKEKEFISECAASASWLVKSLDQRARTILKVASEIVRQQDGFYAHGVAHLRPLNLKQVADAIEMHESTVS
RVTTNKYMATPRGLFEFKYFFSASIPATGGGEAHSAEAVRHRIKQLIEDEAPEDVLSDDQIVDILTGFGIEIARRTVAKY
RESLNIPSSVQRRRMGRAG

Sequences:

>Translated_499_residues
MAMKQSLDMRQGQSLVMTPQLQQAIKLLQLSNQELSEFIEAELERNPLLVKVEDEGGASGTAEDADPKKREQLTLDDNSG
LGEARDQLDAPGEDVFEPGTGSDTGSDEARGPSATTDWSGASGGGSSSEEFDYAANVSGDITLHEHLHTQLNFAGLSAAD
RLIAARLIDEMDETGYLRAPLDDVAKALGADIANVEAVLAVCQGFDPTGVMARSVPECLALQLKDRGRLDPAMQAMLDNL
HLVARHDMKALMDVCGVDKADIQDMLLELRQLSPKPGAGFSSDTTVAVAPDVFVRELPNGMFAVELNSENLPRVLMDKAY
YAEVTALPMREKEKEFISECAASASWLVKSLDQRARTILKVASEIVRQQDGFYAHGVAHLRPLNLKQVADAIEMHESTVS
RVTTNKYMATPRGLFEFKYFFSASIPATGGGEAHSAEAVRHRIKQLIEDEAPEDVLSDDQIVDILTGFGIEIARRTVAKY
RESLNIPSSVQRRRMGRAG
>Mature_498_residues
AMKQSLDMRQGQSLVMTPQLQQAIKLLQLSNQELSEFIEAELERNPLLVKVEDEGGASGTAEDADPKKREQLTLDDNSGL
GEARDQLDAPGEDVFEPGTGSDTGSDEARGPSATTDWSGASGGGSSSEEFDYAANVSGDITLHEHLHTQLNFAGLSAADR
LIAARLIDEMDETGYLRAPLDDVAKALGADIANVEAVLAVCQGFDPTGVMARSVPECLALQLKDRGRLDPAMQAMLDNLH
LVARHDMKALMDVCGVDKADIQDMLLELRQLSPKPGAGFSSDTTVAVAPDVFVRELPNGMFAVELNSENLPRVLMDKAYY
AEVTALPMREKEKEFISECAASASWLVKSLDQRARTILKVASEIVRQQDGFYAHGVAHLRPLNLKQVADAIEMHESTVSR
VTTNKYMATPRGLFEFKYFFSASIPATGGGEAHSAEAVRHRIKQLIEDEAPEDVLSDDQIVDILTGFGIEIARRTVAKYR
ESLNIPSSVQRRRMGRAG

Specific function: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor is responsible for the expression of the nitrogen fixation genes. This sigma factor is required for th

COG id: COG1508

COG function: function code K; DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the sigma-54 factor family [H]

Homologues:

Organism=Escherichia coli, GI1789594, Length=505, Percent_Identity=38.6138613861386, Blast_Score=330, Evalue=1e-91,

Paralogues:

None

Copy number: 70 (log & stationary phase) [C]

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000394
- InterPro:   IPR007046
- InterPro:   IPR007634 [H]

Pfam domain/function: PF00309 Sigma54_AID; PF04963 Sigma54_CBD; PF04552 Sigma54_DBD [H]

EC number: NA

Molecular weight: Translated: 54316; Mature: 54185

Theoretical pI: Translated: 4.51; Mature: 4.51

Prosite motif: PS00717 SIGMA54_1 ; PS00718 SIGMA54_2 ; PS50044 SIGMA54_3

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.8 %Cys     (Translated Protein)
3.4 %Met     (Translated Protein)
4.2 %Cys+Met (Translated Protein)
0.8 %Cys     (Mature Protein)
3.2 %Met     (Mature Protein)
4.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MAMKQSLDMRQGQSLVMTPQLQQAIKLLQLSNQELSEFIEAELERNPLLVKVEDEGGASG
CCCCHHCHHHCCCCEEECHHHHHHHHHHHHCCHHHHHHHHHHHCCCCEEEEEECCCCCCC
TAEDADPKKREQLTLDDNSGLGEARDQLDAPGEDVFEPGTGSDTGSDEARGPSATTDWSG
CCCCCCCHHHCCEEECCCCCCCHHHHHCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCC
ASGGGSSSEEFDYAANVSGDITLHEHLHTQLNFAGLSAADRLIAARLIDEMDETGYLRAP
CCCCCCCCCCCCEECCCCCCEEEHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCCEECC
LDDVAKALGADIANVEAVLAVCQGFDPTGVMARSVPECLALQLKDRGRLDPAMQAMLDNL
HHHHHHHHCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHH
HLVARHDMKALMDVCGVDKADIQDMLLELRQLSPKPGAGFSSDTTVAVAPDVFVRELPNG
HHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCCCCCCCCEEEECCHHHHHHCCCC
MFAVELNSENLPRVLMDKAYYAEVTALPMREKEKEFISECAASASWLVKSLDQRARTILK
EEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
VASEIVRQQDGFYAHGVAHLRPLNLKQVADAIEMHESTVSRVTTNKYMATPRGLFEFKYF
HHHHHHHHCCCCEECCHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHH
FSASIPATGGGEAHSAEAVRHRIKQLIEDEAPEDVLSDDQIVDILTGFGIEIARRTVAKY
HCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHCCCCHHHHHHHCCCHHHHHHHHHHH
RESLNIPSSVQRRRMGRAG
HHHCCCCHHHHHHHCCCCC
>Mature Secondary Structure 
AMKQSLDMRQGQSLVMTPQLQQAIKLLQLSNQELSEFIEAELERNPLLVKVEDEGGASG
CCCHHCHHHCCCCEEECHHHHHHHHHHHHCCHHHHHHHHHHHCCCCEEEEEECCCCCCC
TAEDADPKKREQLTLDDNSGLGEARDQLDAPGEDVFEPGTGSDTGSDEARGPSATTDWSG
CCCCCCCHHHCCEEECCCCCCCHHHHHCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCC
ASGGGSSSEEFDYAANVSGDITLHEHLHTQLNFAGLSAADRLIAARLIDEMDETGYLRAP
CCCCCCCCCCCCEECCCCCCEEEHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCCEECC
LDDVAKALGADIANVEAVLAVCQGFDPTGVMARSVPECLALQLKDRGRLDPAMQAMLDNL
HHHHHHHHCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHH
HLVARHDMKALMDVCGVDKADIQDMLLELRQLSPKPGAGFSSDTTVAVAPDVFVRELPNG
HHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCCCCCCCCEEEECCHHHHHHCCCC
MFAVELNSENLPRVLMDKAYYAEVTALPMREKEKEFISECAASASWLVKSLDQRARTILK
EEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
VASEIVRQQDGFYAHGVAHLRPLNLKQVADAIEMHESTVSRVTTNKYMATPRGLFEFKYF
HHHHHHHHCCCCEECCHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHH
FSASIPATGGGEAHSAEAVRHRIKQLIEDEAPEDVLSDDQIVDILTGFGIEIARRTVAKY
HCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHCCCCHHHHHHHCCCHHHHHHHHHHH
RESLNIPSSVQRRRMGRAG
HHHCCCCHHHHHHHCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 1459461; 7898437; 9260957; 11259647 [H]