Definition | Hyphomonas neptunium ATCC 15444 chromosome, complete genome. |
---|---|
Accession | NC_008358 |
Length | 3,705,021 |
Click here to switch to the map view.
The map label for this gene is rpoN [H]
Identifier: 114800351
GI number: 114800351
Start: 189908
End: 191407
Strand: Direct
Name: rpoN [H]
Synonym: HNE_0206
Alternate gene names: 114800351
Gene position: 189908-191407 (Clockwise)
Preceding gene: 114797827
Following gene: 114798703
Centisome position: 5.13
GC content: 63.53
Gene sequence:
>1500_bases ATGGCCATGAAACAATCACTCGACATGCGCCAGGGCCAGTCCCTGGTAATGACGCCGCAGCTGCAGCAGGCGATCAAGCT TCTGCAGCTGTCGAACCAGGAACTGTCCGAATTCATCGAGGCTGAGCTTGAGCGCAATCCGCTGCTCGTGAAGGTCGAGG ATGAGGGCGGGGCGTCGGGCACGGCCGAGGACGCCGACCCCAAGAAGCGCGAGCAGCTGACCCTCGACGACAATTCCGGC CTTGGCGAAGCGCGCGACCAGCTTGACGCGCCCGGTGAGGACGTCTTCGAGCCCGGCACAGGCTCTGATACCGGATCGGA CGAGGCGCGCGGCCCCTCCGCCACCACTGACTGGTCAGGCGCGTCCGGGGGCGGTTCGTCCTCGGAGGAGTTTGACTATG CCGCCAATGTTTCCGGCGACATCACGCTGCACGAACATCTCCATACCCAGCTGAACTTTGCCGGCCTTTCGGCGGCTGAC CGGTTGATCGCCGCCCGCCTGATCGACGAGATGGACGAGACCGGCTATCTGCGCGCGCCGCTGGACGATGTAGCCAAGGC GCTGGGCGCCGACATTGCCAATGTCGAGGCAGTCCTCGCCGTCTGCCAGGGCTTTGATCCGACCGGCGTGATGGCCCGCT CCGTGCCCGAATGCCTCGCGCTCCAGCTGAAGGATCGCGGCCGGCTTGATCCGGCCATGCAGGCGATGCTGGACAATCTG CATCTTGTGGCCCGCCACGACATGAAGGCGCTGATGGACGTGTGCGGTGTCGACAAGGCCGACATCCAGGACATGCTGCT GGAGCTGCGCCAGCTTTCGCCCAAGCCGGGGGCGGGTTTTTCCAGCGATACCACGGTTGCCGTGGCGCCGGATGTTTTCG TCCGTGAACTGCCCAATGGCATGTTTGCTGTGGAGCTCAACTCCGAGAACCTTCCGCGCGTGCTGATGGACAAGGCATAT TATGCCGAGGTGACCGCTCTGCCGATGCGCGAGAAGGAGAAGGAATTCATCTCCGAATGCGCGGCGTCTGCCAGCTGGCT GGTCAAGTCTCTGGATCAGCGGGCGCGCACCATATTGAAGGTGGCGAGCGAGATTGTTCGCCAGCAGGACGGGTTTTACG CCCATGGCGTGGCGCACCTGCGCCCGCTGAACCTCAAACAGGTGGCTGATGCGATCGAGATGCATGAGTCCACCGTCAGC CGTGTGACGACCAACAAGTACATGGCCACGCCGCGCGGCCTGTTTGAGTTCAAATACTTCTTTTCGGCCTCCATTCCGGC CACCGGCGGCGGCGAGGCCCACTCGGCAGAAGCCGTCCGTCACCGGATCAAGCAGCTGATCGAGGACGAGGCACCTGAAG ACGTGCTGTCGGACGACCAGATCGTGGACATCCTCACCGGTTTCGGCATCGAGATTGCCCGCCGCACGGTGGCAAAGTAT CGCGAGAGCCTGAATATTCCCTCCAGCGTGCAACGCCGCCGCATGGGCCGGGCGGGGTAG
Upstream 100 bases:
>100_bases TCTGTTTGCAGGCACGCCGGAAGACGTGCTGAAGAATGAGGACGTCCGCAGGGTCTATCTGGGTGAGCAGTTCGCCGCTT GATCTCAGGGGGCGCGGCTT
Downstream 100 bases:
>100_bases GGTTTTTGTTTTTTGCTCGGCGGTGTTGCTGAGACCTTCGGTCTCAGCCCGCCTGCGCCTTCGCTTTGCTCGTGGGGGCA TCAAGCCTTGAGTGTAGTAG
Product: RNA polymerase factor sigma-54
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 499; Mature: 498
Protein sequence:
>499_residues MAMKQSLDMRQGQSLVMTPQLQQAIKLLQLSNQELSEFIEAELERNPLLVKVEDEGGASGTAEDADPKKREQLTLDDNSG LGEARDQLDAPGEDVFEPGTGSDTGSDEARGPSATTDWSGASGGGSSSEEFDYAANVSGDITLHEHLHTQLNFAGLSAAD RLIAARLIDEMDETGYLRAPLDDVAKALGADIANVEAVLAVCQGFDPTGVMARSVPECLALQLKDRGRLDPAMQAMLDNL HLVARHDMKALMDVCGVDKADIQDMLLELRQLSPKPGAGFSSDTTVAVAPDVFVRELPNGMFAVELNSENLPRVLMDKAY YAEVTALPMREKEKEFISECAASASWLVKSLDQRARTILKVASEIVRQQDGFYAHGVAHLRPLNLKQVADAIEMHESTVS RVTTNKYMATPRGLFEFKYFFSASIPATGGGEAHSAEAVRHRIKQLIEDEAPEDVLSDDQIVDILTGFGIEIARRTVAKY RESLNIPSSVQRRRMGRAG
Sequences:
>Translated_499_residues MAMKQSLDMRQGQSLVMTPQLQQAIKLLQLSNQELSEFIEAELERNPLLVKVEDEGGASGTAEDADPKKREQLTLDDNSG LGEARDQLDAPGEDVFEPGTGSDTGSDEARGPSATTDWSGASGGGSSSEEFDYAANVSGDITLHEHLHTQLNFAGLSAAD RLIAARLIDEMDETGYLRAPLDDVAKALGADIANVEAVLAVCQGFDPTGVMARSVPECLALQLKDRGRLDPAMQAMLDNL HLVARHDMKALMDVCGVDKADIQDMLLELRQLSPKPGAGFSSDTTVAVAPDVFVRELPNGMFAVELNSENLPRVLMDKAY YAEVTALPMREKEKEFISECAASASWLVKSLDQRARTILKVASEIVRQQDGFYAHGVAHLRPLNLKQVADAIEMHESTVS RVTTNKYMATPRGLFEFKYFFSASIPATGGGEAHSAEAVRHRIKQLIEDEAPEDVLSDDQIVDILTGFGIEIARRTVAKY RESLNIPSSVQRRRMGRAG >Mature_498_residues AMKQSLDMRQGQSLVMTPQLQQAIKLLQLSNQELSEFIEAELERNPLLVKVEDEGGASGTAEDADPKKREQLTLDDNSGL GEARDQLDAPGEDVFEPGTGSDTGSDEARGPSATTDWSGASGGGSSSEEFDYAANVSGDITLHEHLHTQLNFAGLSAADR LIAARLIDEMDETGYLRAPLDDVAKALGADIANVEAVLAVCQGFDPTGVMARSVPECLALQLKDRGRLDPAMQAMLDNLH LVARHDMKALMDVCGVDKADIQDMLLELRQLSPKPGAGFSSDTTVAVAPDVFVRELPNGMFAVELNSENLPRVLMDKAYY AEVTALPMREKEKEFISECAASASWLVKSLDQRARTILKVASEIVRQQDGFYAHGVAHLRPLNLKQVADAIEMHESTVSR VTTNKYMATPRGLFEFKYFFSASIPATGGGEAHSAEAVRHRIKQLIEDEAPEDVLSDDQIVDILTGFGIEIARRTVAKYR ESLNIPSSVQRRRMGRAG
Specific function: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor is responsible for the expression of the nitrogen fixation genes. This sigma factor is required for th
COG id: COG1508
COG function: function code K; DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the sigma-54 factor family [H]
Homologues:
Organism=Escherichia coli, GI1789594, Length=505, Percent_Identity=38.6138613861386, Blast_Score=330, Evalue=1e-91,
Paralogues:
None
Copy number: 70 (log & stationary phase) [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000394 - InterPro: IPR007046 - InterPro: IPR007634 [H]
Pfam domain/function: PF00309 Sigma54_AID; PF04963 Sigma54_CBD; PF04552 Sigma54_DBD [H]
EC number: NA
Molecular weight: Translated: 54316; Mature: 54185
Theoretical pI: Translated: 4.51; Mature: 4.51
Prosite motif: PS00717 SIGMA54_1 ; PS00718 SIGMA54_2 ; PS50044 SIGMA54_3
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.8 %Cys (Translated Protein) 3.4 %Met (Translated Protein) 4.2 %Cys+Met (Translated Protein) 0.8 %Cys (Mature Protein) 3.2 %Met (Mature Protein) 4.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MAMKQSLDMRQGQSLVMTPQLQQAIKLLQLSNQELSEFIEAELERNPLLVKVEDEGGASG CCCCHHCHHHCCCCEEECHHHHHHHHHHHHCCHHHHHHHHHHHCCCCEEEEEECCCCCCC TAEDADPKKREQLTLDDNSGLGEARDQLDAPGEDVFEPGTGSDTGSDEARGPSATTDWSG CCCCCCCHHHCCEEECCCCCCCHHHHHCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCC ASGGGSSSEEFDYAANVSGDITLHEHLHTQLNFAGLSAADRLIAARLIDEMDETGYLRAP CCCCCCCCCCCCEECCCCCCEEEHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCCEECC LDDVAKALGADIANVEAVLAVCQGFDPTGVMARSVPECLALQLKDRGRLDPAMQAMLDNL HHHHHHHHCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHH HLVARHDMKALMDVCGVDKADIQDMLLELRQLSPKPGAGFSSDTTVAVAPDVFVRELPNG HHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCCCCCCCCEEEECCHHHHHHCCCC MFAVELNSENLPRVLMDKAYYAEVTALPMREKEKEFISECAASASWLVKSLDQRARTILK EEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH VASEIVRQQDGFYAHGVAHLRPLNLKQVADAIEMHESTVSRVTTNKYMATPRGLFEFKYF HHHHHHHHCCCCEECCHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHH FSASIPATGGGEAHSAEAVRHRIKQLIEDEAPEDVLSDDQIVDILTGFGIEIARRTVAKY HCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHCCCCHHHHHHHCCCHHHHHHHHHHH RESLNIPSSVQRRRMGRAG HHHCCCCHHHHHHHCCCCC >Mature Secondary Structure AMKQSLDMRQGQSLVMTPQLQQAIKLLQLSNQELSEFIEAELERNPLLVKVEDEGGASG CCCHHCHHHCCCCEEECHHHHHHHHHHHHCCHHHHHHHHHHHCCCCEEEEEECCCCCCC TAEDADPKKREQLTLDDNSGLGEARDQLDAPGEDVFEPGTGSDTGSDEARGPSATTDWSG CCCCCCCHHHCCEEECCCCCCCHHHHHCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCC ASGGGSSSEEFDYAANVSGDITLHEHLHTQLNFAGLSAADRLIAARLIDEMDETGYLRAP CCCCCCCCCCCCEECCCCCCEEEHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCCEECC LDDVAKALGADIANVEAVLAVCQGFDPTGVMARSVPECLALQLKDRGRLDPAMQAMLDNL HHHHHHHHCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHH HLVARHDMKALMDVCGVDKADIQDMLLELRQLSPKPGAGFSSDTTVAVAPDVFVRELPNG HHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCCCCCCCCEEEECCHHHHHHCCCC MFAVELNSENLPRVLMDKAYYAEVTALPMREKEKEFISECAASASWLVKSLDQRARTILK EEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH VASEIVRQQDGFYAHGVAHLRPLNLKQVADAIEMHESTVSRVTTNKYMATPRGLFEFKYF HHHHHHHHCCCCEECCHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHH FSASIPATGGGEAHSAEAVRHRIKQLIEDEAPEDVLSDDQIVDILTGFGIEIARRTVAKY HCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHCCCCHHHHHHHCCCHHHHHHHHHHH RESLNIPSSVQRRRMGRAG HHHCCCCHHHHHHHCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 1459461; 7898437; 9260957; 11259647 [H]