Definition Chromohalobacter salexigens DSM 3043 chromosome, complete genome.
Accession NC_007963
Length 3,696,649

Click here to switch to the map view.

The map label for this gene is algU [H]

Identifier: 92113750

GI number: 92113750

Start: 1847586

End: 1848191

Strand: Direct

Name: algU [H]

Synonym: Csal_1626

Alternate gene names: 92113750

Gene position: 1847586-1848191 (Clockwise)

Preceding gene: 92113748

Following gene: 92113751

Centisome position: 49.98

GC content: 62.05

Gene sequence:

>606_bases
ATGGGCACAAGGGAAACCGATCACCAGCTCGTTGAGCGTGCCCAGAAGGGAGACACCCGCGCTTTCGACCTCCTGGTCAA
GAAATATCAGCACAAGATCATCGGACTGATCGGCCGCTATGTGCACGATCCCGCCGAAGTGCAGGACGTGGCGCAGGAGG
CTTTCATCAAAGCATATCGTGCGCTCGGCAAGTTTCGCTCTGAAAGCGCCTTTTACACCTGGATGTACCGTATCGCGATC
AACACCGCCAAGAACCATCTCGTCTCGCGCGGTCGCCGACCGCCGGGCAGCGACATGGACATCGTCGATGCCGAGGTGCT
CGATCACAGCGGTCGCCTGTCCGATATCGATACCCCCGAGGCGGCGCTGCAGCGCGACCAGCTCGAGGCAGTGGTGTTCG
AGGTGATCGAGAACCTGCCGGAAGACCTGCGCACCGCGATCACGCTGCGGGAGATGGACGGTCTCGCCTACGAGGACATC
GCCAACATCATGCAGTGTCCGGTCGGCACGGTACGCTCGCGCATCTTTCGCGCGCGCGAAGCGGTGGACAAGGCCATCGC
CCCGTTGTTGAGCACGTCGAACAAGGCGGACGCCGCCGTCGAGTGA

Upstream 100 bases:

>100_bases
CGACTGGAACATGATGACCACACAGCAGATGCAAATTTCATGCGGTGCCGCCGAAGCGGGCGGCCCGCGCCATGGTGCCT
CCAGCATTGAGGGGGAGTGA

Downstream 100 bases:

>100_bases
TGAAAAATGCAATCCATGCCATCGCGAGTGAACTGTTCGGCGACTTTGACGTCTCAACGGGGTATCAGTGGAATGAGTGG
GCGCTACGACGCCATGTGAG

Product: RNA polymerase sigma factor AlgU

Products: NA

Alternate protein names: Sigma-30 [H]

Number of amino acids: Translated: 201; Mature: 200

Protein sequence:

>201_residues
MGTRETDHQLVERAQKGDTRAFDLLVKKYQHKIIGLIGRYVHDPAEVQDVAQEAFIKAYRALGKFRSESAFYTWMYRIAI
NTAKNHLVSRGRRPPGSDMDIVDAEVLDHSGRLSDIDTPEAALQRDQLEAVVFEVIENLPEDLRTAITLREMDGLAYEDI
ANIMQCPVGTVRSRIFRAREAVDKAIAPLLSTSNKADAAVE

Sequences:

>Translated_201_residues
MGTRETDHQLVERAQKGDTRAFDLLVKKYQHKIIGLIGRYVHDPAEVQDVAQEAFIKAYRALGKFRSESAFYTWMYRIAI
NTAKNHLVSRGRRPPGSDMDIVDAEVLDHSGRLSDIDTPEAALQRDQLEAVVFEVIENLPEDLRTAITLREMDGLAYEDI
ANIMQCPVGTVRSRIFRAREAVDKAIAPLLSTSNKADAAVE
>Mature_200_residues
GTRETDHQLVERAQKGDTRAFDLLVKKYQHKIIGLIGRYVHDPAEVQDVAQEAFIKAYRALGKFRSESAFYTWMYRIAIN
TAKNHLVSRGRRPPGSDMDIVDAEVLDHSGRLSDIDTPEAALQRDQLEAVVFEVIENLPEDLRTAITLREMDGLAYEDIA
NIMQCPVGTVRSRIFRAREAVDKAIAPLLSTSNKADAAVE

Specific function: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor regulates genes such as algD, involved in alginate biosynthesis [H]

COG id: COG1595

COG function: function code K; DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the sigma-70 factor family. ECF subfamily [H]

Homologues:

Organism=Escherichia coli, GI1788926, Length=190, Percent_Identity=61.5789473684211, Blast_Score=249, Evalue=9e-68,

Paralogues:

None

Copy number: <10 [C]

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR014284
- InterPro:   IPR000838
- InterPro:   IPR007627
- InterPro:   IPR013249
- InterPro:   IPR014286
- InterPro:   IPR013325
- InterPro:   IPR013324 [H]

Pfam domain/function: PF04542 Sigma70_r2; PF08281 Sigma70_r4_2 [H]

EC number: NA

Molecular weight: Translated: 22598; Mature: 22467

Theoretical pI: Translated: 5.68; Mature: 5.68

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.5 %Cys     (Translated Protein)
2.5 %Met     (Translated Protein)
3.0 %Cys+Met (Translated Protein)
0.5 %Cys     (Mature Protein)
2.0 %Met     (Mature Protein)
2.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MGTRETDHQLVERAQKGDTRAFDLLVKKYQHKIIGLIGRYVHDPAEVQDVAQEAFIKAYR
CCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHH
ALGKFRSESAFYTWMYRIAINTAKNHLVSRGRRPPGSDMDIVDAEVLDHSGRLSDIDTPE
HHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCCCCCCHH
AALQRDQLEAVVFEVIENLPEDLRTAITLREMDGLAYEDIANIMQCPVGTVRSRIFRARE
HHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHCCCCHHHHHHHHHCCHHHHHHHHHHHHH
AVDKAIAPLLSTSNKADAAVE
HHHHHHHHHHHCCCCCCCCCC
>Mature Secondary Structure 
GTRETDHQLVERAQKGDTRAFDLLVKKYQHKIIGLIGRYVHDPAEVQDVAQEAFIKAYR
CCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHH
ALGKFRSESAFYTWMYRIAINTAKNHLVSRGRRPPGSDMDIVDAEVLDHSGRLSDIDTPE
HHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCCCCCCHH
AALQRDQLEAVVFEVIENLPEDLRTAITLREMDGLAYEDIANIMQCPVGTVRSRIFRARE
HHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHCCCCHHHHHHHHHCCHHHHHHHHHHHHH
AVDKAIAPLLSTSNKADAAVE
HHHHHHHHHHHCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 8432708; 8378309; 7961421; 10984043; 7737518 [H]