Definition Candidatus Solibacter usitatus Ellin6076 chromosome, complete genome.
Accession NC_008536
Length 9,965,640

Click here to switch to the map view.

The map label for this gene is algU [H]

Identifier: 116622031

GI number: 116622031

Start: 3697861

End: 3698541

Strand: Reverse

Name: algU [H]

Synonym: Acid_2916

Alternate gene names: 116622031

Gene position: 3698541-3697861 (Counterclockwise)

Preceding gene: 116622033

Following gene: 116622019

Centisome position: 37.11

GC content: 60.06

Gene sequence:

>681_bases
GTGCAAGGAGCATCATATGATCGACAGCCCGCAGGATACGTCGACGGAACACAAATGCGAATCACTCCGGAAGTGCTGCA
AAGCGCACAAGGGGGTAATGCCGCGGACTTCGACCGGATCGTGCTCGCGTACCGAAGCAAAGTCATGGCCACCGTCGGTC
GGATGATCGGCCGTCCGGAGGACGCCGAGGATGTGACCCAGGAGGTTTTTACCCGTCTATATTTCATGCTGAGGAATCTG
CGGGAGCCCGCGGCTTTCGAGATCTGGCTTCACAGGATGACGGTAAACGCAGCCTACGACTACTTGCGCGGCCCTCGCGG
ACGCAGTAACCGCCGGGAGGCGCGCGTGGCAGAACTCCCCGAAACGCAGCTGGCTCTGATGGACGCCGTCGCGGCGCGGC
GGGCGCAACTGGACGACCGAGAGCACGAACGGGTTCGCGAACTGGTGGAAGATCTACTCGCCGCCCTGTCCGACGCGGAC
CGGATCCTCATGATTTTCAGAGAAGTGGAAGGGCTGTCTCTTCAGGAGATAGAGACGATTTACAAGACGAACCAGAACGC
CCTGAAGGTCCGACTGTTCCGCGCTCGTCAACGAATGCTGAAAGCGTTCCATAACCAGCGGGGTCCGCGATGCGCGGAAT
TCCCGGCCCTCCGCAGGGAATTTGCACATAACGTTGAGTAG

Upstream 100 bases:

>100_bases
GCCTCCGACCGGGGACAAGAAGTGGAACAGTGCACGGTCTTCGACGATACGTTCGCAGGCCTCGCGATGTCGTTCGCCAA
GAACGAGGTGCCGCTGGGCA

Downstream 100 bases:

>100_bases
CGGCTCAAAGCAGTAGGCCTCACCAAAAGGGGTGACCATATTTTCGAATTCCTGGCTTTCGCTCGGAATCGGTGGAGCCG
CGGGGCATTTATGCGACCCT

Product: ECF subfamily RNA polymerase sigma-24 factor

Products: NA

Alternate protein names: Sigma-30 [H]

Number of amino acids: Translated: 226; Mature: 226

Protein sequence:

>226_residues
MQGASYDRQPAGYVDGTQMRITPEVLQSAQGGNAADFDRIVLAYRSKVMATVGRMIGRPEDAEDVTQEVFTRLYFMLRNL
REPAAFEIWLHRMTVNAAYDYLRGPRGRSNRREARVAELPETQLALMDAVAARRAQLDDREHERVRELVEDLLAALSDAD
RILMIFREVEGLSLQEIETIYKTNQNALKVRLFRARQRMLKAFHNQRGPRCAEFPALRREFAHNVE

Sequences:

>Translated_226_residues
MQGASYDRQPAGYVDGTQMRITPEVLQSAQGGNAADFDRIVLAYRSKVMATVGRMIGRPEDAEDVTQEVFTRLYFMLRNL
REPAAFEIWLHRMTVNAAYDYLRGPRGRSNRREARVAELPETQLALMDAVAARRAQLDDREHERVRELVEDLLAALSDAD
RILMIFREVEGLSLQEIETIYKTNQNALKVRLFRARQRMLKAFHNQRGPRCAEFPALRREFAHNVE
>Mature_226_residues
MQGASYDRQPAGYVDGTQMRITPEVLQSAQGGNAADFDRIVLAYRSKVMATVGRMIGRPEDAEDVTQEVFTRLYFMLRNL
REPAAFEIWLHRMTVNAAYDYLRGPRGRSNRREARVAELPETQLALMDAVAARRAQLDDREHERVRELVEDLLAALSDAD
RILMIFREVEGLSLQEIETIYKTNQNALKVRLFRARQRMLKAFHNQRGPRCAEFPALRREFAHNVE

Specific function: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor regulates genes such as algD, involved in alginate biosynthesis [H]

COG id: COG1595

COG function: function code K; DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the sigma-70 factor family. ECF subfamily [H]

Homologues:

Organism=Escherichia coli, GI1788926, Length=193, Percent_Identity=32.1243523316062, Blast_Score=77, Evalue=7e-16,

Paralogues:

None

Copy number: <10 [C]

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR014284
- InterPro:   IPR000838
- InterPro:   IPR007627
- InterPro:   IPR013249
- InterPro:   IPR014286
- InterPro:   IPR013325
- InterPro:   IPR013324 [H]

Pfam domain/function: PF04542 Sigma70_r2; PF08281 Sigma70_r4_2 [H]

EC number: NA

Molecular weight: Translated: 26096; Mature: 26096

Theoretical pI: Translated: 8.98; Mature: 8.98

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.4 %Cys     (Translated Protein)
4.0 %Met     (Translated Protein)
4.4 %Cys+Met (Translated Protein)
0.4 %Cys     (Mature Protein)
4.0 %Met     (Mature Protein)
4.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MQGASYDRQPAGYVDGTQMRITPEVLQSAQGGNAADFDRIVLAYRSKVMATVGRMIGRPE
CCCCCCCCCCCCCCCCCCEEECHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCC
DAEDVTQEVFTRLYFMLRNLREPAAFEIWLHRMTVNAAYDYLRGPRGRSNRREARVAELP
CHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHCC
ETQLALMDAVAARRAQLDDREHERVRELVEDLLAALSDADRILMIFREVEGLSLQEIETI
HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCHHHHHHH
YKTNQNALKVRLFRARQRMLKAFHNQRGPRCAEFPALRREFAHNVE
HHCCCHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCC
>Mature Secondary Structure
MQGASYDRQPAGYVDGTQMRITPEVLQSAQGGNAADFDRIVLAYRSKVMATVGRMIGRPE
CCCCCCCCCCCCCCCCCCEEECHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCC
DAEDVTQEVFTRLYFMLRNLREPAAFEIWLHRMTVNAAYDYLRGPRGRSNRREARVAELP
CHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHCC
ETQLALMDAVAARRAQLDDREHERVRELVEDLLAALSDADRILMIFREVEGLSLQEIETI
HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCHHHHHHH
YKTNQNALKVRLFRARQRMLKAFHNQRGPRCAEFPALRREFAHNVE
HHCCCHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 8432708; 8378309; 7961421; 10984043; 7737518 [H]