Definition | Candidatus Solibacter usitatus Ellin6076 chromosome, complete genome. |
---|---|
Accession | NC_008536 |
Length | 9,965,640 |
Click here to switch to the map view.
The map label for this gene is algU [H]
Identifier: 116622031
GI number: 116622031
Start: 3697861
End: 3698541
Strand: Reverse
Name: algU [H]
Synonym: Acid_2916
Alternate gene names: 116622031
Gene position: 3698541-3697861 (Counterclockwise)
Preceding gene: 116622033
Following gene: 116622019
Centisome position: 37.11
GC content: 60.06
Gene sequence:
>681_bases GTGCAAGGAGCATCATATGATCGACAGCCCGCAGGATACGTCGACGGAACACAAATGCGAATCACTCCGGAAGTGCTGCA AAGCGCACAAGGGGGTAATGCCGCGGACTTCGACCGGATCGTGCTCGCGTACCGAAGCAAAGTCATGGCCACCGTCGGTC GGATGATCGGCCGTCCGGAGGACGCCGAGGATGTGACCCAGGAGGTTTTTACCCGTCTATATTTCATGCTGAGGAATCTG CGGGAGCCCGCGGCTTTCGAGATCTGGCTTCACAGGATGACGGTAAACGCAGCCTACGACTACTTGCGCGGCCCTCGCGG ACGCAGTAACCGCCGGGAGGCGCGCGTGGCAGAACTCCCCGAAACGCAGCTGGCTCTGATGGACGCCGTCGCGGCGCGGC GGGCGCAACTGGACGACCGAGAGCACGAACGGGTTCGCGAACTGGTGGAAGATCTACTCGCCGCCCTGTCCGACGCGGAC CGGATCCTCATGATTTTCAGAGAAGTGGAAGGGCTGTCTCTTCAGGAGATAGAGACGATTTACAAGACGAACCAGAACGC CCTGAAGGTCCGACTGTTCCGCGCTCGTCAACGAATGCTGAAAGCGTTCCATAACCAGCGGGGTCCGCGATGCGCGGAAT TCCCGGCCCTCCGCAGGGAATTTGCACATAACGTTGAGTAG
Upstream 100 bases:
>100_bases GCCTCCGACCGGGGACAAGAAGTGGAACAGTGCACGGTCTTCGACGATACGTTCGCAGGCCTCGCGATGTCGTTCGCCAA GAACGAGGTGCCGCTGGGCA
Downstream 100 bases:
>100_bases CGGCTCAAAGCAGTAGGCCTCACCAAAAGGGGTGACCATATTTTCGAATTCCTGGCTTTCGCTCGGAATCGGTGGAGCCG CGGGGCATTTATGCGACCCT
Product: ECF subfamily RNA polymerase sigma-24 factor
Products: NA
Alternate protein names: Sigma-30 [H]
Number of amino acids: Translated: 226; Mature: 226
Protein sequence:
>226_residues MQGASYDRQPAGYVDGTQMRITPEVLQSAQGGNAADFDRIVLAYRSKVMATVGRMIGRPEDAEDVTQEVFTRLYFMLRNL REPAAFEIWLHRMTVNAAYDYLRGPRGRSNRREARVAELPETQLALMDAVAARRAQLDDREHERVRELVEDLLAALSDAD RILMIFREVEGLSLQEIETIYKTNQNALKVRLFRARQRMLKAFHNQRGPRCAEFPALRREFAHNVE
Sequences:
>Translated_226_residues MQGASYDRQPAGYVDGTQMRITPEVLQSAQGGNAADFDRIVLAYRSKVMATVGRMIGRPEDAEDVTQEVFTRLYFMLRNL REPAAFEIWLHRMTVNAAYDYLRGPRGRSNRREARVAELPETQLALMDAVAARRAQLDDREHERVRELVEDLLAALSDAD RILMIFREVEGLSLQEIETIYKTNQNALKVRLFRARQRMLKAFHNQRGPRCAEFPALRREFAHNVE >Mature_226_residues MQGASYDRQPAGYVDGTQMRITPEVLQSAQGGNAADFDRIVLAYRSKVMATVGRMIGRPEDAEDVTQEVFTRLYFMLRNL REPAAFEIWLHRMTVNAAYDYLRGPRGRSNRREARVAELPETQLALMDAVAARRAQLDDREHERVRELVEDLLAALSDAD RILMIFREVEGLSLQEIETIYKTNQNALKVRLFRARQRMLKAFHNQRGPRCAEFPALRREFAHNVE
Specific function: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor regulates genes such as algD, involved in alginate biosynthesis [H]
COG id: COG1595
COG function: function code K; DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the sigma-70 factor family. ECF subfamily [H]
Homologues:
Organism=Escherichia coli, GI1788926, Length=193, Percent_Identity=32.1243523316062, Blast_Score=77, Evalue=7e-16,
Paralogues:
None
Copy number: <10 [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR014284 - InterPro: IPR000838 - InterPro: IPR007627 - InterPro: IPR013249 - InterPro: IPR014286 - InterPro: IPR013325 - InterPro: IPR013324 [H]
Pfam domain/function: PF04542 Sigma70_r2; PF08281 Sigma70_r4_2 [H]
EC number: NA
Molecular weight: Translated: 26096; Mature: 26096
Theoretical pI: Translated: 8.98; Mature: 8.98
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.4 %Cys (Translated Protein) 4.0 %Met (Translated Protein) 4.4 %Cys+Met (Translated Protein) 0.4 %Cys (Mature Protein) 4.0 %Met (Mature Protein) 4.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MQGASYDRQPAGYVDGTQMRITPEVLQSAQGGNAADFDRIVLAYRSKVMATVGRMIGRPE CCCCCCCCCCCCCCCCCCEEECHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCC DAEDVTQEVFTRLYFMLRNLREPAAFEIWLHRMTVNAAYDYLRGPRGRSNRREARVAELP CHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHCC ETQLALMDAVAARRAQLDDREHERVRELVEDLLAALSDADRILMIFREVEGLSLQEIETI HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCHHHHHHH YKTNQNALKVRLFRARQRMLKAFHNQRGPRCAEFPALRREFAHNVE HHCCCHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCC >Mature Secondary Structure MQGASYDRQPAGYVDGTQMRITPEVLQSAQGGNAADFDRIVLAYRSKVMATVGRMIGRPE CCCCCCCCCCCCCCCCCCEEECHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCC DAEDVTQEVFTRLYFMLRNLREPAAFEIWLHRMTVNAAYDYLRGPRGRSNRREARVAELP CHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHCC ETQLALMDAVAARRAQLDDREHERVRELVEDLLAALSDADRILMIFREVEGLSLQEIETI HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCHHHHHHH YKTNQNALKVRLFRARQRMLKAFHNQRGPRCAEFPALRREFAHNVE HHCCCHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 8432708; 8378309; 7961421; 10984043; 7737518 [H]