Definition Akkermansia muciniphila ATCC BAA-835, complete genome.
Accession NC_010655
Length 2,664,102

Click here to switch to the map view.

The map label for this gene is algU [H]

Identifier: 187736323

GI number: 187736323

Start: 2232176

End: 2232769

Strand: Reverse

Name: algU [H]

Synonym: Amuc_1836

Alternate gene names: 187736323

Gene position: 2232769-2232176 (Counterclockwise)

Preceding gene: 187736331

Following gene: 187736320

Centisome position: 83.81

GC content: 58.08

Gene sequence:

>594_bases
ATGGATGAAGCACAAGACATGCCGCCTGCGGAACCAGACGAAGATGCCAAGCTGATGCTGAGGGTCAGGAATGGCGACGC
TTCCGCCATGGAAATGCTGGTCCGCAAACACCAGAATTCCGTATATGCGACGGTGGCCCGTATGCTGAACAACGGTCCGG
AGACAGAGGACATCGCCCAGCAGGTATTCATCCGCATCTGGAAGGGAGCCGGGAATTATGAACCTTCCGCACGGTTTACC
ACCTGGATGTTCACCATCCTGCGCAATCTGGTGTTCAATGAAGTGCGCCGCCAGAAGCGCAAGCCCACCACCTCCGCAGA
CGCCATGGAGGAGGAAGGAGGCATGGCCGTGTTTCTGGAACCTTCCCAGACCCCGGACGAAGCACTGGAACATACGGAAC
TCCAGCACGCCGTGGACGCGGCAATCGCCGCTCTGCCGGAAAAGGCGCGGCTGGCCGTCCAACTGCGCCGTTTCGAAAAC
ATGCCCTATGAGGAAATAGCCCGGGCGCTGGATATGACGGTTCCCGCTACCAAAAGCCTGCTGTTCCGCGCCAGAAACAT
GCTGAAGGAGGCTCTTGCTTCCTTTTTATCCTGA

Upstream 100 bases:

>100_bases
CATCTAGTCAGCCGTGTAAAAATGGGATAGACCTTTTTCCGGAATGCGCCCTGCCGGCTGCAACCTTTCACCGGAGACGG
GGTTTATAAAAGGGAATTTC

Downstream 100 bases:

>100_bases
CAATGGCGTCCTCAAAACAAGTTAAGAACACACCGCCGGCTTTCCCGCCAAAGGGTGGGGAGCCGGTTCCTGGTACAGTT
TCCCGTCTTTAGCGGCAAAT

Product: RNA polymerase, sigma-24 subunit, ECF subfamily

Products: NA

Alternate protein names: Sigma-30 [H]

Number of amino acids: Translated: 197; Mature: 197

Protein sequence:

>197_residues
MDEAQDMPPAEPDEDAKLMLRVRNGDASAMEMLVRKHQNSVYATVARMLNNGPETEDIAQQVFIRIWKGAGNYEPSARFT
TWMFTILRNLVFNEVRRQKRKPTTSADAMEEEGGMAVFLEPSQTPDEALEHTELQHAVDAAIAALPEKARLAVQLRRFEN
MPYEEIARALDMTVPATKSLLFRARNMLKEALASFLS

Sequences:

>Translated_197_residues
MDEAQDMPPAEPDEDAKLMLRVRNGDASAMEMLVRKHQNSVYATVARMLNNGPETEDIAQQVFIRIWKGAGNYEPSARFT
TWMFTILRNLVFNEVRRQKRKPTTSADAMEEEGGMAVFLEPSQTPDEALEHTELQHAVDAAIAALPEKARLAVQLRRFEN
MPYEEIARALDMTVPATKSLLFRARNMLKEALASFLS
>Mature_197_residues
MDEAQDMPPAEPDEDAKLMLRVRNGDASAMEMLVRKHQNSVYATVARMLNNGPETEDIAQQVFIRIWKGAGNYEPSARFT
TWMFTILRNLVFNEVRRQKRKPTTSADAMEEEGGMAVFLEPSQTPDEALEHTELQHAVDAAIAALPEKARLAVQLRRFEN
MPYEEIARALDMTVPATKSLLFRARNMLKEALASFLS

Specific function: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor regulates genes such as algD, involved in alginate biosynthesis [H]

COG id: COG1595

COG function: function code K; DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the sigma-70 factor family. ECF subfamily [H]

Homologues:

Organism=Escherichia coli, GI1788926, Length=176, Percent_Identity=33.5227272727273, Blast_Score=98, Evalue=3e-22,

Paralogues:

None

Copy number: <10 [C]

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR014284
- InterPro:   IPR000838
- InterPro:   IPR007627
- InterPro:   IPR013249
- InterPro:   IPR014286
- InterPro:   IPR013325
- InterPro:   IPR013324 [H]

Pfam domain/function: PF04542 Sigma70_r2; PF08281 Sigma70_r4_2 [H]

EC number: NA

Molecular weight: Translated: 22298; Mature: 22298

Theoretical pI: Translated: 4.99; Mature: 4.99

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.0 %Cys     (Translated Protein)
6.1 %Met     (Translated Protein)
6.1 %Cys+Met (Translated Protein)
0.0 %Cys     (Mature Protein)
6.1 %Met     (Mature Protein)
6.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MDEAQDMPPAEPDEDAKLMLRVRNGDASAMEMLVRKHQNSVYATVARMLNNGPETEDIAQ
CCCCCCCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCHHHHHHHHHHCCCCCHHHHHH
QVFIRIWKGAGNYEPSARFTTWMFTILRNLVFNEVRRQKRKPTTSADAMEEEGGMAVFLE
HHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHCCCEEEEEC
PSQTPDEALEHTELQHAVDAAIAALPEKARLAVQLRRFENMPYEEIARALDMTVPATKSL
CCCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCHHHHHHHHHCCCCHHHHH
LFRARNMLKEALASFLS
HHHHHHHHHHHHHHHHC
>Mature Secondary Structure
MDEAQDMPPAEPDEDAKLMLRVRNGDASAMEMLVRKHQNSVYATVARMLNNGPETEDIAQ
CCCCCCCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCHHHHHHHHHHCCCCCHHHHHH
QVFIRIWKGAGNYEPSARFTTWMFTILRNLVFNEVRRQKRKPTTSADAMEEEGGMAVFLE
HHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHCCCEEEEEC
PSQTPDEALEHTELQHAVDAAIAALPEKARLAVQLRRFENMPYEEIARALDMTVPATKSL
CCCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCHHHHHHHHHCCCCHHHHH
LFRARNMLKEALASFLS
HHHHHHHHHHHHHHHHC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 8432708; 8378309; 7961421; 10984043; 7737518 [H]