Definition Akkermansia muciniphila ATCC BAA-835, complete genome.
Accession NC_010655
Length 2,664,102

Click here to switch to the map view.

The map label for this gene is phoR [H]

Identifier: 187735604

GI number: 187735604

Start: 1324301

End: 1325530

Strand: Reverse

Name: phoR [H]

Synonym: Amuc_1109

Alternate gene names: 187735604

Gene position: 1325530-1324301 (Counterclockwise)

Preceding gene: 187735605

Following gene: 187735600

Centisome position: 49.76

GC content: 57.15

Gene sequence:

>1230_bases
ATGTCGGCCATCGACTACATCCTAACCATCCTTATTCTGGCCTCCCTGTACCTGAACTGGCATCTGGTCCAGGTATGTCG
GTCCGCCATGAAAGCCCGGAAAAAAGCCTTGCGGGACGCTCAGCGCCTGCTGAAGCGCGGGGAAGAAGCGCAGGAACAGG
CCATCGCGGACAAACGGCGCTTCCTGGAAGCTCTGGGAGAGGCCTTCCTGCTCATCGGTCCATCCGGACACATCGTGCTG
GCTAATACGCTGGCCAAAGAACTCTTTCAGGAAGAAAAGCTGGAAGGGCGCAAAGTGGGGGCCCTGGTCTGCAACCAGGA
ATTGCTGGGGCATGTTCAGGAAGCATTCGATACGGACGGCCCCGTCACCAAGGAATTCACGCTGAGCGCCGCCAATTCCC
CCGGCGGCGTGCAAAACGGCATCACGGCGTGGCATCTGGACAGCGCCATCACGGACGCCCCAATCAGAGAAAAGCGCATC
CTGCTGCGCAACATCACGCAGAACTACCTCACCAACCAGATGCGCCGGGACTTCGTGGCAAACGCCTCCCACGAGCTGCG
TACGCCCCTCACCATCATCGTGGGATATCTGGAAAACCTGATGGAGGACGATCTGGTGGAGGAAAGTCCCGGACTGGCCC
GCAAATTCATCGGAGTCATGCACCAGAACAGCCAGAGGCTGATGAACATTATTGAAGACATGCTCATGATCTCCAAACTC
GAATCAGGCCACAAGGCGATTCTGAAGGAGCAGTGGTTCCGCCTCACCTCCTGCGCGGACGACGTCTTCTCCCGTCTGGA
TTCCATCCGGGAGAAAAAACAGGCCGTCCTGCACATGGACATTCCCACGGATTGGGAACTTTATGGAGATCCCTTTTACT
GGACGCAAATTCTGTTCAATTTGGTGGAAAACGCCCTCAAGCAAAACACGGAGCCGGGACTTTCCATTACTGTGGCCGCC
GCCAAAACACAGGACGCCTGCGTCATCACCGTCACGGATACGGGCGTGGGCATTCCTGTGGAAAGCATCCCCTTCCTCTT
CAACCGCTTTTACCGGGTGGAAACCCACCACTCCTCGGAAATCAAGGGAACGGGCCTAGGCCTCTCCATTGTGAAACGCG
CCGTGGAAGCCCACGACGGAGCCATCACCGTCTCCAGCATCCCCCACCGGGAAACTGTTTTTACCATCACCATTCCCCTG
AAAAGGTTCCGGGAAGAAAAGGCGGCGTAA

Upstream 100 bases:

>100_bases
TCCCTACGCGGACTGCATTGAAACCGTCCGCAGCATCGGCTACCGCTTTACCCTGCCGGGACGCCGCATCCCGGAGGAAA
AGGCTTAACACTCCGGTTCC

Downstream 100 bases:

>100_bases
AACGCCGATGCGGAACAGTCTGACATGGCCTGCCGTTAAAACCGGAAAAAAGGCATCCTTTCAACCCGGAGGGATGGCCC
ACGAGACATCCGCCTGTTTT

Product: histidine kinase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 409; Mature: 408

Protein sequence:

>409_residues
MSAIDYILTILILASLYLNWHLVQVCRSAMKARKKALRDAQRLLKRGEEAQEQAIADKRRFLEALGEAFLLIGPSGHIVL
ANTLAKELFQEEKLEGRKVGALVCNQELLGHVQEAFDTDGPVTKEFTLSAANSPGGVQNGITAWHLDSAITDAPIREKRI
LLRNITQNYLTNQMRRDFVANASHELRTPLTIIVGYLENLMEDDLVEESPGLARKFIGVMHQNSQRLMNIIEDMLMISKL
ESGHKAILKEQWFRLTSCADDVFSRLDSIREKKQAVLHMDIPTDWELYGDPFYWTQILFNLVENALKQNTEPGLSITVAA
AKTQDACVITVTDTGVGIPVESIPFLFNRFYRVETHHSSEIKGTGLGLSIVKRAVEAHDGAITVSSIPHRETVFTITIPL
KRFREEKAA

Sequences:

>Translated_409_residues
MSAIDYILTILILASLYLNWHLVQVCRSAMKARKKALRDAQRLLKRGEEAQEQAIADKRRFLEALGEAFLLIGPSGHIVL
ANTLAKELFQEEKLEGRKVGALVCNQELLGHVQEAFDTDGPVTKEFTLSAANSPGGVQNGITAWHLDSAITDAPIREKRI
LLRNITQNYLTNQMRRDFVANASHELRTPLTIIVGYLENLMEDDLVEESPGLARKFIGVMHQNSQRLMNIIEDMLMISKL
ESGHKAILKEQWFRLTSCADDVFSRLDSIREKKQAVLHMDIPTDWELYGDPFYWTQILFNLVENALKQNTEPGLSITVAA
AKTQDACVITVTDTGVGIPVESIPFLFNRFYRVETHHSSEIKGTGLGLSIVKRAVEAHDGAITVSSIPHRETVFTITIPL
KRFREEKAA
>Mature_408_residues
SAIDYILTILILASLYLNWHLVQVCRSAMKARKKALRDAQRLLKRGEEAQEQAIADKRRFLEALGEAFLLIGPSGHIVLA
NTLAKELFQEEKLEGRKVGALVCNQELLGHVQEAFDTDGPVTKEFTLSAANSPGGVQNGITAWHLDSAITDAPIREKRIL
LRNITQNYLTNQMRRDFVANASHELRTPLTIIVGYLENLMEDDLVEESPGLARKFIGVMHQNSQRLMNIIEDMLMISKLE
SGHKAILKEQWFRLTSCADDVFSRLDSIREKKQAVLHMDIPTDWELYGDPFYWTQILFNLVENALKQNTEPGLSITVAAA
KTQDACVITVTDTGVGIPVESIPFLFNRFYRVETHHSSEIKGTGLGLSIVKRAVEAHDGAITVSSIPHRETVFTITIPLK
RFREEKAA

Specific function: Member of the two-component regulatory system phoP/phoR involved in the alkaline phosphatase genes regulation. PhoR may function as a membrane-associated protein kinase that phosphorylates phoP in response to environmental signals [H]

COG id: COG0642

COG function: function code T; Signal transduction histidine kinase

Gene ontology:

Cell location: Cell membrane; Multi-pass membrane protein (Probable) [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 PAS (PER-ARNT-SIM) domain [H]

Homologues:

Organism=Escherichia coli, GI1786600, Length=260, Percent_Identity=33.4615384615385, Blast_Score=124, Evalue=7e-30,
Organism=Escherichia coli, GI145693157, Length=233, Percent_Identity=34.7639484978541, Blast_Score=113, Evalue=2e-26,
Organism=Escherichia coli, GI1789149, Length=227, Percent_Identity=33.4801762114537, Blast_Score=108, Evalue=5e-25,
Organism=Escherichia coli, GI1788393, Length=232, Percent_Identity=31.0344827586207, Blast_Score=105, Evalue=4e-24,
Organism=Escherichia coli, GI1788713, Length=232, Percent_Identity=31.4655172413793, Blast_Score=102, Evalue=5e-23,
Organism=Escherichia coli, GI48994928, Length=232, Percent_Identity=30.6034482758621, Blast_Score=95, Evalue=6e-21,
Organism=Escherichia coli, GI1786783, Length=227, Percent_Identity=29.9559471365639, Blast_Score=92, Evalue=7e-20,
Organism=Escherichia coli, GI1790346, Length=237, Percent_Identity=30.379746835443, Blast_Score=88, Evalue=1e-18,
Organism=Escherichia coli, GI1786912, Length=233, Percent_Identity=31.3304721030043, Blast_Score=87, Evalue=2e-18,
Organism=Escherichia coli, GI1790436, Length=251, Percent_Identity=27.4900398406374, Blast_Score=84, Evalue=2e-17,
Organism=Escherichia coli, GI87081816, Length=246, Percent_Identity=28.4552845528455, Blast_Score=81, Evalue=1e-16,
Organism=Escherichia coli, GI1790861, Length=217, Percent_Identity=28.5714285714286, Blast_Score=78, Evalue=1e-15,
Organism=Escherichia coli, GI1788549, Length=232, Percent_Identity=25.4310344827586, Blast_Score=69, Evalue=5e-13,
Organism=Escherichia coli, GI1790551, Length=205, Percent_Identity=28.780487804878, Blast_Score=67, Evalue=3e-12,
Organism=Escherichia coli, GI87082128, Length=243, Percent_Identity=24.6913580246914, Blast_Score=66, Evalue=4e-12,
Organism=Escherichia coli, GI1787374, Length=244, Percent_Identity=23.3606557377049, Blast_Score=64, Evalue=2e-11,
Organism=Escherichia coli, GI1790300, Length=243, Percent_Identity=28.8065843621399, Blast_Score=62, Evalue=6e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003594
- InterPro:   IPR000014
- InterPro:   IPR013767
- InterPro:   IPR004358
- InterPro:   IPR003661
- InterPro:   IPR005467
- InterPro:   IPR009082 [H]

Pfam domain/function: PF02518 HATPase_c; PF00512 HisKA; PF00989 PAS [H]

EC number: =2.7.13.3 [H]

Molecular weight: Translated: 45944; Mature: 45812

Theoretical pI: Translated: 6.65; Mature: 6.65

Prosite motif: PS50109 HIS_KIN

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.0 %Cys     (Translated Protein)
2.2 %Met     (Translated Protein)
3.2 %Cys+Met (Translated Protein)
1.0 %Cys     (Mature Protein)
2.0 %Met     (Mature Protein)
2.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSAIDYILTILILASLYLNWHLVQVCRSAMKARKKALRDAQRLLKRGEEAQEQAIADKRR
CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH
FLEALGEAFLLIGPSGHIVLANTLAKELFQEEKLEGRKVGALVCNQELLGHVQEAFDTDG
HHHHHCCEEEEECCCCCEEEHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCC
PVTKEFTLSAANSPGGVQNGITAWHLDSAITDAPIREKRILLRNITQNYLTNQMRRDFVA
CCCCEEEEECCCCCCCCCCCCEEEECCHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH
NASHELRTPLTIIVGYLENLMEDDLVEESPGLARKFIGVMHQNSQRLMNIIEDMLMISKL
CCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHH
ESGHKAILKEQWFRLTSCADDVFSRLDSIREKKQAVLHMDIPTDWELYGDPFYWTQILFN
CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEEECCCCCCCCCCCCHHHHHHHHH
LVENALKQNTEPGLSITVAAAKTQDACVITVTDTGVGIPVESIPFLFNRFYRVETHHSSE
HHHHHHHCCCCCCCEEEEEEECCCCCEEEEEECCCCCCCHHHHHHHHHHHHHEECCCCCC
IKGTGLGLSIVKRAVEAHDGAITVSSIPHRETVFTITIPLKRFREEKAA
CCCCCCCHHHHHHHHHHCCCEEEECCCCCCCEEEEEEECHHHHHHHCCC
>Mature Secondary Structure 
SAIDYILTILILASLYLNWHLVQVCRSAMKARKKALRDAQRLLKRGEEAQEQAIADKRR
CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH
FLEALGEAFLLIGPSGHIVLANTLAKELFQEEKLEGRKVGALVCNQELLGHVQEAFDTDG
HHHHHCCEEEEECCCCCEEEHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCC
PVTKEFTLSAANSPGGVQNGITAWHLDSAITDAPIREKRILLRNITQNYLTNQMRRDFVA
CCCCEEEEECCCCCCCCCCCCEEEECCHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH
NASHELRTPLTIIVGYLENLMEDDLVEESPGLARKFIGVMHQNSQRLMNIIEDMLMISKL
CCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHH
ESGHKAILKEQWFRLTSCADDVFSRLDSIREKKQAVLHMDIPTDWELYGDPFYWTQILFN
CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEEECCCCCCCCCCCCHHHHHHHHH
LVENALKQNTEPGLSITVAAAKTQDACVITVTDTGVGIPVESIPFLFNRFYRVETHHSSE
HHHHHHHCCCCCCCEEEEEEECCCCCEEEEEECCCCCCCHHHHHHHHHHHHHEECCCCCC
IKGTGLGLSIVKRAVEAHDGAITVSSIPHRETVFTITIPLKRFREEKAA
CCCCCCCHHHHHHHHHHCCCEEEECCCCCCCEEEEEEECHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 3142862; 9387221; 9384377 [H]