Definition Prochlorococcus marinus str. MIT 9301, complete genome.
Accession NC_009091
Length 1,641,879

Click here to switch to the map view.

The map label for this gene is phoR [H]

Identifier: 126696579

GI number: 126696579

Start: 1054684

End: 1055844

Strand: Reverse

Name: phoR [H]

Synonym: P9301_12411

Alternate gene names: 126696579

Gene position: 1055844-1054684 (Counterclockwise)

Preceding gene: 126696580

Following gene: 126696577

Centisome position: 64.31

GC content: 27.48

Gene sequence:

>1161_bases
ATGAAAATGAAAACTCTACAAGAAGTATTATTGTTTTTACATCAGAAGTGGAAAATAAAAGTCAAGCATTTTTCTCAATC
AAAAGAAAAAAAATTTGAAGTTGATAAAAATAAGTATAAAAGAACTATTGATTTCCCATTTGACAAAATTAGACCTCAAG
AAATGTTGTCTTGGTTGGATTATTCATCACAAGGATGGATAATCTTATCATCAGACCTAACAATAAAATTTATTAATAAA
AAGGCTTTATCTCTTATTAGGTTTATTAAATATAAAGATGTCATTGGAAAAGCAATTAATGATATTAATGAGCTTGAATT
ACTGAGAAATAAAATTCTTTACTCAAGAAAAAAAGATTTTCCAGTCAGCATTGATTTCACCATCGCAGGAGAACCTATTT
GGGCAAATATTGTTAGAGGAAGTAAAAAGAATTATTTAATATTGTTGGAAAGTAAATTATCAATTGAATCCATAAAAAAA
CGACAAAATCAATTAATTAATGATGTATCTCATGAATTGAAAACCCCTCTTACATCTCTTATTTTAATTGGGGAAAGACT
GGAAGCAGTAGTATCAAAAAAGGATAGGTATCTTGTTAAAAGGCTTAAGAAAGAGTCAAAAAGATTGCGAAAAATGGCCG
AAGAGACTTTAGAGCTTTCCAAGTTAGAAAGTAGCGAATCTTTTAATAAAAATAAAAAAATATCTATTTCAGATTTGGTA
ATGGAATCTTGGCAAACCTTAAAACCGCTTGCAGATAAAAAAAATATAAAAATAAATTTATTTATACCAACAAAATATTA
TATTTCTGGTGATATTGAAAATTTAAAGAGAGCATTTATAAATATTTTGGATAACTCTATACGTTACTCCCCAACAAATG
AGGAAATACAAATTGAAATTTTTAAGAGAGACAGCTCTGTTGTTATAAGAGTGAGGGATAAAGGTATTGGATTAGAAGAA
AATGAATATAATGATATTTTTTCTCGTTTTTATCGTGGGGATCCATCAAGGACTAAATTTAAGAAAAGTGGTAGTGGTTT
AGGTCTTTCAATAACAAAAAAAATAATCAATAACAATAAAGGGTTTATAAAGGCTTTTAATCATAAAGATGGTGGAGCGA
TTGTTGAAACCATCTTTCCGTGTATTAATACCGATATATAA

Upstream 100 bases:

>100_bases
CTGTAGATGTTCATGTTAGGTGGCTCAGAGAGAAACTAGAAGAAGATCCCTCAGCTCCAAAGTTTCTTAAAACTGTAAGA
GGTTTTGGATATAAGTTTGG

Downstream 100 bases:

>100_bases
TAAAAATTTAAATTCTTAAAGGTAAAAAAAAAGACCTGTATTTAGCAGGTCTTTTTAATGTGTGTAAGATTTCTAAATTA
AACTTTAATTTAGAATTTAA

Product: two-component sensor histidine kinase, phosphate sensing

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 386; Mature: 386

Protein sequence:

>386_residues
MKMKTLQEVLLFLHQKWKIKVKHFSQSKEKKFEVDKNKYKRTIDFPFDKIRPQEMLSWLDYSSQGWIILSSDLTIKFINK
KALSLIRFIKYKDVIGKAINDINELELLRNKILYSRKKDFPVSIDFTIAGEPIWANIVRGSKKNYLILLESKLSIESIKK
RQNQLINDVSHELKTPLTSLILIGERLEAVVSKKDRYLVKRLKKESKRLRKMAEETLELSKLESSESFNKNKKISISDLV
MESWQTLKPLADKKNIKINLFIPTKYYISGDIENLKRAFINILDNSIRYSPTNEEIQIEIFKRDSSVVIRVRDKGIGLEE
NEYNDIFSRFYRGDPSRTKFKKSGSGLGLSITKKIINNNKGFIKAFNHKDGGAIVETIFPCINTDI

Sequences:

>Translated_386_residues
MKMKTLQEVLLFLHQKWKIKVKHFSQSKEKKFEVDKNKYKRTIDFPFDKIRPQEMLSWLDYSSQGWIILSSDLTIKFINK
KALSLIRFIKYKDVIGKAINDINELELLRNKILYSRKKDFPVSIDFTIAGEPIWANIVRGSKKNYLILLESKLSIESIKK
RQNQLINDVSHELKTPLTSLILIGERLEAVVSKKDRYLVKRLKKESKRLRKMAEETLELSKLESSESFNKNKKISISDLV
MESWQTLKPLADKKNIKINLFIPTKYYISGDIENLKRAFINILDNSIRYSPTNEEIQIEIFKRDSSVVIRVRDKGIGLEE
NEYNDIFSRFYRGDPSRTKFKKSGSGLGLSITKKIINNNKGFIKAFNHKDGGAIVETIFPCINTDI
>Mature_386_residues
MKMKTLQEVLLFLHQKWKIKVKHFSQSKEKKFEVDKNKYKRTIDFPFDKIRPQEMLSWLDYSSQGWIILSSDLTIKFINK
KALSLIRFIKYKDVIGKAINDINELELLRNKILYSRKKDFPVSIDFTIAGEPIWANIVRGSKKNYLILLESKLSIESIKK
RQNQLINDVSHELKTPLTSLILIGERLEAVVSKKDRYLVKRLKKESKRLRKMAEETLELSKLESSESFNKNKKISISDLV
MESWQTLKPLADKKNIKINLFIPTKYYISGDIENLKRAFINILDNSIRYSPTNEEIQIEIFKRDSSVVIRVRDKGIGLEE
NEYNDIFSRFYRGDPSRTKFKKSGSGLGLSITKKIINNNKGFIKAFNHKDGGAIVETIFPCINTDI

Specific function: Member of the two-component regulatory system sphR/sphS. Sensory kinase. Is involved in inducible production of alkaline phosphatase in response to phosphate limitation as it is directly involved in the regulation of phoA transcription in response to phos

COG id: COG0642

COG function: function code T; Signal transduction histidine kinase

Gene ontology:

Cell location: Cytoplasm (Probable) [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 PAS (PER-ARNT-SIM) domain [H]

Homologues:

Organism=Escherichia coli, GI1786783, Length=227, Percent_Identity=31.7180616740088, Blast_Score=111, Evalue=8e-26,
Organism=Escherichia coli, GI1788393, Length=230, Percent_Identity=28.2608695652174, Blast_Score=99, Evalue=7e-22,
Organism=Escherichia coli, GI1790346, Length=226, Percent_Identity=31.4159292035398, Blast_Score=96, Evalue=4e-21,
Organism=Escherichia coli, GI1786912, Length=226, Percent_Identity=27.4336283185841, Blast_Score=93, Evalue=3e-20,
Organism=Escherichia coli, GI1786600, Length=269, Percent_Identity=24.1635687732342, Blast_Score=89, Evalue=7e-19,
Organism=Escherichia coli, GI1787894, Length=228, Percent_Identity=24.5614035087719, Blast_Score=75, Evalue=6e-15,
Organism=Escherichia coli, GI1788713, Length=228, Percent_Identity=25.8771929824561, Blast_Score=75, Evalue=7e-15,
Organism=Escherichia coli, GI1789808, Length=218, Percent_Identity=25.2293577981651, Blast_Score=75, Evalue=8e-15,
Organism=Escherichia coli, GI145693157, Length=234, Percent_Identity=27.3504273504274, Blast_Score=74, Evalue=1e-14,
Organism=Escherichia coli, GI1790551, Length=199, Percent_Identity=27.1356783919598, Blast_Score=74, Evalue=2e-14,
Organism=Escherichia coli, GI1789403, Length=223, Percent_Identity=28.6995515695067, Blast_Score=71, Evalue=1e-13,
Organism=Escherichia coli, GI1789149, Length=313, Percent_Identity=23.961661341853, Blast_Score=71, Evalue=1e-13,
Organism=Escherichia coli, GI48994928, Length=226, Percent_Identity=20.353982300885, Blast_Score=70, Evalue=3e-13,
Organism=Escherichia coli, GI1788279, Length=208, Percent_Identity=28.8461538461538, Blast_Score=67, Evalue=2e-12,
Organism=Escherichia coli, GI1788549, Length=226, Percent_Identity=26.9911504424779, Blast_Score=67, Evalue=3e-12,
Organism=Escherichia coli, GI1790436, Length=222, Percent_Identity=22.5225225225225, Blast_Score=66, Evalue=4e-12,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003594
- InterPro:   IPR000014
- InterPro:   IPR013767
- InterPro:   IPR004358
- InterPro:   IPR003661
- InterPro:   IPR005467
- InterPro:   IPR009082 [H]

Pfam domain/function: PF02518 HATPase_c; PF00512 HisKA; PF00989 PAS [H]

EC number: =2.7.13.3 [H]

Molecular weight: Translated: 45036; Mature: 45036

Theoretical pI: Translated: 10.46; Mature: 10.46

Prosite motif: PS50109 HIS_KIN

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.3 %Cys     (Translated Protein)
1.3 %Met     (Translated Protein)
1.6 %Cys+Met (Translated Protein)
0.3 %Cys     (Mature Protein)
1.3 %Met     (Mature Protein)
1.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKMKTLQEVLLFLHQKWKIKVKHFSQSKEKKFEVDKNKYKRTIDFPFDKIRPQEMLSWLD
CCHHHHHHHHHHHHCCCEEEEEHHCCCHHHHHCCCHHHCCEEECCCHHHCCHHHHHHHHC
YSSQGWIILSSDLTIKFINKKALSLIRFIKYKDVIGKAINDINELELLRNKILYSRKKDF
CCCCCEEEEECCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC
PVSIDFTIAGEPIWANIVRGSKKNYLILLESKLSIESIKKRQNQLINDVSHELKTPLTSL
CEEEEEEECCCCHHHHHHCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCHHHHH
ILIGERLEAVVSKKDRYLVKRLKKESKRLRKMAEETLELSKLESSESFNKNKKISISDLV
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEHHHHH
MESWQTLKPLADKKNIKINLFIPTKYYISGDIENLKRAFINILDNSIRYSPTNEEIQIEI
HHHHHHHCCCCCCCCEEEEEEEEEEEEECCCHHHHHHHHHHHHHCCCEECCCCCEEEEEE
FKRDSSVVIRVRDKGIGLEENEYNDIFSRFYRGDPSRTKFKKSGSGLGLSITKKIINNNK
EECCCEEEEEEECCCCCCCCCHHHHHHHHHHCCCCCHHHHHCCCCCCCHHHHHHHHCCCC
GFIKAFNHKDGGAIVETIFPCINTDI
CEEEEECCCCCCHHHHHHHHHHCCCC
>Mature Secondary Structure
MKMKTLQEVLLFLHQKWKIKVKHFSQSKEKKFEVDKNKYKRTIDFPFDKIRPQEMLSWLD
CCHHHHHHHHHHHHCCCEEEEEHHCCCHHHHHCCCHHHCCEEECCCHHHCCHHHHHHHHC
YSSQGWIILSSDLTIKFINKKALSLIRFIKYKDVIGKAINDINELELLRNKILYSRKKDF
CCCCCEEEEECCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC
PVSIDFTIAGEPIWANIVRGSKKNYLILLESKLSIESIKKRQNQLINDVSHELKTPLTSL
CEEEEEEECCCCHHHHHHCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCHHHHH
ILIGERLEAVVSKKDRYLVKRLKKESKRLRKMAEETLELSKLESSESFNKNKKISISDLV
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEHHHHH
MESWQTLKPLADKKNIKINLFIPTKYYISGDIENLKRAFINILDNSIRYSPTNEEIQIEI
HHHHHHHCCCCCCCCEEEEEEEEEEEEECCCHHHHHHHHHHHHHCCCEECCCCCEEEEEE
FKRDSSVVIRVRDKGIGLEENEYNDIFSRFYRGDPSRTKFKKSGSGLGLSITKKIINNNK
EECCCEEEEEEECCCCCCCCCHHHHHHHHHHCCCCCHHHHHCCCCCCCHHHHHHHHCCCC
GFIKAFNHKDGGAIVETIFPCINTDI
CEEEEECCCCCCHHHHHHHHHHCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8497200 [H]