Definition Halothermothrix orenii H 168 chromosome, complete genome.
Accession NC_011899
Length 2,578,146

Click here to switch to the map view.

The map label for this gene is ylaK [H]

Identifier: 220931934

GI number: 220931934

Start: 1187740

End: 1189059

Strand: Direct

Name: ylaK [H]

Synonym: Hore_10930

Alternate gene names: 220931934

Gene position: 1187740-1189059 (Clockwise)

Preceding gene: 220931933

Following gene: 220931936

Centisome position: 46.07

GC content: 37.12

Gene sequence:

>1320_bases
ATGGCCAGTAAAATATTTGTACTTGATACCAATGTATTATTAATGGACCCGATGGCTATTTATTCTTTTGACGAACATGA
AATAGTTATTCCCTTACCGGTAGTGGAAGAGATAGATAATTTAAAAAACAAGAATAATTCAATCGGGTATAATGCCCGGG
AGACTTCCAGGATTTTAAATGACCTGAGGAGTAAAGGTCGCCTTAATGAGGGAGTAGAACTTCCTGATGGAGGTTTATTA
AGACTAGTGATAGATTTTAATGATATGGTTTTACCTAAAGGTTTAAGTTTTACCAAAATGGATAACCGTATCTTATCTAC
TGCTATTAATATAAAAAAAGAAGAACCCGGGCGGGAAGTAATTCTGGTATCAAATGATATTAATTTAAGATTAATAGCCG
ATGCCTTTGGTTTAAAAGCTGAAGAACATAAATCCAGCAGGTTAAAAGATGAAGATATATATACAGGAATTAAAGAAATT
GATGTTCCTTCTAAAGTAATAGATTTATTTTTTAGTAATGAGGGTATTCCCCCTGAGGATATAAATAATGAGAATATAGA
ACTATATCCCCAGGAAATGGTTCAGTTTAATGCAATTGATGTTGAAAACAAACATGCCCTCGGGAGATATGATGGTAACA
GGGTGGTACCCTTTAAATTTTTGCGAAAACAGGCCTGGGGTATTAAACCAAGAAACAGGGAACAGTCCATGGCCTTTGAA
TTGTTACTCAATGATGATATTAAGTTAGTAACCCTTGCTGGAAAGGCAGGGACCGGGAAAACCCTGTTAGCCCTTGCCTG
TGGCCTGGCCAAAGTGACTGATGAAGGAAGGTATAACCGTCTTCTGGTTGCCAGACCGGTTGTTCCCATGGGTAATGATA
TAGGTTTTCTTCCGGGGAGCAAAGAAGAGAAGTTACAGCCCTGGATGCAACCAATATTTGACAACCTTGAATATATCCTG
GGGCAACCCGATAATAGTTCCGATTATTCTTTTGAATATCTGGTTGATAAAGATTTAATTCAGGTGGAGGCATTAACCTA
TATTAGAGGTAGATCTATTCCCGGTCAGTTTATAATAATTGATGAGGCCCAGAATTTATCCAGTCATGAGATAAAAACAA
TTATAACCAGGGTTGGTAAGGGTAGCAAGATAGTTTTAACCGGAGATCCTTACCAGATAGATAACCCTTATTTAGATTTA
CACAATAATGGCTTAACCCATCTGGCACACAAATTTCATCAGGAGAAAATAGGAGGTCATGTTACCCTTGTTAAAGGGGA
AAGGTCAGAGCTTGCCGAAATAGCAAGTAAAATTTTATAA

Upstream 100 bases:

>100_bases
ATTCAGGAATTAATAGGAGAAAAACTTGAGAGAACTTTTGATAAGAAAACAGGTCTCTCATTGTGGTCAATGTAAGGTTC
TAAATGAAGGGAGGGGAGTT

Downstream 100 bases:

>100_bases
CTCTGAAATATAATAGCTTCCAGGGGATTTTTATAATTCTGGTTGTAAAAAACTGGACGCGACTTCTGTTATAAAACAAA
TGCCTCTCCATTTCCTTTAA

Product: PhoH family protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 439; Mature: 438

Protein sequence:

>439_residues
MASKIFVLDTNVLLMDPMAIYSFDEHEIVIPLPVVEEIDNLKNKNNSIGYNARETSRILNDLRSKGRLNEGVELPDGGLL
RLVIDFNDMVLPKGLSFTKMDNRILSTAINIKKEEPGREVILVSNDINLRLIADAFGLKAEEHKSSRLKDEDIYTGIKEI
DVPSKVIDLFFSNEGIPPEDINNENIELYPQEMVQFNAIDVENKHALGRYDGNRVVPFKFLRKQAWGIKPRNREQSMAFE
LLLNDDIKLVTLAGKAGTGKTLLALACGLAKVTDEGRYNRLLVARPVVPMGNDIGFLPGSKEEKLQPWMQPIFDNLEYIL
GQPDNSSDYSFEYLVDKDLIQVEALTYIRGRSIPGQFIIIDEAQNLSSHEIKTIITRVGKGSKIVLTGDPYQIDNPYLDL
HNNGLTHLAHKFHQEKIGGHVTLVKGERSELAEIASKIL

Sequences:

>Translated_439_residues
MASKIFVLDTNVLLMDPMAIYSFDEHEIVIPLPVVEEIDNLKNKNNSIGYNARETSRILNDLRSKGRLNEGVELPDGGLL
RLVIDFNDMVLPKGLSFTKMDNRILSTAINIKKEEPGREVILVSNDINLRLIADAFGLKAEEHKSSRLKDEDIYTGIKEI
DVPSKVIDLFFSNEGIPPEDINNENIELYPQEMVQFNAIDVENKHALGRYDGNRVVPFKFLRKQAWGIKPRNREQSMAFE
LLLNDDIKLVTLAGKAGTGKTLLALACGLAKVTDEGRYNRLLVARPVVPMGNDIGFLPGSKEEKLQPWMQPIFDNLEYIL
GQPDNSSDYSFEYLVDKDLIQVEALTYIRGRSIPGQFIIIDEAQNLSSHEIKTIITRVGKGSKIVLTGDPYQIDNPYLDL
HNNGLTHLAHKFHQEKIGGHVTLVKGERSELAEIASKIL
>Mature_438_residues
ASKIFVLDTNVLLMDPMAIYSFDEHEIVIPLPVVEEIDNLKNKNNSIGYNARETSRILNDLRSKGRLNEGVELPDGGLLR
LVIDFNDMVLPKGLSFTKMDNRILSTAINIKKEEPGREVILVSNDINLRLIADAFGLKAEEHKSSRLKDEDIYTGIKEID
VPSKVIDLFFSNEGIPPEDINNENIELYPQEMVQFNAIDVENKHALGRYDGNRVVPFKFLRKQAWGIKPRNREQSMAFEL
LLNDDIKLVTLAGKAGTGKTLLALACGLAKVTDEGRYNRLLVARPVVPMGNDIGFLPGSKEEKLQPWMQPIFDNLEYILG
QPDNSSDYSFEYLVDKDLIQVEALTYIRGRSIPGQFIIIDEAQNLSSHEIKTIITRVGKGSKIVLTGDPYQIDNPYLDLH
NNGLTHLAHKFHQEKIGGHVTLVKGERSELAEIASKIL

Specific function: Unknown

COG id: COG1875

COG function: function code T; Predicted ATPase related to phosphate starvation-inducible protein PhoH

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 PINc domain [H]

Homologues:

Organism=Escherichia coli, GI145693103, Length=169, Percent_Identity=39.0532544378698, Blast_Score=110, Evalue=2e-25,
Organism=Escherichia coli, GI1787257, Length=191, Percent_Identity=31.9371727748691, Blast_Score=95, Evalue=8e-21,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003714
- InterPro:   IPR006596 [H]

Pfam domain/function: PF02562 PhoH [H]

EC number: NA

Molecular weight: Translated: 49372; Mature: 49241

Theoretical pI: Translated: 5.16; Mature: 5.16

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.2 %Cys     (Translated Protein)
2.1 %Met     (Translated Protein)
2.3 %Cys+Met (Translated Protein)
0.2 %Cys     (Mature Protein)
1.8 %Met     (Mature Protein)
2.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MASKIFVLDTNVLLMDPMAIYSFDEHEIVIPLPVVEEIDNLKNKNNSIGYNARETSRILN
CCCEEEEEECCEEEECCHHEEECCCCCEEECCHHHHHHHHHHCCCCCCCCCHHHHHHHHH
DLRSKGRLNEGVELPDGGLLRLVIDFNDMVLPKGLSFTKMDNRILSTAINIKKEEPGREV
HHHHCCCCCCCCCCCCCCEEEEEEECCCEECCCCCCCHHHHHHHHHHHEECCCCCCCCEE
ILVSNDINLRLIADAFGLKAEEHKSSRLKDEDIYTGIKEIDVPSKVIDLFFSNEGIPPED
EEEECCCCEEEEEHHHCCCCHHHHHHCCCCHHHHCCHHHCCCCHHHHHHHCCCCCCCCCC
INNENIELYPQEMVQFNAIDVENKHALGRYDGNRVVPFKFLRKQAWGIKPRNREQSMAFE
CCCCCEEEEEHHHEEEEEECCCCCCCCCCCCCCEECHHHHHHHHHCCCCCCCCCHHEEEE
LLLNDDIKLVTLAGKAGTGKTLLALACGLAKVTDEGRYNRLLVARPVVPMGNDIGFLPGS
EEECCCEEEEEEECCCCCCHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCEECCCCC
KEEKLQPWMQPIFDNLEYILGQPDNSSDYSFEYLVDKDLIQVEALTYIRGRSIPGQFIII
CHHHHHHHHHHHHHCHHEEECCCCCCCCCEEEEEECCCHHHHHHHHHHHCCCCCCCEEEE
DEAQNLSSHEIKTIITRVGKGSKIVLTGDPYQIDNPYLDLHNNGLTHLAHKFHQEKIGGH
ECCCCCCHHHHHHHHHHCCCCCEEEEECCCEECCCCEEEECCCCHHHHHHHHHHHHCCCE
VTLVKGERSELAEIASKIL
EEEEECCHHHHHHHHHHCC
>Mature Secondary Structure 
ASKIFVLDTNVLLMDPMAIYSFDEHEIVIPLPVVEEIDNLKNKNNSIGYNARETSRILN
CCEEEEEECCEEEECCHHEEECCCCCEEECCHHHHHHHHHHCCCCCCCCCHHHHHHHHH
DLRSKGRLNEGVELPDGGLLRLVIDFNDMVLPKGLSFTKMDNRILSTAINIKKEEPGREV
HHHHCCCCCCCCCCCCCCEEEEEEECCCEECCCCCCCHHHHHHHHHHHEECCCCCCCCEE
ILVSNDINLRLIADAFGLKAEEHKSSRLKDEDIYTGIKEIDVPSKVIDLFFSNEGIPPED
EEEECCCCEEEEEHHHCCCCHHHHHHCCCCHHHHCCHHHCCCCHHHHHHHCCCCCCCCCC
INNENIELYPQEMVQFNAIDVENKHALGRYDGNRVVPFKFLRKQAWGIKPRNREQSMAFE
CCCCCEEEEEHHHEEEEEECCCCCCCCCCCCCCEECHHHHHHHHHCCCCCCCCCHHEEEE
LLLNDDIKLVTLAGKAGTGKTLLALACGLAKVTDEGRYNRLLVARPVVPMGNDIGFLPGS
EEECCCEEEEEEECCCCCCHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCEECCCCC
KEEKLQPWMQPIFDNLEYILGQPDNSSDYSFEYLVDKDLIQVEALTYIRGRSIPGQFIII
CHHHHHHHHHHHHHCHHEEECCCCCCCCCEEEEEECCCHHHHHHHHHHHCCCCCCCEEEE
DEAQNLSSHEIKTIITRVGKGSKIVLTGDPYQIDNPYLDLHNNGLTHLAHKFHQEKIGGH
ECCCCCCHHHHHHHHHHCCCCCEEEEECCCEECCCCEEEECCCCHHHHHHHHHHHHCCCE
VTLVKGERSELAEIASKIL
EEEEECCHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9384377 [H]