| Definition | Deinococcus geothermalis DSM 11300 plasmid pDGEO01, complete sequence. |
|---|---|
| Accession | NC_008010 |
| Length | 574,127 |
Click here to switch to the map view.
The map label for this gene is sasA [H]
Identifier: 94972310
GI number: 94972310
Start: 176965
End: 178506
Strand: Direct
Name: sasA [H]
Synonym: Dgeo_2847
Alternate gene names: 94972310
Gene position: 176965-178506 (Clockwise)
Preceding gene: 94972319
Following gene: 94972309
Centisome position: 30.82
GC content: 67.38
Gene sequence:
>1542_bases ATGCCTGCCCTGATCCCCCGCGACGTGGTGCGCGTCCGCTTTCCCTCACCCCGGCTGCCTCTGAGCGAGGGGACCGTGTG GGGCGTGGCGTTGACCCTGATGGCCGCCGTGTTGCTGGCCGACATCCTGACTCCCGCCTCACTTGCTGTGGGTACGGTTC TCAGCGCTTCGGTGGCCTTTGCGGCGCTGGGTGCCTCGCGGCGCACCGCCTGGCGCCTAACCGGTCTGGCCGTGCTGGCG AATCTGGCTGCGGGTCTTTGGAACGGTGTGCGCGACGGCGTGCATCCCACAGATCTGGCCAACCGAGCGGTGAGCATCCT GGCCGTTTTGTTGGTCGGTTACCTGACCATCCGCGCGCGGGAAGCCTCCGAGCGGGCCGCCGCCCTGCAGGAGGAGGAGC GGCAACTTCGGCGGGAACGGGCTTTGCGCAGCTTGGCGGAGGATATGGGCGGTCCCTTGGGGCAGGCAGAGTTTGTGGAG CGGGCCGCCGCGGCGCTGGTACGGCTGACTGGCGCCAGGACTGTTGAGATCGGTGCGGTGGACCGAGCGACCCTCCGCGC GCCGCATGCTCTGGCCTGCGCGCCAGGTGCCTTTCCAGCCCCAAGTCGCCTCAATACCCGGATTCCGCTCGAATTCCTGG CGCATCCGGTCGGGTCTGGAGATGTGTGGGCGGTGGACGGTGGCCGCGTCTTCCTGGCCCGGTTGCGCCGCCCCGCTGCC GGCGACCTGTTGGTGATCCTGACCGCCCCCACCACACCGCCTAACCTGACGAGCGAGGCCATCCGGGCGTTGCAACCCCT GCTGGAACGCACCGCTTTGCTCGACGATCTGCGCGCAAACCGGGAGCAATTGGCCGAGCAGAGCGAATTGCTGCGTGACC TGATCTACGCTTTTTCCCATGACCTGCGCACGCCCCTCCTCGCCAACGCCATGAACATGCAGGCGGCCCTGAAGGGCGCT TATGGTCCCCTCCCAGAGGCATACCGCGCCACGCTGGTCAACGGGCTTAGCGCCAATGAGACCCTCCTCAACCTGGCCGA CCAACTGCTGCTGGTCGCCCGGTACGAGAGCGGCGTAGAGGCGGGGGAAGACCACCTGAGCGTGAACCTGCGTGACCTCG TTCTCCATGTGGTGAATGATCTGCGCCCCCACGCCCAGGCGCGGCGAGTTACCTTTGAGCTCCACCTGGACGGCGTGCGG GTCTGCGGCAATCCCCATGACCTGCGGCGAGCCGTCCAGAATCTGCTGGACAATGCGATCAAATTCAGTCCACCCGGCGG AACGGTGAGCCTTGGCCTCTTGGCAGATGGCGACGAGGCTGTGTTCAGCGTGCAGGATGAGGGTCCCGGTGTTCCGCCTG GCCGGGAGGGGCGGCTGTTCCAGCGCTTCCGCAGTGGCGGAGCCGGGGGCGGTACCGGCCTGGGCCTCTACCTGACCCGC CGAATTGCCGAGGCGCACGGAGGCAGTGTTCGGTATCACCGCACGGCGCAGGCCAGAAGCCTCTTTGTCCTGACCCTTCC CCTGGAGACCCGACATGCCTGA
Upstream 100 bases:
>100_bases GCTGATTCTGCTGATTCTGGCTCTCCAAGACACGTGAGAGCCGCGCCCCCTGCTCTCTTCCCCAAGCCTCCGTCATGTGC CGCTGTCCTACCCTGACCCC
Downstream 100 bases:
>100_bases ACCTATCCGCATCCTGATCGTCGAGGACCATGCCTTTACCCGCGATGGCCTGCGTGCGAGCCTCAATCTCGAAGCGGATC TGCGCGTCGTAGCAGAGGCC
Product: histidine kinase
Products: NA
Alternate protein names: Synechococcus adaptive sensor protein A [H]
Number of amino acids: Translated: 513; Mature: 512
Protein sequence:
>513_residues MPALIPRDVVRVRFPSPRLPLSEGTVWGVALTLMAAVLLADILTPASLAVGTVLSASVAFAALGASRRTAWRLTGLAVLA NLAAGLWNGVRDGVHPTDLANRAVSILAVLLVGYLTIRAREASERAAALQEEERQLRRERALRSLAEDMGGPLGQAEFVE RAAAALVRLTGARTVEIGAVDRATLRAPHALACAPGAFPAPSRLNTRIPLEFLAHPVGSGDVWAVDGGRVFLARLRRPAA GDLLVILTAPTTPPNLTSEAIRALQPLLERTALLDDLRANREQLAEQSELLRDLIYAFSHDLRTPLLANAMNMQAALKGA YGPLPEAYRATLVNGLSANETLLNLADQLLLVARYESGVEAGEDHLSVNLRDLVLHVVNDLRPHAQARRVTFELHLDGVR VCGNPHDLRRAVQNLLDNAIKFSPPGGTVSLGLLADGDEAVFSVQDEGPGVPPGREGRLFQRFRSGGAGGGTGLGLYLTR RIAEAHGGSVRYHRTAQARSLFVLTLPLETRHA
Sequences:
>Translated_513_residues MPALIPRDVVRVRFPSPRLPLSEGTVWGVALTLMAAVLLADILTPASLAVGTVLSASVAFAALGASRRTAWRLTGLAVLA NLAAGLWNGVRDGVHPTDLANRAVSILAVLLVGYLTIRAREASERAAALQEEERQLRRERALRSLAEDMGGPLGQAEFVE RAAAALVRLTGARTVEIGAVDRATLRAPHALACAPGAFPAPSRLNTRIPLEFLAHPVGSGDVWAVDGGRVFLARLRRPAA GDLLVILTAPTTPPNLTSEAIRALQPLLERTALLDDLRANREQLAEQSELLRDLIYAFSHDLRTPLLANAMNMQAALKGA YGPLPEAYRATLVNGLSANETLLNLADQLLLVARYESGVEAGEDHLSVNLRDLVLHVVNDLRPHAQARRVTFELHLDGVR VCGNPHDLRRAVQNLLDNAIKFSPPGGTVSLGLLADGDEAVFSVQDEGPGVPPGREGRLFQRFRSGGAGGGTGLGLYLTR RIAEAHGGSVRYHRTAQARSLFVLTLPLETRHA >Mature_512_residues PALIPRDVVRVRFPSPRLPLSEGTVWGVALTLMAAVLLADILTPASLAVGTVLSASVAFAALGASRRTAWRLTGLAVLAN LAAGLWNGVRDGVHPTDLANRAVSILAVLLVGYLTIRAREASERAAALQEEERQLRRERALRSLAEDMGGPLGQAEFVER AAAALVRLTGARTVEIGAVDRATLRAPHALACAPGAFPAPSRLNTRIPLEFLAHPVGSGDVWAVDGGRVFLARLRRPAAG DLLVILTAPTTPPNLTSEAIRALQPLLERTALLDDLRANREQLAEQSELLRDLIYAFSHDLRTPLLANAMNMQAALKGAY GPLPEAYRATLVNGLSANETLLNLADQLLLVARYESGVEAGEDHLSVNLRDLVLHVVNDLRPHAQARRVTFELHLDGVRV CGNPHDLRRAVQNLLDNAIKFSPPGGTVSLGLLADGDEAVFSVQDEGPGVPPGREGRLFQRFRSGGAGGGTGLGLYLTRR IAEAHGGSVRYHRTAQARSLFVLTLPLETRHA
Specific function: May be involved in signal transduction. Participates in the kaiABC clock protein complex, which constitutes the main circadian regulator in cyanobacteria, via its interaction with kaiC. Required for robustness of the circadian rhythm of gene expression an
COG id: NA
COG function: NA
Gene ontology:
Cell location: Integral Membrane Protein. Inner Membrane [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 histidine kinase domain [H]
Homologues:
Organism=Escherichia coli, GI1790861, Length=286, Percent_Identity=29.3706293706294, Blast_Score=90, Evalue=3e-19, Organism=Escherichia coli, GI1789403, Length=203, Percent_Identity=33.9901477832512, Blast_Score=89, Evalue=8e-19, Organism=Escherichia coli, GI1786912, Length=284, Percent_Identity=30.6338028169014, Blast_Score=87, Evalue=3e-18, Organism=Escherichia coli, GI87081816, Length=252, Percent_Identity=30.5555555555556, Blast_Score=86, Evalue=5e-18, Organism=Escherichia coli, GI1786600, Length=303, Percent_Identity=27.0627062706271, Blast_Score=79, Evalue=9e-16, Organism=Escherichia coli, GI1788279, Length=228, Percent_Identity=27.1929824561404, Blast_Score=76, Evalue=4e-15, Organism=Escherichia coli, GI1786783, Length=282, Percent_Identity=27.3049645390071, Blast_Score=74, Evalue=2e-14, Organism=Escherichia coli, GI1790551, Length=194, Percent_Identity=30.9278350515464, Blast_Score=74, Evalue=3e-14, Organism=Escherichia coli, GI1789149, Length=246, Percent_Identity=29.2682926829268, Blast_Score=73, Evalue=4e-14, Organism=Escherichia coli, GI145693157, Length=238, Percent_Identity=27.7310924369748, Blast_Score=70, Evalue=2e-13, Organism=Escherichia coli, GI1788393, Length=232, Percent_Identity=27.5862068965517, Blast_Score=70, Evalue=3e-13, Organism=Escherichia coli, GI1788713, Length=262, Percent_Identity=24.0458015267176, Blast_Score=66, Evalue=6e-12,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR003594 - InterPro: IPR011649 - InterPro: IPR004358 - InterPro: IPR003661 - InterPro: IPR005467 - InterPro: IPR009082 - InterPro: IPR012336 - InterPro: IPR012335 [H]
Pfam domain/function: PF02518 HATPase_c; PF00512 HisKA; PF07689 KaiB [H]
EC number: 2.7.3.- [C]
Molecular weight: Translated: 54875; Mature: 54744
Theoretical pI: Translated: 9.43; Mature: 9.43
Prosite motif: PS50109 HIS_KIN
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.4 %Cys (Translated Protein) 1.0 %Met (Translated Protein) 1.4 %Cys+Met (Translated Protein) 0.4 %Cys (Mature Protein) 0.8 %Met (Mature Protein) 1.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MPALIPRDVVRVRFPSPRLPLSEGTVWGVALTLMAAVLLADILTPASLAVGTVLSASVAF CCCCCCCCHHEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHH AALGASRRTAWRLTGLAVLANLAAGLWNGVRDGVHPTDLANRAVSILAVLLVGYLTIRAR HHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHH EASERAAALQEEERQLRRERALRSLAEDMGGPLGQAEFVERAAAALVRLTGARTVEIGAV HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCEEEECCC DRATLRAPHALACAPGAFPAPSRLNTRIPLEFLAHPVGSGDVWAVDGGRVFLARLRRPAA CHHHHCCCCCEEECCCCCCCCCCCCCCCCHHHHHCCCCCCCEEEECCCHHHHHHHCCCCC GDLLVILTAPTTPPNLTSEAIRALQPLLERTALLDDLRANREQLAEQSELLRDLIYAFSH CCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH DLRTPLLANAMNMQAALKGAYGPLPEAYRATLVNGLSANETLLNLADQLLLVARYESGVE HCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCC AGEDHLSVNLRDLVLHVVNDLRPHAQARRVTFELHLDGVRVCGNPHDLRRAVQNLLDNAI CCCHHHCCHHHHHHHHHHHHHCCCHHHHEEEEEEEECCEEECCCHHHHHHHHHHHHHHHE KFSPPGGTVSLGLLADGDEAVFSVQDEGPGVPPGREGRLFQRFRSGGAGGGTGLGLYLTR EECCCCCCEEEEEEECCCCEEEEECCCCCCCCCCCCCHHHHHHHCCCCCCCCHHHHHHHH RIAEAHGGSVRYHRTAQARSLFVLTLPLETRHA HHHHHCCCCEEEEECCCCCEEEEEEECCCCCCC >Mature Secondary Structure PALIPRDVVRVRFPSPRLPLSEGTVWGVALTLMAAVLLADILTPASLAVGTVLSASVAF CCCCCCCHHEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHH AALGASRRTAWRLTGLAVLANLAAGLWNGVRDGVHPTDLANRAVSILAVLLVGYLTIRAR HHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHH EASERAAALQEEERQLRRERALRSLAEDMGGPLGQAEFVERAAAALVRLTGARTVEIGAV HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCEEEECCC DRATLRAPHALACAPGAFPAPSRLNTRIPLEFLAHPVGSGDVWAVDGGRVFLARLRRPAA CHHHHCCCCCEEECCCCCCCCCCCCCCCCHHHHHCCCCCCCEEEECCCHHHHHHHCCCCC GDLLVILTAPTTPPNLTSEAIRALQPLLERTALLDDLRANREQLAEQSELLRDLIYAFSH CCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH DLRTPLLANAMNMQAALKGAYGPLPEAYRATLVNGLSANETLLNLADQLLLVARYESGVE HCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCC AGEDHLSVNLRDLVLHVVNDLRPHAQARRVTFELHLDGVRVCGNPHDLRRAVQNLLDNAI CCCHHHCCHHHHHHHHHHHHHCCCHHHHEEEEEEEECCEEECCCHHHHHHHHHHHHHHHE KFSPPGGTVSLGLLADGDEAVFSVQDEGPGVPPGREGRLFQRFRSGGAGGGTGLGLYLTR EECCCCCCEEEEEEECCCCEEEEECCCCCCCCCCCCCHHHHHHHCCCCCCCCHHHHHHHH RIAEAHGGSVRYHRTAQARSLFVLTLPLETRHA HHHHHCCCCEEEEECCCCCEEEEEEECCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: NA