| Definition | Methylobacterium chloromethanicum CM4, complete genome. |
|---|---|
| Accession | NC_011757 |
| Length | 5,777,908 |
Click here to switch to the map view.
The map label for this gene is atoS [C]
Identifier: 218532673
GI number: 218532673
Start: 5074765
End: 5076822
Strand: Reverse
Name: atoS [C]
Synonym: Mchl_4788
Alternate gene names: 218532673
Gene position: 5076822-5074765 (Counterclockwise)
Preceding gene: 218532674
Following gene: 218532671
Centisome position: 87.87
GC content: 69.97
Gene sequence:
>2058_bases GTGCGCCGAGGTCTCGTCCTGCTCACCCTCGCAGCCCTGCTGCCGCTGGTGCTGCTCTCCGGGGCACTGTCGTTTCTGCT CCTGCACCAGGAGCAGGCCACGATGCGCTCAGAGGCGGTGCACCAGGTCGAGCGGATGCTCGGCAGCGTCGACCGGGAAC TCTATACCCAGATCGAGCTCCTCAAGGTTCTGGCCCAATCGCCGATACTCGACGGCAACCAGCCGAGTCTGGCTGCCTTC CACGACCTGGCCGAGCGCTTCCAGAGCCAGCTTCCACTCTGGCACCGTATCATCCTCGCCGACGCCGAGGGCCACATGCT GGTGCGAACGGGTGCCTCCTTCGGCACCCCGCTCCCGCCCCTCGTCGACGAGAGGAGCTACAGGCGGGTGCTCGACACGG GTGAGCCTACCATCGGCGACGTCGCGGCCTCGGCCTCCAGCAGCGATCCGCCGCGGGCGAGCTTCCGCGTGCCCGTCCGG CGCGACGGCGCGATCCGCTACGTGCTGACGGGCGTGGTCTCCACCAAGCGCCTGACCGAGCTCCTCGCAACGACCGGCCT CGATCCCGCTTGGCGGCCCTACCTCGTCGACGGCTCCGGCCGCATCGCCGCTAGCCTACGCCGGCCCGACACCGTGGGTC GGCAAGCGATGGCGCCGACCGTCCGCGCCCGGGAAGGCGGCGCGTCGGGCGTCTACGACGGCATCTCGCCGGAAGGCGAG CTGCTGGTCACCGCCTTCCGCAAGTCGGACAGGACCGGATGGTCGGTGCACGTCGCCATCCCGCTGGGCGTCTACAACCA GCCGCTGACGCGGGCCGCCTGGGTCCTGGCCTGCGCCGGGGCCGCCGCGGTCCTGTTGACGGCAGCCTTCATCCTCCTGC TCCGGCGCGAGCTCCGGGCGCAGCGCCGGGAGGCGCTCACCCATGAGCGGGCGGTGCGCATGGAAGCCCTCGGGCGCATG ACCGGCGGTGTCGCGCACGACTTCAACAACCTGCTTATGGTCATCCTGGGCAATCTGGAGATGCTCGGGCGGCGCAACCA GGAGCCGCGCCTGGAGCGCTACGTCACAGCCATCCGCAAGGCGGCCGAGCGCGGCACACACCTCACCCGCGAGCTGCTCG CCTTTTCGCGGGGCCAGGCCAGCCAGTCCGAGGTCGTCGATCTCAATGAGCGGTTGGCCAGTACGCTCACCATGATCCGG CAATCCGTCAGCGGCCACATCCACGTCGAGACCGACCTCGTCCCCGGGCGGCACGCCGTCAGGCTCGACCCGCTGCAGTT CGACCTCGCGCTGCTCAACATCGCCGCCAACGCCCGCGACGCCATGCCGGAGGGCGGCAGCCTGCGGATCGCGACCCGCC GGGCCCTGCTGCCGGGCCGGTCCGGGCGCGAAGGGATCGCGTTGTCCATCAAGGACACGGGCGGCGGCATCCCGCCGGAA GCCCTGCCGCACGTGTTCGAGCCGTTCTTCACGACCAAGGACGTTGGGAAGGGGACCGGCCTCGGCCTGAGCCAGGTCTT CGGCTTCGCCAAGCAGTCGGGCGGCGCGGCCGACATCGAGAGCCGGTCAGGACAGGGCACGACCGTCACCCTGCACTTGC CGCTGGCGCAGGAAGAGGTACCGGCAGCGAGGATGCTTGCGGACGCAGATGGAGCCTCGCCAAGCTCGACCGCGGCCCGG GTGGTCCTGATCGATGACAACGACGAGGTGCGCACCGTGACGGCCTCCTTCCTGGAGGATGCTGGCTTCCGGGTCGAGCA AGCCAACTCTGCGCAGGCCGGGTTGGACCTGCTTGAGCGAAGTGGTGCCGACATCCTGGTGAGCGACTTGGTCATGCCCG GCGGCATGGACGGGCTGGCGTTTGCCAACGAGGCGCGCCGGCGCTGGCCGCACCTGCCGGTGATCCTGGTGTCGGGCTAC AGCACCTCGGCGGCGCGGGCGACTGAGCTTGGCTACTCGCTCTACATGAAGCCGTTCGACATGGCCGAGCTCGCCAAGGG CATCCGGGCTCAGCTAGGCCCGAGAGACCGACAGTTGGCGGTGCCGGGACCTAGTTGA
Upstream 100 bases:
>100_bases CCGGGTCGCACCTCGATTGTGTCAGAAAACGTCGGCTTGAATGGAAAGCTAGGCTTTTTGCACTAGGTCTGGTTCAGCCG AACCATGGGGATCGACGAGG
Downstream 100 bases:
>100_bases CCGCTGCGACGGGAGCAAGCCGGGATAGCCCGACCCGAGGCCACACGCTACTCCCTGCCACTTCAGCCTTGTCCCTCGCC GTCCTCAGCCGGCCTGCGGC
Product: histidine kinase
Products: NA
Alternate protein names: Blue-light-activated histidine kinase; Response regulator [H]
Number of amino acids: Translated: 685; Mature: 685
Protein sequence:
>685_residues MRRGLVLLTLAALLPLVLLSGALSFLLLHQEQATMRSEAVHQVERMLGSVDRELYTQIELLKVLAQSPILDGNQPSLAAF HDLAERFQSQLPLWHRIILADAEGHMLVRTGASFGTPLPPLVDERSYRRVLDTGEPTIGDVAASASSSDPPRASFRVPVR RDGAIRYVLTGVVSTKRLTELLATTGLDPAWRPYLVDGSGRIAASLRRPDTVGRQAMAPTVRAREGGASGVYDGISPEGE LLVTAFRKSDRTGWSVHVAIPLGVYNQPLTRAAWVLACAGAAAVLLTAAFILLLRRELRAQRREALTHERAVRMEALGRM TGGVAHDFNNLLMVILGNLEMLGRRNQEPRLERYVTAIRKAAERGTHLTRELLAFSRGQASQSEVVDLNERLASTLTMIR QSVSGHIHVETDLVPGRHAVRLDPLQFDLALLNIAANARDAMPEGGSLRIATRRALLPGRSGREGIALSIKDTGGGIPPE ALPHVFEPFFTTKDVGKGTGLGLSQVFGFAKQSGGAADIESRSGQGTTVTLHLPLAQEEVPAARMLADADGASPSSTAAR VVLIDDNDEVRTVTASFLEDAGFRVEQANSAQAGLDLLERSGADILVSDLVMPGGMDGLAFANEARRRWPHLPVILVSGY STSAARATELGYSLYMKPFDMAELAKGIRAQLGPRDRQLAVPGPS
Sequences:
>Translated_685_residues MRRGLVLLTLAALLPLVLLSGALSFLLLHQEQATMRSEAVHQVERMLGSVDRELYTQIELLKVLAQSPILDGNQPSLAAF HDLAERFQSQLPLWHRIILADAEGHMLVRTGASFGTPLPPLVDERSYRRVLDTGEPTIGDVAASASSSDPPRASFRVPVR RDGAIRYVLTGVVSTKRLTELLATTGLDPAWRPYLVDGSGRIAASLRRPDTVGRQAMAPTVRAREGGASGVYDGISPEGE LLVTAFRKSDRTGWSVHVAIPLGVYNQPLTRAAWVLACAGAAAVLLTAAFILLLRRELRAQRREALTHERAVRMEALGRM TGGVAHDFNNLLMVILGNLEMLGRRNQEPRLERYVTAIRKAAERGTHLTRELLAFSRGQASQSEVVDLNERLASTLTMIR QSVSGHIHVETDLVPGRHAVRLDPLQFDLALLNIAANARDAMPEGGSLRIATRRALLPGRSGREGIALSIKDTGGGIPPE ALPHVFEPFFTTKDVGKGTGLGLSQVFGFAKQSGGAADIESRSGQGTTVTLHLPLAQEEVPAARMLADADGASPSSTAAR VVLIDDNDEVRTVTASFLEDAGFRVEQANSAQAGLDLLERSGADILVSDLVMPGGMDGLAFANEARRRWPHLPVILVSGY STSAARATELGYSLYMKPFDMAELAKGIRAQLGPRDRQLAVPGPS >Mature_685_residues MRRGLVLLTLAALLPLVLLSGALSFLLLHQEQATMRSEAVHQVERMLGSVDRELYTQIELLKVLAQSPILDGNQPSLAAF HDLAERFQSQLPLWHRIILADAEGHMLVRTGASFGTPLPPLVDERSYRRVLDTGEPTIGDVAASASSSDPPRASFRVPVR RDGAIRYVLTGVVSTKRLTELLATTGLDPAWRPYLVDGSGRIAASLRRPDTVGRQAMAPTVRAREGGASGVYDGISPEGE LLVTAFRKSDRTGWSVHVAIPLGVYNQPLTRAAWVLACAGAAAVLLTAAFILLLRRELRAQRREALTHERAVRMEALGRM TGGVAHDFNNLLMVILGNLEMLGRRNQEPRLERYVTAIRKAAERGTHLTRELLAFSRGQASQSEVVDLNERLASTLTMIR QSVSGHIHVETDLVPGRHAVRLDPLQFDLALLNIAANARDAMPEGGSLRIATRRALLPGRSGREGIALSIKDTGGGIPPE ALPHVFEPFFTTKDVGKGTGLGLSQVFGFAKQSGGAADIESRSGQGTTVTLHLPLAQEEVPAARMLADADGASPSSTAAR VVLIDDNDEVRTVTASFLEDAGFRVEQANSAQAGLDLLERSGADILVSDLVMPGGMDGLAFANEARRRWPHLPVILVSGY STSAARATELGYSLYMKPFDMAELAKGIRAQLGPRDRQLAVPGPS
Specific function: Photosensitive kinase and response regulator that is involved in increased bacterial virulence upon exposure to light [H]
COG id: NA
COG function: NA
Gene ontology:
Cell location: Integral Membrane Protein. Inner Membrane [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 response regulatory domain [H]
Homologues:
Organism=Escherichia coli, GI1788549, Length=245, Percent_Identity=31.4285714285714, Blast_Score=103, Evalue=3e-23, Organism=Escherichia coli, GI1790436, Length=270, Percent_Identity=29.2592592592593, Blast_Score=99, Evalue=1e-21, Organism=Escherichia coli, GI1790300, Length=283, Percent_Identity=28.9752650176678, Blast_Score=79, Evalue=1e-15, Organism=Escherichia coli, GI48994928, Length=224, Percent_Identity=29.9107142857143, Blast_Score=75, Evalue=1e-14, Organism=Escherichia coli, GI1788713, Length=280, Percent_Identity=24.2857142857143, Blast_Score=72, Evalue=9e-14, Organism=Escherichia coli, GI145693157, Length=317, Percent_Identity=23.0283911671924, Blast_Score=72, Evalue=1e-13, Organism=Escherichia coli, GI87081816, Length=365, Percent_Identity=26.027397260274, Blast_Score=69, Evalue=7e-13, Organism=Escherichia coli, GI1786600, Length=228, Percent_Identity=26.3157894736842, Blast_Score=65, Evalue=2e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR003594 - InterPro: IPR011006 - InterPro: IPR001610 - InterPro: IPR000014 - InterPro: IPR000700 - InterPro: IPR013767 - InterPro: IPR004358 - InterPro: IPR003661 - InterPro: IPR005467 - InterPro: IPR009082 - InterPro: IPR001789 [H]
Pfam domain/function: PF02518 HATPase_c; PF00512 HisKA; PF00989 PAS; PF00072 Response_reg [H]
EC number: =2.7.13.3 [H]
Molecular weight: Translated: 73812; Mature: 73812
Theoretical pI: Translated: 8.88; Mature: 8.88
Prosite motif: PS50110 RESPONSE_REGULATORY ; PS50109 HIS_KIN
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.1 %Cys (Translated Protein) 2.3 %Met (Translated Protein) 2.5 %Cys+Met (Translated Protein) 0.1 %Cys (Mature Protein) 2.3 %Met (Mature Protein) 2.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MRRGLVLLTLAALLPLVLLSGALSFLLLHQEQATMRSEAVHQVERMLGSVDRELYTQIEL CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH LKVLAQSPILDGNQPSLAAFHDLAERFQSQLPLWHRIILADAEGHMLVRTGASFGTPLPP HHHHHHCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEEECCCCCEEEEECCCCCCCCCC LVDERSYRRVLDTGEPTIGDVAASASSSDPPRASFRVPVRRDGAIRYVLTGVVSTKRLTE CCCCHHHHHHHHCCCCCHHHHHHCCCCCCCCCCCEECCCCCCCCEEEHHHHHHHHHHHHH LLATTGLDPAWRPYLVDGSGRIAASLRRPDTVGRQAMAPTVRAREGGASGVYDGISPEGE HHHHCCCCCCCCCEEECCCCCEEEECCCCCHHCHHHHCCCHHCCCCCCCCCCCCCCCCCC LLVTAFRKSDRTGWSVHVAIPLGVYNQPLTRAAWVLACAGAAAVLLTAAFILLLRRELRA EEEEEECCCCCCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH QRREALTHERAVRMEALGRMTGGVAHDFNNLLMVILGNLEMLGRRNQEPRLERYVTAIRK HHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCHHHHCCCCCCHHHHHHHHHHHH AAERGTHLTRELLAFSRGQASQSEVVDLNERLASTLTMIRQSVSGHIHVETDLVPGRHAV HHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCEE RLDPLQFDLALLNIAANARDAMPEGGSLRIATRRALLPGRSGREGIALSIKDTGGGIPPE EECCHHHHHHHHHHHCCHHHCCCCCCCEEEEEHHHHCCCCCCCCCEEEEEECCCCCCCHH ALPHVFEPFFTTKDVGKGTGLGLSQVFGFAKQSGGAADIESRSGQGTTVTLHLPLAQEEV HHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCEEEEEECCCHHHC PAARMLADADGASPSSTAARVVLIDDNDEVRTVTASFLEDAGFRVEQANSAQAGLDLLER CHHHHHHCCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHCCCEEECCCCHHHHHHHHHH SGADILVSDLVMPGGMDGLAFANEARRRWPHLPVILVSGYSTSAARATELGYSLYMKPFD CCCCEEHHHHCCCCCCCHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHCHHEECCCCC MAELAKGIRAQLGPRDRQLAVPGPS HHHHHHHHHHHCCCCCCEEECCCCC >Mature Secondary Structure MRRGLVLLTLAALLPLVLLSGALSFLLLHQEQATMRSEAVHQVERMLGSVDRELYTQIEL CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH LKVLAQSPILDGNQPSLAAFHDLAERFQSQLPLWHRIILADAEGHMLVRTGASFGTPLPP HHHHHHCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEEECCCCCEEEEECCCCCCCCCC LVDERSYRRVLDTGEPTIGDVAASASSSDPPRASFRVPVRRDGAIRYVLTGVVSTKRLTE CCCCHHHHHHHHCCCCCHHHHHHCCCCCCCCCCCEECCCCCCCCEEEHHHHHHHHHHHHH LLATTGLDPAWRPYLVDGSGRIAASLRRPDTVGRQAMAPTVRAREGGASGVYDGISPEGE HHHHCCCCCCCCCEEECCCCCEEEECCCCCHHCHHHHCCCHHCCCCCCCCCCCCCCCCCC LLVTAFRKSDRTGWSVHVAIPLGVYNQPLTRAAWVLACAGAAAVLLTAAFILLLRRELRA EEEEEECCCCCCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH QRREALTHERAVRMEALGRMTGGVAHDFNNLLMVILGNLEMLGRRNQEPRLERYVTAIRK HHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCHHHHCCCCCCHHHHHHHHHHHH AAERGTHLTRELLAFSRGQASQSEVVDLNERLASTLTMIRQSVSGHIHVETDLVPGRHAV HHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCEE RLDPLQFDLALLNIAANARDAMPEGGSLRIATRRALLPGRSGREGIALSIKDTGGGIPPE EECCHHHHHHHHHHHCCHHHCCCCCCCEEEEEHHHHCCCCCCCCCEEEEEECCCCCCCHH ALPHVFEPFFTTKDVGKGTGLGLSQVFGFAKQSGGAADIESRSGQGTTVTLHLPLAQEEV HHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCEEEEEECCCHHHC PAARMLADADGASPSSTAARVVLIDDNDEVRTVTASFLEDAGFRVEQANSAQAGLDLLER CHHHHHHCCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHCCCEEECCCCHHHHHHHHHH SGADILVSDLVMPGGMDGLAFANEARRRWPHLPVILVSGYSTSAARATELGYSLYMKPFD CCCCEEHHHHCCCCCCCHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHCHHEECCCCC MAELAKGIRAQLGPRDRQLAVPGPS HHHHHHHHHHHCCCCCCEEECCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: NA