Definition Geobacter bemidjiensis Bem chromosome, complete genome.
Accession NC_011146
Length 4,615,150

Click here to switch to the map view.

The map label for this gene is atoS [C]

Identifier: 197117185

GI number: 197117185

Start: 930183

End: 932798

Strand: Direct

Name: atoS [C]

Synonym: Gbem_0793

Alternate gene names: 197117185

Gene position: 930183-932798 (Clockwise)

Preceding gene: 308535193

Following gene: 197117186

Centisome position: 20.15

GC content: 60.97

Gene sequence:

>2616_bases
ATGGATCGCTTTCTTTCACTTAAACTGCGCACCCAGATGATGATCATGATCCTGTTAATGGCTATAGGATCGATGGGCGT
CATCCTGTATTCCGCGGAAAGGCAGCGCGAAGCCGATTTCCGTGAAGCTGCACACCTCTCCCTGAGCCTCGCAACTGCGG
TGCATAACGACCAGAATGTACTGCTGTCGGGCGCCGAGCAACTTCTGAGCACCCTTTCCTATGTCGAGGCCGTGAGAAAG
AGGGATGCCGCCGAGGTGAACCGGATCCTGGCCGGACTGCTGAAAAAATCCCCCCAGATCAGCAATCTCCTCATGATCGC
ACCTGACGGAACCATCTGGGCCTCGGCGGTGCCGATGGCAAAGCCGATAAACGCGGCGGAGAGGAGGTACTTCAAGGCGG
CCCTGCAAAGCGGCCGTTTCTCCTCCGGGGAGTACACCATAGGCAAGGTACTCTCCAAACCGGCGCTCAGCTTCGGCTAT
CCCATTTTGGATGAAAACGGAAAGGTGCAGGACGTCGCGGTAGCGGCCTTCACGCTCACCAACTACGAGAAGCTGTTGGA
CTCCCGCAACCTCCCCCGCAACACCTCGCTTTTGCTGCTGGACCACAAAGGGACCGTCCTCTTCAGCAACAAGCCCCGGG
AACTGGTCGGCAAACAGGACCGGCCCGACCTCTTCGCCGCCATGAAAAAGGAAGATCTGGGCAGCTTCAGCGCCAAGGAC
ATCAACGGCATCCAGAAGGTGATGGCCTACCACCAGGTTTACCTCAAGGGGGAGAGCGAGCCGTACATGTACGTCCGTGC
CGGCATCGACAAGGAGTGGGTCGACAAAAAGAGCTCGCTGCCGTTGATGGCGAACATCTCGGCCATGGCGGCCATCGTCT
TGCTCACGGCGCTGATGGCGCTGTACATAAGCAAGCGGGCGATTCTCGACAAGGTGCGGGCGCTCAGGGAGGGGACCCAG
AAGATCTCGGCCGGCGACCTGACCGTCCGGGTGAACGACAACGTCGCCGGAGGGGAGCTGGGCGAGCTCGGAGCCGCTTT
CGACAGCATGGCGCAGCAGCTGTCGGTCGACATCGAGAGAAGAAAGCGCGCCGAGGCAGAGGCCCTCGCCAAGGGCGAGG
AACTCGACAGGTACTTCAACAACAGCCTGGACCTCCTCTGCATCGCCGACACAAGCGGCTGTTTCCGGCGCCTCAACCCG
GCTTGGGAAACGGCTCTCGGGTTTCCCCTGGACCGGCTTGTGGGGCAGAACTTCCTGGAGATGGTGCATCCCGATGACGT
GGCTGCCACGGCCGATGCTGTCGCGGTATTGGAAAGAAATGGGGAGATCCGGAGTTTCACCAACCGGTTCCTGCACGCCG
ACGGGTCCTACCGCTCGATCGAGTGGCACTCGGTGCGGCCGGATGGAAATATCATCTTCGGTGCCGCCAGGGACGTGACC
GACAAGATCCTCGCTGAGGAGGAGAAGCTCCAGCTCGAGCGCCAGCTCCTGCACATGCAGAAACTGGAGAGCCTGGGAGT
GCTGGCGGGGGGGATAGCGCACGATTTCAACAACATCCTGATGGCCATCATGGGGAACGCCGAACTCGCCTTGATGCGCA
TCGACAAGGACTCTCCGGCGGTGGACAACCTGCAGAAGATCGAGCAGGCATCCACCCGCGCCGCGGACCTGGCCCGTCAG
ATGCTGGCCTACTCCGGTAAAGGGAGGTTCGTGGTGGGAAATACCGACCTGAACCGCCTGCTGCAAGAGATGCTGCACAT
GCTGGAGCTGTCCGTCTCCAAGAGCGCGATTGTGAGGCTGGACCTGCACCCTTCTCTGCTGCAGATAGAAGCCGATGCGA
CCCAGATCAGGCAGGTAGTGATGAACCTGGTCATCAACGCCTCCGAGGCCCTCGGCGACGGCCGCGGCTTCATCTCGATC
AAAACCCTGAGCCTTAAGTGCGACCGCAGCTACCTCAAAAAGGCGTGGCTCGACGTGGACGTACCGGAAGGCGATTACGT
CTGCCTGGAGGTGACCGACACCGGGTGCGGCATGGATAGCGCCACCAGGAGCAAGATTTTCGACCCCTTCTTCACCACGA
AATTCACCGGGCGCGGGCTGGGCATGGCCGCGACGCTCGGCATCGTAAGGGGGCACCGCGGGGCGATTATCGTCGAGAGC
GAGCCGAACCAGGGGAGTTGCTTCAGGGTTCTTTTCCCTGCGGCCGGCAGCGCAGCCGAGCTGTCCGAAGCGCCGTGCAG
CCAGGAGAACTGGAGGGGGCACGGCACCTTGCTGCTTGTCGACGACGAGGAAGCCGTGCGCTGCATCGCCTCGGAAATGC
TGCAGCAACTAGGGTTCGACCTGCTGACGGCATCCGACGGCAGGACCGCCATCGAGCTCTTCTCGGCCAATCCCGGGATC
GGGGTCGTCCTTCTCGACCTCACGATGCCGCAGATGGACGGCGAGCGCTGCCTTGTCGAACTGCAAAAAATAAGGCCGGA
CGTGAAAGTCGTGATGTCGAGCGGCTACACCGAATACGAAGTGACGCAAAAACTGAGAGGCAAACAGCTGGCGGGGTTCA
TCCAGAAGCCGTACAAGCTGAGCGCGCTGGTGGAAGCATTGAGGCCGATTGCCTGA

Upstream 100 bases:

>100_bases
CATAATAAATTTTTATATCTTAAATCTCCTCTTCCCGCCGAAACAGTAGAGATCATCCCAGGATCATGTAGCAGCAATAA
AGTATGTGACAGGAGCTGTC

Downstream 100 bases:

>100_bases
GCGCATCCTGACATAGCTAAACCTGAAACCCCGAGGCGGCAATGCACCGGGGTTTTTTTTATGCCTGCCCACCGCCGAGG
TTTCTCTGTCAGCAACGTGC

Product: PAS domain-containing sensor histidine kinase response regulator

Products: NA

Alternate protein names: Blue-light-activated histidine kinase; Response regulator [H]

Number of amino acids: Translated: 871; Mature: 871

Protein sequence:

>871_residues
MDRFLSLKLRTQMMIMILLMAIGSMGVILYSAERQREADFREAAHLSLSLATAVHNDQNVLLSGAEQLLSTLSYVEAVRK
RDAAEVNRILAGLLKKSPQISNLLMIAPDGTIWASAVPMAKPINAAERRYFKAALQSGRFSSGEYTIGKVLSKPALSFGY
PILDENGKVQDVAVAAFTLTNYEKLLDSRNLPRNTSLLLLDHKGTVLFSNKPRELVGKQDRPDLFAAMKKEDLGSFSAKD
INGIQKVMAYHQVYLKGESEPYMYVRAGIDKEWVDKKSSLPLMANISAMAAIVLLTALMALYISKRAILDKVRALREGTQ
KISAGDLTVRVNDNVAGGELGELGAAFDSMAQQLSVDIERRKRAEAEALAKGEELDRYFNNSLDLLCIADTSGCFRRLNP
AWETALGFPLDRLVGQNFLEMVHPDDVAATADAVAVLERNGEIRSFTNRFLHADGSYRSIEWHSVRPDGNIIFGAARDVT
DKILAEEEKLQLERQLLHMQKLESLGVLAGGIAHDFNNILMAIMGNAELALMRIDKDSPAVDNLQKIEQASTRAADLARQ
MLAYSGKGRFVVGNTDLNRLLQEMLHMLELSVSKSAIVRLDLHPSLLQIEADATQIRQVVMNLVINASEALGDGRGFISI
KTLSLKCDRSYLKKAWLDVDVPEGDYVCLEVTDTGCGMDSATRSKIFDPFFTTKFTGRGLGMAATLGIVRGHRGAIIVES
EPNQGSCFRVLFPAAGSAAELSEAPCSQENWRGHGTLLLVDDEEAVRCIASEMLQQLGFDLLTASDGRTAIELFSANPGI
GVVLLDLTMPQMDGERCLVELQKIRPDVKVVMSSGYTEYEVTQKLRGKQLAGFIQKPYKLSALVEALRPIA

Sequences:

>Translated_871_residues
MDRFLSLKLRTQMMIMILLMAIGSMGVILYSAERQREADFREAAHLSLSLATAVHNDQNVLLSGAEQLLSTLSYVEAVRK
RDAAEVNRILAGLLKKSPQISNLLMIAPDGTIWASAVPMAKPINAAERRYFKAALQSGRFSSGEYTIGKVLSKPALSFGY
PILDENGKVQDVAVAAFTLTNYEKLLDSRNLPRNTSLLLLDHKGTVLFSNKPRELVGKQDRPDLFAAMKKEDLGSFSAKD
INGIQKVMAYHQVYLKGESEPYMYVRAGIDKEWVDKKSSLPLMANISAMAAIVLLTALMALYISKRAILDKVRALREGTQ
KISAGDLTVRVNDNVAGGELGELGAAFDSMAQQLSVDIERRKRAEAEALAKGEELDRYFNNSLDLLCIADTSGCFRRLNP
AWETALGFPLDRLVGQNFLEMVHPDDVAATADAVAVLERNGEIRSFTNRFLHADGSYRSIEWHSVRPDGNIIFGAARDVT
DKILAEEEKLQLERQLLHMQKLESLGVLAGGIAHDFNNILMAIMGNAELALMRIDKDSPAVDNLQKIEQASTRAADLARQ
MLAYSGKGRFVVGNTDLNRLLQEMLHMLELSVSKSAIVRLDLHPSLLQIEADATQIRQVVMNLVINASEALGDGRGFISI
KTLSLKCDRSYLKKAWLDVDVPEGDYVCLEVTDTGCGMDSATRSKIFDPFFTTKFTGRGLGMAATLGIVRGHRGAIIVES
EPNQGSCFRVLFPAAGSAAELSEAPCSQENWRGHGTLLLVDDEEAVRCIASEMLQQLGFDLLTASDGRTAIELFSANPGI
GVVLLDLTMPQMDGERCLVELQKIRPDVKVVMSSGYTEYEVTQKLRGKQLAGFIQKPYKLSALVEALRPIA
>Mature_871_residues
MDRFLSLKLRTQMMIMILLMAIGSMGVILYSAERQREADFREAAHLSLSLATAVHNDQNVLLSGAEQLLSTLSYVEAVRK
RDAAEVNRILAGLLKKSPQISNLLMIAPDGTIWASAVPMAKPINAAERRYFKAALQSGRFSSGEYTIGKVLSKPALSFGY
PILDENGKVQDVAVAAFTLTNYEKLLDSRNLPRNTSLLLLDHKGTVLFSNKPRELVGKQDRPDLFAAMKKEDLGSFSAKD
INGIQKVMAYHQVYLKGESEPYMYVRAGIDKEWVDKKSSLPLMANISAMAAIVLLTALMALYISKRAILDKVRALREGTQ
KISAGDLTVRVNDNVAGGELGELGAAFDSMAQQLSVDIERRKRAEAEALAKGEELDRYFNNSLDLLCIADTSGCFRRLNP
AWETALGFPLDRLVGQNFLEMVHPDDVAATADAVAVLERNGEIRSFTNRFLHADGSYRSIEWHSVRPDGNIIFGAARDVT
DKILAEEEKLQLERQLLHMQKLESLGVLAGGIAHDFNNILMAIMGNAELALMRIDKDSPAVDNLQKIEQASTRAADLARQ
MLAYSGKGRFVVGNTDLNRLLQEMLHMLELSVSKSAIVRLDLHPSLLQIEADATQIRQVVMNLVINASEALGDGRGFISI
KTLSLKCDRSYLKKAWLDVDVPEGDYVCLEVTDTGCGMDSATRSKIFDPFFTTKFTGRGLGMAATLGIVRGHRGAIIVES
EPNQGSCFRVLFPAAGSAAELSEAPCSQENWRGHGTLLLVDDEEAVRCIASEMLQQLGFDLLTASDGRTAIELFSANPGI
GVVLLDLTMPQMDGERCLVELQKIRPDVKVVMSSGYTEYEVTQKLRGKQLAGFIQKPYKLSALVEALRPIA

Specific function: Photosensitive kinase and response regulator that is involved in increased bacterial virulence upon exposure to light [H]

COG id: COG0642

COG function: function code T; Signal transduction histidine kinase

Gene ontology:

Cell location: Integral Membrane Protein. Inner Membrane [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 response regulatory domain [H]

Homologues:

Organism=Escherichia coli, GI1788549, Length=398, Percent_Identity=26.8844221105528, Blast_Score=118, Evalue=1e-27,
Organism=Escherichia coli, GI1790436, Length=247, Percent_Identity=29.9595141700405, Blast_Score=99, Evalue=1e-21,
Organism=Escherichia coli, GI48994928, Length=549, Percent_Identity=22.9508196721311, Blast_Score=94, Evalue=3e-20,
Organism=Escherichia coli, GI87081816, Length=368, Percent_Identity=25.2717391304348, Blast_Score=85, Evalue=2e-17,
Organism=Escherichia coli, GI1790300, Length=272, Percent_Identity=26.1029411764706, Blast_Score=72, Evalue=2e-13,
Organism=Escherichia coli, GI1786912, Length=248, Percent_Identity=28.2258064516129, Blast_Score=65, Evalue=3e-11,
Organism=Escherichia coli, GI145693157, Length=291, Percent_Identity=25.085910652921, Blast_Score=64, Evalue=3e-11,
Organism=Escherichia coli, GI1788550, Length=108, Percent_Identity=36.1111111111111, Blast_Score=63, Evalue=8e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003594
- InterPro:   IPR011006
- InterPro:   IPR001610
- InterPro:   IPR000014
- InterPro:   IPR000700
- InterPro:   IPR013767
- InterPro:   IPR004358
- InterPro:   IPR003661
- InterPro:   IPR005467
- InterPro:   IPR009082
- InterPro:   IPR001789 [H]

Pfam domain/function: PF02518 HATPase_c; PF00512 HisKA; PF00989 PAS; PF00072 Response_reg [H]

EC number: =2.7.13.3 [H]

Molecular weight: Translated: 95738; Mature: 95738

Theoretical pI: Translated: 6.24; Mature: 6.24

Prosite motif: PS50885 HAMP ; PS50112 PAS ; PS50110 RESPONSE_REGULATORY ; PS50109 HIS_KIN

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.0 %Cys     (Translated Protein)
3.4 %Met     (Translated Protein)
4.5 %Cys+Met (Translated Protein)
1.0 %Cys     (Mature Protein)
3.4 %Met     (Mature Protein)
4.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MDRFLSLKLRTQMMIMILLMAIGSMGVILYSAERQREADFREAAHLSLSLATAVHNDQNV
CCCEEEHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHEEEHHHHCCCCHH
LLSGAEQLLSTLSYVEAVRKRDAAEVNRILAGLLKKSPQISNLLMIAPDGTIWASAVPMA
HHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCEEEECCCCCEEECCCCCC
KPINAAERRYFKAALQSGRFSSGEYTIGKVLSKPALSFGYPILDENGKVQDVAVAAFTLT
CCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHCCCHHHCCCCEECCCCCEEHEEEEEHHHH
NYEKLLDSRNLPRNTSLLLLDHKGTVLFSNKPRELVGKQDRPDLFAAMKKEDLGSFSAKD
HHHHHHHCCCCCCCCEEEEEECCCEEEECCCCHHHCCCCCCCHHHHHHHHHHCCCCCCHH
INGIQKVMAYHQVYLKGESEPYMYVRAGIDKEWVDKKSSLPLMANISAMAAIVLLTALMA
HHHHHHHHHHHHHEEECCCCCEEEEEECCCHHHHCCCCCCCCEECHHHHHHHHHHHHHHH
LYISKRAILDKVRALREGTQKISAGDLTVRVNDNVAGGELGELGAAFDSMAQQLSVDIER
HHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHH
RKRAEAEALAKGEELDRYFNNSLDLLCIADTSGCFRRLNPAWETALGFPLDRLVGQNFLE
HHHHHHHHHHCHHHHHHHHCCCCCEEEEECCHHHHHHCCHHHHHHHCCCHHHHHHHHHHH
MVHPDDVAATADAVAVLERNGEIRSFTNRFLHADGSYRSIEWHSVRPDGNIIFGAARDVT
HCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCCCEEEEEEECCCCCEEEECCHHHH
DKILAEEEKLQLERQLLHMQKLESLGVLAGGIAHDFNNILMAIMGNAELALMRIDKDSPA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCC
VDNLQKIEQASTRAADLARQMLAYSGKGRFVVGNTDLNRLLQEMLHMLELSVSKSAIVRL
HHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHCCCCCEEEEE
DLHPSLLQIEADATQIRQVVMNLVINASEALGDGRGFISIKTLSLKCDRSYLKKAWLDVD
ECCCCEEEEECCHHHHHHHHHHHHHCCHHHCCCCCCEEEEEEEEEECCHHHHHHHHCEEE
VPEGDYVCLEVTDTGCGMDSATRSKIFDPFFTTKFTGRGLGMAATLGIVRGHRGAIIVES
CCCCCEEEEEEECCCCCCCCHHHHHCCCCHHCCEECCCCCCHHHHHHHHCCCCCEEEEEC
EPNQGSCFRVLFPAAGSAAELSEAPCSQENWRGHGTLLLVDDEEAVRCIASEMLQQLGFD
CCCCCCEEEEEECCCCCHHHHHCCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHCCCE
LLTASDGRTAIELFSANPGIGVVLLDLTMPQMDGERCLVELQKIRPDVKVVMSSGYTEYE
EEECCCCCEEEEEEECCCCCEEEEEEEECCCCCCHHHHHHHHHCCCCCEEHHHCCCCHHH
VTQKLRGKQLAGFIQKPYKLSALVEALRPIA
HHHHHHHHHHHHHHCCCHHHHHHHHHHHCCC
>Mature Secondary Structure
MDRFLSLKLRTQMMIMILLMAIGSMGVILYSAERQREADFREAAHLSLSLATAVHNDQNV
CCCEEEHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHEEEHHHHCCCCHH
LLSGAEQLLSTLSYVEAVRKRDAAEVNRILAGLLKKSPQISNLLMIAPDGTIWASAVPMA
HHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCEEEECCCCCEEECCCCCC
KPINAAERRYFKAALQSGRFSSGEYTIGKVLSKPALSFGYPILDENGKVQDVAVAAFTLT
CCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHCCCHHHCCCCEECCCCCEEHEEEEEHHHH
NYEKLLDSRNLPRNTSLLLLDHKGTVLFSNKPRELVGKQDRPDLFAAMKKEDLGSFSAKD
HHHHHHHCCCCCCCCEEEEEECCCEEEECCCCHHHCCCCCCCHHHHHHHHHHCCCCCCHH
INGIQKVMAYHQVYLKGESEPYMYVRAGIDKEWVDKKSSLPLMANISAMAAIVLLTALMA
HHHHHHHHHHHHHEEECCCCCEEEEEECCCHHHHCCCCCCCCEECHHHHHHHHHHHHHHH
LYISKRAILDKVRALREGTQKISAGDLTVRVNDNVAGGELGELGAAFDSMAQQLSVDIER
HHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHH
RKRAEAEALAKGEELDRYFNNSLDLLCIADTSGCFRRLNPAWETALGFPLDRLVGQNFLE
HHHHHHHHHHCHHHHHHHHCCCCCEEEEECCHHHHHHCCHHHHHHHCCCHHHHHHHHHHH
MVHPDDVAATADAVAVLERNGEIRSFTNRFLHADGSYRSIEWHSVRPDGNIIFGAARDVT
HCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCCCEEEEEEECCCCCEEEECCHHHH
DKILAEEEKLQLERQLLHMQKLESLGVLAGGIAHDFNNILMAIMGNAELALMRIDKDSPA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCC
VDNLQKIEQASTRAADLARQMLAYSGKGRFVVGNTDLNRLLQEMLHMLELSVSKSAIVRL
HHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHCCCCCEEEEE
DLHPSLLQIEADATQIRQVVMNLVINASEALGDGRGFISIKTLSLKCDRSYLKKAWLDVD
ECCCCEEEEECCHHHHHHHHHHHHHCCHHHCCCCCCEEEEEEEEEECCHHHHHHHHCEEE
VPEGDYVCLEVTDTGCGMDSATRSKIFDPFFTTKFTGRGLGMAATLGIVRGHRGAIIVES
CCCCCEEEEEEECCCCCCCCHHHHHCCCCHHCCEECCCCCCHHHHHHHHCCCCCEEEEEC
EPNQGSCFRVLFPAAGSAAELSEAPCSQENWRGHGTLLLVDDEEAVRCIASEMLQQLGFD
CCCCCCEEEEEECCCCCHHHHHCCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHCCCE
LLTASDGRTAIELFSANPGIGVVLLDLTMPQMDGERCLVELQKIRPDVKVVMSSGYTEYE
EEECCCCCEEEEEEECCCCCEEEEEEEECCCCCCHHHHHHHHHCCCCCEEHHHCCCCHHH
VTQKLRGKQLAGFIQKPYKLSALVEALRPIA
HHHHHHHHHHHHHHCCCHHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: NA