Definition Mesorhizobium loti MAFF303099 plasmid pMLb, complete sequence.
Accession NC_002682
Length 208,315

Click here to switch to the map view.

The map label for this gene is atoS [C]

Identifier: 13488499

GI number: 13488499

Start: 126527

End: 128341

Strand: Reverse

Name: atoS [C]

Synonym: mll9655

Alternate gene names: 13488499

Gene position: 128341-126527 (Counterclockwise)

Preceding gene: 13488500

Following gene: 13488498

Centisome position: 61.61

GC content: 59.17

Gene sequence:

>1815_bases
GTGGCCCGTTTAGATCGGGGACCTCGAATGACTGAACTACCGCAGGCCGACGTTGCCGCCGTCAAGAAACCGGCAATGGT
CCTGGGCGGTAACCTCGACTATCAGGACTTTTTCGAAAACGGTGCCATCGCGCTCCATCTGGTGAGCGCGGATGGCCTAA
TCTTGCATGCTAACAAGGCGGAGCTCGATCTCCTCGGCTATCCGGCAGAGGACTATGTCGGTCGCCACATTACTGAGTTT
TATCCCGACCGGGACGTGATCACCAACATTTTGAGTAGGCTCTCCCGTGGTGAGCAGATTGCCAGATATCCGGCGCGTTT
GCGCGCCCGTGACGGGTCCATCAAGCACGTCGAACTTACGTCGAGCGGCCATTTCCGCGACGGCAAGCTAGTCAACACCA
GATGCTTTACTGTCGATGTAACTGACCTCGAGCGGACACGGACAGAACTCAGGCAGCAGGACAACGCCTATCACCAGATT
CTCGATGGCCTTCCGGTCGCCATCTACACGACTGACCAGAACGGCATCATTACCTATTACAATCGAGCCGCTGCCGATCT
GGCAGGGCGGGAGCCTCAGGTCGGCAAGGACAAATGGTGCGTAACGTTCAAGCTGTTCACCACTGACGGCAAGGAGCTGC
CGCACGACGAATGCCCGATGGCGGTCGCCCTCAAGGAAAACCGGCCGGTCCGCAACCAGCAGGCCATAGCGCAGCGTCCG
GACGGGTCCTTTTTCCCGTTCATGCCCTACCCCACTCCGCTGCGCGATGAGCAGGGCGGCCTCGTTGGCGCCGTGAATAT
GCTGCTCGATCTGACGGACCGCCAGCGTGCCGAAGAGACCAGGCAGCACCTGTCGGCGATCGTGGAATCCTCATTTGACG
CTATTGTCAGCAAGGATCTCAACACCATCATCAAGAGCTGGAATCGAGGCGCCGAAAAGCTCTTTGGCTATACCGCCAGC
GAAGCGATCGGCAAGTCGGTAACCATGCTTATCCCCGACGACCACCAGGACGAGGAACCTCGCATCCTTGAGCGCCTCCG
CCGTGGCGAGCGCGTCGATACCTATGAAACGATACGTCGGCGCAAGGATGGCAGCCTGGTTCCGGTCTCGCTGACGATAT
CGCCCGTGCGTAACGCAACAGGGCAGATCGTCGGCGCCTCGAAGATTGCCCGGGACATCACATCGGCCAGGGAAAACGAG
CAGCGTATCCGCATGTTGATGCGCGAGGTCAATCACCGGGTGAAGAACCAGTATTCCGTTATCCTGTCGATGATCCGGGA
GACCAACAAACGATCGGAGACGCCCAAGCAGTTTGAAGCCCAGGTGCGCCAGCGCATCATGGCATTGTCGCGCTCCCACG
ACCTGCTTGTGTCGGCCGACTGGAAGGGAGCTACCATCCGTGAACTTTTGGCGGCGCAGGCCCAGCCATTTCCCCGCGGT
GAGATGATCGATATGTCAGGGCCGTCGTTCGTGCTTGGCCCCAATGCGGTCCAATATCTCGGGATTGCCTTCAACGAACT
CGCGACCAACTCTGCGAAGTATGGCGTGCTCTCCGGCGATGACGGCCAGATTTCTGTTACATGGAACATCAGCGGTTCCG
GTACATCGCGGCTCTTCCACCTGACCTGGGCGGAGACCGATGGGCCGCAAGTCACGACCATCAGACAGGGCGGCTTTGGA
ACCGTGGTCCTGGAGCGGGTTGCACCTGAAGCCGTGGGCGGACGGGGAAATCTTGAATATGGTTCGCATGGGATCACATG
GAACCTGGAAGCGCCGTTGGCCGGACTGGACCGTTCCGTTGCCAATCAGGATTAG

Upstream 100 bases:

>100_bases
GTCTGCGTCAGGAGATCGGACCCCTTTACCTTCAGCTAACCACCCCCTTCACTTCCGACATGTAGGCAGCGACTTGGAAC
AATTGTTTAGACGGCGCGTT

Downstream 100 bases:

>100_bases
AAGCCTAAAGTAGCCTTGCTGGAGACGGGCGTTCGTCACTCGTTTGGCCATGCGGAATCTCCAGCCGGAACACCCCCTCG
TCGCCCGGCTGGCGGTCCGC

Product: sensor histidine kinase of two-component

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 604; Mature: 603

Protein sequence:

>604_residues
MARLDRGPRMTELPQADVAAVKKPAMVLGGNLDYQDFFENGAIALHLVSADGLILHANKAELDLLGYPAEDYVGRHITEF
YPDRDVITNILSRLSRGEQIARYPARLRARDGSIKHVELTSSGHFRDGKLVNTRCFTVDVTDLERTRTELRQQDNAYHQI
LDGLPVAIYTTDQNGIITYYNRAAADLAGREPQVGKDKWCVTFKLFTTDGKELPHDECPMAVALKENRPVRNQQAIAQRP
DGSFFPFMPYPTPLRDEQGGLVGAVNMLLDLTDRQRAEETRQHLSAIVESSFDAIVSKDLNTIIKSWNRGAEKLFGYTAS
EAIGKSVTMLIPDDHQDEEPRILERLRRGERVDTYETIRRRKDGSLVPVSLTISPVRNATGQIVGASKIARDITSARENE
QRIRMLMREVNHRVKNQYSVILSMIRETNKRSETPKQFEAQVRQRIMALSRSHDLLVSADWKGATIRELLAAQAQPFPRG
EMIDMSGPSFVLGPNAVQYLGIAFNELATNSAKYGVLSGDDGQISVTWNISGSGTSRLFHLTWAETDGPQVTTIRQGGFG
TVVLERVAPEAVGGRGNLEYGSHGITWNLEAPLAGLDRSVANQD

Sequences:

>Translated_604_residues
MARLDRGPRMTELPQADVAAVKKPAMVLGGNLDYQDFFENGAIALHLVSADGLILHANKAELDLLGYPAEDYVGRHITEF
YPDRDVITNILSRLSRGEQIARYPARLRARDGSIKHVELTSSGHFRDGKLVNTRCFTVDVTDLERTRTELRQQDNAYHQI
LDGLPVAIYTTDQNGIITYYNRAAADLAGREPQVGKDKWCVTFKLFTTDGKELPHDECPMAVALKENRPVRNQQAIAQRP
DGSFFPFMPYPTPLRDEQGGLVGAVNMLLDLTDRQRAEETRQHLSAIVESSFDAIVSKDLNTIIKSWNRGAEKLFGYTAS
EAIGKSVTMLIPDDHQDEEPRILERLRRGERVDTYETIRRRKDGSLVPVSLTISPVRNATGQIVGASKIARDITSARENE
QRIRMLMREVNHRVKNQYSVILSMIRETNKRSETPKQFEAQVRQRIMALSRSHDLLVSADWKGATIRELLAAQAQPFPRG
EMIDMSGPSFVLGPNAVQYLGIAFNELATNSAKYGVLSGDDGQISVTWNISGSGTSRLFHLTWAETDGPQVTTIRQGGFG
TVVLERVAPEAVGGRGNLEYGSHGITWNLEAPLAGLDRSVANQD
>Mature_603_residues
ARLDRGPRMTELPQADVAAVKKPAMVLGGNLDYQDFFENGAIALHLVSADGLILHANKAELDLLGYPAEDYVGRHITEFY
PDRDVITNILSRLSRGEQIARYPARLRARDGSIKHVELTSSGHFRDGKLVNTRCFTVDVTDLERTRTELRQQDNAYHQIL
DGLPVAIYTTDQNGIITYYNRAAADLAGREPQVGKDKWCVTFKLFTTDGKELPHDECPMAVALKENRPVRNQQAIAQRPD
GSFFPFMPYPTPLRDEQGGLVGAVNMLLDLTDRQRAEETRQHLSAIVESSFDAIVSKDLNTIIKSWNRGAEKLFGYTASE
AIGKSVTMLIPDDHQDEEPRILERLRRGERVDTYETIRRRKDGSLVPVSLTISPVRNATGQIVGASKIARDITSARENEQ
RIRMLMREVNHRVKNQYSVILSMIRETNKRSETPKQFEAQVRQRIMALSRSHDLLVSADWKGATIRELLAAQAQPFPRGE
MIDMSGPSFVLGPNAVQYLGIAFNELATNSAKYGVLSGDDGQISVTWNISGSGTSRLFHLTWAETDGPQVTTIRQGGFGT
VVLERVAPEAVGGRGNLEYGSHGITWNLEAPLAGLDRSVANQD

Specific function: Photosensitive kinase that is involved in increased bacterial virulence upon exposure to light. Once ejected from an infected animal host, sunlight acts as an environmental signal that increases the virulence of the bacterium, preparing it for infection o

COG id: NA

COG function: NA

Gene ontology:

Cell location: Integral Membrane Protein. Inner Membrane [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 PAS (PER-ARNT-SIM) domain [H]

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001610
- InterPro:   IPR000014
- InterPro:   IPR000700
- InterPro:   IPR013767
- InterPro:   IPR013655
- InterPro:   IPR011102 [H]

Pfam domain/function: PF07536 HWE_HK; PF00989 PAS; PF08447 PAS_3 [H]

EC number: =2.7.13.3 [H]

Molecular weight: Translated: 67250; Mature: 67119

Theoretical pI: Translated: 6.85; Mature: 6.85

Prosite motif: PS50112 PAS ; PS50113 PAC

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.5 %Cys     (Translated Protein)
2.2 %Met     (Translated Protein)
2.6 %Cys+Met (Translated Protein)
0.5 %Cys     (Mature Protein)
2.0 %Met     (Mature Protein)
2.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MARLDRGPRMTELPQADVAAVKKPAMVLGGNLDYQDFFENGAIALHLVSADGLILHANKA
CCCCCCCCCCCCCCCHHHHHHCCCCEEECCCCCHHHHHCCCCEEEEEEECCCEEEECCCC
ELDLLGYPAEDYVGRHITEFYPDRDVITNILSRLSRGEQIARYPARLRARDGSIKHVELT
CEEEECCCHHHHHHHHHHHHCCCHHHHHHHHHHHHCCHHHHHCHHHHHCCCCCEEEEEEE
SSGHFRDGKLVNTRCFTVDVTDLERTRTELRQQDNAYHQILDGLPVAIYTTDQNGIITYY
CCCCCCCCEEEEEEEEEEECHHHHHHHHHHHHHCCHHHHHHCCCCEEEEEECCCCEEEEE
NRAAADLAGREPQVGKDKWCVTFKLFTTDGKELPHDECPMAVALKENRPVRNQQAIAQRP
CCHHHHHCCCCCCCCCCCEEEEEEEEECCCCCCCCCCCCEEEEECCCCCCCCHHHHHCCC
DGSFFPFMPYPTPLRDEQGGLVGAVNMLLDLTDRQRAEETRQHLSAIVESSFDAIVSKDL
CCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
NTIIKSWNRGAEKLFGYTASEAIGKSVTMLIPDDHQDEEPRILERLRRGERVDTYETIRR
HHHHHHHCCCHHHHHCCCHHHHCCCEEEEEECCCCCCCCHHHHHHHHCCCCCCHHHHHHH
RKDGSLVPVSLTISPVRNATGQIVGASKIARDITSARENEQRIRMLMREVNHRVKNQYSV
CCCCCEEEEEEEECCCCCCCCCEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
ILSMIRETNKRSETPKQFEAQVRQRIMALSRSHDLLVSADWKGATIRELLAAQAQPFPRG
HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHCCCCCCC
EMIDMSGPSFVLGPNAVQYLGIAFNELATNSAKYGVLSGDDGQISVTWNISGSGTSRLFH
CEEECCCCCEEECCCHHHHHHHHHHHHHCCCCCCCEEECCCCEEEEEEEECCCCCCEEEE
LTWAETDGPQVTTIRQGGFGTVVLERVAPEAVGGRGNLEYGSHGITWNLEAPLAGLDRSV
EEEECCCCCEEEEEECCCCCEEHHHHHCHHHCCCCCCCCCCCCCEEEEECCCHHHCCCHH
ANQD
CCCC
>Mature Secondary Structure 
ARLDRGPRMTELPQADVAAVKKPAMVLGGNLDYQDFFENGAIALHLVSADGLILHANKA
CCCCCCCCCCCCCCHHHHHHCCCCEEECCCCCHHHHHCCCCEEEEEEECCCEEEECCCC
ELDLLGYPAEDYVGRHITEFYPDRDVITNILSRLSRGEQIARYPARLRARDGSIKHVELT
CEEEECCCHHHHHHHHHHHHCCCHHHHHHHHHHHHCCHHHHHCHHHHHCCCCCEEEEEEE
SSGHFRDGKLVNTRCFTVDVTDLERTRTELRQQDNAYHQILDGLPVAIYTTDQNGIITYY
CCCCCCCCEEEEEEEEEEECHHHHHHHHHHHHHCCHHHHHHCCCCEEEEEECCCCEEEEE
NRAAADLAGREPQVGKDKWCVTFKLFTTDGKELPHDECPMAVALKENRPVRNQQAIAQRP
CCHHHHHCCCCCCCCCCCEEEEEEEEECCCCCCCCCCCCEEEEECCCCCCCCHHHHHCCC
DGSFFPFMPYPTPLRDEQGGLVGAVNMLLDLTDRQRAEETRQHLSAIVESSFDAIVSKDL
CCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
NTIIKSWNRGAEKLFGYTASEAIGKSVTMLIPDDHQDEEPRILERLRRGERVDTYETIRR
HHHHHHHCCCHHHHHCCCHHHHCCCEEEEEECCCCCCCCHHHHHHHHCCCCCCHHHHHHH
RKDGSLVPVSLTISPVRNATGQIVGASKIARDITSARENEQRIRMLMREVNHRVKNQYSV
CCCCCEEEEEEEECCCCCCCCCEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
ILSMIRETNKRSETPKQFEAQVRQRIMALSRSHDLLVSADWKGATIRELLAAQAQPFPRG
HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHCCCCCCC
EMIDMSGPSFVLGPNAVQYLGIAFNELATNSAKYGVLSGDDGQISVTWNISGSGTSRLFH
CEEECCCCCEEECCCHHHHHHHHHHHHHCCCCCCCEEECCCCEEEEEEEECCCCCCEEEE
LTWAETDGPQVTTIRQGGFGTVVLERVAPEAVGGRGNLEYGSHGITWNLEAPLAGLDRSV
EEEECCCCCEEEEEECCCCCEEHHHHHCHHHCCCCCCCCCCCCCEEEEECCCHHHCCCHH
ANQD
CCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: NA