| Definition | Mesorhizobium loti MAFF303099 plasmid pMLb, complete sequence. |
|---|---|
| Accession | NC_002682 |
| Length | 208,315 |
Click here to switch to the map view.
The map label for this gene is atoS [C]
Identifier: 13488499
GI number: 13488499
Start: 126527
End: 128341
Strand: Reverse
Name: atoS [C]
Synonym: mll9655
Alternate gene names: 13488499
Gene position: 128341-126527 (Counterclockwise)
Preceding gene: 13488500
Following gene: 13488498
Centisome position: 61.61
GC content: 59.17
Gene sequence:
>1815_bases GTGGCCCGTTTAGATCGGGGACCTCGAATGACTGAACTACCGCAGGCCGACGTTGCCGCCGTCAAGAAACCGGCAATGGT CCTGGGCGGTAACCTCGACTATCAGGACTTTTTCGAAAACGGTGCCATCGCGCTCCATCTGGTGAGCGCGGATGGCCTAA TCTTGCATGCTAACAAGGCGGAGCTCGATCTCCTCGGCTATCCGGCAGAGGACTATGTCGGTCGCCACATTACTGAGTTT TATCCCGACCGGGACGTGATCACCAACATTTTGAGTAGGCTCTCCCGTGGTGAGCAGATTGCCAGATATCCGGCGCGTTT GCGCGCCCGTGACGGGTCCATCAAGCACGTCGAACTTACGTCGAGCGGCCATTTCCGCGACGGCAAGCTAGTCAACACCA GATGCTTTACTGTCGATGTAACTGACCTCGAGCGGACACGGACAGAACTCAGGCAGCAGGACAACGCCTATCACCAGATT CTCGATGGCCTTCCGGTCGCCATCTACACGACTGACCAGAACGGCATCATTACCTATTACAATCGAGCCGCTGCCGATCT GGCAGGGCGGGAGCCTCAGGTCGGCAAGGACAAATGGTGCGTAACGTTCAAGCTGTTCACCACTGACGGCAAGGAGCTGC CGCACGACGAATGCCCGATGGCGGTCGCCCTCAAGGAAAACCGGCCGGTCCGCAACCAGCAGGCCATAGCGCAGCGTCCG GACGGGTCCTTTTTCCCGTTCATGCCCTACCCCACTCCGCTGCGCGATGAGCAGGGCGGCCTCGTTGGCGCCGTGAATAT GCTGCTCGATCTGACGGACCGCCAGCGTGCCGAAGAGACCAGGCAGCACCTGTCGGCGATCGTGGAATCCTCATTTGACG CTATTGTCAGCAAGGATCTCAACACCATCATCAAGAGCTGGAATCGAGGCGCCGAAAAGCTCTTTGGCTATACCGCCAGC GAAGCGATCGGCAAGTCGGTAACCATGCTTATCCCCGACGACCACCAGGACGAGGAACCTCGCATCCTTGAGCGCCTCCG CCGTGGCGAGCGCGTCGATACCTATGAAACGATACGTCGGCGCAAGGATGGCAGCCTGGTTCCGGTCTCGCTGACGATAT CGCCCGTGCGTAACGCAACAGGGCAGATCGTCGGCGCCTCGAAGATTGCCCGGGACATCACATCGGCCAGGGAAAACGAG CAGCGTATCCGCATGTTGATGCGCGAGGTCAATCACCGGGTGAAGAACCAGTATTCCGTTATCCTGTCGATGATCCGGGA GACCAACAAACGATCGGAGACGCCCAAGCAGTTTGAAGCCCAGGTGCGCCAGCGCATCATGGCATTGTCGCGCTCCCACG ACCTGCTTGTGTCGGCCGACTGGAAGGGAGCTACCATCCGTGAACTTTTGGCGGCGCAGGCCCAGCCATTTCCCCGCGGT GAGATGATCGATATGTCAGGGCCGTCGTTCGTGCTTGGCCCCAATGCGGTCCAATATCTCGGGATTGCCTTCAACGAACT CGCGACCAACTCTGCGAAGTATGGCGTGCTCTCCGGCGATGACGGCCAGATTTCTGTTACATGGAACATCAGCGGTTCCG GTACATCGCGGCTCTTCCACCTGACCTGGGCGGAGACCGATGGGCCGCAAGTCACGACCATCAGACAGGGCGGCTTTGGA ACCGTGGTCCTGGAGCGGGTTGCACCTGAAGCCGTGGGCGGACGGGGAAATCTTGAATATGGTTCGCATGGGATCACATG GAACCTGGAAGCGCCGTTGGCCGGACTGGACCGTTCCGTTGCCAATCAGGATTAG
Upstream 100 bases:
>100_bases GTCTGCGTCAGGAGATCGGACCCCTTTACCTTCAGCTAACCACCCCCTTCACTTCCGACATGTAGGCAGCGACTTGGAAC AATTGTTTAGACGGCGCGTT
Downstream 100 bases:
>100_bases AAGCCTAAAGTAGCCTTGCTGGAGACGGGCGTTCGTCACTCGTTTGGCCATGCGGAATCTCCAGCCGGAACACCCCCTCG TCGCCCGGCTGGCGGTCCGC
Product: sensor histidine kinase of two-component
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 604; Mature: 603
Protein sequence:
>604_residues MARLDRGPRMTELPQADVAAVKKPAMVLGGNLDYQDFFENGAIALHLVSADGLILHANKAELDLLGYPAEDYVGRHITEF YPDRDVITNILSRLSRGEQIARYPARLRARDGSIKHVELTSSGHFRDGKLVNTRCFTVDVTDLERTRTELRQQDNAYHQI LDGLPVAIYTTDQNGIITYYNRAAADLAGREPQVGKDKWCVTFKLFTTDGKELPHDECPMAVALKENRPVRNQQAIAQRP DGSFFPFMPYPTPLRDEQGGLVGAVNMLLDLTDRQRAEETRQHLSAIVESSFDAIVSKDLNTIIKSWNRGAEKLFGYTAS EAIGKSVTMLIPDDHQDEEPRILERLRRGERVDTYETIRRRKDGSLVPVSLTISPVRNATGQIVGASKIARDITSARENE QRIRMLMREVNHRVKNQYSVILSMIRETNKRSETPKQFEAQVRQRIMALSRSHDLLVSADWKGATIRELLAAQAQPFPRG EMIDMSGPSFVLGPNAVQYLGIAFNELATNSAKYGVLSGDDGQISVTWNISGSGTSRLFHLTWAETDGPQVTTIRQGGFG TVVLERVAPEAVGGRGNLEYGSHGITWNLEAPLAGLDRSVANQD
Sequences:
>Translated_604_residues MARLDRGPRMTELPQADVAAVKKPAMVLGGNLDYQDFFENGAIALHLVSADGLILHANKAELDLLGYPAEDYVGRHITEF YPDRDVITNILSRLSRGEQIARYPARLRARDGSIKHVELTSSGHFRDGKLVNTRCFTVDVTDLERTRTELRQQDNAYHQI LDGLPVAIYTTDQNGIITYYNRAAADLAGREPQVGKDKWCVTFKLFTTDGKELPHDECPMAVALKENRPVRNQQAIAQRP DGSFFPFMPYPTPLRDEQGGLVGAVNMLLDLTDRQRAEETRQHLSAIVESSFDAIVSKDLNTIIKSWNRGAEKLFGYTAS EAIGKSVTMLIPDDHQDEEPRILERLRRGERVDTYETIRRRKDGSLVPVSLTISPVRNATGQIVGASKIARDITSARENE QRIRMLMREVNHRVKNQYSVILSMIRETNKRSETPKQFEAQVRQRIMALSRSHDLLVSADWKGATIRELLAAQAQPFPRG EMIDMSGPSFVLGPNAVQYLGIAFNELATNSAKYGVLSGDDGQISVTWNISGSGTSRLFHLTWAETDGPQVTTIRQGGFG TVVLERVAPEAVGGRGNLEYGSHGITWNLEAPLAGLDRSVANQD >Mature_603_residues ARLDRGPRMTELPQADVAAVKKPAMVLGGNLDYQDFFENGAIALHLVSADGLILHANKAELDLLGYPAEDYVGRHITEFY PDRDVITNILSRLSRGEQIARYPARLRARDGSIKHVELTSSGHFRDGKLVNTRCFTVDVTDLERTRTELRQQDNAYHQIL DGLPVAIYTTDQNGIITYYNRAAADLAGREPQVGKDKWCVTFKLFTTDGKELPHDECPMAVALKENRPVRNQQAIAQRPD GSFFPFMPYPTPLRDEQGGLVGAVNMLLDLTDRQRAEETRQHLSAIVESSFDAIVSKDLNTIIKSWNRGAEKLFGYTASE AIGKSVTMLIPDDHQDEEPRILERLRRGERVDTYETIRRRKDGSLVPVSLTISPVRNATGQIVGASKIARDITSARENEQ RIRMLMREVNHRVKNQYSVILSMIRETNKRSETPKQFEAQVRQRIMALSRSHDLLVSADWKGATIRELLAAQAQPFPRGE MIDMSGPSFVLGPNAVQYLGIAFNELATNSAKYGVLSGDDGQISVTWNISGSGTSRLFHLTWAETDGPQVTTIRQGGFGT VVLERVAPEAVGGRGNLEYGSHGITWNLEAPLAGLDRSVANQD
Specific function: Photosensitive kinase that is involved in increased bacterial virulence upon exposure to light. Once ejected from an infected animal host, sunlight acts as an environmental signal that increases the virulence of the bacterium, preparing it for infection o
COG id: NA
COG function: NA
Gene ontology:
Cell location: Integral Membrane Protein. Inner Membrane [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 PAS (PER-ARNT-SIM) domain [H]
Homologues:
None
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001610 - InterPro: IPR000014 - InterPro: IPR000700 - InterPro: IPR013767 - InterPro: IPR013655 - InterPro: IPR011102 [H]
Pfam domain/function: PF07536 HWE_HK; PF00989 PAS; PF08447 PAS_3 [H]
EC number: =2.7.13.3 [H]
Molecular weight: Translated: 67250; Mature: 67119
Theoretical pI: Translated: 6.85; Mature: 6.85
Prosite motif: PS50112 PAS ; PS50113 PAC
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.5 %Cys (Translated Protein) 2.2 %Met (Translated Protein) 2.6 %Cys+Met (Translated Protein) 0.5 %Cys (Mature Protein) 2.0 %Met (Mature Protein) 2.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MARLDRGPRMTELPQADVAAVKKPAMVLGGNLDYQDFFENGAIALHLVSADGLILHANKA CCCCCCCCCCCCCCCHHHHHHCCCCEEECCCCCHHHHHCCCCEEEEEEECCCEEEECCCC ELDLLGYPAEDYVGRHITEFYPDRDVITNILSRLSRGEQIARYPARLRARDGSIKHVELT CEEEECCCHHHHHHHHHHHHCCCHHHHHHHHHHHHCCHHHHHCHHHHHCCCCCEEEEEEE SSGHFRDGKLVNTRCFTVDVTDLERTRTELRQQDNAYHQILDGLPVAIYTTDQNGIITYY CCCCCCCCEEEEEEEEEEECHHHHHHHHHHHHHCCHHHHHHCCCCEEEEEECCCCEEEEE NRAAADLAGREPQVGKDKWCVTFKLFTTDGKELPHDECPMAVALKENRPVRNQQAIAQRP CCHHHHHCCCCCCCCCCCEEEEEEEEECCCCCCCCCCCCEEEEECCCCCCCCHHHHHCCC DGSFFPFMPYPTPLRDEQGGLVGAVNMLLDLTDRQRAEETRQHLSAIVESSFDAIVSKDL CCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH NTIIKSWNRGAEKLFGYTASEAIGKSVTMLIPDDHQDEEPRILERLRRGERVDTYETIRR HHHHHHHCCCHHHHHCCCHHHHCCCEEEEEECCCCCCCCHHHHHHHHCCCCCCHHHHHHH RKDGSLVPVSLTISPVRNATGQIVGASKIARDITSARENEQRIRMLMREVNHRVKNQYSV CCCCCEEEEEEEECCCCCCCCCEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH ILSMIRETNKRSETPKQFEAQVRQRIMALSRSHDLLVSADWKGATIRELLAAQAQPFPRG HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHCCCCCCC EMIDMSGPSFVLGPNAVQYLGIAFNELATNSAKYGVLSGDDGQISVTWNISGSGTSRLFH CEEECCCCCEEECCCHHHHHHHHHHHHHCCCCCCCEEECCCCEEEEEEEECCCCCCEEEE LTWAETDGPQVTTIRQGGFGTVVLERVAPEAVGGRGNLEYGSHGITWNLEAPLAGLDRSV EEEECCCCCEEEEEECCCCCEEHHHHHCHHHCCCCCCCCCCCCCEEEEECCCHHHCCCHH ANQD CCCC >Mature Secondary Structure ARLDRGPRMTELPQADVAAVKKPAMVLGGNLDYQDFFENGAIALHLVSADGLILHANKA CCCCCCCCCCCCCCHHHHHHCCCCEEECCCCCHHHHHCCCCEEEEEEECCCEEEECCCC ELDLLGYPAEDYVGRHITEFYPDRDVITNILSRLSRGEQIARYPARLRARDGSIKHVELT CEEEECCCHHHHHHHHHHHHCCCHHHHHHHHHHHHCCHHHHHCHHHHHCCCCCEEEEEEE SSGHFRDGKLVNTRCFTVDVTDLERTRTELRQQDNAYHQILDGLPVAIYTTDQNGIITYY CCCCCCCCEEEEEEEEEEECHHHHHHHHHHHHHCCHHHHHHCCCCEEEEEECCCCEEEEE NRAAADLAGREPQVGKDKWCVTFKLFTTDGKELPHDECPMAVALKENRPVRNQQAIAQRP CCHHHHHCCCCCCCCCCCEEEEEEEEECCCCCCCCCCCCEEEEECCCCCCCCHHHHHCCC DGSFFPFMPYPTPLRDEQGGLVGAVNMLLDLTDRQRAEETRQHLSAIVESSFDAIVSKDL CCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH NTIIKSWNRGAEKLFGYTASEAIGKSVTMLIPDDHQDEEPRILERLRRGERVDTYETIRR HHHHHHHCCCHHHHHCCCHHHHCCCEEEEEECCCCCCCCHHHHHHHHCCCCCCHHHHHHH RKDGSLVPVSLTISPVRNATGQIVGASKIARDITSARENEQRIRMLMREVNHRVKNQYSV CCCCCEEEEEEEECCCCCCCCCEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH ILSMIRETNKRSETPKQFEAQVRQRIMALSRSHDLLVSADWKGATIRELLAAQAQPFPRG HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHCCCCCCC EMIDMSGPSFVLGPNAVQYLGIAFNELATNSAKYGVLSGDDGQISVTWNISGSGTSRLFH CEEECCCCCEEECCCHHHHHHHHHHHHHCCCCCCCEEECCCCEEEEEEEECCCCCCEEEE LTWAETDGPQVTTIRQGGFGTVVLERVAPEAVGGRGNLEYGSHGITWNLEAPLAGLDRSV EEEECCCCCEEEEEECCCCCEEHHHHHCHHHCCCCCCCCCCCCCEEEEECCCHHHCCCHH ANQD CCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: NA