| Definition | Mesorhizobium loti MAFF303099 chromosome, complete genome. |
|---|---|
| Accession | NC_002678 |
| Length | 7,036,071 |
Click here to switch to the map view.
The map label for this gene is irlS [H]
Identifier: 13473758
GI number: 13473758
Start: 3545087
End: 3546463
Strand: Direct
Name: irlS [H]
Synonym: mlr4459
Alternate gene names: 13473758
Gene position: 3545087-3546463 (Clockwise)
Preceding gene: 13473757
Following gene: 13473759
Centisome position: 50.38
GC content: 63.98
Gene sequence:
>1377_bases ATGACCGGAAGCTGGTCTCGCCTGCTGCGCAGCACGCCGTTCCGGCTCGCCCTGACATTCGGCTTCCTGTTCATGCTGGC CTTCGTGCTGTCGGGCGCCATCGTCTACCAGATGATGAGCGCCGATCTTGCCGAACGGCTCGACGAGAGCATCAAGGAAA CCTATTCGGTCGTCGCCGCCACCTATGCCGCGAACGATCTGGAGGACCTTGTCGCCACGATCGAAAGCCATGCGAAGCTC AGCCCGAAGAAAGAACAGCTGTTTTCGCTGACCGACCCGGCCGGGAACCACCTGGCCGGCAACTTCACCGCCACCGGGCT TCCCGACGGGTTCTCGGGGTTCGACGCCGTGTTGCCAGGCGTGCCTCCGGATACGGAGTATCGCGCCTTTTCCGGTTCGG TCGGCGGCAACAATCTGACGGTCGCCTTCAGCCTTTCCGAAACGGAGGAGCTGGAAACAGTGGCGATGATGAGCTTCGGC TGGGCAACCCTGATCATCACCGGACTGGCGGTCGCCGGCGGCGCCCTGCTTGCCTCGCGCGTCCAGCTCCGTCTCGATGG CATTGCCGCCACCATGGTCGATGTCTCGCATGGCCGGCTCGATACCCGCATTCCGCTGACCGGCACGGGCGACGACATCG ACATCGTCTCCAGCCAGGTCAACGCGGCGCTCGATCGCCTGTCGGGCCTGGTGGACGGCATGAAGCAGGTCAGCGCCAAC ATCGCGCATGATCTGAAGACGCCGCTCAACCGGCTGCAGATGATCCTGGAAGGCGCCGCCGACAAGGCAGCGCGGGATCA GGACGTCTCCGACGATCTGGCCGACGCGCGTGCGGAGGGCCATCAGATCAACGAGACCTTCGATGCGTTGTTGCGCATCG CTCAGATCGAGGCCGGTGCCCGCAAGGCGCGGTTCACCGATCTCGATCTCGGCGAGGTGCTGGAAATCATAGCCGAGATC TACACCGATGTAGCCGAGGACGACGGAAAGTCGCTGGTGTCGACGCAACTGCGTGAGACAGCAGACCCGATTCATGGCGA TCGGGAACTGTTGACGCAGATGTTCGCCAATCTGGTCGAGAATGCGCTGCGCCATTGCCCGCCCGGGACGACCATCAAAC TGTCGGTGACGCGCCAGGCCGAGCGTGTCGTTGCCAGCGTCGCCGACAATGGACCCGGCATTCCTCCGGACGAGCGCGAA CAGGTGTTTCAGCGGCTCTACCGGCTGGACCACAGCCGCTCGACGCCGGGCAACGGCCTTGGCCTCAGCCTGGTGCGGGC CATTGCCGATTTGCATGGCGCATCCATCGCCCTTGACGATTGTCAGCCTGGCCTCGCCGTGGTGGTGAGCTTTCCGCTGG TAAGATCGGCGGCGTGA
Upstream 100 bases:
>100_bases ATATCAGCCGGCTGCGGGCCAAGGTCGACAAGCCGTTCGAGGCCCAGCTCATCCATACCATCCGCAACACCGGCTACAGC CTGCACGCACCGCTTGCGCC
Downstream 100 bases:
>100_bases TAGGGCCGAAAGCTGGTGCCAGAAACTTACCAGAGCGCCAATAGCTTACGAAAGGGTATGAGGCCGGCGACCCGCCCGTA AGCTTGCAGCGGCATAATGA
Product: sensory histidine protein kinase
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 458; Mature: 457
Protein sequence:
>458_residues MTGSWSRLLRSTPFRLALTFGFLFMLAFVLSGAIVYQMMSADLAERLDESIKETYSVVAATYAANDLEDLVATIESHAKL SPKKEQLFSLTDPAGNHLAGNFTATGLPDGFSGFDAVLPGVPPDTEYRAFSGSVGGNNLTVAFSLSETEELETVAMMSFG WATLIITGLAVAGGALLASRVQLRLDGIAATMVDVSHGRLDTRIPLTGTGDDIDIVSSQVNAALDRLSGLVDGMKQVSAN IAHDLKTPLNRLQMILEGAADKAARDQDVSDDLADARAEGHQINETFDALLRIAQIEAGARKARFTDLDLGEVLEIIAEI YTDVAEDDGKSLVSTQLRETADPIHGDRELLTQMFANLVENALRHCPPGTTIKLSVTRQAERVVASVADNGPGIPPDERE QVFQRLYRLDHSRSTPGNGLGLSLVRAIADLHGASIALDDCQPGLAVVVSFPLVRSAA
Sequences:
>Translated_458_residues MTGSWSRLLRSTPFRLALTFGFLFMLAFVLSGAIVYQMMSADLAERLDESIKETYSVVAATYAANDLEDLVATIESHAKL SPKKEQLFSLTDPAGNHLAGNFTATGLPDGFSGFDAVLPGVPPDTEYRAFSGSVGGNNLTVAFSLSETEELETVAMMSFG WATLIITGLAVAGGALLASRVQLRLDGIAATMVDVSHGRLDTRIPLTGTGDDIDIVSSQVNAALDRLSGLVDGMKQVSAN IAHDLKTPLNRLQMILEGAADKAARDQDVSDDLADARAEGHQINETFDALLRIAQIEAGARKARFTDLDLGEVLEIIAEI YTDVAEDDGKSLVSTQLRETADPIHGDRELLTQMFANLVENALRHCPPGTTIKLSVTRQAERVVASVADNGPGIPPDERE QVFQRLYRLDHSRSTPGNGLGLSLVRAIADLHGASIALDDCQPGLAVVVSFPLVRSAA >Mature_457_residues TGSWSRLLRSTPFRLALTFGFLFMLAFVLSGAIVYQMMSADLAERLDESIKETYSVVAATYAANDLEDLVATIESHAKLS PKKEQLFSLTDPAGNHLAGNFTATGLPDGFSGFDAVLPGVPPDTEYRAFSGSVGGNNLTVAFSLSETEELETVAMMSFGW ATLIITGLAVAGGALLASRVQLRLDGIAATMVDVSHGRLDTRIPLTGTGDDIDIVSSQVNAALDRLSGLVDGMKQVSANI AHDLKTPLNRLQMILEGAADKAARDQDVSDDLADARAEGHQINETFDALLRIAQIEAGARKARFTDLDLGEVLEIIAEIY TDVAEDDGKSLVSTQLRETADPIHGDRELLTQMFANLVENALRHCPPGTTIKLSVTRQAERVVASVADNGPGIPPDEREQ VFQRLYRLDHSRSTPGNGLGLSLVRAIADLHGASIALDDCQPGLAVVVSFPLVRSAA
Specific function: Member of the two-component regulatory system irlR/irlS. May be involved in invasion of eukaryotic cells and heavy-metal resistance. Probably activates irlR by phosphorylation [H]
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein (Potential) [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 histidine kinase domain [H]
Homologues:
Organism=Escherichia coli, GI1787894, Length=282, Percent_Identity=30.4964539007092, Blast_Score=94, Evalue=2e-20, Organism=Escherichia coli, GI1786783, Length=368, Percent_Identity=25, Blast_Score=91, Evalue=2e-19, Organism=Escherichia coli, GI1788393, Length=271, Percent_Identity=27.6752767527675, Blast_Score=91, Evalue=2e-19, Organism=Escherichia coli, GI1789808, Length=330, Percent_Identity=28.4848484848485, Blast_Score=88, Evalue=1e-18, Organism=Escherichia coli, GI1786600, Length=205, Percent_Identity=27.8048780487805, Blast_Score=86, Evalue=5e-18, Organism=Escherichia coli, GI1786912, Length=221, Percent_Identity=30.7692307692308, Blast_Score=86, Evalue=6e-18, Organism=Escherichia coli, GI1790346, Length=270, Percent_Identity=28.5185185185185, Blast_Score=84, Evalue=1e-17, Organism=Escherichia coli, GI1788279, Length=208, Percent_Identity=27.8846153846154, Blast_Score=82, Evalue=6e-17, Organism=Escherichia coli, GI1790551, Length=233, Percent_Identity=30.0429184549356, Blast_Score=76, Evalue=4e-15, Organism=Escherichia coli, GI1790861, Length=304, Percent_Identity=25.6578947368421, Blast_Score=74, Evalue=2e-14, Organism=Escherichia coli, GI1789403, Length=255, Percent_Identity=28.2352941176471, Blast_Score=67, Evalue=2e-12,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR003594 - InterPro: IPR003660 - InterPro: IPR004358 - InterPro: IPR006290 - InterPro: IPR003661 - InterPro: IPR005467 - InterPro: IPR009082 [H]
Pfam domain/function: PF00672 HAMP; PF02518 HATPase_c; PF00512 HisKA [H]
EC number: =2.7.13.3 [H]
Molecular weight: Translated: 48915; Mature: 48784
Theoretical pI: Translated: 4.45; Mature: 4.45
Prosite motif: PS50885 HAMP ; PS50109 HIS_KIN
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.4 %Cys (Translated Protein) 2.2 %Met (Translated Protein) 2.6 %Cys+Met (Translated Protein) 0.4 %Cys (Mature Protein) 2.0 %Met (Mature Protein) 2.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTGSWSRLLRSTPFRLALTFGFLFMLAFVLSGAIVYQMMSADLAERLDESIKETYSVVAA CCCCHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH TYAANDLEDLVATIESHAKLSPKKEQLFSLTDPAGNHLAGNFTATGLPDGFSGFDAVLPG HHHHHHHHHHHHHHHHHHCCCCCHHHHHCCCCCCCCCCCCCCEECCCCCCCCCHHHHCCC VPPDTEYRAFSGSVGGNNLTVAFSLSETEELETVAMMSFGWATLIITGLAVAGGALLASR CCCCCCCEEEECCCCCCCEEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH VQLRLDGIAATMVDVSHGRLDTRIPLTGTGDDIDIVSSQVNAALDRLSGLVDGMKQVSAN HHHHHHHHHHHHHECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH IAHDLKTPLNRLQMILEGAADKAARDQDVSDDLADARAEGHQINETFDALLRIAQIEAGA HHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCC RKARFTDLDLGEVLEIIAEIYTDVAEDDGKSLVSTQLRETADPIHGDRELLTQMFANLVE HHCCCCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHH NALRHCPPGTTIKLSVTRQAERVVASVADNGPGIPPDEREQVFQRLYRLDHSRSTPGNGL HHHHHCCCCCEEEEEEHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCH GLSLVRAIADLHGASIALDDCQPGLAVVVSFPLVRSAA HHHHHHHHHHHCCCEEEECCCCCCHHHHHHHHHHHCCC >Mature Secondary Structure TGSWSRLLRSTPFRLALTFGFLFMLAFVLSGAIVYQMMSADLAERLDESIKETYSVVAA CCCHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH TYAANDLEDLVATIESHAKLSPKKEQLFSLTDPAGNHLAGNFTATGLPDGFSGFDAVLPG HHHHHHHHHHHHHHHHHHCCCCCHHHHHCCCCCCCCCCCCCCEECCCCCCCCCHHHHCCC VPPDTEYRAFSGSVGGNNLTVAFSLSETEELETVAMMSFGWATLIITGLAVAGGALLASR CCCCCCCEEEECCCCCCCEEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH VQLRLDGIAATMVDVSHGRLDTRIPLTGTGDDIDIVSSQVNAALDRLSGLVDGMKQVSAN HHHHHHHHHHHHHECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH IAHDLKTPLNRLQMILEGAADKAARDQDVSDDLADARAEGHQINETFDALLRIAQIEAGA HHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCC RKARFTDLDLGEVLEIIAEIYTDVAEDDGKSLVSTQLRETADPIHGDRELLTQMFANLVE HHCCCCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHH NALRHCPPGTTIKLSVTRQAERVVASVADNGPGIPPDEREQVFQRLYRLDHSRSTPGNGL HHHHHCCCCCEEEEEEHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCH GLSLVRAIADLHGASIALDDCQPGLAVVVSFPLVRSAA HHHHHHHHHHHCCCEEEECCCCCCHHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 9393784 [H]