The gene/protein map for NC_009525 is currently unavailable.
Definition Mycobacterium tuberculosis H37Ra, complete genome.
Accession NC_009525
Length 4,419,977

Click here to switch to the map view.

The map label for this gene is arlS [H]

Identifier: 148663630

GI number: 148663630

Start: 4218028

End: 4219455

Strand: Reverse

Name: arlS [H]

Synonym: MRA_3802

Alternate gene names: 148663630

Gene position: 4219455-4218028 (Counterclockwise)

Preceding gene: 148663631

Following gene: 148663628

Centisome position: 95.46

GC content: 66.67

Gene sequence:

>1428_bases
GTGGGAATCACCGCGGCAACCGAAATGGCGCTGCGTCGTCATCTGGTGGCACAACTTGACAACCAACTCGGCGGAACGTC
GTACCGCTCGGTGTTGATGTATCCGGAGAAAATGCCCCGTCCGCCCTGGCGGCACGAGACGCACAACTACATCCGGTCGG
GCCCCGGTCCGAGGTTTCTCGATGCTCCGGGCCAGCCGGCCGGGATGGTGGCGGCGGTGGTCAGCGACGGCACGACGGTC
GCCGCCGGATATCTGACCGGCAGTGGTTCGCGGGCGGCGTTGACGTCAACCGGCCGGTCCCAGCTGGAACGGATCGCCGG
CAGCCGCACACCGCTGACCCTGGATCTCGACGGTCTGGGCCGGTACCGTGTGCTGGCCGCTCCGAGCCGAAACGGGCACG
ACGTCATCGTCACCGGCCTGTCGATGGGCAACGTCGACGCCACGATGTTGCAGATGCTGATCATTTTCGGAATCGTCACG
GTGATTGCGTTGGTCGCCGCGACGACCGCCGGAATCGTCATCATCAAGCGGGCGCTGGCGCCGTTGCGGCGCGTCGCGCA
AACCGCGAGCGAAGTCGTCGACCTACCGTTGGATCGCGGCGAGGTCAAGCTACCGGTCCGGGTGCCCGAACCTGACGCAA
ACCCCTCCACCGAGGTGGGGCAACTCGGGTCGGCGCTCAACCGGATGCTCGACCACATCGCTGCCGCACTGTCGGCGCGG
CAGGCCAGTGAAACCTGTGTGCGCCAGTTCGTTGCCGATGCCAGTCATGAACTGCGAACTCCCCTTGCGGCGATCCGTGG
TTACACGGAATTGACGCAGCGGATAGGGGACGATCCCGAGGCCGTCGCACACGCGATGAGCCGGGTGGCATCGGAGACCG
AGCGGATAACACGTCTCGTCGAGGACCTGCTGCTGCTGGCGCGTCTGGACTCGGGGCGGCCGCTGGAACGCGGACCGGTG
GACATGTCGCGGCTTGCGGTTGACGCGGTCAGCGACGCTCATGTTGCCGGACCAGATCACCAGTGGGCGCTCGACCTGCC
CCCCGAACCGGTGGTCATCCCGGGTGATGCGGCACGGTTGCACCAGGTGGTGACCAACCTGCTGGCCAACGCCCGCGTGC
ACACCGGTCCCGGCACGATCGTGACGACGCGCTTGAGCACCGGGCCGACGCACGTCGTGCTGCAGGTGATCGACAATGGG
CCGGGTATTCCGGCCGCGCTGCAGTCCGAGGTTTTCGAGCGGTTCGCCCGCGGCGATACGTCACGGTCCCGCCAAGCCGG
TAGCACCGGGCTCGGCCTGGCGATCGTCTCCGCTGTGGTCAAGGCGCACAACGGAACGATCACCGTGAGTAGCTCACCCG
GATATACCGAGTTTGCGGTGCGGTTGCCACTTGACGGATGGCAACCGCTCGAATCGTCGCCGCGCTAG

Upstream 100 bases:

>100_bases
GCCGGCTATGTGCTCAAGCCGGCCCGCTAGCAGTCCGCGAATTTGGTCGCTTCGGCTGCGGCTCCTGGTCGGACAGGTTG
TCGTCCTCGCCGTGGTGTGT

Downstream 100 bases:

>100_bases
GCCTGACTGCCCGGCTCCGACGCGCTGTTCACAGCCCGCATCGACACGCTTTAGGTTAGGAACAGGTCACCTCGATTTCG
AACGACTTGTTCACCGGTGA

Product: two component sensor kinase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 475; Mature: 474

Protein sequence:

>475_residues
MGITAATEMALRRHLVAQLDNQLGGTSYRSVLMYPEKMPRPPWRHETHNYIRSGPGPRFLDAPGQPAGMVAAVVSDGTTV
AAGYLTGSGSRAALTSTGRSQLERIAGSRTPLTLDLDGLGRYRVLAAPSRNGHDVIVTGLSMGNVDATMLQMLIIFGIVT
VIALVAATTAGIVIIKRALAPLRRVAQTASEVVDLPLDRGEVKLPVRVPEPDANPSTEVGQLGSALNRMLDHIAAALSAR
QASETCVRQFVADASHELRTPLAAIRGYTELTQRIGDDPEAVAHAMSRVASETERITRLVEDLLLLARLDSGRPLERGPV
DMSRLAVDAVSDAHVAGPDHQWALDLPPEPVVIPGDAARLHQVVTNLLANARVHTGPGTIVTTRLSTGPTHVVLQVIDNG
PGIPAALQSEVFERFARGDTSRSRQAGSTGLGLAIVSAVVKAHNGTITVSSSPGYTEFAVRLPLDGWQPLESSPR

Sequences:

>Translated_475_residues
MGITAATEMALRRHLVAQLDNQLGGTSYRSVLMYPEKMPRPPWRHETHNYIRSGPGPRFLDAPGQPAGMVAAVVSDGTTV
AAGYLTGSGSRAALTSTGRSQLERIAGSRTPLTLDLDGLGRYRVLAAPSRNGHDVIVTGLSMGNVDATMLQMLIIFGIVT
VIALVAATTAGIVIIKRALAPLRRVAQTASEVVDLPLDRGEVKLPVRVPEPDANPSTEVGQLGSALNRMLDHIAAALSAR
QASETCVRQFVADASHELRTPLAAIRGYTELTQRIGDDPEAVAHAMSRVASETERITRLVEDLLLLARLDSGRPLERGPV
DMSRLAVDAVSDAHVAGPDHQWALDLPPEPVVIPGDAARLHQVVTNLLANARVHTGPGTIVTTRLSTGPTHVVLQVIDNG
PGIPAALQSEVFERFARGDTSRSRQAGSTGLGLAIVSAVVKAHNGTITVSSSPGYTEFAVRLPLDGWQPLESSPR
>Mature_474_residues
GITAATEMALRRHLVAQLDNQLGGTSYRSVLMYPEKMPRPPWRHETHNYIRSGPGPRFLDAPGQPAGMVAAVVSDGTTVA
AGYLTGSGSRAALTSTGRSQLERIAGSRTPLTLDLDGLGRYRVLAAPSRNGHDVIVTGLSMGNVDATMLQMLIIFGIVTV
IALVAATTAGIVIIKRALAPLRRVAQTASEVVDLPLDRGEVKLPVRVPEPDANPSTEVGQLGSALNRMLDHIAAALSARQ
ASETCVRQFVADASHELRTPLAAIRGYTELTQRIGDDPEAVAHAMSRVASETERITRLVEDLLLLARLDSGRPLERGPVD
MSRLAVDAVSDAHVAGPDHQWALDLPPEPVVIPGDAARLHQVVTNLLANARVHTGPGTIVTTRLSTGPTHVVLQVIDNGP
GIPAALQSEVFERFARGDTSRSRQAGSTGLGLAIVSAVVKAHNGTITVSSSPGYTEFAVRLPLDGWQPLESSPR

Specific function: Member of the two-component regulatory system ArlS/ArlR. ArlS probably functions as a sensor protein kinase which is autophosphorylated at a histidine residue and transfers its phosphate group to ArlR [H]

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cell membrane; Multi-pass membrane protein [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 histidine kinase domain [H]

Homologues:

Organism=Escherichia coli, GI1790861, Length=444, Percent_Identity=28.6036036036036, Blast_Score=119, Evalue=6e-28,
Organism=Escherichia coli, GI1788393, Length=256, Percent_Identity=35.15625, Blast_Score=115, Evalue=6e-27,
Organism=Escherichia coli, GI1786783, Length=353, Percent_Identity=29.1784702549575, Blast_Score=110, Evalue=3e-25,
Organism=Escherichia coli, GI1786600, Length=239, Percent_Identity=30.9623430962343, Blast_Score=107, Evalue=2e-24,
Organism=Escherichia coli, GI145693157, Length=239, Percent_Identity=33.0543933054393, Blast_Score=104, Evalue=1e-23,
Organism=Escherichia coli, GI1789149, Length=375, Percent_Identity=27.4666666666667, Blast_Score=104, Evalue=2e-23,
Organism=Escherichia coli, GI1790346, Length=308, Percent_Identity=25.3246753246753, Blast_Score=103, Evalue=2e-23,
Organism=Escherichia coli, GI1787894, Length=248, Percent_Identity=32.258064516129, Blast_Score=93, Evalue=3e-20,
Organism=Escherichia coli, GI1788713, Length=226, Percent_Identity=29.646017699115, Blast_Score=92, Evalue=8e-20,
Organism=Escherichia coli, GI1786912, Length=263, Percent_Identity=32.319391634981, Blast_Score=91, Evalue=2e-19,
Organism=Escherichia coli, GI1788549, Length=221, Percent_Identity=29.8642533936652, Blast_Score=88, Evalue=1e-18,
Organism=Escherichia coli, GI1789403, Length=338, Percent_Identity=28.4023668639053, Blast_Score=86, Evalue=6e-18,
Organism=Escherichia coli, GI87081816, Length=352, Percent_Identity=27.8409090909091, Blast_Score=85, Evalue=8e-18,
Organism=Escherichia coli, GI1788279, Length=318, Percent_Identity=25.4716981132075, Blast_Score=85, Evalue=1e-17,
Organism=Escherichia coli, GI1790300, Length=237, Percent_Identity=29.535864978903, Blast_Score=82, Evalue=5e-17,
Organism=Escherichia coli, GI1789808, Length=228, Percent_Identity=28.0701754385965, Blast_Score=80, Evalue=4e-16,
Organism=Escherichia coli, GI1790551, Length=260, Percent_Identity=28.4615384615385, Blast_Score=77, Evalue=2e-15,
Organism=Escherichia coli, GI1790436, Length=234, Percent_Identity=31.1965811965812, Blast_Score=72, Evalue=1e-13,
Organism=Escherichia coli, GI87082128, Length=224, Percent_Identity=27.6785714285714, Blast_Score=69, Evalue=6e-13,
Organism=Escherichia coli, GI48994928, Length=214, Percent_Identity=27.5700934579439, Blast_Score=67, Evalue=2e-12,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003594
- InterPro:   IPR003660
- InterPro:   IPR004358
- InterPro:   IPR003661
- InterPro:   IPR005467
- InterPro:   IPR009082 [H]

Pfam domain/function: PF00672 HAMP; PF02518 HATPase_c; PF00512 HisKA [H]

EC number: =2.7.13.3 [H]

Molecular weight: Translated: 50345; Mature: 50214

Theoretical pI: Translated: 7.50; Mature: 7.50

Prosite motif: PS50885 HAMP ; PS50109 HIS_KIN

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.2 %Cys     (Translated Protein)
2.3 %Met     (Translated Protein)
2.5 %Cys+Met (Translated Protein)
0.2 %Cys     (Mature Protein)
2.1 %Met     (Mature Protein)
2.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MGITAATEMALRRHLVAQLDNQLGGTSYRSVLMYPEKMPRPPWRHETHNYIRSGPGPRFL
CCCCHHHHHHHHHHHHHHHHHHCCCCCHHHEEECHHHCCCCCCCCHHHHHHHCCCCCCCC
DAPGQPAGMVAAVVSDGTTVAAGYLTGSGSRAALTSTGRSQLERIAGSRTPLTLDLDGLG
CCCCCCHHHEEEEECCCCEEEEEEEECCCCCEEHHHCCHHHHHHHCCCCCCEEEEECCCC
RYRVLAAPSRNGHDVIVTGLSMGNVDATMLQMLIIFGIVTVIALVAATTAGIVIIKRALA
CEEEEECCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
PLRRVAQTASEVVDLPLDRGEVKLPVRVPEPDANPSTEVGQLGSALNRMLDHIAAALSAR
HHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH
QASETCVRQFVADASHELRTPLAAIRGYTELTQRIGDDPEAVAHAMSRVASETERITRLV
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHH
EDLLLLARLDSGRPLERGPVDMSRLAVDAVSDAHVAGPDHQWALDLPPEPVVIPGDAARL
HHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCEEECCCCCCEEECCCHHHH
HQVVTNLLANARVHTGPGTIVTTRLSTGPTHVVLQVIDNGPGIPAALQSEVFERFARGDT
HHHHHHHHHCCEEECCCCCEEEEEECCCCCEEEEEEECCCCCCCHHHHHHHHHHHHCCCC
SRSRQAGSTGLGLAIVSAVVKAHNGTITVSSSPGYTEFAVRLPLDGWQPLESSPR
CHHHHCCCCCHHHHHHHHHHHHCCCEEEEECCCCCEEEEEEECCCCCCCCCCCCC
>Mature Secondary Structure 
GITAATEMALRRHLVAQLDNQLGGTSYRSVLMYPEKMPRPPWRHETHNYIRSGPGPRFL
CCCHHHHHHHHHHHHHHHHHHCCCCCHHHEEECHHHCCCCCCCCHHHHHHHCCCCCCCC
DAPGQPAGMVAAVVSDGTTVAAGYLTGSGSRAALTSTGRSQLERIAGSRTPLTLDLDGLG
CCCCCCHHHEEEEECCCCEEEEEEEECCCCCEEHHHCCHHHHHHHCCCCCCEEEEECCCC
RYRVLAAPSRNGHDVIVTGLSMGNVDATMLQMLIIFGIVTVIALVAATTAGIVIIKRALA
CEEEEECCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
PLRRVAQTASEVVDLPLDRGEVKLPVRVPEPDANPSTEVGQLGSALNRMLDHIAAALSAR
HHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH
QASETCVRQFVADASHELRTPLAAIRGYTELTQRIGDDPEAVAHAMSRVASETERITRLV
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHH
EDLLLLARLDSGRPLERGPVDMSRLAVDAVSDAHVAGPDHQWALDLPPEPVVIPGDAARL
HHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCEEECCCCCCEEECCCHHHH
HQVVTNLLANARVHTGPGTIVTTRLSTGPTHVVLQVIDNGPGIPAALQSEVFERFARGDT
HHHHHHHHHCCEEECCCCCEEEEEECCCCCEEEEEEECCCCCCCHHHHHHHHHHHHCCCC
SRSRQAGSTGLGLAIVSAVVKAHNGTITVSSSPGYTEFAVRLPLDGWQPLESSPR
CHHHHCCCCCHHHHHHHHHHHHCCCEEEEECCCCCEEEEEEECCCCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: NA