Definition Mesorhizobium loti MAFF303099 plasmid pMLb, complete sequence.
Accession NC_002682
Length 208,315

Click here to switch to the map view.

The map label for this gene is frzE [H]

Identifier: 13488379

GI number: 13488379

Start: 6931

End: 9204

Strand: Reverse

Name: frzE [H]

Synonym: mll9511

Alternate gene names: 13488379

Gene position: 9204-6931 (Counterclockwise)

Preceding gene: 13488380

Following gene: 13488378

Centisome position: 4.42

GC content: 59.41

Gene sequence:

>2274_bases
ATGACCGACGAAGACTTCAGCCAGGTTTCGATGCTCAGCCTCTTCCAGGCCGAACTGGAGACGCAGTCACAGGCTCTAAC
CTCTGGGCTTCTGGCACTCGAACGCGATCCTATCGCCGCCGATGCCCTGGAAGCATGCATGCGTGCCGCGCATTCCCTGA
AGGGAGCTGCCCGCATCATCGACCTGATGCCCGCGGTCAAAGTTGCAAATGCGATGGAAGACTGCCTGGTCGCCGCGCAA
CGTGGGCATCTCAGGGTGCGACGGGAGCATATAGATACGCTTCTGCAAGGCTCTGACCTTTTGAAATTGATCGCATCGGG
GTCATCGAGCGAGAGCGCCAACTCAGAAGTCGAAACCTTTCTTGCGAAATTCGAGGAATTGTTCCTCCCCGAGGCAAAGC
CCAGACCCCAAGATTCTGCAGCTGATTTCGCAACTCCAGATCATGCCCAGGAGCAGCCGGCGACGACCTCTTCAGCGACC
CTCCTCGTTGGGACGGTACCGCAAGCCCACGGCAATATCGGGTCTGATAGAATGGTTCGGGTGACGGCAGACAGCCTGAA
CCGATTGCTCGGTTTGGCTGGCGAGTCGCTGATAGAAGCTCGGCGGCTCAGGCCATTCGCTGACGGGCTGCTGCGGCTGA
AGCGACTTCAATCGGACCTCGCAAGTGCGTTCGACAATCTTCATGCAGTGCTACCGCCGGGTTCAAACGATGCGGCGGTG
GTGGAGGCGCTGGCTGAAGCGCAGCGAAAGCTTCGGTTGAGCCAAGCGTTCCTGGCGGAGCGTCTCGATGAATTGGATTC
AGTGGATCGGAAAGCGACCGATCTTGCCAACCGATTGTACGACGAAACGTTGGCAAGCCGGATGCGGCCTTTTGAGGACG
GCGTGCGACACTATGCGAGAACGGTCCGCGACCTCGGTCGTACCCTTGGCAAGCGGGTGCGACTGGAGATCGTAGGCGGC
TCGACCGGCATCGACCGCGATATCCTCGAGCAACTCGATGCCCCTTTGGGGCATCTTTTGCGCAACGCTGTAGACCATGG
GCTTGAACCGCCTGAGGAGCGCCTCGCGATGGGGAAACCCGAAGAGGGGCTGATCAGGCTCGAAGCGAGGCATAATGCCG
GGCTGCTGCAGATAGCCGTGGTAGACGACGGCAGGGGGATCGAACTCTCGACGCTGCGCGAGACTGTCGTCGCCCGCCAG
CTTGCAACCAGGGAAAGCGCTGACACACTCAGCGAAACCGAATTGCTCGCATTCCTGTTTCTGCCAGGCTTTTCGATGAA
AGCGGGCCTCACTGACGTCTCCGGTCGCGGTGTGGGGCTAGATGCAGTGCAAGCCATGACGAAGGAGGTGCGGGGAGTCG
CAAGCGTCTCCTCAGAATTTGGGCACGGAACCCGATTTCAACTGCAATTGCCGCTGACGTTGTCAGTCCTGCGCACGCTG
CTCGTCGATGTCGATGGAGAGCCCTATGCTTTTCCACTCGCTGCCATTGCCAAGACTTTGAAACTGCAGCGGACGCAGAT
CAATCTCCTGCAGGGCCGGCCGCATTTTCGCTTGAACGATCGACAGATCGGGCTGGTCACCGCGCGAGAGGTGCTGGATC
GAGGTGAGCCCGGATCGGAACCGGACGAACTCTCGGTCGTCGTCGTGGATGCAGGTCGGGGCGATGCGTACGGCCTGATC
GTTGACCGTTTTCTCGGGGAACGCGAGTTGGTGATCAGACCCCTCGATCCACGGTTTGCCAAGATCAAGGACATCAGCGC
AGCAGCCCTGATGGAAGACGGTTCGCCGGTTCTGATCTTCGACGTCGACGATCTGATCCGATCGGTGGAAAAGCTTGCGT
CGTCGAGCGGATTTCGAGCTCTTCGGCGCGCCGCGGGCGGTCCCGAAGCGAGCAGGCGCAAGCGTGTGCTCGTGATTGAC
GATTCACTGACGGTGCGCGAACTCCAGCGCAAGATGCTGGGCAATTACGGTTATGAGGTCGAAGTGGCGGTCGACGGTAT
GGACGGGTGGAATGCCGTCCGGTCTGGTCCTTTCGATCTCGTCGTCACCGACATAGACATGCCTCGCATGGATGGCATCG
AGCTGGTGAGGTTGATCAGAAAAGACGCTCATCTCAAGAGCACACCCATCATGATCGTTTCATACAAGGATCGGGAAGAA
GATCGCGCGCGCGGGCTCGACGCAGGTGCGGACTATTATCTGACCAAGAGTAGCTTTCAGGACGAGGCGCTGATCCATGC
CGTCGTGGACATGATTGGCGAGGCGGCTGAGTGA

Upstream 100 bases:

>100_bases
CACTGTCAGCCGCTACGGTTGCCGTGCTGCCGTGGCAGGGACGAGCCGTTGGCTGTCTCGATCCACGACAACTGCTCACG
ATGCTGGATCGGAGCATCGC

Downstream 100 bases:

>100_bases
GATCGAAATCGGAAGCCAGCGAATGAGAATCGGCGTAGTCAACGACATGCCGATGGCGGTTGAACTTTTGCGCCGACTGG
TCCTGTCGACCGGCGAACAT

Product: chemotaxis histidine kinase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 757; Mature: 756

Protein sequence:

>757_residues
MTDEDFSQVSMLSLFQAELETQSQALTSGLLALERDPIAADALEACMRAAHSLKGAARIIDLMPAVKVANAMEDCLVAAQ
RGHLRVRREHIDTLLQGSDLLKLIASGSSSESANSEVETFLAKFEELFLPEAKPRPQDSAADFATPDHAQEQPATTSSAT
LLVGTVPQAHGNIGSDRMVRVTADSLNRLLGLAGESLIEARRLRPFADGLLRLKRLQSDLASAFDNLHAVLPPGSNDAAV
VEALAEAQRKLRLSQAFLAERLDELDSVDRKATDLANRLYDETLASRMRPFEDGVRHYARTVRDLGRTLGKRVRLEIVGG
STGIDRDILEQLDAPLGHLLRNAVDHGLEPPEERLAMGKPEEGLIRLEARHNAGLLQIAVVDDGRGIELSTLRETVVARQ
LATRESADTLSETELLAFLFLPGFSMKAGLTDVSGRGVGLDAVQAMTKEVRGVASVSSEFGHGTRFQLQLPLTLSVLRTL
LVDVDGEPYAFPLAAIAKTLKLQRTQINLLQGRPHFRLNDRQIGLVTAREVLDRGEPGSEPDELSVVVVDAGRGDAYGLI
VDRFLGERELVIRPLDPRFAKIKDISAAALMEDGSPVLIFDVDDLIRSVEKLASSSGFRALRRAAGGPEASRRKRVLVID
DSLTVRELQRKMLGNYGYEVEVAVDGMDGWNAVRSGPFDLVVTDIDMPRMDGIELVRLIRKDAHLKSTPIMIVSYKDREE
DRARGLDAGADYYLTKSSFQDEALIHAVVDMIGEAAE

Sequences:

>Translated_757_residues
MTDEDFSQVSMLSLFQAELETQSQALTSGLLALERDPIAADALEACMRAAHSLKGAARIIDLMPAVKVANAMEDCLVAAQ
RGHLRVRREHIDTLLQGSDLLKLIASGSSSESANSEVETFLAKFEELFLPEAKPRPQDSAADFATPDHAQEQPATTSSAT
LLVGTVPQAHGNIGSDRMVRVTADSLNRLLGLAGESLIEARRLRPFADGLLRLKRLQSDLASAFDNLHAVLPPGSNDAAV
VEALAEAQRKLRLSQAFLAERLDELDSVDRKATDLANRLYDETLASRMRPFEDGVRHYARTVRDLGRTLGKRVRLEIVGG
STGIDRDILEQLDAPLGHLLRNAVDHGLEPPEERLAMGKPEEGLIRLEARHNAGLLQIAVVDDGRGIELSTLRETVVARQ
LATRESADTLSETELLAFLFLPGFSMKAGLTDVSGRGVGLDAVQAMTKEVRGVASVSSEFGHGTRFQLQLPLTLSVLRTL
LVDVDGEPYAFPLAAIAKTLKLQRTQINLLQGRPHFRLNDRQIGLVTAREVLDRGEPGSEPDELSVVVVDAGRGDAYGLI
VDRFLGERELVIRPLDPRFAKIKDISAAALMEDGSPVLIFDVDDLIRSVEKLASSSGFRALRRAAGGPEASRRKRVLVID
DSLTVRELQRKMLGNYGYEVEVAVDGMDGWNAVRSGPFDLVVTDIDMPRMDGIELVRLIRKDAHLKSTPIMIVSYKDREE
DRARGLDAGADYYLTKSSFQDEALIHAVVDMIGEAAE
>Mature_756_residues
TDEDFSQVSMLSLFQAELETQSQALTSGLLALERDPIAADALEACMRAAHSLKGAARIIDLMPAVKVANAMEDCLVAAQR
GHLRVRREHIDTLLQGSDLLKLIASGSSSESANSEVETFLAKFEELFLPEAKPRPQDSAADFATPDHAQEQPATTSSATL
LVGTVPQAHGNIGSDRMVRVTADSLNRLLGLAGESLIEARRLRPFADGLLRLKRLQSDLASAFDNLHAVLPPGSNDAAVV
EALAEAQRKLRLSQAFLAERLDELDSVDRKATDLANRLYDETLASRMRPFEDGVRHYARTVRDLGRTLGKRVRLEIVGGS
TGIDRDILEQLDAPLGHLLRNAVDHGLEPPEERLAMGKPEEGLIRLEARHNAGLLQIAVVDDGRGIELSTLRETVVARQL
ATRESADTLSETELLAFLFLPGFSMKAGLTDVSGRGVGLDAVQAMTKEVRGVASVSSEFGHGTRFQLQLPLTLSVLRTLL
VDVDGEPYAFPLAAIAKTLKLQRTQINLLQGRPHFRLNDRQIGLVTAREVLDRGEPGSEPDELSVVVVDAGRGDAYGLIV
DRFLGERELVIRPLDPRFAKIKDISAAALMEDGSPVLIFDVDDLIRSVEKLASSSGFRALRRAAGGPEASRRKRVLVIDD
SLTVRELQRKMLGNYGYEVEVAVDGMDGWNAVRSGPFDLVVTDIDMPRMDGIELVRLIRKDAHLKSTPIMIVSYKDREED
RARGLDAGADYYLTKSSFQDEALIHAVVDMIGEAAE

Specific function: FrzE is involved in a sensory transduction pathway that controls the frequency at which cells reverse their gliding direction. FrzE seems to be capable of autophosphorylating itself on an histidine residue and then to transfer that group to an aspartate r

COG id: COG0643

COG function: function code NT; Chemotaxis protein histidine kinase and related kinases

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 response regulatory domain [H]

Homologues:

Organism=Escherichia coli, GI1788197, Length=370, Percent_Identity=35.1351351351351, Blast_Score=206, Evalue=3e-54,
Organism=Escherichia coli, GI1788191, Length=108, Percent_Identity=35.1851851851852, Blast_Score=79, Evalue=1e-15,
Organism=Escherichia coli, GI87082012, Length=102, Percent_Identity=33.3333333333333, Blast_Score=66, Evalue=9e-12,
Organism=Escherichia coli, GI145693157, Length=101, Percent_Identity=34.6534653465347, Blast_Score=66, Evalue=1e-11,
Organism=Escherichia coli, GI1786784, Length=102, Percent_Identity=32.3529411764706, Blast_Score=65, Evalue=2e-11,
Organism=Escherichia coli, GI1788713, Length=101, Percent_Identity=35.6435643564356, Blast_Score=63, Evalue=7e-11,
Organism=Escherichia coli, GI1786599, Length=125, Percent_Identity=30.4, Blast_Score=63, Evalue=9e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003594
- InterPro:   IPR002545
- InterPro:   IPR011006
- InterPro:   IPR004358
- InterPro:   IPR008207
- InterPro:   IPR005467
- InterPro:   IPR001789 [H]

Pfam domain/function: PF01584 CheW; PF02518 HATPase_c; PF01627 Hpt; PF00072 Response_reg [H]

EC number: =2.7.13.3 [H]

Molecular weight: Translated: 82691; Mature: 82560

Theoretical pI: Translated: 4.94; Mature: 4.94

Prosite motif: PS50851 CHEW ; PS50894 HPT ; PS50110 RESPONSE_REGULATORY ; PS50109 HIS_KIN

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.3 %Cys     (Translated Protein)
2.2 %Met     (Translated Protein)
2.5 %Cys+Met (Translated Protein)
0.3 %Cys     (Mature Protein)
2.1 %Met     (Mature Protein)
2.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTDEDFSQVSMLSLFQAELETQSQALTSGLLALERDPIAADALEACMRAAHSLKGAARII
CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHH
DLMPAVKVANAMEDCLVAAQRGHLRVRREHIDTLLQGSDLLKLIASGSSSESANSEVETF
HHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCHHHHHHHHCCCCCCCCHHHHHHH
LAKFEELFLPEAKPRPQDSAADFATPDHAQEQPATTSSATLLVGTVPQAHGNIGSDRMVR
HHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCHHCCCCCCCCEEE
VTADSLNRLLGLAGESLIEARRLRPFADGLLRLKRLQSDLASAFDNLHAVLPPGSNDAAV
EEHHHHHHHHHHCCHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCEEECCCCCCHHHH
VEALAEAQRKLRLSQAFLAERLDELDSVDRKATDLANRLYDETLASRMRPFEDGVRHYAR
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHH
TVRDLGRTLGKRVRLEIVGGSTGIDRDILEQLDAPLGHLLRNAVDHGLEPPEERLAMGKP
HHHHHHHHHCCEEEEEEECCCCCCCHHHHHHHCCCHHHHHHHHHHCCCCCCHHHHCCCCC
EEGLIRLEARHNAGLLQIAVVDDGRGIELSTLRETVVARQLATRESADTLSETELLAFLF
CCCEEEEEECCCCCEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
LPGFSMKAGLTDVSGRGVGLDAVQAMTKEVRGVASVSSEFGHGTRFQLQLPLTLSVLRTL
CCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCHHHHHHHHH
LVDVDGEPYAFPLAAIAKTLKLQRTQINLLQGRPHFRLNDRQIGLVTAREVLDRGEPGSE
HHCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCEEEHHHHHHHHHCCCCCCC
PDELSVVVVDAGRGDAYGLIVDRFLGERELVIRPLDPRFAKIKDISAAALMEDGSPVLIF
CCCEEEEEEECCCCCCHHHHHHHHCCCCCEEEECCCCCHHHHHCCHHHHHHCCCCCEEEE
DVDDLIRSVEKLASSSGFRALRRAAGGPEASRRKRVLVIDDSLTVRELQRKMLGNYGYEV
EHHHHHHHHHHHHCCCHHHHHHHHCCCCCHHHCCEEEEEECCCHHHHHHHHHHCCCCCEE
EVAVDGMDGWNAVRSGPFDLVVTDIDMPRMDGIELVRLIRKDAHLKSTPIMIVSYKDREE
EEEEECCCCCHHHCCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCCCCEEEEEECCCCH
DRARGLDAGADYYLTKSSFQDEALIHAVVDMIGEAAE
HHHCCCCCCCCEEEECCCCCHHHHHHHHHHHHCCCCC
>Mature Secondary Structure 
TDEDFSQVSMLSLFQAELETQSQALTSGLLALERDPIAADALEACMRAAHSLKGAARII
CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHH
DLMPAVKVANAMEDCLVAAQRGHLRVRREHIDTLLQGSDLLKLIASGSSSESANSEVETF
HHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCHHHHHHHHCCCCCCCCHHHHHHH
LAKFEELFLPEAKPRPQDSAADFATPDHAQEQPATTSSATLLVGTVPQAHGNIGSDRMVR
HHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCHHCCCCCCCCEEE
VTADSLNRLLGLAGESLIEARRLRPFADGLLRLKRLQSDLASAFDNLHAVLPPGSNDAAV
EEHHHHHHHHHHCCHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCEEECCCCCCHHHH
VEALAEAQRKLRLSQAFLAERLDELDSVDRKATDLANRLYDETLASRMRPFEDGVRHYAR
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHH
TVRDLGRTLGKRVRLEIVGGSTGIDRDILEQLDAPLGHLLRNAVDHGLEPPEERLAMGKP
HHHHHHHHHCCEEEEEEECCCCCCCHHHHHHHCCCHHHHHHHHHHCCCCCCHHHHCCCCC
EEGLIRLEARHNAGLLQIAVVDDGRGIELSTLRETVVARQLATRESADTLSETELLAFLF
CCCEEEEEECCCCCEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
LPGFSMKAGLTDVSGRGVGLDAVQAMTKEVRGVASVSSEFGHGTRFQLQLPLTLSVLRTL
CCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCHHHHHHHHH
LVDVDGEPYAFPLAAIAKTLKLQRTQINLLQGRPHFRLNDRQIGLVTAREVLDRGEPGSE
HHCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCEEEHHHHHHHHHCCCCCCC
PDELSVVVVDAGRGDAYGLIVDRFLGERELVIRPLDPRFAKIKDISAAALMEDGSPVLIF
CCCEEEEEEECCCCCCHHHHHHHHCCCCCEEEECCCCCHHHHHCCHHHHHHCCCCCEEEE
DVDDLIRSVEKLASSSGFRALRRAAGGPEASRRKRVLVIDDSLTVRELQRKMLGNYGYEV
EHHHHHHHHHHHHCCCHHHHHHHHCCCCCHHHCCEEEEEECCCHHHHHHHHHHCCCCCEE
EVAVDGMDGWNAVRSGPFDLVVTDIDMPRMDGIELVRLIRKDAHLKSTPIMIVSYKDREE
EEEEECCCCCHHHCCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCCCCEEEEEECCCCH
DRARGLDAGADYYLTKSSFQDEALIHAVVDMIGEAAE
HHHCCCCCCCCEEEECCCCCHHHHHHHHHHHHCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 2165608; 2123853 [H]