Definition Xanthomonas axonopodis pv. citri str. 306 chromosome, complete genome.
Accession NC_003919
Length 5,175,554

Click here to switch to the map view.

The map label for this gene is lhr2 [C]

Identifier: 21242093

GI number: 21242093

Start: 1544959

End: 1547460

Strand: Reverse

Name: lhr2 [C]

Synonym: XAC1340

Alternate gene names: 21242093

Gene position: 1547460-1544959 (Counterclockwise)

Preceding gene: 21242094

Following gene: 21242092

Centisome position: 29.9

GC content: 67.63

Gene sequence:

>2502_bases
GTGAGCGCCACACAGCAAGTGCACGGCACACCGTTGCAACAGTGGCGTGCCTGGTTCGCGCAGCGCGGCTGGGCGCCGCT
GCCGTTTCAACGCGAGGTGTGGAAGCGCTATCTGGACGGCAAATCGGGCCTGCTGCACACGCCTACCGGCAGCGGCAAGA
CGCTGGCGGCATTTGGCGGGCCGTTACTGGAGGCATTGGCCGCGCGTGGGCGCAATTCCCCGCGCAAATCGGGCAAACCC
GCCTCGCCTGCGCGCCGCCAGCCGCAACGCAATCTGCAAGTGCTGTGGATCACGCCGTTGCGCGCACTGGCCGCCGACAC
TGCACGCGCGCTGCGCAAGCCGGTCGACGACTTGGGGCTGGACTGGCAGGTCGGCCTGCGTACCGGCGATGCCAGCGCGC
GCGACAAGCGGCTGGCACGCAGCGGCAAGCTCGACGTGCTGGTCACCACGCCCGAATCGCTGGCGTTGCTGTTGTCGTAT
GCGGATACCGCGCCACAACTGTCGGCGCTGCGCTGCGTCATCGTCGACGAGTGGCACGAGCTGCTGGGCAACAAGCGCGG
CGTATTGCTGCAATTGTGTCTTGCGCGCTTGCGCGGATGGACGCCGCAACTGCGCATCTGGGGGCTATCGGCAACGCTGG
GTAATTTGCCGCAGGCACGCGATGTGCTGTTGCCACATCGCCCCGAGGCGGCGCTGGTCTCCGGCGTCAAACCGCGCACC
ATGACGCTGGAAACCCTGCTGCCGCAAAGTGGCGAGCGGTTTCCGTGGGCGGGCCATCTTGGCCTGGCGCAACTGGCGCG
CGTGCTGCAGAAGATCATGCAGCAGCGCACCAGCCTGGTGTTTACCAACACGCGTGCGCAGGCCGAGCTATGGCATCAAG
CCTTGAGTGCGGTGTGGCCGGAGGATCTGGCCACACTGGCGCTGCATCACGGCTCGCTGGATCCAGCGTTACGCGCGGCC
GCCGAACGCGGGCTGGCCGACGGCAGCCTGCGCTGCGTGGTGGCCACCTCCAGCCTGGATCTGGGCGTCGACTTCCCGGC
CGTGGATCAGGTGCTGCAGGTCGGCAGCCCGAAAGGCATCGCACGCCTGCTGCAACGCGCCGGGCGCGCACGCCATCGTC
CCGGCGAATCAGGCCACGTGGTGTGCGTACCCTCGCATGCGCTGGAGCTGGTCGAATATGCGGCGGCACGGCGCGCGCTG
GTGCACAGCCATATCGAAGCGCGTCCACCGCCACGGTTATCGCTGGACGTGCTGGCGCAGCACTGTGTCACCCTCGCCCT
CGGCGGCGGCTTCCACGCCGATGCGTTGTTCGAGGAAGTGCGCGGCACCGATGCCTTCGCCGCGCTGGAAAAAACCACGT
GGAACGCGGTGCTGGATTTCATCGTGCAAGGCGGCAGCGCGCTGGCGCATTACCCGGACTTCCACAAGGTGATGCGCGAC
GACGACGGTCTGTACCGCGTGATCGACCGACGCGTCGCGCTGCGGCATCGGCTGTCCATCGGCACGATCACCAGCGACGG
CAGCGTGCGCGTGCAGTTTCTGCGCGGCGGGCGGCTGGGCGCGGTGGAAGAACAATTCATCGGCCGTCTGCGCCGCGGCG
ACCGCTTCCAGTTCGCCGGCCGCCTGCTGGAACTGGTGCGCCTGGAAGACATGACCGCCTACGTGCGCGTGGCAAAGGGC
GGCAGTGGCGTGGTGCCGAAATGGATGGGCGGGCGCATGCCGTTATCGTCGGCGCTGGGACGCGAGGTGGAAGCGGTGTT
CGCCGATCCCGGCGATGCGCCGGAGATGCAGGCCCTGGCGCCGTTGCTGCACCTGCAAGCGTCGCTTTCTTCGCTGCCTG
GGCCGGACCATCTGCTGGTGGAAAGCGTCAAGGCGCGCGACGGCCGCCACGTCTTCGTCTACCCGTTCGCCGGCAGGCAG
GTCAACGAAGGCCTGGCCGCGTTGCTCGCGGCACGCTGGGGCCGGCGCCACCGCAATACCTTCAGCTTCGCTGCCAACGA
CTATGGCTTTGTGCTGTCGCCAGCGCAGGATGTCGACATCGATGCCGATGCGCTGCAGACCCTGCTGTCACCTGCGGGCC
TGTTCGACGATCTGCGCGACAGCCTCAATCTGGGCGAACTGGCACGCCGGCAGTTCCGCGAAATCGCACGCGTGGCCGGA
TTGTTGTCGCCCTCGTTGCCTGGCCGGGCGCCGCGCAGCCTGCGCCAGCTGCAGGCCTCCAGCGGCCTGCTGTACGACGT
ATTGCAACGCTTCGACCCCGATCACCTGCTGCTCGCCCAGGCCGAACGCGAAGTGTTCGAAGGCCAGCTCGAACTGGCGC
GGCTTGCGCATGCCCTGGAAGATTGCGCGCGACGTGAGCTACGCCTGCGCAAGCCGCGCAGCCTCACGCCCTTATCGTTT
CCGCTCTGGGCCGAACGCGTGCGTGGGCAACTGAGTACCGAAGACTGGAAGGCCCGCGTATTGCGCGCTGCCGAACAACT
GGAACGCAAGCATGGGCGATAG

Upstream 100 bases:

>100_bases
AATCCGGCATCGCAGTGCGCTTCCCGCGCATCCTGCGCTGGCGTCACGACAAGCCGATGGCCGAAGCCGATCATCTGAGC
ACCCTGCAGGCACTGGCGCG

Downstream 100 bases:

>100_bases
CGTGCAACTGCAGCTGGCCGGCGAAACCGTGGAATTGCTCGGCGAACGCGCATTGTATCGGCCGGCACAACGCGCGCTAT
TGATCGCAGACCTGCATCTG

Product: helicase-related protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 833; Mature: 832

Protein sequence:

>833_residues
MSATQQVHGTPLQQWRAWFAQRGWAPLPFQREVWKRYLDGKSGLLHTPTGSGKTLAAFGGPLLEALAARGRNSPRKSGKP
ASPARRQPQRNLQVLWITPLRALAADTARALRKPVDDLGLDWQVGLRTGDASARDKRLARSGKLDVLVTTPESLALLLSY
ADTAPQLSALRCVIVDEWHELLGNKRGVLLQLCLARLRGWTPQLRIWGLSATLGNLPQARDVLLPHRPEAALVSGVKPRT
MTLETLLPQSGERFPWAGHLGLAQLARVLQKIMQQRTSLVFTNTRAQAELWHQALSAVWPEDLATLALHHGSLDPALRAA
AERGLADGSLRCVVATSSLDLGVDFPAVDQVLQVGSPKGIARLLQRAGRARHRPGESGHVVCVPSHALELVEYAAARRAL
VHSHIEARPPPRLSLDVLAQHCVTLALGGGFHADALFEEVRGTDAFAALEKTTWNAVLDFIVQGGSALAHYPDFHKVMRD
DDGLYRVIDRRVALRHRLSIGTITSDGSVRVQFLRGGRLGAVEEQFIGRLRRGDRFQFAGRLLELVRLEDMTAYVRVAKG
GSGVVPKWMGGRMPLSSALGREVEAVFADPGDAPEMQALAPLLHLQASLSSLPGPDHLLVESVKARDGRHVFVYPFAGRQ
VNEGLAALLAARWGRRHRNTFSFAANDYGFVLSPAQDVDIDADALQTLLSPAGLFDDLRDSLNLGELARRQFREIARVAG
LLSPSLPGRAPRSLRQLQASSGLLYDVLQRFDPDHLLLAQAEREVFEGQLELARLAHALEDCARRELRLRKPRSLTPLSF
PLWAERVRGQLSTEDWKARVLRAAEQLERKHGR

Sequences:

>Translated_833_residues
MSATQQVHGTPLQQWRAWFAQRGWAPLPFQREVWKRYLDGKSGLLHTPTGSGKTLAAFGGPLLEALAARGRNSPRKSGKP
ASPARRQPQRNLQVLWITPLRALAADTARALRKPVDDLGLDWQVGLRTGDASARDKRLARSGKLDVLVTTPESLALLLSY
ADTAPQLSALRCVIVDEWHELLGNKRGVLLQLCLARLRGWTPQLRIWGLSATLGNLPQARDVLLPHRPEAALVSGVKPRT
MTLETLLPQSGERFPWAGHLGLAQLARVLQKIMQQRTSLVFTNTRAQAELWHQALSAVWPEDLATLALHHGSLDPALRAA
AERGLADGSLRCVVATSSLDLGVDFPAVDQVLQVGSPKGIARLLQRAGRARHRPGESGHVVCVPSHALELVEYAAARRAL
VHSHIEARPPPRLSLDVLAQHCVTLALGGGFHADALFEEVRGTDAFAALEKTTWNAVLDFIVQGGSALAHYPDFHKVMRD
DDGLYRVIDRRVALRHRLSIGTITSDGSVRVQFLRGGRLGAVEEQFIGRLRRGDRFQFAGRLLELVRLEDMTAYVRVAKG
GSGVVPKWMGGRMPLSSALGREVEAVFADPGDAPEMQALAPLLHLQASLSSLPGPDHLLVESVKARDGRHVFVYPFAGRQ
VNEGLAALLAARWGRRHRNTFSFAANDYGFVLSPAQDVDIDADALQTLLSPAGLFDDLRDSLNLGELARRQFREIARVAG
LLSPSLPGRAPRSLRQLQASSGLLYDVLQRFDPDHLLLAQAEREVFEGQLELARLAHALEDCARRELRLRKPRSLTPLSF
PLWAERVRGQLSTEDWKARVLRAAEQLERKHGR
>Mature_832_residues
SATQQVHGTPLQQWRAWFAQRGWAPLPFQREVWKRYLDGKSGLLHTPTGSGKTLAAFGGPLLEALAARGRNSPRKSGKPA
SPARRQPQRNLQVLWITPLRALAADTARALRKPVDDLGLDWQVGLRTGDASARDKRLARSGKLDVLVTTPESLALLLSYA
DTAPQLSALRCVIVDEWHELLGNKRGVLLQLCLARLRGWTPQLRIWGLSATLGNLPQARDVLLPHRPEAALVSGVKPRTM
TLETLLPQSGERFPWAGHLGLAQLARVLQKIMQQRTSLVFTNTRAQAELWHQALSAVWPEDLATLALHHGSLDPALRAAA
ERGLADGSLRCVVATSSLDLGVDFPAVDQVLQVGSPKGIARLLQRAGRARHRPGESGHVVCVPSHALELVEYAAARRALV
HSHIEARPPPRLSLDVLAQHCVTLALGGGFHADALFEEVRGTDAFAALEKTTWNAVLDFIVQGGSALAHYPDFHKVMRDD
DGLYRVIDRRVALRHRLSIGTITSDGSVRVQFLRGGRLGAVEEQFIGRLRRGDRFQFAGRLLELVRLEDMTAYVRVAKGG
SGVVPKWMGGRMPLSSALGREVEAVFADPGDAPEMQALAPLLHLQASLSSLPGPDHLLVESVKARDGRHVFVYPFAGRQV
NEGLAALLAARWGRRHRNTFSFAANDYGFVLSPAQDVDIDADALQTLLSPAGLFDDLRDSLNLGELARRQFREIARVAGL
LSPSLPGRAPRSLRQLQASSGLLYDVLQRFDPDHLLLAQAEREVFEGQLELARLAHALEDCARRELRLRKPRSLTPLSFP
LWAERVRGQLSTEDWKARVLRAAEQLERKHGR

Specific function: Unknown

COG id: COG1201

COG function: function code R; Lhr-like helicases

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 helicase C-terminal domain [H]

Homologues:

Organism=Escherichia coli, GI1787942, Length=886, Percent_Identity=27.4266365688488, Blast_Score=209, Evalue=5e-55,
Organism=Saccharomyces cerevisiae, GI6321020, Length=413, Percent_Identity=25.4237288135593, Blast_Score=89, Evalue=3e-18,
Organism=Saccharomyces cerevisiae, GI6320497, Length=310, Percent_Identity=26.7741935483871, Blast_Score=73, Evalue=2e-13,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR014001
- InterPro:   IPR013701
- InterPro:   IPR011545
- InterPro:   IPR001650
- InterPro:   IPR014021
- InterPro:   IPR017170 [H]

Pfam domain/function: PF00270 DEAD; PF08494 DEAD_assoc; PF00271 Helicase_C [H]

EC number: 3.6.1.-

Molecular weight: Translated: 91722; Mature: 91590

Theoretical pI: Translated: 10.68; Mature: 10.68

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.7 %Cys     (Translated Protein)
1.0 %Met     (Translated Protein)
1.7 %Cys+Met (Translated Protein)
0.7 %Cys     (Mature Protein)
0.8 %Met     (Mature Protein)
1.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSATQQVHGTPLQQWRAWFAQRGWAPLPFQREVWKRYLDGKSGLLHTPTGSGKTLAAFGG
CCCCCCCCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHCCCCCEEECCCCCCCEEHHHCC
PLLEALAARGRNSPRKSGKPASPARRQPQRNLQVLWITPLRALAADTARALRKPVDDLGL
HHHHHHHHCCCCCCCCCCCCCCCHHCCCCCCCEEEEECHHHHHHHHHHHHHHCCHHHHCC
DWQVGLRTGDASARDKRLARSGKLDVLVTTPESLALLLSYADTAPQLSALRCVIVDEWHE
CEEEEEECCCCCHHHHHHHHCCCEEEEEECHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH
LLGNKRGVLLQLCLARLRGWTPQLRIWGLSATLGNLPQARDVLLPHRPEAALVSGVKPRT
HHCCCCCHHHHHHHHHHCCCCCCEEEEEEHHHHCCCCCCCCCCCCCCCCHHHHCCCCCCE
MTLETLLPQSGERFPWAGHLGLAQLARVLQKIMQQRTSLVFTNTRAQAELWHQALSAVWP
EEHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHEEEECCHHHHHHHHHHHHHHCH
EDLATLALHHGSLDPALRAAAERGLADGSLRCVVATSSLDLGVDFPAVDQVLQVGSPKGI
HHHHHHHHHCCCCCHHHHHHHHCCCCCCCEEEEEEECCCCCCCCCHHHHHHHHCCCCHHH
ARLLQRAGRARHRPGESGHVVCVPSHALELVEYAAARRALVHSHIEARPPPRLSLDVLAQ
HHHHHHHHHHCCCCCCCCCEEEECHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHH
HCVTLALGGGFHADALFEEVRGTDAFAALEKTTWNAVLDFIVQGGSALAHYPDFHKVMRD
HHHHHHHCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHCC
DDGLYRVIDRRVALRHRLSIGTITSDGSVRVQFLRGGRLGAVEEQFIGRLRRGDRFQFAG
CCHHHHHHHHHHHHHHHCCEEEEECCCCEEEEEEECCCCCCHHHHHHHHHHCCCHHHHHH
RLLELVRLEDMTAYVRVAKGGSGVVPKWMGGRMPLSSALGREVEAVFADPGDAPEMQALA
HHHHHHHHHHHHEEEEEECCCCCCCCHHCCCCCCHHHHHCCCEEEEECCCCCCCHHHHHH
PLLHLQASLSSLPGPDHLLVESVKARDGRHVFVYPFAGRQVNEGLAALLAARWGRRHRNT
HHHHHHHHHHHCCCCCHHHHHHHHCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHCCCC
FSFAANDYGFVLSPAQDVDIDADALQTLLSPAGLFDDLRDSLNLGELARRQFREIARVAG
EEEECCCCCEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHH
LLSPSLPGRAPRSLRQLQASSGLLYDVLQRFDPDHLLLAQAEREVFEGQLELARLAHALE
HHCCCCCCCCHHHHHHHHHCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH
DCARRELRLRKPRSLTPLSFPLWAERVRGQLSTEDWKARVLRAAEQLERKHGR
HHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCC
>Mature Secondary Structure 
SATQQVHGTPLQQWRAWFAQRGWAPLPFQREVWKRYLDGKSGLLHTPTGSGKTLAAFGG
CCCCCCCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHCCCCCEEECCCCCCCEEHHHCC
PLLEALAARGRNSPRKSGKPASPARRQPQRNLQVLWITPLRALAADTARALRKPVDDLGL
HHHHHHHHCCCCCCCCCCCCCCCHHCCCCCCCEEEEECHHHHHHHHHHHHHHCCHHHHCC
DWQVGLRTGDASARDKRLARSGKLDVLVTTPESLALLLSYADTAPQLSALRCVIVDEWHE
CEEEEEECCCCCHHHHHHHHCCCEEEEEECHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH
LLGNKRGVLLQLCLARLRGWTPQLRIWGLSATLGNLPQARDVLLPHRPEAALVSGVKPRT
HHCCCCCHHHHHHHHHHCCCCCCEEEEEEHHHHCCCCCCCCCCCCCCCCHHHHCCCCCCE
MTLETLLPQSGERFPWAGHLGLAQLARVLQKIMQQRTSLVFTNTRAQAELWHQALSAVWP
EEHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHEEEECCHHHHHHHHHHHHHHCH
EDLATLALHHGSLDPALRAAAERGLADGSLRCVVATSSLDLGVDFPAVDQVLQVGSPKGI
HHHHHHHHHCCCCCHHHHHHHHCCCCCCCEEEEEEECCCCCCCCCHHHHHHHHCCCCHHH
ARLLQRAGRARHRPGESGHVVCVPSHALELVEYAAARRALVHSHIEARPPPRLSLDVLAQ
HHHHHHHHHHCCCCCCCCCEEEECHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHH
HCVTLALGGGFHADALFEEVRGTDAFAALEKTTWNAVLDFIVQGGSALAHYPDFHKVMRD
HHHHHHHCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHCC
DDGLYRVIDRRVALRHRLSIGTITSDGSVRVQFLRGGRLGAVEEQFIGRLRRGDRFQFAG
CCHHHHHHHHHHHHHHHCCEEEEECCCCEEEEEEECCCCCCHHHHHHHHHHCCCHHHHHH
RLLELVRLEDMTAYVRVAKGGSGVVPKWMGGRMPLSSALGREVEAVFADPGDAPEMQALA
HHHHHHHHHHHHEEEEEECCCCCCCCHHCCCCCCHHHHHCCCEEEEECCCCCCCHHHHHH
PLLHLQASLSSLPGPDHLLVESVKARDGRHVFVYPFAGRQVNEGLAALLAARWGRRHRNT
HHHHHHHHHHHCCCCCHHHHHHHHCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHCCCC
FSFAANDYGFVLSPAQDVDIDADALQTLLSPAGLFDDLRDSLNLGELARRQFREIARVAG
EEEECCCCCEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHH
LLSPSLPGRAPRSLRQLQASSGLLYDVLQRFDPDHLLLAQAEREVFEGQLELARLAHALE
HHCCCCCCCCHHHHHHHHHCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH
DCARRELRLRKPRSLTPLSFPLWAERVRGQLSTEDWKARVLRAAEQLERKHGR
HHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: Hydrolase; Acting on acid anhydrides [C]

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8688087 [H]