Definition Rhizobium leguminosarum bv. trifolii WSM2304 chromosome, complete genome.
Accession NC_011369
Length 4,537,948

Click here to switch to the map view.

The map label for this gene is 209550227

Identifier: 209550227

GI number: 209550227

Start: 2694662

End: 2695546

Strand: Direct

Name: 209550227

Synonym: Rleg2_2647

Alternate gene names: NA

Gene position: 2694662-2695546 (Clockwise)

Preceding gene: 209550220

Following gene: 209550228

Centisome position: 59.38

GC content: 64.41

Gene sequence:

>885_bases
ATGACGGAGTTTGCAATCGGCATAGACGGCGGTGGAACGAGCTGCCGAGCCGCGGTCGCGGACAGACACGGCAATGTCAT
CGGCCGTGGCAAAGCAGGTCCGGCCAATATCCTGTCCGATCTGGAAAACTCTCTTCTCAACATCGTCGAATCTGCCCGGC
AGGCGCTGATCGATGCAGGGCTTGCCGCTGAAACCATCGCCTCATCTGCGTCAGTGGTCGGCGTTGCCGGCGCCAATGTC
ACGGACTACGGCCAGCGCATCGAAAAGGCCCTGCCCTTTGCCGAAGGCCGCGTCGTCACCGACGCGCTGATTGCCCTGCA
GGGCGCGCTCGGCGATGGCGACGGCATCGTCGGCGCCTTCGGCACCGGCTCGGTCTATAATGCCCGCAGGAACGGCCGGC
TGAACGGCATTGGCGGCTGGGGCTTTATCGTCGGCGACCAGGCAAGCGGCGCCCGCCTCGGCCGCGACCTGATGGAGCGA
TCGCTGCTTGCCCATGACGGCGTGCGCCTGACGTCCCCCGTGACCGAAGCGATCATGGCCGAATACGGCAACGACCCTGA
AAGCATCGTCGAATTCGCACATTCGGCAAGACCGACGGATTTTGCCCGTTATGCGCCCGTCGTCTTCGAACATGCAGCCA
AGGGCGATGCCGTCGCGGTCGGCATCGTCACGGACGCGGCAACGGCAATCGGTGAAAGCCTCGAAGCGCTGCTCTGGCCT
GAATGCCCGTCGATCTGCCTGCTCGGCGGCCTTGCAGGAGCCTATGAGCCGTGGCTTTCCGAACGCTACAGATCACGGCT
TGCCAGGCCGAAGGGCGATGCCCTGCAGGGTGCGGTGGAACTTGCCGTCAAGCTCCTGCACGACGGACAGAGAGGTGCGG
CATGA

Upstream 100 bases:

>100_bases
ATGATAAAAAGATTGATTTTCTCCATCATTCCCACTTGGTAAATGGTATTTTTTTGGTATTGTATTTAAAAACATAGGGA
ACGGCGTATTCGGAGGCCCG

Downstream 100 bases:

>100_bases
CCGACGATCTCGCCACCCTTCTGTCCCTGGAGCGTCTGCAGGCAGCCGGCACCGGCCCGCTCTACGTCAAGCTGCGCCGC
ACGCTCGAAGAGGCCGTGCG

Product: ATPase BadF/BadG/BcrA/BcrD

Products: NA

Alternate protein names: N-Acetylglucosamine Kinase; BadF/BadG/BcrA/BcrD ATPase Family Protein; BadF/BadG/BcrA/BcrD Type ATPase; ATPase Family Protein; N-Acetylglucosamine Kinase Protein; BadF/BadG/BcrA/BcrD Family ATPase; ATPase Badf/Badg/Bcra/Bcrd Type; BadF/BadG/BcrA/BcrD ATPase Family Superfamily; BadF/BadG/BcrA/BcrD ATPase Family; Glucosamine Kinase GpsK; Aryl-Alcohol Dehydrogenase; Glucosamine Kinase; BadF/BadG/BcrA/BcrD ATPase

Number of amino acids: Translated: 294; Mature: 293

Protein sequence:

>294_residues
MTEFAIGIDGGGTSCRAAVADRHGNVIGRGKAGPANILSDLENSLLNIVESARQALIDAGLAAETIASSASVVGVAGANV
TDYGQRIEKALPFAEGRVVTDALIALQGALGDGDGIVGAFGTGSVYNARRNGRLNGIGGWGFIVGDQASGARLGRDLMER
SLLAHDGVRLTSPVTEAIMAEYGNDPESIVEFAHSARPTDFARYAPVVFEHAAKGDAVAVGIVTDAATAIGESLEALLWP
ECPSICLLGGLAGAYEPWLSERYRSRLARPKGDALQGAVELAVKLLHDGQRGAA

Sequences:

>Translated_294_residues
MTEFAIGIDGGGTSCRAAVADRHGNVIGRGKAGPANILSDLENSLLNIVESARQALIDAGLAAETIASSASVVGVAGANV
TDYGQRIEKALPFAEGRVVTDALIALQGALGDGDGIVGAFGTGSVYNARRNGRLNGIGGWGFIVGDQASGARLGRDLMER
SLLAHDGVRLTSPVTEAIMAEYGNDPESIVEFAHSARPTDFARYAPVVFEHAAKGDAVAVGIVTDAATAIGESLEALLWP
ECPSICLLGGLAGAYEPWLSERYRSRLARPKGDALQGAVELAVKLLHDGQRGAA
>Mature_293_residues
TEFAIGIDGGGTSCRAAVADRHGNVIGRGKAGPANILSDLENSLLNIVESARQALIDAGLAAETIASSASVVGVAGANVT
DYGQRIEKALPFAEGRVVTDALIALQGALGDGDGIVGAFGTGSVYNARRNGRLNGIGGWGFIVGDQASGARLGRDLMERS
LLAHDGVRLTSPVTEAIMAEYGNDPESIVEFAHSARPTDFARYAPVVFEHAAKGDAVAVGIVTDAATAIGESLEALLWPE
CPSICLLGGLAGAYEPWLSERYRSRLARPKGDALQGAVELAVKLLHDGQRGAA

Specific function: Unknown

COG id: COG2971

COG function: function code G; Predicted N-acetylglucosamine kinase

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 30275; Mature: 30144

Theoretical pI: Translated: 4.79; Mature: 4.79

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.0 %Cys     (Translated Protein)
1.0 %Met     (Translated Protein)
2.0 %Cys+Met (Translated Protein)
1.0 %Cys     (Mature Protein)
0.7 %Met     (Mature Protein)
1.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTEFAIGIDGGGTSCRAAVADRHGNVIGRGKAGPANILSDLENSLLNIVESARQALIDAG
CCCEEEEECCCCCHHHHHHHHCCCCEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCC
LAAETIASSASVVGVAGANVTDYGQRIEKALPFAEGRVVTDALIALQGALGDGDGIVGAF
HHHHHHHCCCCEEEECCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCCCCCEEEEC
GTGSVYNARRNGRLNGIGGWGFIVGDQASGARLGRDLMERSLLAHDGVRLTSPVTEAIMA
CCCCEEEHHCCCCCCCCCCCEEEECCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHH
EYGNDPESIVEFAHSARPTDFARYAPVVFEHAAKGDAVAVGIVTDAATAIGESLEALLWP
HHCCCHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCCEEEEEEHHHHHHHHHHHHHHHHCC
ECPSICLLGGLAGAYEPWLSERYRSRLARPKGDALQGAVELAVKLLHDGQRGAA
CCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCC
>Mature Secondary Structure 
TEFAIGIDGGGTSCRAAVADRHGNVIGRGKAGPANILSDLENSLLNIVESARQALIDAG
CCEEEEECCCCCHHHHHHHHCCCCEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCC
LAAETIASSASVVGVAGANVTDYGQRIEKALPFAEGRVVTDALIALQGALGDGDGIVGAF
HHHHHHHCCCCEEEECCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCCCCCEEEEC
GTGSVYNARRNGRLNGIGGWGFIVGDQASGARLGRDLMERSLLAHDGVRLTSPVTEAIMA
CCCCEEEHHCCCCCCCCCCCEEEECCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHH
EYGNDPESIVEFAHSARPTDFARYAPVVFEHAAKGDAVAVGIVTDAATAIGESLEALLWP
HHCCCHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCCEEEEEEHHHHHHHHHHHHHHHHCC
ECPSICLLGGLAGAYEPWLSERYRSRLARPKGDALQGAVELAVKLLHDGQRGAA
CCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: NA