| Definition | Mesorhizobium loti MAFF303099 chromosome, complete genome. |
|---|---|
| Accession | NC_002678 |
| Length | 7,036,071 |
Click here to switch to the map view.
The map label for this gene is ygeK [C]
Identifier: 13474495
GI number: 13474495
Start: 4306476
End: 4308029
Strand: Reverse
Name: ygeK [C]
Synonym: mll5392
Alternate gene names: 13474495
Gene position: 4308029-4306476 (Counterclockwise)
Preceding gene: 13474496
Following gene: 13474494
Centisome position: 61.23
GC content: 66.47
Gene sequence:
>1554_bases GTGAGGCCGATCGAGACCCGATATGCCCTCAGTGGCGAGGCGCGGATCGCCTATCAGGTGGTCGGCCAGGGTTCGCTTGA TCTTGTCTTCGTGCCGGGCTTCATTTCCAATCTCGACCTGCATTGGGAAGACGAAGGCTATACAAGGCTCTTGAGACGGC TGTCGGCATTCTCGCGGCTGATCCTGTTCGACAAGCGCGGCACCGGCCTTTCCGACCGTGTCGATGCGCACAACCTACCC AGTCTGGAAACGCGTATGGACGATGTGCGCGCGGTGATGGACGCCGCCGGCAGCGGTCGTGCTGCCCTGCTTGGCGCTTC GGAAGGCGCGCCAATGGCCATGCTGTTCGCCGCCACCTATCCCGAGCGGACGCGCGCGCTGGCGCTCTATGGTGGATATG CGCATTTCCACAAATGGGTGATGCCGCCCGAACGCCTCAACGCCTTTATCGCCACGGCCGAGACGGCCTGGGGCACCGGT GCCACCTTGCCGAATTTCGCGCCGGGCCGGGTCGACGACGCGCATTTCACCCAGTGGTGGGCGCGATTCGAGCGGCTGTC GGCGAGCCCGACGGCAGCGGCGGCACTGGCGCGAATGAATGCGGAAATCGATGTGCGTGGTGTGCTGGCGGCGATCAGTG CCCCGACTCTGCTGATCCATCGCCGCAACGATGCGCGGGTCGATCCCGACGCCAGCCGCTTCCTGGCCAAGAAAATCCCG AATGCGCGACTGGTCGAGATTCCGGGTCGTGACCATCCGATCTGGACCGGCGATGTCGACCGGGTCGCCGACCTGATCGA GGAGTTCCTGACCGGCACGCGCGCCGTCGCCGAAGCGGACCGTGTGCTGGCGGCACTTCTGGTGACGCGCATCTACGACA CGACGCGGATGGGCGACCGTATGTGGAGCGAGCGCAGCGAGCGCTTCCAGGAAACCTGGCGGTTGCTGGTCGGGCGCCAT GGCGGCCGAGCGCTGGGCACGCAGGGCGAGATGATGATTTCGCGCTTCGATGGGCCGGCGCGCGCGATACGCTGTGCAGC GGCGCTGCGCGAGGCAGCTCAAGGGATCGGCGTGGCCAGCGCGCAGGGCGTGCATGTCGGGGAGATCGAACTGCGCGGGC CGCCAGTGGGGTTGACGGCCCGCGTGACGATGCAGCTCGCCGCGCACGCCAGCCGGAGCGATATCCTGGCGTCGCGGCTG GTCGCGGATCTGGCAACCGGCTCAGGCCTGCATTTCGAAGACGCCAGCCGGATCACGCTGGACGATCTGGACGAGCCGAT GGCGCTGGTGCGGGCAATGTCGGAACAGCACCTGGAGCCTGACTGCCGCGTCAGGACCAAAACGACCGAACCGGCCGTGC TGACGGCGAGGGAAAGCGAGGTGGTGAGCCTGATTGCCGACGGCAAGAGCAATGCCGCGATTGCAGCCGAGCTCAGGCTG AGCGAACACACGGTCAAGCGCCACGTCGCCAACATATTGCTCAAGCTCGACCTGCCGTCGCGAGCGGCGGCGGCAGTATT CTCGGTCCGACACACTGGCCCGGACGGGCCATGA
Upstream 100 bases:
>100_bases CCGGACATACATTCACCTTCGCTAAAGTTTGAGCTGCCCTATCGAAGCGGCTCTGCTATGGTTGACGATCAGAAGTCGTC AGGCCGGGCGGAAGAAACTC
Downstream 100 bases:
>100_bases GAGCCATGGCGCTTTCGGGCGAAGCGGCCAGACGGATGTAAGCGCTATCAATTCCCCCATGCAGGCCAGGCGGATATCGC CGGAAGGGCACTGCGAACGG
Product: hypothetical protein
Products: NA
Alternate protein names: Transcriptional Regulator CadC; Transcriptional Regulator SARP Family; Lignin Peroxidase LipJ; Hydrolase Alpha/Beta Hydrolase Fold Family Protein; Two-Component System Response Regulator; LuxR Family Transcriptional Regulator; Adenylate/Guanylate Cyclase; Transcriptional Regulator LuxR Family; Alpha/Beta Hydrolase Fold; Protein Kinase; Hydrolase Alpha/Beta Fold Family Protein; Hydrolase And Adenylate/Guanylate Cyclase
Number of amino acids: Translated: 517; Mature: 517
Protein sequence:
>517_residues MRPIETRYALSGEARIAYQVVGQGSLDLVFVPGFISNLDLHWEDEGYTRLLRRLSAFSRLILFDKRGTGLSDRVDAHNLP SLETRMDDVRAVMDAAGSGRAALLGASEGAPMAMLFAATYPERTRALALYGGYAHFHKWVMPPERLNAFIATAETAWGTG ATLPNFAPGRVDDAHFTQWWARFERLSASPTAAAALARMNAEIDVRGVLAAISAPTLLIHRRNDARVDPDASRFLAKKIP NARLVEIPGRDHPIWTGDVDRVADLIEEFLTGTRAVAEADRVLAALLVTRIYDTTRMGDRMWSERSERFQETWRLLVGRH GGRALGTQGEMMISRFDGPARAIRCAAALREAAQGIGVASAQGVHVGEIELRGPPVGLTARVTMQLAAHASRSDILASRL VADLATGSGLHFEDASRITLDDLDEPMALVRAMSEQHLEPDCRVRTKTTEPAVLTARESEVVSLIADGKSNAAIAAELRL SEHTVKRHVANILLKLDLPSRAAAAVFSVRHTGPDGP
Sequences:
>Translated_517_residues MRPIETRYALSGEARIAYQVVGQGSLDLVFVPGFISNLDLHWEDEGYTRLLRRLSAFSRLILFDKRGTGLSDRVDAHNLP SLETRMDDVRAVMDAAGSGRAALLGASEGAPMAMLFAATYPERTRALALYGGYAHFHKWVMPPERLNAFIATAETAWGTG ATLPNFAPGRVDDAHFTQWWARFERLSASPTAAAALARMNAEIDVRGVLAAISAPTLLIHRRNDARVDPDASRFLAKKIP NARLVEIPGRDHPIWTGDVDRVADLIEEFLTGTRAVAEADRVLAALLVTRIYDTTRMGDRMWSERSERFQETWRLLVGRH GGRALGTQGEMMISRFDGPARAIRCAAALREAAQGIGVASAQGVHVGEIELRGPPVGLTARVTMQLAAHASRSDILASRL VADLATGSGLHFEDASRITLDDLDEPMALVRAMSEQHLEPDCRVRTKTTEPAVLTARESEVVSLIADGKSNAAIAAELRL SEHTVKRHVANILLKLDLPSRAAAAVFSVRHTGPDGP >Mature_517_residues MRPIETRYALSGEARIAYQVVGQGSLDLVFVPGFISNLDLHWEDEGYTRLLRRLSAFSRLILFDKRGTGLSDRVDAHNLP SLETRMDDVRAVMDAAGSGRAALLGASEGAPMAMLFAATYPERTRALALYGGYAHFHKWVMPPERLNAFIATAETAWGTG ATLPNFAPGRVDDAHFTQWWARFERLSASPTAAAALARMNAEIDVRGVLAAISAPTLLIHRRNDARVDPDASRFLAKKIP NARLVEIPGRDHPIWTGDVDRVADLIEEFLTGTRAVAEADRVLAALLVTRIYDTTRMGDRMWSERSERFQETWRLLVGRH GGRALGTQGEMMISRFDGPARAIRCAAALREAAQGIGVASAQGVHVGEIELRGPPVGLTARVTMQLAAHASRSDILASRL VADLATGSGLHFEDASRITLDDLDEPMALVRAMSEQHLEPDCRVRTKTTEPAVLTARESEVVSLIADGKSNAAIAAELRL SEHTVKRHVANILLKLDLPSRAAAAVFSVRHTGPDGP
Specific function: Unknown
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
None
Paralogues:
None
Copy number: 10-20 Molecules/Cell [C]
Swissprot (AC and ID): NA
Other databases:
NA
Pfam domain/function: NA
EC number: NA
Molecular weight: Translated: 56332; Mature: 56332
Theoretical pI: Translated: 7.28; Mature: 7.28
Prosite motif: PS00622 HTH_LUXR_1 ; PS50043 HTH_LUXR_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.4 %Cys (Translated Protein) 2.7 %Met (Translated Protein) 3.1 %Cys+Met (Translated Protein) 0.4 %Cys (Mature Protein) 2.7 %Met (Mature Protein) 3.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MRPIETRYALSGEARIAYQVVGQGSLDLVFVPGFISNLDLHWEDEGYTRLLRRLSAFSRL CCCCCCCEEECCCCEEEEEEECCCCEEEEECCCHHCCCCEEECCCHHHHHHHHHHHHHHE ILFDKRGTGLSDRVDAHNLPSLETRMDDVRAVMDAAGSGRAALLGASEGAPMAMLFAATY EEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCEEEEEEECC PERTRALALYGGYAHFHKWVMPPERLNAFIATAETAWGTGATLPNFAPGRVDDAHFTQWW CCHHEEEEEECCHHHHHHCCCCHHHHCEEEEECCCCCCCCCCCCCCCCCCCCCHHHHHHH ARFERLSASPTAAAALARMNAEIDVRGVLAAISAPTLLIHRRNDARVDPDASRFLAKKIP HHHHHCCCCCHHHHHHHHHCCCEEHHHHHHHHCCCEEEEECCCCCCCCCCHHHHHHHHCC NARLVEIPGRDHPIWTGDVDRVADLIEEFLTGTRAVAEADRVLAALLVTRIYDTTRMGDR CCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH MWSERSERFQETWRLLVGRHGGRALGTQGEMMISRFDGPARAIRCAAALREAAQGIGVAS HHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEECCCCHHHHHHHHHHHHHHHHCCCCCC AQGVHVGEIELRGPPVGLTARVTMQLAAHASRSDILASRLVADLATGSGLHFEDASRITL CCCEEEEEEEECCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCH DDLDEPMALVRAMSEQHLEPDCRVRTKTTEPAVLTARESEVVSLIADGKSNAAIAAELRL HCCCHHHHHHHHHHHHCCCCCCEEEECCCCCEEEEECCHHEEEEEECCCCCCEEEEEEHH SEHTVKRHVANILLKLDLPSRAAAAVFSVRHTGPDGP HHHHHHHHHHHEEEEECCCCHHHHHHHEEECCCCCCC >Mature Secondary Structure MRPIETRYALSGEARIAYQVVGQGSLDLVFVPGFISNLDLHWEDEGYTRLLRRLSAFSRL CCCCCCCEEECCCCEEEEEEECCCCEEEEECCCHHCCCCEEECCCHHHHHHHHHHHHHHE ILFDKRGTGLSDRVDAHNLPSLETRMDDVRAVMDAAGSGRAALLGASEGAPMAMLFAATY EEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCEEEEEEECC PERTRALALYGGYAHFHKWVMPPERLNAFIATAETAWGTGATLPNFAPGRVDDAHFTQWW CCHHEEEEEECCHHHHHHCCCCHHHHCEEEEECCCCCCCCCCCCCCCCCCCCCHHHHHHH ARFERLSASPTAAAALARMNAEIDVRGVLAAISAPTLLIHRRNDARVDPDASRFLAKKIP HHHHHCCCCCHHHHHHHHHCCCEEHHHHHHHHCCCEEEEECCCCCCCCCCHHHHHHHHCC NARLVEIPGRDHPIWTGDVDRVADLIEEFLTGTRAVAEADRVLAALLVTRIYDTTRMGDR CCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH MWSERSERFQETWRLLVGRHGGRALGTQGEMMISRFDGPARAIRCAAALREAAQGIGVAS HHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEECCCCHHHHHHHHHHHHHHHHCCCCCC AQGVHVGEIELRGPPVGLTARVTMQLAAHASRSDILASRLVADLATGSGLHFEDASRITL CCCEEEEEEEECCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCH DDLDEPMALVRAMSEQHLEPDCRVRTKTTEPAVLTARESEVVSLIADGKSNAAIAAELRL HCCCHHHHHHHHHHHHCCCCCCEEEECCCCCEEEEECCHHEEEEEECCCCCCEEEEEEHH SEHTVKRHVANILLKLDLPSRAAAAVFSVRHTGPDGP HHHHHHHHHHHEEEEECCCCHHHHHHHEEECCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA