Definition | Mesorhizobium loti MAFF303099 chromosome, complete genome. |
---|---|
Accession | NC_002678 |
Length | 7,036,071 |
Click here to switch to the map view.
The map label for this gene is ybeQ [H]
Identifier: 13470794
GI number: 13470794
Start: 455176
End: 456909
Strand: Direct
Name: ybeQ [H]
Synonym: mlr0590
Alternate gene names: 13470794
Gene position: 455176-456909 (Clockwise)
Preceding gene: 13470793
Following gene: 13470796
Centisome position: 6.47
GC content: 66.61
Gene sequence:
>1734_bases ATGACAATCCTGGATCGGCTGACTGGTCTGGCCTTCCCGGCGGCGGTATTGCGTCGAGCGATCAAGCTTGCTGACAGCGG CAAGGCCGCTGAAGCCTTTCCGCTTATGGCGAAAGCGGCCCGCGCGGGGATCGTCGACGCCGAGTACCGGGTCGCGCGTT GCTATCTGGAAGGCGCCGGTGTGCCACCGAGCCGTATGGAAGGCGCGCGATGGCTGCTGCGGGCGGCCGACCACGGGAGC GCGGATGCGCAAGCGCTCCTCGCCGCACTCTACGTCACCGGGCTGGTGACGTCGGAAGCCGACGGCAAGGGACCCTCGGA GCACTTATTCAAACCTGATTCCGCTGGAAAACCAGACTTCACGGCCGCTTTCGACTTTGCCACCAAGGCGGCCGAGGCAG GCTCGGCCACGGGACAGGCGATCCTCGGTTATATCCTTACCAGCGGCCCCGCATCGATGCGCGATGCCGATGCGGCCCAC CGGTGGTACGAGAAGTCGGCTTCTGCCGGCTGCGCCGAAGGGTGCCTTGGCTTTGCCCTGTCGCTGGCTCGCCGCGGCAA ACGCGAAAACAGAGTTCGGATTGCGGAAGAGGTAAGACGTGCAGCCGACGCCGGCCTGCCGACCGCGACCTATCTTCTTG CGGTCCTTACCGAGCACGGGCTCGGCGTAGCACGCGACATGGCAGCAGCCGCTCAACTGTATCAGGCTGCTGCGGAGAAG GGCCTGCCCTCTGCTCAATTCCGGTTGGGATTGGCACTGATCGACGGTGCACTCGTCGGCCAAGATGTCGCTGCTGGGGA GGCGTGGATGCGGCGTGCTGCCCTGGCCGGCAATATCGAGGCAGCCTATCTCCTGGGCGATCGCCATGCGAAGACGCAGC GGCCGGATTTTGCGGAAGCCGCGAACTGGTATCGGCGGGCAGCCGAAGCGGGCCACCAAGCCGCCGCCCGCGCTTTGGCC TCGCTCTATTTGACTGGAAATGGTGTGGCTGAGGATGTCGAGGAAGGAGCACGCTGGCTGCGCTCTTCCGCAAGTGCCGG CAATCAGCAGGCACAGACAGACCTCGCCAATCTTATCCTCGGAGGGGCTGGCGAGCCGGACGACGGTGCAAGCGTTGCCG GATGGTTCGAGGCGGCCGCATCATCGGGCGACCTCATCGCGGCATTCAACCTTGGGCTCTGCTTCGCCAAGGGCGTCGGC GTTCGCCAGGACGAGGGGCAAGCGGCACACTGGCTGCGGCGCGCGGCCGAAGGCGTCGCCGAGGCACAATATATGTATGC CCGCCTCCTCCAGGATGGACGCGGCGTGGCGGCCGACCCAACTCAGGCCCGCGTGTGGTTTGCGCGGGCGGCCGACGCCG GCGTGCTCGACGCACGCGTGGCGCTCGCCGAGATGCTACTCAATGGGCGCGGCGGCATGCCCGAGCCCGAGGCTGCAATG CAGTTGTTTGAACAGGCCGCCGCCGACGGTCACGCCGGGGCGATGTTTGCGATTGGCGCGCTGTACGGAGCCGGCCACGG CCTGCCGCTCGACCAGACAACGGCACAGAAATGGTACGCTGCCGCTGCCGGACGTGGGCATAGCCAAGCCCAATTCATGC TCGGACGTTATCTCTTGAAAGGCCTCGCCGGGGAGCGCGACCCGGTTGCCGCTCGCCTTTGGCTCGAACGTGCGGCTGCG CACGGGATCAGTGAAGCCGCCGACGAACTGGGGGCTGCGGGCGGGTCCGACTAG
Upstream 100 bases:
>100_bases GCCGCACGATACTCAAGTACATTCTCGGCGTCATGCTCCCGATCGGCCAGGAGGCAATGCGGGAGCCTTAGAGTACTGGA GGGATCAGAGGCTCGGATCG
Downstream 100 bases:
>100_bases TGCAGCGGTGGCCGTTGGAGTGAAGCAGCGCTAAGAAGGCTGGAGGACCGTTGGAAATTCGCTGACCGACTGGCTGCGAA AAAAGAAGAGAAGCAGAACT
Product: hypothetical protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 577; Mature: 576
Protein sequence:
>577_residues MTILDRLTGLAFPAAVLRRAIKLADSGKAAEAFPLMAKAARAGIVDAEYRVARCYLEGAGVPPSRMEGARWLLRAADHGS ADAQALLAALYVTGLVTSEADGKGPSEHLFKPDSAGKPDFTAAFDFATKAAEAGSATGQAILGYILTSGPASMRDADAAH RWYEKSASAGCAEGCLGFALSLARRGKRENRVRIAEEVRRAADAGLPTATYLLAVLTEHGLGVARDMAAAAQLYQAAAEK GLPSAQFRLGLALIDGALVGQDVAAGEAWMRRAALAGNIEAAYLLGDRHAKTQRPDFAEAANWYRRAAEAGHQAAARALA SLYLTGNGVAEDVEEGARWLRSSASAGNQQAQTDLANLILGGAGEPDDGASVAGWFEAAASSGDLIAAFNLGLCFAKGVG VRQDEGQAAHWLRRAAEGVAEAQYMYARLLQDGRGVAADPTQARVWFARAADAGVLDARVALAEMLLNGRGGMPEPEAAM QLFEQAAADGHAGAMFAIGALYGAGHGLPLDQTTAQKWYAAAAGRGHSQAQFMLGRYLLKGLAGERDPVAARLWLERAAA HGISEAADELGAAGGSD
Sequences:
>Translated_577_residues MTILDRLTGLAFPAAVLRRAIKLADSGKAAEAFPLMAKAARAGIVDAEYRVARCYLEGAGVPPSRMEGARWLLRAADHGS ADAQALLAALYVTGLVTSEADGKGPSEHLFKPDSAGKPDFTAAFDFATKAAEAGSATGQAILGYILTSGPASMRDADAAH RWYEKSASAGCAEGCLGFALSLARRGKRENRVRIAEEVRRAADAGLPTATYLLAVLTEHGLGVARDMAAAAQLYQAAAEK GLPSAQFRLGLALIDGALVGQDVAAGEAWMRRAALAGNIEAAYLLGDRHAKTQRPDFAEAANWYRRAAEAGHQAAARALA SLYLTGNGVAEDVEEGARWLRSSASAGNQQAQTDLANLILGGAGEPDDGASVAGWFEAAASSGDLIAAFNLGLCFAKGVG VRQDEGQAAHWLRRAAEGVAEAQYMYARLLQDGRGVAADPTQARVWFARAADAGVLDARVALAEMLLNGRGGMPEPEAAM QLFEQAAADGHAGAMFAIGALYGAGHGLPLDQTTAQKWYAAAAGRGHSQAQFMLGRYLLKGLAGERDPVAARLWLERAAA HGISEAADELGAAGGSD >Mature_576_residues TILDRLTGLAFPAAVLRRAIKLADSGKAAEAFPLMAKAARAGIVDAEYRVARCYLEGAGVPPSRMEGARWLLRAADHGSA DAQALLAALYVTGLVTSEADGKGPSEHLFKPDSAGKPDFTAAFDFATKAAEAGSATGQAILGYILTSGPASMRDADAAHR WYEKSASAGCAEGCLGFALSLARRGKRENRVRIAEEVRRAADAGLPTATYLLAVLTEHGLGVARDMAAAAQLYQAAAEKG LPSAQFRLGLALIDGALVGQDVAAGEAWMRRAALAGNIEAAYLLGDRHAKTQRPDFAEAANWYRRAAEAGHQAAARALAS LYLTGNGVAEDVEEGARWLRSSASAGNQQAQTDLANLILGGAGEPDDGASVAGWFEAAASSGDLIAAFNLGLCFAKGVGV RQDEGQAAHWLRRAAEGVAEAQYMYARLLQDGRGVAADPTQARVWFARAADAGVLDARVALAEMLLNGRGGMPEPEAAMQ LFEQAAADGHAGAMFAIGALYGAGHGLPLDQTTAQKWYAAAAGRGHSQAQFMLGRYLLKGLAGERDPVAARLWLERAAAH GISEAADELGAAGGSD
Specific function: Unknown
COG id: COG0790
COG function: function code R; FOG: TPR repeat, SEL1 subfamily
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: To E.coli ybeT [H]
Homologues:
Organism=Homo sapiens, GI19923669, Length=541, Percent_Identity=25.3234750462107, Blast_Score=87, Evalue=4e-17, Organism=Homo sapiens, GI151301150, Length=569, Percent_Identity=25.3075571177504, Blast_Score=85, Evalue=1e-16, Organism=Escherichia coli, GI87081769, Length=297, Percent_Identity=29.96632996633, Blast_Score=117, Evalue=2e-27, Organism=Escherichia coli, GI1790515, Length=260, Percent_Identity=28.0769230769231, Blast_Score=64, Evalue=4e-11, Organism=Caenorhabditis elegans, GI17563256, Length=352, Percent_Identity=25.8522727272727, Blast_Score=66, Evalue=5e-11, Organism=Drosophila melanogaster, GI21355295, Length=346, Percent_Identity=26.5895953757225, Blast_Score=76, Evalue=5e-14,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR006597 - InterPro: IPR011990 [H]
Pfam domain/function: PF08238 Sel1 [H]
EC number: NA
Molecular weight: Translated: 59910; Mature: 59779
Theoretical pI: Translated: 6.00; Mature: 6.00
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.7 %Cys (Translated Protein) 2.1 %Met (Translated Protein) 2.8 %Cys+Met (Translated Protein) 0.7 %Cys (Mature Protein) 1.9 %Met (Mature Protein) 2.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTILDRLTGLAFPAAVLRRAIKLADSGKAAEAFPLMAKAARAGIVDAEYRVARCYLEGAG CCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCC VPPSRMEGARWLLRAADHGSADAQALLAALYVTGLVTSEADGKGPSEHLFKPDSAGKPDF CCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHCCCCCCCCCCCC TAAFDFATKAAEAGSATGQAILGYILTSGPASMRDADAAHRWYEKSASAGCAEGCLGFAL HHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHH SLARRGKRENRVRIAEEVRRAADAGLPTATYLLAVLTEHGLGVARDMAAAAQLYQAAAEK HHHHCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHH GLPSAQFRLGLALIDGALVGQDVAAGEAWMRRAALAGNIEAAYLLGDRHAKTQRPDFAEA CCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCHHHH ANWYRRAAEAGHQAAARALASLYLTGNGVAEDVEEGARWLRSSASAGNQQAQTDLANLIL HHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHH GGAGEPDDGASVAGWFEAAASSGDLIAAFNLGLCFAKGVGVRQDEGQAAHWLRRAAEGVA CCCCCCCCCCHHHHHHHHHCCCCCEEEEHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHH EAQYMYARLLQDGRGVAADPTQARVWFARAADAGVLDARVALAEMLLNGRGGMPEPEAAM HHHHHHHHHHHCCCCCCCCCCHHHEEEECCCCCCHHHHHHHHHHHHHCCCCCCCCHHHHH QLFEQAAADGHAGAMFAIGALYGAGHGLPLDQTTAQKWYAAAAGRGHSQAQFMLGRYLLK HHHHHHHCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHH GLAGERDPVAARLWLERAAAHGISEAADELGAAGGSD HHCCCCCHHHHHHHHHHHHHCCHHHHHHHHCCCCCCC >Mature Secondary Structure TILDRLTGLAFPAAVLRRAIKLADSGKAAEAFPLMAKAARAGIVDAEYRVARCYLEGAG CHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCC VPPSRMEGARWLLRAADHGSADAQALLAALYVTGLVTSEADGKGPSEHLFKPDSAGKPDF CCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHCCCCCCCCCCCC TAAFDFATKAAEAGSATGQAILGYILTSGPASMRDADAAHRWYEKSASAGCAEGCLGFAL HHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHH SLARRGKRENRVRIAEEVRRAADAGLPTATYLLAVLTEHGLGVARDMAAAAQLYQAAAEK HHHHCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHH GLPSAQFRLGLALIDGALVGQDVAAGEAWMRRAALAGNIEAAYLLGDRHAKTQRPDFAEA CCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCHHHH ANWYRRAAEAGHQAAARALASLYLTGNGVAEDVEEGARWLRSSASAGNQQAQTDLANLIL HHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHH GGAGEPDDGASVAGWFEAAASSGDLIAAFNLGLCFAKGVGVRQDEGQAAHWLRRAAEGVA CCCCCCCCCCHHHHHHHHHCCCCCEEEEHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHH EAQYMYARLLQDGRGVAADPTQARVWFARAADAGVLDARVALAEMLLNGRGGMPEPEAAM HHHHHHHHHHHCCCCCCCCCCHHHEEEECCCCCCHHHHHHHHHHHHHCCCCCCCCHHHHH QLFEQAAADGHAGAMFAIGALYGAGHGLPLDQTTAQKWYAAAAGRGHSQAQFMLGRYLLK HHHHHHHCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHH GLAGERDPVAARLWLERAAAHGISEAADELGAAGGSD HHCCCCCHHHHHHHHHHHHHCCHHHHHHHHCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 8905232; 9278503 [H]