Definition | Mesorhizobium loti MAFF303099 chromosome, complete genome. |
---|---|
Accession | NC_002678 |
Length | 7,036,071 |
Click here to switch to the map view.
The map label for this gene is ybbP [H]
Identifier: 13473669
GI number: 13473669
Start: 3454332
End: 3456881
Strand: Reverse
Name: ybbP [H]
Synonym: mll4345
Alternate gene names: 13473669
Gene position: 3456881-3454332 (Counterclockwise)
Preceding gene: 13473670
Following gene: 13473668
Centisome position: 49.13
GC content: 67.02
Gene sequence:
>2550_bases ATGCCGATGGCGCAGACGCTGAAGCTAGCGGTCCGCTTCTCGCTCAGGGAGATGCGTGGCGGCCTGTCCGGCTTTATGAT CTTCCTCGCCTGCATCGCGCTTGGCGTCGCGGCGATCGGCGGCGTCAATTCGGTTGCCCGCTCGATCAGCGCCGGCGTCG CAGACCAGGGCCAGACGCTGCTTGGCGGCGACTTTCGCTTCCAGATCAACCAACGCGACGCCAGCCAGGCCGAGCGCGGC TTCCTCGTTGGGCTCGGCACCGTTTCGCGCACCGCCAGCATGCGCTCTATGGCGCGGCTGGCCGACGGCACGGACCAGGC GCTGGTCGAGGCCAAGGCGGTCGATGACGCCTATCCGCTCTATGGCGCGCTGGAAACCGAACCAAAGCTATCGAAACGGG AACTGTTCGGCGAAGAATTCGGCGTCTTTGGCGCGGCGGCGCCCGATCTGTTGTTCGAAAGGCTGCATCTCAAGCCCGGC GATCGGCTGAAGGTCGGCACCGCCACCTTCGAACTGCGCGCCAGACTGATCACCGAGCCGGATGCCGTGTCCGAGGGTTT CGGCTTCGCGCCAAGGCTGATGATCTCGACCGAAGGCCTGGCCGCCACCGGGCTGGTGCAACCAGGCAGCCTGGTGGAAA ACGCCTACAAGGTCCGGCTGCCCGCCGACGCCGACCAAGCGCGGCTCAAGGCCATCCAGGATCAGGCGGCGAAGGATTTT CCCGAGGCCGGCTGGTCGATCCGCACGCGCGACAATGCGGCGCCGGCGCTGTCGTCCAACATCGAACGCTTCTCGCAATT CCTGACGCTGGTCGGGCTGACGGCGCTGGTGGTCGGCGGCGTCGGCGTCGCCAATGCGGTGCGCGCCTATCTCGACGGCA AGCGCGGCGTCATCGCCACCTTCAAGAGCCTCGGTGCTTCCGGCGGCTTCGTCTTTGCCGTCTATCTCGTGCAGATCCTG ATCATCGCCGCCCTTGGCATCTTGCTCGGCCTCGTCCTCGGCGCGCTGATGCCGTTCGTGGCGAGTGCTGCCCTCCAGTC GGTCATTCCAGTGCCGGCGCAAGGCGGCTTCTATCCCGGCGCGCTCGCCATGGCGGCGCTGTTCGGCCTGCTGGTGACGC TGGCCTTCGCGCTGCTGCCGCTCGGCCGCGCCCGCGACGTGCCCGCGACGGCACTGTTCCGCGAGATGGGGCTCGAAGGC CGCGGTCAGCCGCGCCTCGTCTATGTCGCTTCGGCGCTCGGCATCGCGCTGCTGCTGGCAGCGCTGGCGATCCTGTTTTC CGGCGATCAGCGCATCGCCTCGATCTTCGCCGGCGCCACTATCTTCGCCTTCCTGGTGCTGCGTCTCGTCGGTGCGCTGG TGCAGTGGGCGGCAAGAAAAAGCCCGCGCGTGCGTTTCGTGGCACTCCGGCTCGCCATCGGCAACATCCACCGGCCGGGC GCCCTGACGCCATCGGTGGTGCTGTCGCTGGGGCTCGGGCTGACGCTCCTGGTGACGCTGGCGCTTATCGATGGCAATCT GCGGCAACAGATCTCCGGCAGCCTGCCGGAACGGGCGCCGAACTTCTTCTTCGTCGACATCCAGGGCAGCGATGTCGATG CGTTCTCCGCTCTGATCGGCAAGGAGGCGCCAAAGGGGACGCTGGCCAAGGTGCCGATGCTGCGCGGCCGGGTGATGGCG CTCAACGGCGTCGATGTCGACAAGGTCAAGGTGCCGGCCGAAGGCGCCTGGGTGCTGAAAGGCGATCGCGGCCTGACCTA CGACGCCAGGCAACCGGAAAATGCGACGCTGACGGAGGGCAGATGGTGGCCGGACAATTATGCCGGCGAGCCGCTGGTTT CCTTCTCGGCGCATGAGGGCCAGGAGATCGGACTGAAGCTCGGCGACACCGTCACCGTCAATGTGCTCGGCCGCAATGTG ACGGCGAGGATCGCCAATTTCCGCCAGGTCCAATGGGAAACGATGGGCATCAACTTCGTCATGGTGTTCTCGCCCAACGC ATTCGCTGGCGCCCCGCATGGCTGGATGGCGACGCTGACCGAAAAGAACGCCACCACGGCCGATGATGCACGCATCCTCA ATGCCGTCACCCGCGCCTTTCCCGCGGTCACGACGGTGCGGGTCAAGGATGCGCTCGATGTCGTCAACCGGCTGGTCGGG CAGCTCGGCACGGCGATCCGCGCGGCGGCCGGCGTGGCGCTGATCGCATCGGTGCTGGTGCTGGCCGGCGCGCTCGCCGC GGGAAATCGGGCGCGCATCCATGACGCGGTGGTGCTGAAGACGCTCGGCGCCACCAGGCGAACTCTGATCACGGCGTTTT CCCTCGAATACATGCTGATCGGATTGGCCACCGCCATCTTCGCGCTGGCCGCCGGCGGCATCGCGGCCTGGTACATTGTC GCCCGCATCATGACCCTGCCGTCGCATTTCATGCCGGAGGTGGCGGTGGCGACCATCGTCTTTGCGCTGGTGATCACCGT CGGAATCGGCCTCGCCGGCACCTGGCGGGTGCTCGGCCACAAGGCAGCACCGGTGCTGCGCGAGCTGTAA
Upstream 100 bases:
>100_bases TGGTCACCCACGATCCGGCGCTCGCGGCACGCTGTTCGCGCCAGGTCTCGATGCGCTCGGGCCGGATTGAAGCGCCGGCG CCCCTGAAGGTCACCGCCTG
Downstream 100 bases:
>100_bases TTTCAGGGCCTAAGCGGCCGTGGCGGATTAAATCTTGGTAAGGATGCAAGCTGGAAAGCCCGGGAATCCGCCTTTCGGCC CCTCACGAACCGTTACATCC
Product: hypothetical protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 849; Mature: 848
Protein sequence:
>849_residues MPMAQTLKLAVRFSLREMRGGLSGFMIFLACIALGVAAIGGVNSVARSISAGVADQGQTLLGGDFRFQINQRDASQAERG FLVGLGTVSRTASMRSMARLADGTDQALVEAKAVDDAYPLYGALETEPKLSKRELFGEEFGVFGAAAPDLLFERLHLKPG DRLKVGTATFELRARLITEPDAVSEGFGFAPRLMISTEGLAATGLVQPGSLVENAYKVRLPADADQARLKAIQDQAAKDF PEAGWSIRTRDNAAPALSSNIERFSQFLTLVGLTALVVGGVGVANAVRAYLDGKRGVIATFKSLGASGGFVFAVYLVQIL IIAALGILLGLVLGALMPFVASAALQSVIPVPAQGGFYPGALAMAALFGLLVTLAFALLPLGRARDVPATALFREMGLEG RGQPRLVYVASALGIALLLAALAILFSGDQRIASIFAGATIFAFLVLRLVGALVQWAARKSPRVRFVALRLAIGNIHRPG ALTPSVVLSLGLGLTLLVTLALIDGNLRQQISGSLPERAPNFFFVDIQGSDVDAFSALIGKEAPKGTLAKVPMLRGRVMA LNGVDVDKVKVPAEGAWVLKGDRGLTYDARQPENATLTEGRWWPDNYAGEPLVSFSAHEGQEIGLKLGDTVTVNVLGRNV TARIANFRQVQWETMGINFVMVFSPNAFAGAPHGWMATLTEKNATTADDARILNAVTRAFPAVTTVRVKDALDVVNRLVG QLGTAIRAAAGVALIASVLVLAGALAAGNRARIHDAVVLKTLGATRRTLITAFSLEYMLIGLATAIFALAAGGIAAWYIV ARIMTLPSHFMPEVAVATIVFALVITVGIGLAGTWRVLGHKAAPVLREL
Sequences:
>Translated_849_residues MPMAQTLKLAVRFSLREMRGGLSGFMIFLACIALGVAAIGGVNSVARSISAGVADQGQTLLGGDFRFQINQRDASQAERG FLVGLGTVSRTASMRSMARLADGTDQALVEAKAVDDAYPLYGALETEPKLSKRELFGEEFGVFGAAAPDLLFERLHLKPG DRLKVGTATFELRARLITEPDAVSEGFGFAPRLMISTEGLAATGLVQPGSLVENAYKVRLPADADQARLKAIQDQAAKDF PEAGWSIRTRDNAAPALSSNIERFSQFLTLVGLTALVVGGVGVANAVRAYLDGKRGVIATFKSLGASGGFVFAVYLVQIL IIAALGILLGLVLGALMPFVASAALQSVIPVPAQGGFYPGALAMAALFGLLVTLAFALLPLGRARDVPATALFREMGLEG RGQPRLVYVASALGIALLLAALAILFSGDQRIASIFAGATIFAFLVLRLVGALVQWAARKSPRVRFVALRLAIGNIHRPG ALTPSVVLSLGLGLTLLVTLALIDGNLRQQISGSLPERAPNFFFVDIQGSDVDAFSALIGKEAPKGTLAKVPMLRGRVMA LNGVDVDKVKVPAEGAWVLKGDRGLTYDARQPENATLTEGRWWPDNYAGEPLVSFSAHEGQEIGLKLGDTVTVNVLGRNV TARIANFRQVQWETMGINFVMVFSPNAFAGAPHGWMATLTEKNATTADDARILNAVTRAFPAVTTVRVKDALDVVNRLVG QLGTAIRAAAGVALIASVLVLAGALAAGNRARIHDAVVLKTLGATRRTLITAFSLEYMLIGLATAIFALAAGGIAAWYIV ARIMTLPSHFMPEVAVATIVFALVITVGIGLAGTWRVLGHKAAPVLREL >Mature_848_residues PMAQTLKLAVRFSLREMRGGLSGFMIFLACIALGVAAIGGVNSVARSISAGVADQGQTLLGGDFRFQINQRDASQAERGF LVGLGTVSRTASMRSMARLADGTDQALVEAKAVDDAYPLYGALETEPKLSKRELFGEEFGVFGAAAPDLLFERLHLKPGD RLKVGTATFELRARLITEPDAVSEGFGFAPRLMISTEGLAATGLVQPGSLVENAYKVRLPADADQARLKAIQDQAAKDFP EAGWSIRTRDNAAPALSSNIERFSQFLTLVGLTALVVGGVGVANAVRAYLDGKRGVIATFKSLGASGGFVFAVYLVQILI IAALGILLGLVLGALMPFVASAALQSVIPVPAQGGFYPGALAMAALFGLLVTLAFALLPLGRARDVPATALFREMGLEGR GQPRLVYVASALGIALLLAALAILFSGDQRIASIFAGATIFAFLVLRLVGALVQWAARKSPRVRFVALRLAIGNIHRPGA LTPSVVLSLGLGLTLLVTLALIDGNLRQQISGSLPERAPNFFFVDIQGSDVDAFSALIGKEAPKGTLAKVPMLRGRVMAL NGVDVDKVKVPAEGAWVLKGDRGLTYDARQPENATLTEGRWWPDNYAGEPLVSFSAHEGQEIGLKLGDTVTVNVLGRNVT ARIANFRQVQWETMGINFVMVFSPNAFAGAPHGWMATLTEKNATTADDARILNAVTRAFPAVTTVRVKDALDVVNRLVGQ LGTAIRAAAGVALIASVLVLAGALAAGNRARIHDAVVLKTLGATRRTLITAFSLEYMLIGLATAIFALAAGGIAAWYIVA RIMTLPSHFMPEVAVATIVFALVITVGIGLAGTWRVLGHKAAPVLREL
Specific function: Unknown
COG id: COG3127
COG function: function code Q; Predicted ABC-type transport system involved in lysophospholipase L1 biosynthesis, permease component
Gene ontology:
Cell location: Cell membrane; Multi-pass membrane protein (Potential) [H]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the ABC-4 integral membrane protein family [H]
Homologues:
Organism=Escherichia coli, GI1786704, Length=820, Percent_Identity=27.1951219512195, Blast_Score=195, Evalue=8e-51,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR003838 [H]
Pfam domain/function: PF02687 FtsX [H]
EC number: NA
Molecular weight: Translated: 89455; Mature: 89324
Theoretical pI: Translated: 10.38; Mature: 10.38
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.1 %Cys (Translated Protein) 2.1 %Met (Translated Protein) 2.2 %Cys+Met (Translated Protein) 0.1 %Cys (Mature Protein) 2.0 %Met (Mature Protein) 2.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MPMAQTLKLAVRFSLREMRGGLSGFMIFLACIALGVAAIGGVNSVARSISAGVADQGQTL CCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCCCCCCCEE LGGDFRFQINQRDASQAERGFLVGLGTVSRTASMRSMARLADGTDQALVEAKAVDDAYPL ECCCEEEEECCCCHHHHHCCEEEEECHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCC YGALETEPKLSKRELFGEEFGVFGAAAPDLLFERLHLKPGDRLKVGTATFELRARLITEP CCCCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHCCCCCCCEEECCHHHHHHHHHCCCC DAVSEGFGFAPRLMISTEGLAATGLVQPGSLVENAYKVRLPADADQARLKAIQDQAAKDF CHHHCCCCCCCEEEEECCCCCEECCCCCCHHHCCCEEEECCCCCHHHHHHHHHHHHHHCC PEAGWSIRTRDNAAPALSSNIERFSQFLTLVGLTALVVGGVGVANAVRAYLDGKRGVIAT CCCCCEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCHHHH FKSLGASGGFVFAVYLVQILIIAALGILLGLVLGALMPFVASAALQSVIPVPAQGGFYPG HHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHH ALAMAALFGLLVTLAFALLPLGRARDVPATALFREMGLEGRGQPRLVYVASALGIALLLA HHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCCCCCCEEEEHHHHHHHHHHHH ALAILFSGDQRIASIFAGATIFAFLVLRLVGALVQWAARKSPRVRFVALRLAIGNIHRPG HHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEEEEHHCCCCCCC ALTPSVVLSLGLGLTLLVTLALIDGNLRQQISGSLPERAPNFFFVDIQGSDVDAFSALIG CCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHCCCCCCCCCCEEEEEEECCCHHHHHHHHC KEAPKGTLAKVPMLRGRVMALNGVDVDKVKVPAEGAWVLKGDRGLTYDARQPENATLTEG CCCCCCCHHHCHHHCCEEEEECCCCCCEEECCCCCEEEEECCCCCCCCCCCCCCCCCCCC RWWPDNYAGEPLVSFSAHEGQEIGLKLGDTVTVNVLGRNVTARIANFRQVQWETMGINFV CCCCCCCCCCCEEEECCCCCCCCCEEECCEEEEEEECCCHHHHHHHHHHEEEEEECEEEE MVFSPNAFAGAPHGWMATLTEKNATTADDARILNAVTRAFPAVTTVRVKDALDVVNRLVG EEECCCCCCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHCCCEEEHHHHHHHHHHHHHH QLGTAIRAAAGVALIASVLVLAGALAAGNRARIHDAVVLKTLGATRRTLITAFSLEYMLI HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH GLATAIFALAAGGIAAWYIVARIMTLPSHFMPEVAVATIVFALVITVGIGLAGTWRVLGH HHHHHHHHHHHCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHCC KAAPVLREL HHHHHHHCC >Mature Secondary Structure PMAQTLKLAVRFSLREMRGGLSGFMIFLACIALGVAAIGGVNSVARSISAGVADQGQTL CHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCCCCCCCEE LGGDFRFQINQRDASQAERGFLVGLGTVSRTASMRSMARLADGTDQALVEAKAVDDAYPL ECCCEEEEECCCCHHHHHCCEEEEECHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCC YGALETEPKLSKRELFGEEFGVFGAAAPDLLFERLHLKPGDRLKVGTATFELRARLITEP CCCCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHCCCCCCCEEECCHHHHHHHHHCCCC DAVSEGFGFAPRLMISTEGLAATGLVQPGSLVENAYKVRLPADADQARLKAIQDQAAKDF CHHHCCCCCCCEEEEECCCCCEECCCCCCHHHCCCEEEECCCCCHHHHHHHHHHHHHHCC PEAGWSIRTRDNAAPALSSNIERFSQFLTLVGLTALVVGGVGVANAVRAYLDGKRGVIAT CCCCCEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCHHHH FKSLGASGGFVFAVYLVQILIIAALGILLGLVLGALMPFVASAALQSVIPVPAQGGFYPG HHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHH ALAMAALFGLLVTLAFALLPLGRARDVPATALFREMGLEGRGQPRLVYVASALGIALLLA HHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCCCCCCEEEEHHHHHHHHHHHH ALAILFSGDQRIASIFAGATIFAFLVLRLVGALVQWAARKSPRVRFVALRLAIGNIHRPG HHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEEEEHHCCCCCCC ALTPSVVLSLGLGLTLLVTLALIDGNLRQQISGSLPERAPNFFFVDIQGSDVDAFSALIG CCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHCCCCCCCCCCEEEEEEECCCHHHHHHHHC KEAPKGTLAKVPMLRGRVMALNGVDVDKVKVPAEGAWVLKGDRGLTYDARQPENATLTEG CCCCCCCHHHCHHHCCEEEEECCCCCCEEECCCCCEEEEECCCCCCCCCCCCCCCCCCCC RWWPDNYAGEPLVSFSAHEGQEIGLKLGDTVTVNVLGRNVTARIANFRQVQWETMGINFV CCCCCCCCCCCEEEECCCCCCCCCEEECCEEEEEEECCCHHHHHHHHHHEEEEEECEEEE MVFSPNAFAGAPHGWMATLTEKNATTADDARILNAVTRAFPAVTTVRVKDALDVVNRLVG EEECCCCCCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHCCCEEEHHHHHHHHHHHHHH QLGTAIRAAAGVALIASVLVLAGALAAGNRARIHDAVVLKTLGATRRTLITAFSLEYMLI HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH GLATAIFALAAGGIAAWYIVARIMTLPSHFMPEVAVATIVFALVITVGIGLAGTWRVLGH HHHHHHHHHHHCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHCC KAAPVLREL HHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 9278503 [H]