| Definition | Mesorhizobium loti MAFF303099 chromosome, complete genome. |
|---|---|
| Accession | NC_002678 |
| Length | 7,036,071 |
Click here to switch to the map view.
The map label for this gene is dppE [H]
Identifier: 13473817
GI number: 13473817
Start: 3610043
End: 3611674
Strand: Reverse
Name: dppE [H]
Synonym: mll4539
Alternate gene names: 13473817
Gene position: 3611674-3610043 (Counterclockwise)
Preceding gene: 13473818
Following gene: 13473814
Centisome position: 51.33
GC content: 64.71
Gene sequence:
>1632_bases ATGCTGACCGACCGCAAACTGCACCCGCGGGCCAAGCCCGTCGCCGAGGATTTCAAAACCGGCGCCATCAGCCGCCGCGA ATATCTGGCGCTGATGGCCGGCCTCGGCGTCAGCGCCGCCGGCGCGTTTGCGCTCGGCGGCCTCGCCCCGACCCCGGCCC GCGCGGCCGAGCCGAAGAAGGGCGGCACCTTGCGCGTCGCCATGAACGTCAAGGGTTTCAAGGACCCACGCACCTTCGAC GGCGTCGAAATGTCCAACGTCGCCCGCCACTGCAACGAATATCTGGTGCGCTGGAACACCGATTTTTCCTTCGAGCCCTG GCTGCTGGAAAAATGGGAGATGAGCGATGACGCCAAGACGCTGACGCTGCATGTGCGCAAAGGCGTCACCTGGTCGAACG GCGACGCCTTCAACGCCGACGATGTCGTCCACAACCTCACCCGCTGGTGCGAGGCCGGCGTGGCCGGCAATTCGGTCGCC GCGCGCATGGGCGCGCTGGTCAATGCCGACACCAAGAAGGCGGTCGATGGCGGCATCGAGAAGGTCGACGACTACACGAT CAAGCTCAACCTGCCGAAGCCCGACATTTCGCTGATCGCCGGCATGGCCGACTATCCCGCGCTGATCATGCACCGCTCCT ATGCCGGCGACGGCGACCCCATGAAGGCGCTGGCCATCACCACCGGCCCCTGCGAACTGGTGAAATGGGACGCCGAGACG GGCGCGCAGGTGAAGCGCAAGGACAAGCCCTGGTGGAAGGGCGAATTCCATCTGGATGGCATCCAGTGGATCGACTACGG CTCCGACCCCAACGCCATGCTGTCGGCCTTCGAATCCGGCGAGATCGACACTGACCACGAAACCGCCTCCGACGCCGTCT CGCAGACCGACAAGATGGGGCTTAAAAATTCCGAGATCGCCACCGGCTCGACCATCGTCGCCCGCTTCAACATCGGCAAT GCGCCCTATGACGACGTCAAGGTCCGCCGTGCCGCCCAGCTCGCCGTCGACAATGCCGCCGTGCTCGCGCTCGGCCTCGG AGGCCGCGGCAAGCCCGCCGACAACCACCATGTCGGCCCCATGCACCCCGAATATGCCGATATCGGCCCAGCCAAGCGCG ACGTGGAGGAAGCCAAGAACCTGCTCGCCGCCTCCGCCAAGCCGGACCATGAATATGAGCTGATCTCGGTCGACGTCGAA TGGCAGAAGAGCACCGGCGACGCGATATCAGCGCAGATGCGCGAGGCCGGCCTCAAGATCAAGCGCACCGTGCTGCCCGC CGCCACCTTCTGGAACGACTGGAGCAAATATCCCTTCTCCTGCACCGAATGGCTCGGCCGCTCGCTCGGCGTCCAGGTGC TGGCGCTGGCGTATAAATCGGGCGCCGCCTGGAACGAAAGCGCCTATGCCAGCAAGGAGTTCGACGACCTGCTCGACAAG GCGCTGGCGACGCCCGACGCCAAGGCGCGCAAGGAGATCATGGCCGGCATCGAGAAGAATTTGCGCGACAGCGGCATCAT CATCCAACCCTACTGGCGCTCGGTGTATCGGACCTACCGCAAGGGCGTCGAGGGCTGCGAACAGCACCAGGCACTGGAGC AGCACTTTGAGAAGGTTTGGATCGAAAGCTGA
Upstream 100 bases:
>100_bases CTCGCGTAGGCGATTCGCCGGGACGAGTCTGATTCCGTGCATGTCCCAAAGCGCTTTTGGGCGGCCTGCACCAACAAAAG AGGGAACGACGATGAAATTC
Downstream 100 bases:
>100_bases GGCGCGGCGCTTCTTACCTTCTCCCCTTGTGGGGTGAGGGGCGGTCCGCGTCAGCGGACGGAAAGCCAACTGCTTGGCTT TCCGAGCCTCGAACGCCCTG
Product: dipeptide ABC transporter (dipeptide-binding protein)
Products: ADP; phosphate; dipeptides [Cytoplasm] [C]
Alternate protein names: NA
Number of amino acids: Translated: 543; Mature: 543
Protein sequence:
>543_residues MLTDRKLHPRAKPVAEDFKTGAISRREYLALMAGLGVSAAGAFALGGLAPTPARAAEPKKGGTLRVAMNVKGFKDPRTFD GVEMSNVARHCNEYLVRWNTDFSFEPWLLEKWEMSDDAKTLTLHVRKGVTWSNGDAFNADDVVHNLTRWCEAGVAGNSVA ARMGALVNADTKKAVDGGIEKVDDYTIKLNLPKPDISLIAGMADYPALIMHRSYAGDGDPMKALAITTGPCELVKWDAET GAQVKRKDKPWWKGEFHLDGIQWIDYGSDPNAMLSAFESGEIDTDHETASDAVSQTDKMGLKNSEIATGSTIVARFNIGN APYDDVKVRRAAQLAVDNAAVLALGLGGRGKPADNHHVGPMHPEYADIGPAKRDVEEAKNLLAASAKPDHEYELISVDVE WQKSTGDAISAQMREAGLKIKRTVLPAATFWNDWSKYPFSCTEWLGRSLGVQVLALAYKSGAAWNESAYASKEFDDLLDK ALATPDAKARKEIMAGIEKNLRDSGIIIQPYWRSVYRTYRKGVEGCEQHQALEQHFEKVWIES
Sequences:
>Translated_543_residues MLTDRKLHPRAKPVAEDFKTGAISRREYLALMAGLGVSAAGAFALGGLAPTPARAAEPKKGGTLRVAMNVKGFKDPRTFD GVEMSNVARHCNEYLVRWNTDFSFEPWLLEKWEMSDDAKTLTLHVRKGVTWSNGDAFNADDVVHNLTRWCEAGVAGNSVA ARMGALVNADTKKAVDGGIEKVDDYTIKLNLPKPDISLIAGMADYPALIMHRSYAGDGDPMKALAITTGPCELVKWDAET GAQVKRKDKPWWKGEFHLDGIQWIDYGSDPNAMLSAFESGEIDTDHETASDAVSQTDKMGLKNSEIATGSTIVARFNIGN APYDDVKVRRAAQLAVDNAAVLALGLGGRGKPADNHHVGPMHPEYADIGPAKRDVEEAKNLLAASAKPDHEYELISVDVE WQKSTGDAISAQMREAGLKIKRTVLPAATFWNDWSKYPFSCTEWLGRSLGVQVLALAYKSGAAWNESAYASKEFDDLLDK ALATPDAKARKEIMAGIEKNLRDSGIIIQPYWRSVYRTYRKGVEGCEQHQALEQHFEKVWIES >Mature_543_residues MLTDRKLHPRAKPVAEDFKTGAISRREYLALMAGLGVSAAGAFALGGLAPTPARAAEPKKGGTLRVAMNVKGFKDPRTFD GVEMSNVARHCNEYLVRWNTDFSFEPWLLEKWEMSDDAKTLTLHVRKGVTWSNGDAFNADDVVHNLTRWCEAGVAGNSVA ARMGALVNADTKKAVDGGIEKVDDYTIKLNLPKPDISLIAGMADYPALIMHRSYAGDGDPMKALAITTGPCELVKWDAET GAQVKRKDKPWWKGEFHLDGIQWIDYGSDPNAMLSAFESGEIDTDHETASDAVSQTDKMGLKNSEIATGSTIVARFNIGN APYDDVKVRRAAQLAVDNAAVLALGLGGRGKPADNHHVGPMHPEYADIGPAKRDVEEAKNLLAASAKPDHEYELISVDVE WQKSTGDAISAQMREAGLKIKRTVLPAATFWNDWSKYPFSCTEWLGRSLGVQVLALAYKSGAAWNESAYASKEFDDLLDK ALATPDAKARKEIMAGIEKNLRDSGIIIQPYWRSVYRTYRKGVEGCEQHQALEQHFEKVWIES
Specific function: Part of the binding-protein-dependent transport system for dipeptides; probably responsible for the binding of dipeptides with high affinity. Is expressed to facilitate adaptation to nutrient deficiency conditions, which also induce sporulation [H]
COG id: COG0747
COG function: function code E; ABC-type dipeptide transport system, periplasmic component
Gene ontology:
Cell location: Cell membrane; Lipid-anchor (Probable) [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the bacterial solute-binding protein 5 family [H]
Homologues:
Organism=Escherichia coli, GI1789966, Length=464, Percent_Identity=24.3534482758621, Blast_Score=100, Evalue=2e-22, Organism=Escherichia coli, GI1787052, Length=486, Percent_Identity=24.4855967078189, Blast_Score=92, Evalue=9e-20, Organism=Escherichia coli, GI1787495, Length=548, Percent_Identity=22.4452554744526, Blast_Score=83, Evalue=5e-17, Organism=Escherichia coli, GI1787551, Length=333, Percent_Identity=25.2252252252252, Blast_Score=75, Evalue=1e-14, Organism=Escherichia coli, GI1789397, Length=494, Percent_Identity=21.2550607287449, Blast_Score=75, Evalue=1e-14,
Paralogues:
None
Copy number: 660 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). 2980 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). 40 Molecules/Cell In: Stationary Phase, Rich Media (Based on E. coli). [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000914 [H]
Pfam domain/function: PF00496 SBP_bac_5 [H]
EC number: NA
Molecular weight: Translated: 59597; Mature: 59597
Theoretical pI: Translated: 6.21; Mature: 6.21
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.9 %Cys (Translated Protein) 2.6 %Met (Translated Protein) 3.5 %Cys+Met (Translated Protein) 0.9 %Cys (Mature Protein) 2.6 %Met (Mature Protein) 3.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MLTDRKLHPRAKPVAEDFKTGAISRREYLALMAGLGVSAAGAFALGGLAPTPARAAEPKK CCCCCCCCCCCCCHHHHHHHCCCCHHHHHHHHHCCCCCCCCHHHHCCCCCCCCCCCCCCC GGTLRVAMNVKGFKDPRTFDGVEMSNVARHCNEYLVRWNTDFSFEPWLLEKWEMSDDAKT CCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHEEEECCCCCCCCCHHHCCCCCCCCEE LTLHVRKGVTWSNGDAFNADDVVHNLTRWCEAGVAGNSVAARMGALVNADTKKAVDGGIE EEEEEECCCEECCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCHHHHHHCCCC KVDDYTIKLNLPKPDISLIAGMADYPALIMHRSYAGDGDPMKALAITTGPCELVKWDAET CCCCEEEEEECCCCCHHHHHCCCCCCHHHEECCCCCCCCCCEEEEEECCCCEEEEECCCC GAQVKRKDKPWWKGEFHLDGIQWIDYGSDPNAMLSAFESGEIDTDHETASDAVSQTDKMG CCHHHCCCCCCCCCEEEECCEEEEECCCCHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCC LKNSEIATGSTIVARFNIGNAPYDDVKVRRAAQLAVDNAAVLALGLGGRGKPADNHHVGP CCCCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCC MHPEYADIGPAKRDVEEAKNLLAASAKPDHEYELISVDVEWQKSTGDAISAQMREAGLKI CCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCEEEEEEEEEECCCCCHHHHHHHHHCCCEE KRTVLPAATFWNDWSKYPFSCTEWLGRSLGVQVLALAYKSGAAWNESAYASKEFDDLLDK EEHHCCHHHHHCCCCCCCCHHHHHHHHHHCHHHHHHHHHCCCCCCCCCCCHHHHHHHHHH ALATPDAKARKEIMAGIEKNLRDSGIIIQPYWRSVYRTYRKGVEGCEQHQALEQHFEKVW HHCCCCHHHHHHHHHHHHHCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH IES CCC >Mature Secondary Structure MLTDRKLHPRAKPVAEDFKTGAISRREYLALMAGLGVSAAGAFALGGLAPTPARAAEPKK CCCCCCCCCCCCCHHHHHHHCCCCHHHHHHHHHCCCCCCCCHHHHCCCCCCCCCCCCCCC GGTLRVAMNVKGFKDPRTFDGVEMSNVARHCNEYLVRWNTDFSFEPWLLEKWEMSDDAKT CCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHEEEECCCCCCCCCHHHCCCCCCCCEE LTLHVRKGVTWSNGDAFNADDVVHNLTRWCEAGVAGNSVAARMGALVNADTKKAVDGGIE EEEEEECCCEECCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCHHHHHHCCCC KVDDYTIKLNLPKPDISLIAGMADYPALIMHRSYAGDGDPMKALAITTGPCELVKWDAET CCCCEEEEEECCCCCHHHHHCCCCCCHHHEECCCCCCCCCCEEEEEECCCCEEEEECCCC GAQVKRKDKPWWKGEFHLDGIQWIDYGSDPNAMLSAFESGEIDTDHETASDAVSQTDKMG CCHHHCCCCCCCCCEEEECCEEEEECCCCHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCC LKNSEIATGSTIVARFNIGNAPYDDVKVRRAAQLAVDNAAVLALGLGGRGKPADNHHVGP CCCCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCC MHPEYADIGPAKRDVEEAKNLLAASAKPDHEYELISVDVEWQKSTGDAISAQMREAGLKI CCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCEEEEEEEEEECCCCCHHHHHHHHHCCCEE KRTVLPAATFWNDWSKYPFSCTEWLGRSLGVQVLALAYKSGAAWNESAYASKEFDDLLDK EEHHCCHHHHHCCCCCCCCHHHHHHHHHHCHHHHHHHHHCCCCCCCCCCCHHHHHHHHHH ALATPDAKARKEIMAGIEKNLRDSGIIIQPYWRSVYRTYRKGVEGCEQHQALEQHFEKVW HHCCCCHHHHHHHHHHHHHCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH IES CCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: ATP; dipeptides [Periplasm]; H2O [C]
Specific reaction: ATP + dipeptides [Periplasm] + H2O = ADP + phosphate + dipeptides [Cytoplasm] [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 1766370; 9384377 [H]