Definition Escherichia coli ED1a chromosome, complete genome.
Accession NC_011745
Length 5,209,548

Click here to switch to the map view.

The map label for this gene is ugpB [H]

Identifier: 218691738

GI number: 218691738

Start: 4033131

End: 4034447

Strand: Reverse

Name: ugpB [H]

Synonym: ECED1_4126

Alternate gene names: 218691738

Gene position: 4034447-4033131 (Counterclockwise)

Preceding gene: 218691739

Following gene: 218691737

Centisome position: 77.44

GC content: 53.0

Gene sequence:

>1317_bases
ATGAAACCGTTACGTTATACAGCTTCAGCACTGGCGCTCGGACTGGCTTTAATGGCAAATGCGCAGGCAGTGACGACCAT
TCCGTTCTGGCATTCTATGGAAGGGGAACTGGGTAAAGAGGTGGATTCTCTGGCCCAACGTTTTAACGCCGAAAACCCGG
ATTACAAAATTGTACCGACCTATAAAGGCAACTACGAACAGAATTTAAGCGCGGGGATTGCCGCATTTCGTACCGGCAAC
GCTCCGGCTATTTTGCAGGTTTATGAAGTTGGCACCGCCACCATGATGGCGTCGAAAGCCATTAAACCGGTGTATGACGT
ATTTAAAGAGGCAGGGATTCAATTCGATGAGTCGCAGTTTGTGCCGACGGTTTCAGGCTATTACTCCGACAGCAAAACCG
GCCACTTACTCTCCCAGCCGTTCAACAGCTCGACTCCCGTTCTCTATTACAACAAAGACGCCTTCAAGAAAGCAGGATTA
GACCCGGAACAGCCGCCGAAAACCTGGCAGGATCTGGCGGACTATGCCGCAAAACTGAAAGCCTCCGGTATGAAGTGCGG
CTACGCCAGCGGCTGGCAGGGATGGATCCAACTGGAAAACTTTAGCGCCTGGAACGGTTTGCCGTTTGCCAGCAAAAACA
ACGGCTTTGACGGCACAGACGCGGTGCTTGAGTTCAACAAGCCGGAGCAGGTGAAACACATCGCCATGCTCGAAGAGATG
AACAAGAAGGGCGATTTCAGCTACGTTGGGCGTAAGGATGAATCCACCGAGAAGTTCTATAACGGTGATTGCGCGATGAC
GACCGCCTCTTCCGGTTCTCTTGCCAACATTCGCGAGTACGCCAAATTTAACTATGGCGTAGGCATGATGCCTTACGACG
CCGATGCGAAAGACGCGCCGCAAAACGCCATTATCGGCGGAGCCAGTCTATGGGTAATGCAGGGTAAAGATAAAGAAACC
TACACCGGCGTGGCGAAGTTCCTCGACTTCCTCGCAAAGCCAGAAAACGCTGCCGAGTGGCATCAGAAAACCGGCTATCT
GCCAATCACCAAAGCGGCGTATGACCTGACCCGTGAGCAGGGCTTTTACGAGAAAAACCCAGGAGCGGATATTGCCACGC
GTCAGATGCTGAACAAGCCACCGTTGCCGTTCACCAAAGGGCTGCGTCTGGGCAACATGCCGCAGATCCGCGTGATTGTG
GATGAAGAGCTGGAGAGCGTGTGGACCGGTAAGAAGACACCACAGCAGGCGCTGGATACTGCCGTTGAGCGTGGGAACCA
GTTACTGCGCCGCTTTGAGAAATCGACGAAGTCTTAA

Upstream 100 bases:

>100_bases
TAACAAAATAGTTATTTTTCTGTAATTCGAGCATGTCATGTTACTCCGCGAGCATAAAACGCGTGAATTCGCGCATTCGG
TACAACAAGAGAGATAATCA

Downstream 100 bases:

>100_bases
TCAGTGTAATGTCGGATGCGTTTCGCTTATCTGACCTGGCATCGCGTGTAGGCCGGATAAGCGAAGCGCATCCGGCACAG
TTCAGGAATTAACTGTAATG

Product: glycerol-3-phosphate transporter periplasmic binding protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 438; Mature: 438

Protein sequence:

>438_residues
MKPLRYTASALALGLALMANAQAVTTIPFWHSMEGELGKEVDSLAQRFNAENPDYKIVPTYKGNYEQNLSAGIAAFRTGN
APAILQVYEVGTATMMASKAIKPVYDVFKEAGIQFDESQFVPTVSGYYSDSKTGHLLSQPFNSSTPVLYYNKDAFKKAGL
DPEQPPKTWQDLADYAAKLKASGMKCGYASGWQGWIQLENFSAWNGLPFASKNNGFDGTDAVLEFNKPEQVKHIAMLEEM
NKKGDFSYVGRKDESTEKFYNGDCAMTTASSGSLANIREYAKFNYGVGMMPYDADAKDAPQNAIIGGASLWVMQGKDKET
YTGVAKFLDFLAKPENAAEWHQKTGYLPITKAAYDLTREQGFYEKNPGADIATRQMLNKPPLPFTKGLRLGNMPQIRVIV
DEELESVWTGKKTPQQALDTAVERGNQLLRRFEKSTKS

Sequences:

>Translated_438_residues
MKPLRYTASALALGLALMANAQAVTTIPFWHSMEGELGKEVDSLAQRFNAENPDYKIVPTYKGNYEQNLSAGIAAFRTGN
APAILQVYEVGTATMMASKAIKPVYDVFKEAGIQFDESQFVPTVSGYYSDSKTGHLLSQPFNSSTPVLYYNKDAFKKAGL
DPEQPPKTWQDLADYAAKLKASGMKCGYASGWQGWIQLENFSAWNGLPFASKNNGFDGTDAVLEFNKPEQVKHIAMLEEM
NKKGDFSYVGRKDESTEKFYNGDCAMTTASSGSLANIREYAKFNYGVGMMPYDADAKDAPQNAIIGGASLWVMQGKDKET
YTGVAKFLDFLAKPENAAEWHQKTGYLPITKAAYDLTREQGFYEKNPGADIATRQMLNKPPLPFTKGLRLGNMPQIRVIV
DEELESVWTGKKTPQQALDTAVERGNQLLRRFEKSTKS
>Mature_438_residues
MKPLRYTASALALGLALMANAQAVTTIPFWHSMEGELGKEVDSLAQRFNAENPDYKIVPTYKGNYEQNLSAGIAAFRTGN
APAILQVYEVGTATMMASKAIKPVYDVFKEAGIQFDESQFVPTVSGYYSDSKTGHLLSQPFNSSTPVLYYNKDAFKKAGL
DPEQPPKTWQDLADYAAKLKASGMKCGYASGWQGWIQLENFSAWNGLPFASKNNGFDGTDAVLEFNKPEQVKHIAMLEEM
NKKGDFSYVGRKDESTEKFYNGDCAMTTASSGSLANIREYAKFNYGVGMMPYDADAKDAPQNAIIGGASLWVMQGKDKET
YTGVAKFLDFLAKPENAAEWHQKTGYLPITKAAYDLTREQGFYEKNPGADIATRQMLNKPPLPFTKGLRLGNMPQIRVIV
DEELESVWTGKKTPQQALDTAVERGNQLLRRFEKSTKS

Specific function: sn-glycerol-3-phosphate and glycerophosphoryl diester- binding protein interacts with the binding protein-dependent transport system ugpACE [H]

COG id: COG1653

COG function: function code G; ABC-type sugar transport system, periplasmic component

Gene ontology:

Cell location: Periplasm (Probable) [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the bacterial solute-binding protein 1 family [H]

Homologues:

Organism=Escherichia coli, GI1789862, Length=438, Percent_Identity=99.3150684931507, Blast_Score=904, Evalue=0.0,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR006059
- InterPro:   IPR006061 [H]

Pfam domain/function: PF01547 SBP_bac_1 [H]

EC number: NA

Molecular weight: Translated: 48494; Mature: 48494

Theoretical pI: Translated: 6.95; Mature: 6.95

Prosite motif: PS01037 SBP_BACTERIAL_1

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.5 %Cys     (Translated Protein)
3.2 %Met     (Translated Protein)
3.7 %Cys+Met (Translated Protein)
0.5 %Cys     (Mature Protein)
3.2 %Met     (Mature Protein)
3.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKPLRYTASALALGLALMANAQAVTTIPFWHSMEGELGKEVDSLAQRFNAENPDYKIVPT
CCCHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCCEEEEEC
YKGNYEQNLSAGIAAFRTGNAPAILQVYEVGTATMMASKAIKPVYDVFKEAGIQFDESQF
CCCCCHHHHHHHHHHEECCCCCEEEEEEECCHHHHHHHHHHHHHHHHHHHHCCCCCCCCC
VPTVSGYYSDSKTGHLLSQPFNSSTPVLYYNKDAFKKAGLDPEQPPKTWQDLADYAAKLK
CCCHHCCCCCCCCCCHHHCCCCCCCCEEEECCHHHHHCCCCCCCCCHHHHHHHHHHHHHH
ASGMKCGYASGWQGWIQLENFSAWNGLPFASKNNGFDGTDAVLEFNKPEQVKHIAMLEEM
HCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCEEECCCHHHHHHHHHHHHH
NKKGDFSYVGRKDESTEKFYNGDCAMTTASSGSLANIREYAKFNYGVGMMPYDADAKDAP
CCCCCEEECCCCCCCHHHHCCCCEEEEECCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCC
QNAIIGGASLWVMQGKDKETYTGVAKFLDFLAKPENAAEWHQKTGYLPITKAAYDLTREQ
CCEEECCCEEEEEECCCCHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCHHHHHHHHHHC
GFYEKNPGADIATRQMLNKPPLPFTKGLRLGNMPQIRVIVDEELESVWTGKKTPQQALDT
CCCCCCCCCCHHHHHHHCCCCCCHHHCCCCCCCCEEEEEECHHHHHHHCCCCCHHHHHHH
AVERGNQLLRRFEKSTKS
HHHHHHHHHHHHHHHCCC
>Mature Secondary Structure
MKPLRYTASALALGLALMANAQAVTTIPFWHSMEGELGKEVDSLAQRFNAENPDYKIVPT
CCCHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCCEEEEEC
YKGNYEQNLSAGIAAFRTGNAPAILQVYEVGTATMMASKAIKPVYDVFKEAGIQFDESQF
CCCCCHHHHHHHHHHEECCCCCEEEEEEECCHHHHHHHHHHHHHHHHHHHHCCCCCCCCC
VPTVSGYYSDSKTGHLLSQPFNSSTPVLYYNKDAFKKAGLDPEQPPKTWQDLADYAAKLK
CCCHHCCCCCCCCCCHHHCCCCCCCCEEEECCHHHHHCCCCCCCCCHHHHHHHHHHHHHH
ASGMKCGYASGWQGWIQLENFSAWNGLPFASKNNGFDGTDAVLEFNKPEQVKHIAMLEEM
HCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCEEECCCHHHHHHHHHHHHH
NKKGDFSYVGRKDESTEKFYNGDCAMTTASSGSLANIREYAKFNYGVGMMPYDADAKDAP
CCCCCEEECCCCCCCHHHHCCCCEEEEECCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCC
QNAIIGGASLWVMQGKDKETYTGVAKFLDFLAKPENAAEWHQKTGYLPITKAAYDLTREQ
CCEEECCCEEEEEECCCCHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCHHHHHHHHHHC
GFYEKNPGADIATRQMLNKPPLPFTKGLRLGNMPQIRVIVDEELESVWTGKKTPQQALDT
CCCCCCCCCCHHHHHHHCCCCCCHHHCCCCCCCCEEEEEECHHHHHHHCCCCCHHHHHHH
AVERGNQLLRRFEKSTKS
HHHHHHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796 [H]