| Definition | Escherichia coli ED1a chromosome, complete genome. |
|---|---|
| Accession | NC_011745 |
| Length | 5,209,548 |
Click here to switch to the map view.
The map label for this gene is bglG [H]
Identifier: 218692011
GI number: 218692011
Start: 4342628
End: 4343464
Strand: Reverse
Name: bglG [H]
Synonym: ECED1_4413
Alternate gene names: 218692011
Gene position: 4343464-4342628 (Counterclockwise)
Preceding gene: 218692012
Following gene: 218692010
Centisome position: 83.38
GC content: 43.49
Gene sequence:
>837_bases ATGAACATGCAAATCACCAAAATTCTCAACAATAATGTTGTGGTGGTTATTGATGATCAACAGCGGGAAAAAGTCGTCAT GGGGCGCGGAATTGGCTTTCAAAAACGCCCAAGCGAAAGAATTAACTCAAGTGGAATAGAAAAAGAGTATGCCTTGAGCA GTCATGAACTGAACGGGCGATTAAGCGAACTCTTAAGTCATATGCCTCTTGAGGTGATGGCAACCTGTGATCGTATTATC TCTCTGGCGCAGGAGCGTCTGGGAAAGTTGCAGGACAGTATTTATATCTCGCTAACTGACCATTGCCAGTTTGCGATTAA ACGCTTTCAGCAAAACGTGCTACTGCCCAACCCGCTGCTGTGGGATATCCAGCGACTTTACCCGAAAGAGTTCCAGCTAG GGGAAGAAGCGTTAACCATTATTGATAAACGGTTGGGCGTGCAGTTACCGAAAGATGAAGTGGGCTTTATTGCCATGCAT CTGGTCAGTGCCCAAATGAGCGGAAATATGGAGGATGTTGCAGGTGTCACGCAATTAATGCGAGAAATGCTGCAATTAAT AAAATTTCAGTTCAGCCTTAATTACCAGGAAGAAAGCTTGAGTTATCAGCGACTGGTTACGCATCTGAAATTTTTATCCT GGCGTATTCTTGAACATGCATCGATTAACGATAGTGATGAATCATTACAACAAGCAGTAAAGCAAAATTACCCGCAAGCA TGGCAATGTGCGGAGCGGATCGCCATTTTTATTGGTTTGCAGTATCAACGTAAAATCTCACCTGCAGAGATTATGTTTTT AGCCATAAATATAGAGCGCGTGCGCAAAGAACACTGA
Upstream 100 bases:
>100_bases ATTAATGACTGGATTGTTACTGCATTCGCAGGCAAAACCTGACATAACCAGAGAATACTGGTGAAGTCGGGTTTTTTTGT TTATAAAAAAGGTCCTTGCT
Downstream 100 bases:
>100_bases AATATTATTACTGAATAAAGGATTGTTACCGCACTAAGCGGGCAAAACCTGAAAAAAATTGCTTGATTCACGTCAGGCCG TTTTTTTCAGGTTTTTTTTT
Product: transcriptional antiterminator BglG
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 278; Mature: 278
Protein sequence:
>278_residues MNMQITKILNNNVVVVIDDQQREKVVMGRGIGFQKRPSERINSSGIEKEYALSSHELNGRLSELLSHMPLEVMATCDRII SLAQERLGKLQDSIYISLTDHCQFAIKRFQQNVLLPNPLLWDIQRLYPKEFQLGEEALTIIDKRLGVQLPKDEVGFIAMH LVSAQMSGNMEDVAGVTQLMREMLQLIKFQFSLNYQEESLSYQRLVTHLKFLSWRILEHASINDSDESLQQAVKQNYPQA WQCAERIAIFIGLQYQRKISPAEIMFLAINIERVRKEH
Sequences:
>Translated_278_residues MNMQITKILNNNVVVVIDDQQREKVVMGRGIGFQKRPSERINSSGIEKEYALSSHELNGRLSELLSHMPLEVMATCDRII SLAQERLGKLQDSIYISLTDHCQFAIKRFQQNVLLPNPLLWDIQRLYPKEFQLGEEALTIIDKRLGVQLPKDEVGFIAMH LVSAQMSGNMEDVAGVTQLMREMLQLIKFQFSLNYQEESLSYQRLVTHLKFLSWRILEHASINDSDESLQQAVKQNYPQA WQCAERIAIFIGLQYQRKISPAEIMFLAINIERVRKEH >Mature_278_residues MNMQITKILNNNVVVVIDDQQREKVVMGRGIGFQKRPSERINSSGIEKEYALSSHELNGRLSELLSHMPLEVMATCDRII SLAQERLGKLQDSIYISLTDHCQFAIKRFQQNVLLPNPLLWDIQRLYPKEFQLGEEALTIIDKRLGVQLPKDEVGFIAMH LVSAQMSGNMEDVAGVTQLMREMLQLIKFQFSLNYQEESLSYQRLVTHLKFLSWRILEHASINDSDESLQQAVKQNYPQA WQCAERIAIFIGLQYQRKISPAEIMFLAINIERVRKEH
Specific function: Mediates the positive regulation of the beta-glucoside (bgl) operon by functioning as a transcriptional antiterminator. This is a RNA-binding protein that recognizes a specific sequence located just upstream of two termination sites within the operon [H]
COG id: COG3711
COG function: function code K; Transcriptional antiterminator
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 2 PRD domains [H]
Homologues:
Organism=Escherichia coli, GI1790160, Length=278, Percent_Identity=98.9208633093525, Blast_Score=566, Evalue=1e-163,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR004341 - InterPro: IPR011608 - InterPro: IPR001550 [H]
Pfam domain/function: PF03123 CAT_RBD; PF00874 PRD [H]
EC number: NA
Molecular weight: Translated: 32172; Mature: 32172
Theoretical pI: Translated: 6.88; Mature: 6.88
Prosite motif: PS00654 ANTITERMINATORS_BGLG
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.1 %Cys (Translated Protein) 4.0 %Met (Translated Protein) 5.0 %Cys+Met (Translated Protein) 1.1 %Cys (Mature Protein) 4.0 %Met (Mature Protein) 5.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MNMQITKILNNNVVVVIDDQQREKVVMGRGIGFQKRPSERINSSGIEKEYALSSHELNGR CCCEEEEECCCCEEEEECCCCHHHHHHCCCCCCCCCCHHHHCCCCCCHHHHHHHHHCCHH LSELLSHMPLEVMATCDRIISLAQERLGKLQDSIYISLTDHCQFAIKRFQQNVLLPNPLL HHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHEEEEEHHHHHHHHHHHHHCCCCCCCHH WDIQRLYPKEFQLGEEALTIIDKRLGVQLPKDEVGFIAMHLVSAQMSGNMEDVAGVTQLM HHHHHHCCHHHHCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHH REMLQLIKFQFSLNYQEESLSYQRLVTHLKFLSWRILEHASINDSDESLQQAVKQNYPQA HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCHH WQCAERIAIFIGLQYQRKISPAEIMFLAINIERVRKEH HHHHHHHHHHHCCHHHHCCCHHHEEEEEEEHHHHHCCC >Mature Secondary Structure MNMQITKILNNNVVVVIDDQQREKVVMGRGIGFQKRPSERINSSGIEKEYALSSHELNGR CCCEEEEECCCCEEEEECCCCHHHHHHCCCCCCCCCCHHHHCCCCCCHHHHHHHHHCCHH LSELLSHMPLEVMATCDRIISLAQERLGKLQDSIYISLTDHCQFAIKRFQQNVLLPNPLL HHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHEEEEEHHHHHHHHHHHHHCCCCCCCHH WDIQRLYPKEFQLGEEALTIIDKRLGVQLPKDEVGFIAMHLVSAQMSGNMEDVAGVTQLM HHHHHHCCHHHHCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHH REMLQLIKFQFSLNYQEESLSYQRLVTHLKFLSWRILEHASINDSDESLQQAVKQNYPQA HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCHH WQCAERIAIFIGLQYQRKISPAEIMFLAINIERVRKEH HHHHHHHHHHHCCHHHHCCCHHHEEEEEEEHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 3034860; 7686882; 9278503; 3301003; 3309161; 2200123; 2195546; 1698125 [H]