The gene/protein map for NC_006274 is currently unavailable.
Definition Escherichia coli HS, complete genome.
Accession NC_009800
Length 4,643,538

Click here to switch to the map view.

The map label for this gene is ybgQ [H]

Identifier: 157160198

GI number: 157160198

Start: 775805

End: 778306

Strand: Reverse

Name: ybgQ [H]

Synonym: EcHS_A0766

Alternate gene names: 157160198

Gene position: 778306-775805 (Counterclockwise)

Preceding gene: 157160199

Following gene: 157160197

Centisome position: 16.76

GC content: 49.6

Gene sequence:

>2502_bases
TTGGCGGCGCTATCGCCGCCTTTCATAAAATATTTATCAGGTATGGACACCGTGAATATTTATCGACTCTCTTTTGTATC
CTGCCTGGTCGTGGCGATGCCTTGCGCATTGGCGGTCGAATTCAACCTTAATGTTCTCGATAAATCGATGCGCGACCGCA
TTGATATTTCATTATTAAAGGAAAAAGGAGTCATTGCTCCCGGTGAGTATTTTGTTAGCGTTGCGGTAAATAATAACCAA
ATCAGTAACGGGCAAAAGATTAACTGGCACAAAAATGACGATAAAACCATTCCGTGCATCAATGATTTACTGGTCGATAA
ATTTGGCTTAAAACCTGAAGTCCGTCAGTCGTTACCATTGATAAATCAGTGCGTCGATTTTAGCTCCCGACCTGAAATGC
TCTTCAATTTCGATCAAGCCAATCAGCAACTAAATATCACCATTCCGCAAGCCTGGCTGGCGTGGCACTCAGAAAACTGG
ACCCCACCCTCCACATGGAAAGAAGGTGTCGCCGGTATCCTGATGGATTACAACTTGTTTGCCAGCAGCTACCGCCCACA
GGACGGCAGCAGCAGCACTAACCTGAACGCCTACGGTACCACCGGAATTAACGCCGGGGCATGGCGCTTACGTAGTGATT
ATCAGTTGAATCAGACTGATAGCGATGATAACCATGAACAGTCAGGCGGAATATCGCGCACCTATCTTTTTCGTCCATTA
CCGCAATTAGGCTCTAAATTAACCCTCGGCGAAACGGATTTTAGTTCCAATATTTTCGACGGTTTTTCTTATACCGGCGC
GGCACTGGCAAGTGACGAGCGAATGTTGCCATGGGAACTACGCGGCTACGCCCCACAAATTAGCGGTATTGCACAGACCA
ATGCCACGGTGACGATCAGTCAATCAGGCCGCGTCATTTACCAGAAAAAAGTCCCACCAGGCCCATTTATCATTGACGAC
CTTAATCAGTCTGTTCAGGGCACACTGGATGTCAAAGTGACGGAAGAAGATGGTCGGGTGAACAATTTCCAGGTTTCGGC
AGCATCGACGCCCTTCCTGACTCGTCAGGGACAGGTTCGCTATAAACTGGCCGCGGGTCAGCCACGGCCCTCCATGTCAC
ATCAAACTGAAAATGAAACCTTTTTTAGCAATGAAGTTTCCTGGGGGATGCTGTCAAACACCTCGCTGTACGGCGGCCTG
CTGCTTTCTGGTGATGACTACCATTCTGCCGCAATGGGTATTGGGCAAAATATGCTGTGGCTTGGTGCGCTGTCGTTTGA
TGTCACGTGGGCCAGTAGCCATTTTGATACTCAGCAGGACGAGCGGGGCTTAAGCTACCGTTTTAATTACAGCAAACAAG
TGGATGCCACTAACAGCACGATTTCCCTCGCCGCTTATCGTTTCTCCGATCGTCATTTTCACAGCTACGCCAACTATCTG
GATCACAAATACAACGACAGCGATGCGCAGGACGAAAAACAGACGATCAGCTTATCTGTGGGCCAACCGATTACCCCACT
AAACCTCAATCTTTACGCCAACCTGCTACATCAAACCTGGTGGAATGCAGACGCCTCCACGACCGCCAACATCACAGCCG
GTTTTAATGTTGATATTGGTGACTGGAGAGATATCTCTATTTCGACGTCATTCAATACAACCCATTACGAAGATAAAGAT
CGCGACAACCAGATTTACCTGTCGATTTCGCTCCCCTTCGGTAATGGTGGTCGGGTTGGTTATGACATGCAAAACAGTAG
CCACAGCACCACACACCGCATGTCGTGGAACGATACGCTGGATGAACGTAATAGCTGGGGCATGTCTGCCGGACTGCAAT
CCGACCGTCCTGACAATGGAGCCCAGGTGAGCGGTAACTATCAGCACCTGAGTTCAGCGGGTGAGTGGGATATTTCTGGT
ACCTATGCCGCCAATGATTACAGTTCCGTCAGCAGCAGCTGGAGCGGTTCTTTCACCGCAACCCAATATGGTGCAGCGTT
TCATCGCCGCAGCTCCACCAATGAACCTCGCCTGATGGTCAGCACCGATGGCGTGGCAGATATTCCGGTTCAGGGCAATC
TCGACTACACCAACCATTTTGGCATTGCGGTGGTGCCGTTGATTTCCAGTTATCAGCCTTCCACCGTGGCGGTGAACATG
AATGACTTACCCGACGGCGTAACAGTTACAGAAAACGTTATCAAAGAAACGTGGATTGAAGGCGCGATAGGTTACAAATC
ACTGGCTTCCCGTTCCGGTAAAGACGTTAACGTCATCATTCGCAACGCCAGCGGTCAGTTTCCTCCCCTCGGAGCGGATA
TCCGCCAGGATGACAGCGGCATTAGCGTGGGGATGGTTGGCGAGGAAGGACATGCCTGGTTAAGCGGAGTCGCTGAAAAT
CAAAAGTTTACCGTGGTCTGGGGTGATAGCCAGCATTGCTCGCTCCATCTTCCTGAACATATGGAAGACACCGCAAATCG
CCTGATTTTACCTTGTCATTAA

Upstream 100 bases:

>100_bases
TCTTTGCCTGGATGGAACAAATTGATCAGGCTACACCTGTAACGCCAGGCGCAGTTACGGCGAATGCAACCTACGTGCTG
GATTATAAATAAGTATTACT

Downstream 100 bases:

>100_bases
TTAAGGAAATGAAAATGACATTTATGAAAGGACTACCCCTTTTGTTGTTGGTTGCCAGTCTGTGCAGCCACGCTGCACTA
CAACCCGATCGCACTCGTAT

Product: fimbrial usher family protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 833; Mature: 832

Protein sequence:

>833_residues
MAALSPPFIKYLSGMDTVNIYRLSFVSCLVVAMPCALAVEFNLNVLDKSMRDRIDISLLKEKGVIAPGEYFVSVAVNNNQ
ISNGQKINWHKNDDKTIPCINDLLVDKFGLKPEVRQSLPLINQCVDFSSRPEMLFNFDQANQQLNITIPQAWLAWHSENW
TPPSTWKEGVAGILMDYNLFASSYRPQDGSSSTNLNAYGTTGINAGAWRLRSDYQLNQTDSDDNHEQSGGISRTYLFRPL
PQLGSKLTLGETDFSSNIFDGFSYTGAALASDERMLPWELRGYAPQISGIAQTNATVTISQSGRVIYQKKVPPGPFIIDD
LNQSVQGTLDVKVTEEDGRVNNFQVSAASTPFLTRQGQVRYKLAAGQPRPSMSHQTENETFFSNEVSWGMLSNTSLYGGL
LLSGDDYHSAAMGIGQNMLWLGALSFDVTWASSHFDTQQDERGLSYRFNYSKQVDATNSTISLAAYRFSDRHFHSYANYL
DHKYNDSDAQDEKQTISLSVGQPITPLNLNLYANLLHQTWWNADASTTANITAGFNVDIGDWRDISISTSFNTTHYEDKD
RDNQIYLSISLPFGNGGRVGYDMQNSSHSTTHRMSWNDTLDERNSWGMSAGLQSDRPDNGAQVSGNYQHLSSAGEWDISG
TYAANDYSSVSSSWSGSFTATQYGAAFHRRSSTNEPRLMVSTDGVADIPVQGNLDYTNHFGIAVVPLISSYQPSTVAVNM
NDLPDGVTVTENVIKETWIEGAIGYKSLASRSGKDVNVIIRNASGQFPPLGADIRQDDSGISVGMVGEEGHAWLSGVAEN
QKFTVVWGDSQHCSLHLPEHMEDTANRLILPCH

Sequences:

>Translated_833_residues
MAALSPPFIKYLSGMDTVNIYRLSFVSCLVVAMPCALAVEFNLNVLDKSMRDRIDISLLKEKGVIAPGEYFVSVAVNNNQ
ISNGQKINWHKNDDKTIPCINDLLVDKFGLKPEVRQSLPLINQCVDFSSRPEMLFNFDQANQQLNITIPQAWLAWHSENW
TPPSTWKEGVAGILMDYNLFASSYRPQDGSSSTNLNAYGTTGINAGAWRLRSDYQLNQTDSDDNHEQSGGISRTYLFRPL
PQLGSKLTLGETDFSSNIFDGFSYTGAALASDERMLPWELRGYAPQISGIAQTNATVTISQSGRVIYQKKVPPGPFIIDD
LNQSVQGTLDVKVTEEDGRVNNFQVSAASTPFLTRQGQVRYKLAAGQPRPSMSHQTENETFFSNEVSWGMLSNTSLYGGL
LLSGDDYHSAAMGIGQNMLWLGALSFDVTWASSHFDTQQDERGLSYRFNYSKQVDATNSTISLAAYRFSDRHFHSYANYL
DHKYNDSDAQDEKQTISLSVGQPITPLNLNLYANLLHQTWWNADASTTANITAGFNVDIGDWRDISISTSFNTTHYEDKD
RDNQIYLSISLPFGNGGRVGYDMQNSSHSTTHRMSWNDTLDERNSWGMSAGLQSDRPDNGAQVSGNYQHLSSAGEWDISG
TYAANDYSSVSSSWSGSFTATQYGAAFHRRSSTNEPRLMVSTDGVADIPVQGNLDYTNHFGIAVVPLISSYQPSTVAVNM
NDLPDGVTVTENVIKETWIEGAIGYKSLASRSGKDVNVIIRNASGQFPPLGADIRQDDSGISVGMVGEEGHAWLSGVAEN
QKFTVVWGDSQHCSLHLPEHMEDTANRLILPCH
>Mature_832_residues
AALSPPFIKYLSGMDTVNIYRLSFVSCLVVAMPCALAVEFNLNVLDKSMRDRIDISLLKEKGVIAPGEYFVSVAVNNNQI
SNGQKINWHKNDDKTIPCINDLLVDKFGLKPEVRQSLPLINQCVDFSSRPEMLFNFDQANQQLNITIPQAWLAWHSENWT
PPSTWKEGVAGILMDYNLFASSYRPQDGSSSTNLNAYGTTGINAGAWRLRSDYQLNQTDSDDNHEQSGGISRTYLFRPLP
QLGSKLTLGETDFSSNIFDGFSYTGAALASDERMLPWELRGYAPQISGIAQTNATVTISQSGRVIYQKKVPPGPFIIDDL
NQSVQGTLDVKVTEEDGRVNNFQVSAASTPFLTRQGQVRYKLAAGQPRPSMSHQTENETFFSNEVSWGMLSNTSLYGGLL
LSGDDYHSAAMGIGQNMLWLGALSFDVTWASSHFDTQQDERGLSYRFNYSKQVDATNSTISLAAYRFSDRHFHSYANYLD
HKYNDSDAQDEKQTISLSVGQPITPLNLNLYANLLHQTWWNADASTTANITAGFNVDIGDWRDISISTSFNTTHYEDKDR
DNQIYLSISLPFGNGGRVGYDMQNSSHSTTHRMSWNDTLDERNSWGMSAGLQSDRPDNGAQVSGNYQHLSSAGEWDISGT
YAANDYSSVSSSWSGSFTATQYGAAFHRRSSTNEPRLMVSTDGVADIPVQGNLDYTNHFGIAVVPLISSYQPSTVAVNMN
DLPDGVTVTENVIKETWIEGAIGYKSLASRSGKDVNVIIRNASGQFPPLGADIRQDDSGISVGMVGEEGHAWLSGVAENQ
KFTVVWGDSQHCSLHLPEHMEDTANRLILPCH

Specific function: Could be involved in the export and assembly of the putative ybgD fimbrial subunit across the outer membrane [H]

COG id: COG3188

COG function: function code NU; P pilus assembly protein, porin PapC

Gene ontology:

Cell location: Cell outer membrane; Multi-pass membrane protein [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the fimbrial export usher family [H]

Homologues:

Organism=Escherichia coli, GI87081778, Length=816, Percent_Identity=95.4656862745098, Blast_Score=1571, Evalue=0.0,
Organism=Escherichia coli, GI1790772, Length=879, Percent_Identity=28.5551763367463, Blast_Score=285, Evalue=1e-77,
Organism=Escherichia coli, GI1789533, Length=789, Percent_Identity=29.404309252218, Blast_Score=271, Evalue=2e-73,
Organism=Escherichia coli, GI1786744, Length=781, Percent_Identity=28.5531370038412, Blast_Score=265, Evalue=1e-71,
Organism=Escherichia coli, GI1786332, Length=875, Percent_Identity=26.5142857142857, Blast_Score=248, Evalue=1e-66,
Organism=Escherichia coli, GI1787172, Length=871, Percent_Identity=25.947187141217, Blast_Score=244, Evalue=1e-65,
Organism=Escherichia coli, GI1788427, Length=805, Percent_Identity=26.0869565217391, Blast_Score=202, Evalue=6e-53,
Organism=Escherichia coli, GI1789610, Length=807, Percent_Identity=23.9157372986369, Blast_Score=107, Evalue=3e-24,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000015
- InterPro:   IPR018030 [H]

Pfam domain/function: PF00577 Usher [H]

EC number: NA

Molecular weight: Translated: 92048; Mature: 91916

Theoretical pI: Translated: 4.81; Mature: 4.81

Prosite motif: PS01151 FIMBRIAL_USHER

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.7 %Cys     (Translated Protein)
2.2 %Met     (Translated Protein)
2.9 %Cys+Met (Translated Protein)
0.7 %Cys     (Mature Protein)
2.0 %Met     (Mature Protein)
2.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MAALSPPFIKYLSGMDTVNIYRLSFVSCLVVAMPCALAVEFNLNVLDKSMRDRIDISLLK
CCCCCCHHHHHHCCCCCEEEHHHHHHHHHHHHHCCEEEEEECCHHHHHHHHHHCCEEEEE
EKGVIAPGEYFVSVAVNNNQISNGQKINWHKNDDKTIPCINDLLVDKFGLKPEVRQSLPL
CCCCCCCCCEEEEEEECCCCCCCCCEEEEECCCCCCCCHHHHHHHHHCCCCHHHHHHCHH
INQCVDFSSRPEMLFNFDQANQQLNITIPQAWLAWHSENWTPPSTWKEGVAGILMDYNLF
HHHHHCCCCCCCEEEEECCCCCEEEEEECHHEEEECCCCCCCCHHHHHHHHEEEEEHHHH
ASSYRPQDGSSSTNLNAYGTTGINAGAWRLRSDYQLNQTDSDDNHEQSGGISRTYLFRPL
HCCCCCCCCCCCCCEEEEECCCCCCCEEEEECCCCCCCCCCCCCCHHCCCCEEEEEECCH
PQLGSKLTLGETDFSSNIFDGFSYTGAALASDERMLPWELRGYAPQISGIAQTNATVTIS
HHHCCEEEECCCCCCCCCCCCCCCCCCEECCCCCCCCHHHCCCCCCCCCEEECCCEEEEE
QSGRVIYQKKVPPGPFIIDDLNQSVQGTLDVKVTEEDGRVNNFQVSAASTPFLTRQGQVR
CCCCEEEEECCCCCCEEEECCCCCCCEEEEEEEECCCCCCCEEEEECCCCCEEECCCCEE
YKLAAGQPRPSMSHQTENETFFSNEVSWGMLSNTSLYGGLLLSGDDYHSAAMGIGQNMLW
EEEECCCCCCCCCCCCCCCEEECCCCCCCEEECCCEECEEEECCCCHHHHHHHCCCCEEE
LGALSFDVTWASSHFDTQQDERGLSYRFNYSKQVDATNSTISLAAYRFSDRHFHSYANYL
EEEEEEEEEECCCCCCCCHHCCCCEEEECCCCCCCCCCCEEEEEEEEECCHHHHHHHHHH
DHKYNDSDAQDEKQTISLSVGQPITPLNLNLYANLLHQTWWNADASTTANITAGFNVDIG
CCCCCCCCCCCCCEEEEEECCCCCCCEEHHHHHHHHHHHHCCCCCCCEEEEEEEEEECCC
DWRDISISTSFNTTHYEDKDRDNQIYLSISLPFGNGGRVGYDMQNSSHSTTHRMSWNDTL
CCEEEEEEECCCCCCCCCCCCCCEEEEEEEECCCCCCEEEEECCCCCCCCEEEECCCCCC
DERNSWGMSAGLQSDRPDNGAQVSGNYQHLSSAGEWDISGTYAANDYSSVSSSWSGSFTA
CCCCCCCCCCCCCCCCCCCCCEECCCHHHHCCCCCCCCCCEEECCCHHHHCCCCCCCEEE
TQYGAAFHRRSSTNEPRLMVSTDGVADIPVQGNLDYTNHFGIAVVPLISSYQPSTVAVNM
HHHHHHHHHCCCCCCCEEEEECCCCEECCCCCCCCCCCCCCEEEEEHHCCCCCCEEEEEC
NDLPDGVTVTENVIKETWIEGAIGYKSLASRSGKDVNVIIRNASGQFPPLGADIRQDDSG
CCCCCCCCHHHHHHHHHHHHCCCCHHHHHHCCCCEEEEEEECCCCCCCCCCCCCCCCCCC
ISVGMVGEEGHAWLSGVAENQKFTVVWGDSQHCSLHLPEHMEDTANRLILPCH
CEEEEECCCCCHHHHHHCCCCEEEEEECCCCCEEEECCHHHHHHHCCEEEECC
>Mature Secondary Structure 
AALSPPFIKYLSGMDTVNIYRLSFVSCLVVAMPCALAVEFNLNVLDKSMRDRIDISLLK
CCCCCHHHHHHCCCCCEEEHHHHHHHHHHHHHCCEEEEEECCHHHHHHHHHHCCEEEEE
EKGVIAPGEYFVSVAVNNNQISNGQKINWHKNDDKTIPCINDLLVDKFGLKPEVRQSLPL
CCCCCCCCCEEEEEEECCCCCCCCCEEEEECCCCCCCCHHHHHHHHHCCCCHHHHHHCHH
INQCVDFSSRPEMLFNFDQANQQLNITIPQAWLAWHSENWTPPSTWKEGVAGILMDYNLF
HHHHHCCCCCCCEEEEECCCCCEEEEEECHHEEEECCCCCCCCHHHHHHHHEEEEEHHHH
ASSYRPQDGSSSTNLNAYGTTGINAGAWRLRSDYQLNQTDSDDNHEQSGGISRTYLFRPL
HCCCCCCCCCCCCCEEEEECCCCCCCEEEEECCCCCCCCCCCCCCHHCCCCEEEEEECCH
PQLGSKLTLGETDFSSNIFDGFSYTGAALASDERMLPWELRGYAPQISGIAQTNATVTIS
HHHCCEEEECCCCCCCCCCCCCCCCCCEECCCCCCCCHHHCCCCCCCCCEEECCCEEEEE
QSGRVIYQKKVPPGPFIIDDLNQSVQGTLDVKVTEEDGRVNNFQVSAASTPFLTRQGQVR
CCCCEEEEECCCCCCEEEECCCCCCCEEEEEEEECCCCCCCEEEEECCCCCEEECCCCEE
YKLAAGQPRPSMSHQTENETFFSNEVSWGMLSNTSLYGGLLLSGDDYHSAAMGIGQNMLW
EEEECCCCCCCCCCCCCCCEEECCCCCCCEEECCCEECEEEECCCCHHHHHHHCCCCEEE
LGALSFDVTWASSHFDTQQDERGLSYRFNYSKQVDATNSTISLAAYRFSDRHFHSYANYL
EEEEEEEEEECCCCCCCCHHCCCCEEEECCCCCCCCCCCEEEEEEEEECCHHHHHHHHHH
DHKYNDSDAQDEKQTISLSVGQPITPLNLNLYANLLHQTWWNADASTTANITAGFNVDIG
CCCCCCCCCCCCCEEEEEECCCCCCCEEHHHHHHHHHHHHCCCCCCCEEEEEEEEEECCC
DWRDISISTSFNTTHYEDKDRDNQIYLSISLPFGNGGRVGYDMQNSSHSTTHRMSWNDTL
CCEEEEEEECCCCCCCCCCCCCCEEEEEEEECCCCCCEEEEECCCCCCCCEEEECCCCCC
DERNSWGMSAGLQSDRPDNGAQVSGNYQHLSSAGEWDISGTYAANDYSSVSSSWSGSFTA
CCCCCCCCCCCCCCCCCCCCCEECCCHHHHCCCCCCCCCCEEECCCHHHHCCCCCCCEEE
TQYGAAFHRRSSTNEPRLMVSTDGVADIPVQGNLDYTNHFGIAVVPLISSYQPSTVAVNM
HHHHHHHHHCCCCCCCEEEEECCCCEECCCCCCCCCCCCCCEEEEEHHCCCCCCEEEEEC
NDLPDGVTVTENVIKETWIEGAIGYKSLASRSGKDVNVIIRNASGQFPPLGADIRQDDSG
CCCCCCCCHHHHHHHHHHHHCCCCHHHHHHCCCCEEEEEEECCCCCCCCCCCCCCCCCCC
ISVGMVGEEGHAWLSGVAENQKFTVVWGDSQHCSLHLPEHMEDTANRLILPCH
CEEEEECCCCCHHHHHHCCCCEEEEEECCCCCEEEECCHHHHHHHCCEEEECC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 8905232; 9278503 [H]