Definition Escherichia coli O157:H7 str. EC4115, complete genome.
Accession NC_011353
Length 5,572,075

Click here to switch to the map view.

The map label for this gene is orfB [H]

Identifier: 209398173

GI number: 209398173

Start: 1303574

End: 1304797

Strand: Direct

Name: orfB [H]

Synonym: ECH74115_1289

Alternate gene names: 209398173

Gene position: 1303574-1304797 (Clockwise)

Preceding gene: 209398258

Following gene: 209399379

Centisome position: 23.39

GC content: 52.78

Gene sequence:

>1224_bases
GTGGACAATGCCGCTATTATTTCGGGTTACAGCGTGTGGATGCCGTATGCCGAAGACTGTTCGCAGTTAATTGACAATCT
GAAACAGGGGAAACGCGTTGCGCGTACACCCTGGTTTACCTCCAACGAGGAAGCCATTAAATGCGGGTTTAAGGGTAATC
CGTCAGTGGCGACGTTGAAACAGGTGGACGATAGCGCATTGGACCTGCTGTCTCAGTTGATCGACGAAGCCCTGGAACAG
GCAATGCTGGATAAGCATTGCCTGGCCGGGCGCAACGTTCGCGTTTATCTGACCGGTATTGGGCCGCGTATCGACGGGCT
GGATTACAAATCTTTCTATAATTACAACGATGTAGAAGATATCAACTTAACGCAATCCATCACAAATCTGCATGCCTCAA
AGATGTCGCAGGACACGATTTCAAGCCATCTTGCGCGCAAATATCGCTTGCAGTATTTGCCGCCTAACATGAACTGCACC
AGTAACTCGTCACTGACTGCGGTGCATCTGGCGACGCAGGGCATTGAGCAGGGCGGGATTGATCTCGCGATTGTGCTGAA
CTGTTCGAAAATTAAAACCCAGGACATCTGGTTCCTGGAAACGCAATCGATGCTGGACAGCGAACAAGTGCAGCCGTTTG
GTGAAAACAGTAAAGGTGTGGCGTTCGCAGAAGGTTTTAGCGCGCTCCTACTGGAAAGCGCTCATCACCGCCGGGCGCGT
CAACAAAGTGAGGGCGTAAGGGTGCAAACGACTTACACCCAAATCAGCGCGGGCCGCAGCAACGATGCCTCCTGGCTTAG
CACTAACGTGCACAAGGTCATGCAGGCCGCGATGAAGCAGGCTGAAATTGCGCTGGATGATCTTGCAGCCATTCTTCCAC
ACGGCAACGGTTCGGCAGTGAGCGATAACGCGGAGGCCAAAGCCATCGCCATGTTCGCAGGGGAGCGACAGATCCCCGTT
CTCGCCTATAAAGGGCAGATCGGCTATACCGCAACCGGATCTGGTGTTGTCGATCTGATCATTGGCCACCATTCGTTGAC
GCATCATCAACTGATCGCACCTGTTGGCAACGACGTCATTATCGACAGCATGGCTTCGCTGGTACTTACGGACGGCAGCG
TGACAAACCATAGCAAACGCCATTTGCTGAAGGTTGGTGTGGGTGTCGATGGTTCAGTTATTGGCGTTGTCATGACAAAT
ATGCAAGCGGGGCGCGCGAAATGA

Upstream 100 bases:

>100_bases
TTTGTACGCCAGTCGATGAAACTTGCGCTGAACCATGTCCTGCTTGTTGGCGCAACGGAAGGCGGCAACTACTACGCGTT
CGTCATTAAGGGATAAGACT

Downstream 100 bases:

>100_bases
CGGAAAAGAGTATCTACCTGTCTTCCTGGGCGCTTACGGAACCGGACGACATTTTTTATTTTTATCAGCCGACCTGGCTC
AACGCCTGGGAAGCACATAT

Product: beta-ketoacyl synthase, C- domain protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 407; Mature: 407

Protein sequence:

>407_residues
MDNAAIISGYSVWMPYAEDCSQLIDNLKQGKRVARTPWFTSNEEAIKCGFKGNPSVATLKQVDDSALDLLSQLIDEALEQ
AMLDKHCLAGRNVRVYLTGIGPRIDGLDYKSFYNYNDVEDINLTQSITNLHASKMSQDTISSHLARKYRLQYLPPNMNCT
SNSSLTAVHLATQGIEQGGIDLAIVLNCSKIKTQDIWFLETQSMLDSEQVQPFGENSKGVAFAEGFSALLLESAHHRRAR
QQSEGVRVQTTYTQISAGRSNDASWLSTNVHKVMQAAMKQAEIALDDLAAILPHGNGSAVSDNAEAKAIAMFAGERQIPV
LAYKGQIGYTATGSGVVDLIIGHHSLTHHQLIAPVGNDVIIDSMASLVLTDGSVTNHSKRHLLKVGVGVDGSVIGVVMTN
MQAGRAK

Sequences:

>Translated_407_residues
MDNAAIISGYSVWMPYAEDCSQLIDNLKQGKRVARTPWFTSNEEAIKCGFKGNPSVATLKQVDDSALDLLSQLIDEALEQ
AMLDKHCLAGRNVRVYLTGIGPRIDGLDYKSFYNYNDVEDINLTQSITNLHASKMSQDTISSHLARKYRLQYLPPNMNCT
SNSSLTAVHLATQGIEQGGIDLAIVLNCSKIKTQDIWFLETQSMLDSEQVQPFGENSKGVAFAEGFSALLLESAHHRRAR
QQSEGVRVQTTYTQISAGRSNDASWLSTNVHKVMQAAMKQAEIALDDLAAILPHGNGSAVSDNAEAKAIAMFAGERQIPV
LAYKGQIGYTATGSGVVDLIIGHHSLTHHQLIAPVGNDVIIDSMASLVLTDGSVTNHSKRHLLKVGVGVDGSVIGVVMTN
MQAGRAK
>Mature_407_residues
MDNAAIISGYSVWMPYAEDCSQLIDNLKQGKRVARTPWFTSNEEAIKCGFKGNPSVATLKQVDDSALDLLSQLIDEALEQ
AMLDKHCLAGRNVRVYLTGIGPRIDGLDYKSFYNYNDVEDINLTQSITNLHASKMSQDTISSHLARKYRLQYLPPNMNCT
SNSSLTAVHLATQGIEQGGIDLAIVLNCSKIKTQDIWFLETQSMLDSEQVQPFGENSKGVAFAEGFSALLLESAHHRRAR
QQSEGVRVQTTYTQISAGRSNDASWLSTNVHKVMQAAMKQAEIALDDLAAILPHGNGSAVSDNAEAKAIAMFAGERQIPV
LAYKGQIGYTATGSGVVDLIIGHHSLTHHQLIAPVGNDVIIDSMASLVLTDGSVTNHSKRHLLKVGVGVDGSVIGVVMTN
MQAGRAK

Specific function: May be involved in the biosynthesis of the oleandomycin lactone ring [H]

COG id: COG0304

COG function: function code IQ; 3-oxoacyl-(acyl-carrier-protein) synthase

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Contains 2 acyl carrier domains [H]

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001227
- InterPro:   IPR009081
- InterPro:   IPR014043
- InterPro:   IPR016035
- InterPro:   IPR000794
- InterPro:   IPR002198
- InterPro:   IPR018201
- InterPro:   IPR014031
- InterPro:   IPR014030
- InterPro:   IPR016036
- InterPro:   IPR016040
- InterPro:   IPR006163
- InterPro:   IPR020842
- InterPro:   IPR020801
- InterPro:   IPR020841
- InterPro:   IPR013968
- InterPro:   IPR020806
- InterPro:   IPR020802
- InterPro:   IPR015083
- InterPro:   IPR006162
- InterPro:   IPR001031
- InterPro:   IPR016039
- InterPro:   IPR016038 [H]

Pfam domain/function: PF00698 Acyl_transf_1; PF00106 adh_short; PF08990 Docking; PF00109 ketoacyl-synt; PF02801 Ketoacyl-synt_C; PF08659 KR; PF00550 PP-binding; PF00975 Thioesterase [H]

EC number: NA

Molecular weight: Translated: 44110; Mature: 44110

Theoretical pI: Translated: 6.51; Mature: 6.51

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.2 %Cys     (Translated Protein)
2.9 %Met     (Translated Protein)
4.2 %Cys+Met (Translated Protein)
1.2 %Cys     (Mature Protein)
2.9 %Met     (Mature Protein)
4.2 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MDNAAIISGYSVWMPYAEDCSQLIDNLKQGKRVARTPWFTSNEEAIKCGFKGNPSVATLK
CCCCEEEECHHEECCCHHHHHHHHHHHHHCCHHHCCCCCCCCCCEEEECCCCCCCCHHHH
QVDDSALDLLSQLIDEALEQAMLDKHCLAGRNVRVYLTGIGPRIDGLDYKSFYNYNDVED
HCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCHHHHCCCCCCCC
INLTQSITNLHASKMSQDTISSHLARKYRLQYLPPNMNCTSNSSLTAVHLATQGIEQGGI
CHHHHHHHHHHHHHHHHHHHHHHHHHHHEEEECCCCCCCCCCCCEEEEEEEHHCCCCCCC
DLAIVLNCSKIKTQDIWFLETQSMLDSEQVQPFGENSKGVAFAEGFSALLLESAHHRRAR
EEEEEEECCCCCCCCEEEEEHHHHHCHHHCCCCCCCCCCEEEHHHHHHHHHHHHHHHHHH
QQSEGVRVQTTYTQISAGRSNDASWLSTNVHKVMQAAMKQAEIALDDLAAILPHGNGSAV
HHHCCEEEEEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC
SDNAEAKAIAMFAGERQIPVLAYKGQIGYTATGSGVVDLIIGHHSLTHHQLIAPVGNDVI
CCCCCCEEEEEECCCCCCEEEEECCCCCEEECCCCEEEEEECCCCCCCCEEECCCCCCHH
IDSMASLVLTDGSVTNHSKRHLLKVGVGVDGSVIGVVMTNMQAGRAK
HHHHHHHEEECCCCCCCCCCEEEEECCCCCCCEEEEEEECCCCCCCC
>Mature Secondary Structure
MDNAAIISGYSVWMPYAEDCSQLIDNLKQGKRVARTPWFTSNEEAIKCGFKGNPSVATLK
CCCCEEEECHHEECCCHHHHHHHHHHHHHCCHHHCCCCCCCCCCEEEECCCCCCCCHHHH
QVDDSALDLLSQLIDEALEQAMLDKHCLAGRNVRVYLTGIGPRIDGLDYKSFYNYNDVED
HCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCHHHHCCCCCCCC
INLTQSITNLHASKMSQDTISSHLARKYRLQYLPPNMNCTSNSSLTAVHLATQGIEQGGI
CHHHHHHHHHHHHHHHHHHHHHHHHHHHEEEECCCCCCCCCCCCEEEEEEEHHCCCCCCC
DLAIVLNCSKIKTQDIWFLETQSMLDSEQVQPFGENSKGVAFAEGFSALLLESAHHRRAR
EEEEEEECCCCCCCCEEEEEHHHHHCHHHCCCCCCCCCCEEEHHHHHHHHHHHHHHHHHH
QQSEGVRVQTTYTQISAGRSNDASWLSTNVHKVMQAAMKQAEIALDDLAAILPHGNGSAV
HHHCCEEEEEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC
SDNAEAKAIAMFAGERQIPVLAYKGQIGYTATGSGVVDLIIGHHSLTHHQLIAPVGNDVI
CCCCCCEEEEEECCCCCCEEEEECCCCCEEECCCCEEEEEECCCCCCCCEEECCCCCCHH
IDSMASLVLTDGSVTNHSKRHLLKVGVGVDGSVIGVVMTNMQAGRAK
HHHHHHHEEECCCCCCCCCCEEEEECCCCCCCEEEEEEECCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8107683 [H]