Definition Escherichia coli HS, complete genome.
Accession NC_009800
Length 4,643,538

Click here to switch to the map view.

The map label for this gene is bioC

Identifier: 157160252

GI number: 157160252

Start: 838125

End: 838880

Strand: Direct

Name: bioC

Synonym: EcHS_A0831

Alternate gene names: 157160252

Gene position: 838125-838880 (Clockwise)

Preceding gene: 157160251

Following gene: 157160253

Centisome position: 18.05

GC content: 56.48

Gene sequence:

>756_bases
ATGGCAACGGTTAATAAACAAGCCATTGCAGCGGCATTTGGTCGGGCAGCCGCACACTATGAGCAACATGCAGATCTACA
GCGCCAGAGTGCTGACGCCTTACTGGCAATGCTTCCACAGCGTAAATACACCCACGTACTGGACGCGGGTTGTGGACCTG
GCTGGATGAGCCGCCACTGGCGGGAACGTCACGCGCAGGTGACGGCCTTAGATCTCTCGCCGCCAATGCTTGTTCAGGCA
CGCCAGAAGGATGCCGCAGACCATTATCTGGCGGGAGATATCGAATCCCTGCCGTTAGCGACTGCGACGTTCGATCTTGC
ATGGAGCAATCTCGCAGTGCAGTGGTGCGGTAATTTATCCACGGCACTCCGCGAGCTGTATCGGGTGGTGCGCCCCAAAG
GCGTGGTCGCGTTTACCACGCTGGTGCAGGGATCGTTACCCGAACTGCATCAGGCGTGGCAGGCGGTGGACGAGCGTCCG
CATGCTAATCGCTTTTTACCGCCAGATGAAATCGAACAGTCGCTGAACGGCGTGCATTATCAACATCATATTCAGCCCAT
CACGCTGTGGTTTGATGATGCGCTCAGTGCCATGCGTTCGCTGAAAGGCATCGGTGCCACGCATCTTCATGAAGGGCGCG
ACCCGCGAATATTAACGCGTTCGCAGTTGCAGCGATTGCAACTGGCCTGGCCGCAACAGCAGGGGCGATATCCTCTGACG
TATCATCTTTTTTTGGGAGTGATTGCTCGTGAGTAA

Upstream 100 bases:

>100_bases
ATTCGCCCGCCAACCGTACCCGCTGGTACTGCGCGACTGCGCTTAACGCTAACCGCTGCGCATGAAATGCAGGATATCGA
CCGTCTGCTGGAGGTGCTGC

Downstream 100 bases:

>100_bases
ACGTTATTTTGTCACCGGAACGGATACCGAAGTGGGGAAAACTGTCGCCAGTTGTGCACTTTTACAAGCCGCAAAGGCAG
CAGGCTACCGGACGGCAGGT

Product: biotin biosynthesis protein BioC

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 251; Mature: 250

Protein sequence:

>251_residues
MATVNKQAIAAAFGRAAAHYEQHADLQRQSADALLAMLPQRKYTHVLDAGCGPGWMSRHWRERHAQVTALDLSPPMLVQA
RQKDAADHYLAGDIESLPLATATFDLAWSNLAVQWCGNLSTALRELYRVVRPKGVVAFTTLVQGSLPELHQAWQAVDERP
HANRFLPPDEIEQSLNGVHYQHHIQPITLWFDDALSAMRSLKGIGATHLHEGRDPRILTRSQLQRLQLAWPQQQGRYPLT
YHLFLGVIARE

Sequences:

>Translated_251_residues
MATVNKQAIAAAFGRAAAHYEQHADLQRQSADALLAMLPQRKYTHVLDAGCGPGWMSRHWRERHAQVTALDLSPPMLVQA
RQKDAADHYLAGDIESLPLATATFDLAWSNLAVQWCGNLSTALRELYRVVRPKGVVAFTTLVQGSLPELHQAWQAVDERP
HANRFLPPDEIEQSLNGVHYQHHIQPITLWFDDALSAMRSLKGIGATHLHEGRDPRILTRSQLQRLQLAWPQQQGRYPLT
YHLFLGVIARE
>Mature_250_residues
ATVNKQAIAAAFGRAAAHYEQHADLQRQSADALLAMLPQRKYTHVLDAGCGPGWMSRHWRERHAQVTALDLSPPMLVQAR
QKDAADHYLAGDIESLPLATATFDLAWSNLAVQWCGNLSTALRELYRVVRPKGVVAFTTLVQGSLPELHQAWQAVDERPH
ANRFLPPDEIEQSLNGVHYQHHIQPITLWFDDALSAMRSLKGIGATHLHEGRDPRILTRSQLQRLQLAWPQQQGRYPLTY
HLFLGVIARE

Specific function: BioC is involved in an early, but chemically unexplored, step in the conversion of pimelic acid to biotin

COG id: COG0500

COG function: function code QR; SAM-dependent methyltransferases

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the methyltransferase superfamily

Homologues:

Organism=Escherichia coli, GI1786994, Length=251, Percent_Identity=100, Blast_Score=514, Evalue=1e-147,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): BIOC_ECOLI (P12999)

Other databases:

- EMBL:   J04423
- EMBL:   A11534
- EMBL:   U00096
- EMBL:   AP009048
- PIR:   A64814
- RefSeq:   AP_001408.1
- RefSeq:   NP_415298.1
- ProteinModelPortal:   P12999
- SMR:   P12999
- STRING:   P12999
- EnsemblBacteria:   EBESCT00000002795
- EnsemblBacteria:   EBESCT00000018394
- GeneID:   945388
- GenomeReviews:   AP009048_GR
- GenomeReviews:   U00096_GR
- KEGG:   ecj:JW0760
- KEGG:   eco:b0777
- EchoBASE:   EB0117
- EcoGene:   EG10119
- eggNOG:   COG0500
- GeneTree:   EBGT00050000011109
- HOGENOM:   HBG678410
- OMA:   THLHQGR
- ProtClustDB:   PRK10258
- BioCyc:   EcoCyc:EG10119-MONOMER
- Genevestigator:   P12999
- InterPro:   IPR011814
- InterPro:   IPR013216
- TIGRFAMs:   TIGR02072

Pfam domain/function: PF08241 Methyltransf_11

EC number: NA

Molecular weight: Translated: 28277; Mature: 28145

Theoretical pI: Translated: 8.30; Mature: 8.30

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.8 %Cys     (Translated Protein)
2.0 %Met     (Translated Protein)
2.8 %Cys+Met (Translated Protein)
0.8 %Cys     (Mature Protein)
1.6 %Met     (Mature Protein)
2.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MATVNKQAIAAAFGRAAAHYEQHADLQRQSADALLAMLPQRKYTHVLDAGCGPGWMSRHW
CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCCCHHHHHHH
RERHAQVTALDLSPPMLVQARQKDAADHYLAGDIESLPLATATFDLAWSNLAVQWCGNLS
HHHCCEEEEEECCCCHHHHHHHHHCCCHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCHH
TALRELYRVVRPKGVVAFTTLVQGSLPELHQAWQAVDERPHANRFLPPDEIEQSLNGVHY
HHHHHHHHHHCCCCCEEHHHHHCCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCCH
QHHIQPITLWFDDALSAMRSLKGIGATHLHEGRDPRILTRSQLQRLQLAWPQQQGRYPLT
HHCCCEEEEHHHHHHHHHHHHHCCCCHHCCCCCCCCEECHHHHHHHHHCCCHHCCCCCCH
YHLFLGVIARE
HHHHHHHHHCC
>Mature Secondary Structure 
ATVNKQAIAAAFGRAAAHYEQHADLQRQSADALLAMLPQRKYTHVLDAGCGPGWMSRHW
CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCCCHHHHHHH
RERHAQVTALDLSPPMLVQARQKDAADHYLAGDIESLPLATATFDLAWSNLAVQWCGNLS
HHHCCEEEEEECCCCHHHHHHHHHCCCHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCHH
TALRELYRVVRPKGVVAFTTLVQGSLPELHQAWQAVDERPHANRFLPPDEIEQSLNGVHY
HHHHHHHHHHCCCCCEEHHHHHCCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCCH
QHHIQPITLWFDDALSAMRSLKGIGATHLHEGRDPRILTRSQLQRLQLAWPQQQGRYPLT
HHCCCEEEEHHHHHHHHHHHHHCCCCHHCCCCCCCCEECHHHHHHHHHCCCHHCCCCCCH
YHLFLGVIARE
HHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 3058702; 9278503