The gene/protein map for NC_008769 is currently unavailable.
Definition Mycobacterium bovis BCG str. Pasteur 1173P2, complete genome.
Accession NC_008769
Length 4,374,522

Click here to switch to the map view.

The map label for this gene is embR

Identifier: 121637195

GI number: 121637195

Start: 1442805

End: 1443971

Strand: Reverse

Name: embR

Synonym: BCG_1326c

Alternate gene names: 121637195

Gene position: 1443971-1442805 (Counterclockwise)

Preceding gene: 121637196

Following gene: 121637194

Centisome position: 33.01

GC content: 65.04

Gene sequence:

>1167_bases
ATGGCTGGTAGCGCGACAGTGGAGAAGCGGCTCGACTTCGGCCTGCTTGGACCATTGCAGATGACTATCGACGGCACCCC
GGTGCCATCGGGCACCCCCAAGCAACGGGCTGTGCTAGCCATGTTGGTCATCAACCGCAACAGGCCCGTAGGAGTCGACG
CCCTAATCACCGCCCTCTGGGAGGAGTGGCCACCCTCGGGCGCACGCGCGAGTATCCACTCCTACGTGTCTAATCTGCGT
AAGCTCCTCGGTGGCGCCGGGATCGACCCACGGGTGGTGTTGGCCGCAGCGCCGCCGGGTTATCGGCTCAGCATCCCCGA
CAACACTTGCGATCTGGGGCGGTTTGTTGCCGAAAAAACCGCGGGCGTGCACGCGGCCGCCGCCGGCCGGTTCGAACAAG
CCAGCCGCCACCTGTCGGCCGCATTGAGAGAATGGCGTGGGCCGGTGCTCGATGACCTGCGCGACTTCCAGTTCGTCGAA
CCCTTTGCCACGGCGCTGGTAGAAGACAAGGTTCTTGCCCATACCGCCAAGGCGGAGGCCGAAATCGCGTGTGGGCGGGC
CAGCGCAGTGATCGCCGAGCTCGAGGCTCTGACATTCGAACACCCCTACCGGGAGCCGCTGTGGACACAGCTGATCACCG
CCTACTACCTCTCCGACCGGCAATCCGATGCGCTGGGCGCCTATCGCCGGGTGAAGACAACACTGGCCGACGACCTCGGC
ATCGACCCCGGTCCGACGTTGCGCGCTCTCAACGAGCGGATTCTGCGTCAGCAACCGCTGGATGCCAAGAAGTCCGCCAA
AACCACCGCTGCCGGCACCGTCACGGTGCTCGATCAGCGCACCATGGCGTCGGGCCAGCAGGCGGTGGCCTACCTGCACG
ACATCGCCTCGGGTCGCGGCTACCCACTGCAAGCCGCGGCGACCCGGATCGGGCGTCTGCATGACAACGACATCGTCCTA
GACAGCGCCAACGTCAGCCGCCACCACGCCGTCATCGTCGACACGGGCACCAACTACGTCATCAACGACCTCCGATCGTC
CAACGGCGTGCATGTGCAGCACGAGCGAATCCGCTCCGCGGTCACGCTGAACGACGGCGACCACATTCGCATCTGTGACC
ATGAATTCACGTTCCAGATCAGCGCGGGGACGCATGGCGGCACGTAG

Upstream 100 bases:

>100_bases
AGATCCTGGGTTCTTACCGCATCGGCGAAACGTGGTATCACAATGACCGGTAGCGACACCCTATTGGCACCTTGGCACCG
CAAGCCACGGAGGACCCGCA

Downstream 100 bases:

>100_bases
ATCCGTGGACACAAGCCCTTAACACCGATGTCCACGGATATGACTGCTACACAAAGCTTTTCGTGGCAACGTCGCGCGCT
GGCCGCCGGCGCTGGGCTGC

Product: putative transcriptional regulatory protein embR

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 388; Mature: 387

Protein sequence:

>388_residues
MAGSATVEKRLDFGLLGPLQMTIDGTPVPSGTPKQRAVLAMLVINRNRPVGVDALITALWEEWPPSGARASIHSYVSNLR
KLLGGAGIDPRVVLAAAPPGYRLSIPDNTCDLGRFVAEKTAGVHAAAAGRFEQASRHLSAALREWRGPVLDDLRDFQFVE
PFATALVEDKVLAHTAKAEAEIACGRASAVIAELEALTFEHPYREPLWTQLITAYYLSDRQSDALGAYRRVKTTLADDLG
IDPGPTLRALNERILRQQPLDAKKSAKTTAAGTVTVLDQRTMASGQQAVAYLHDIASGRGYPLQAAATRIGRLHDNDIVL
DSANVSRHHAVIVDTGTNYVINDLRSSNGVHVQHERIRSAVTLNDGDHIRICDHEFTFQISAGTHGGT

Sequences:

>Translated_388_residues
MAGSATVEKRLDFGLLGPLQMTIDGTPVPSGTPKQRAVLAMLVINRNRPVGVDALITALWEEWPPSGARASIHSYVSNLR
KLLGGAGIDPRVVLAAAPPGYRLSIPDNTCDLGRFVAEKTAGVHAAAAGRFEQASRHLSAALREWRGPVLDDLRDFQFVE
PFATALVEDKVLAHTAKAEAEIACGRASAVIAELEALTFEHPYREPLWTQLITAYYLSDRQSDALGAYRRVKTTLADDLG
IDPGPTLRALNERILRQQPLDAKKSAKTTAAGTVTVLDQRTMASGQQAVAYLHDIASGRGYPLQAAATRIGRLHDNDIVL
DSANVSRHHAVIVDTGTNYVINDLRSSNGVHVQHERIRSAVTLNDGDHIRICDHEFTFQISAGTHGGT
>Mature_387_residues
AGSATVEKRLDFGLLGPLQMTIDGTPVPSGTPKQRAVLAMLVINRNRPVGVDALITALWEEWPPSGARASIHSYVSNLRK
LLGGAGIDPRVVLAAAPPGYRLSIPDNTCDLGRFVAEKTAGVHAAAAGRFEQASRHLSAALREWRGPVLDDLRDFQFVEP
FATALVEDKVLAHTAKAEAEIACGRASAVIAELEALTFEHPYREPLWTQLITAYYLSDRQSDALGAYRRVKTTLADDLGI
DPGPTLRALNERILRQQPLDAKKSAKTTAAGTVTVLDQRTMASGQQAVAYLHDIASGRGYPLQAAATRIGRLHDNDIVLD
SANVSRHHAVIVDTGTNYVINDLRSSNGVHVQHERIRSAVTLNDGDHIRICDHEFTFQISAGTHGGT

Specific function: Unknown

COG id: COG3629

COG function: function code T; DNA-binding transcriptional activator of the SARP family

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Contains 1 FHA domain

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): EMBR_MYCBO (P66800)

Other databases:

- EMBL:   BX248338
- RefSeq:   NP_854952.1
- ProteinModelPortal:   P66800
- SMR:   P66800
- EnsemblBacteria:   EBMYCT00000016221
- GeneID:   1090580
- GenomeReviews:   BX248333_GR
- KEGG:   mbo:Mb1298c
- GeneTree:   EBGT00050000014812
- HOGENOM:   HBG700221
- OMA:   LLGPLEM
- ProtClustDB:   CLSK791063
- BioCyc:   MBOV233413:MB1298C-MONOMER
- GO:   GO:0005622
- GO:   GO:0006350
- InterPro:   IPR005158
- InterPro:   IPR000253
- InterPro:   IPR001867
- InterPro:   IPR016032
- InterPro:   IPR008984
- InterPro:   IPR011990
- InterPro:   IPR011991
- Gene3D:   G3DSA:2.60.200.20
- Gene3D:   G3DSA:1.25.40.10
- Gene3D:   G3DSA:1.10.10.10
- SMART:   SM00240
- SMART:   SM00862

Pfam domain/function: PF03704 BTAD; PF00498 FHA; PF00486 Trans_reg_C; SSF46894 Bipartite_resp_reg_C-effector; SSF49879 SMAD_FHA

EC number: NA

Molecular weight: Translated: 41934; Mature: 41803

Theoretical pI: Translated: 7.23; Mature: 7.23

Prosite motif: PS50006 FHA_DOMAIN

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.8 %Cys     (Translated Protein)
1.0 %Met     (Translated Protein)
1.8 %Cys+Met (Translated Protein)
0.8 %Cys     (Mature Protein)
0.8 %Met     (Mature Protein)
1.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MAGSATVEKRLDFGLLGPLQMTIDGTPVPSGTPKQRAVLAMLVINRNRPVGVDALITALW
CCCCCCHHHHCCCCCCCCEEEEECCCCCCCCCCHHHHEEEEEEEECCCCCCHHHHHHHHH
EEWPPSGARASIHSYVSNLRKLLGGAGIDPRVVLAAAPPGYRLSIPDNTCDLGRFVAEKT
HHCCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCEEECCCCCHHHHHHHHHHH
AGVHAAAAGRFEQASRHLSAALREWRGPVLDDLRDFQFVEPFATALVEDKVLAHTAKAEA
CCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCH
EIACGRASAVIAELEALTFEHPYREPLWTQLITAYYLSDRQSDALGAYRRVKTTLADDLG
HHCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCC
IDPGPTLRALNERILRQQPLDAKKSAKTTAAGTVTVLDQRTMASGQQAVAYLHDIASGRG
CCCCHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEECCHHHHCHHHHHHHHHHHHCCCC
YPLQAAATRIGRLHDNDIVLDSANVSRHHAVIVDTGTNYVINDLRSSNGVHVQHERIRSA
CCHHHHHHHHCCCCCCCEEEECCCCCCCEEEEEECCCCHHHHHHCCCCCCEEEHHHHHHE
VTLNDGDHIRICDHEFTFQISAGTHGGT
EEECCCCEEEEECCEEEEEEECCCCCCC
>Mature Secondary Structure 
AGSATVEKRLDFGLLGPLQMTIDGTPVPSGTPKQRAVLAMLVINRNRPVGVDALITALW
CCCCCHHHHCCCCCCCCEEEEECCCCCCCCCCHHHHEEEEEEEECCCCCCHHHHHHHHH
EEWPPSGARASIHSYVSNLRKLLGGAGIDPRVVLAAAPPGYRLSIPDNTCDLGRFVAEKT
HHCCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCEEECCCCCHHHHHHHHHHH
AGVHAAAAGRFEQASRHLSAALREWRGPVLDDLRDFQFVEPFATALVEDKVLAHTAKAEA
CCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCH
EIACGRASAVIAELEALTFEHPYREPLWTQLITAYYLSDRQSDALGAYRRVKTTLADDLG
HHCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCC
IDPGPTLRALNERILRQQPLDAKKSAKTTAAGTVTVLDQRTMASGQQAVAYLHDIASGRG
CCCCHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEECCHHHHCHHHHHHHHHHHHCCCC
YPLQAAATRIGRLHDNDIVLDSANVSRHHAVIVDTGTNYVINDLRSSNGVHVQHERIRSA
CCHHHHHHHHCCCCCCCEEEECCCCCCCEEEEEECCCCHHHHHHCCCCCCEEEHHHHHHE
VTLNDGDHIRICDHEFTFQISAGTHGGT
EEECCCCEEEEECCEEEEEEECCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 12788972