Definition Escherichia coli HS, complete genome.
Accession NC_009800
Length 4,643,538

Click here to switch to the map view.

The map label for this gene is nemR

Identifier: 157161112

GI number: 157161112

Start: 1747976

End: 1748575

Strand: Direct

Name: nemR

Synonym: EcHS_A1726

Alternate gene names: 157161112

Gene position: 1747976-1748575 (Clockwise)

Preceding gene: 157161111

Following gene: 157161113

Centisome position: 37.64

GC content: 52.33

Gene sequence:

>600_bases
ATGAACAAACACACCGAACATGATACTCGCGAACATCTCCTGGCGACGGGCGAGCAACTTTGCCTGCAACGTGGATTCAC
CGGGATGGGGCTAAGCGAATTACTAAAAACCGCTGAAGTGCCGAAAGGGTCCTTCTATCACTACTTTCGCTCTAAAGAAG
CGTTTGGCGTTGCCATGCTTGAGCGTCATTACGCCGCATATCACCAGCGACTGACTGAGTTGCTGCAATCCGGCGAAGGT
AACTACCGCGACCGCATACTGGCTTATTACCAGCAAACACTGAACCAGTTTTGCCAACATGGAACCATCAGTGGTTGCCT
GACAGTAAAACTCTCTGCCGAAGTGTGCGATCTGTCAGAAGATATGCGCAGCGCGATGGATAAAGGTGCTCGCGGCGTGA
TCGCCCTGCTCTCTCAGGCGCTGGAAAATGGCCGTGAGAACCATTGTTTAACCTTTTGTGGCGAACCGCTGCAACAGGCA
CAAGTGCTTTACGCACTGTGGCTTGGCGCGAATCTGCAGGCCAAAATTTCGCGCAGTTTCGAGCCACTGGAAAACGCGCT
GGCCCATGTAAAAAACATTATTGCGACGCCTGCCGTTTAG

Upstream 100 bases:

>100_bases
CGCCCTCCTCAGATAAGATTATTACCATTATTGAAGCTGTTAATGTCCAAAGTAGCAACTTTGCTTGCACTAGACCGACT
GGTCTACTACACTCCAACGC

Downstream 100 bases:

>100_bases
CAGGCATTTTTTCACCAGACGACCGGGAGCCTTTATGTCATCTGAAAAACTGTATTCCCCACTGAAAGTGGGCGCGATCA
CGGCGGCAAACCGTATTTTT

Product: TetR family transcriptional regulator

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 199; Mature: 199

Protein sequence:

>199_residues
MNKHTEHDTREHLLATGEQLCLQRGFTGMGLSELLKTAEVPKGSFYHYFRSKEAFGVAMLERHYAAYHQRLTELLQSGEG
NYRDRILAYYQQTLNQFCQHGTISGCLTVKLSAEVCDLSEDMRSAMDKGARGVIALLSQALENGRENHCLTFCGEPLQQA
QVLYALWLGANLQAKISRSFEPLENALAHVKNIIATPAV

Sequences:

>Translated_199_residues
MNKHTEHDTREHLLATGEQLCLQRGFTGMGLSELLKTAEVPKGSFYHYFRSKEAFGVAMLERHYAAYHQRLTELLQSGEG
NYRDRILAYYQQTLNQFCQHGTISGCLTVKLSAEVCDLSEDMRSAMDKGARGVIALLSQALENGRENHCLTFCGEPLQQA
QVLYALWLGANLQAKISRSFEPLENALAHVKNIIATPAV
>Mature_199_residues
MNKHTEHDTREHLLATGEQLCLQRGFTGMGLSELLKTAEVPKGSFYHYFRSKEAFGVAMLERHYAAYHQRLTELLQSGEG
NYRDRILAYYQQTLNQFCQHGTISGCLTVKLSAEVCDLSEDMRSAMDKGARGVIALLSQALENGRENHCLTFCGEPLQQA
QVLYALWLGANLQAKISRSFEPLENALAHVKNIIATPAV

Specific function: Represses the transcription of the nemRA operon by binding to the nemR box

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HTH tetR-type DNA-binding domain

Homologues:

Organism=Escherichia coli, GI145693141, Length=199, Percent_Identity=100, Blast_Score=414, Evalue=1e-117,

Paralogues:

None

Copy number: 10-20 Molecules/Cell [C]

Swissprot (AC and ID): NEMR_ECOL6 (P67431)

Other databases:

- EMBL:   AE014075
- RefSeq:   NP_753937.1
- ProteinModelPortal:   P67431
- SMR:   P67431
- EnsemblBacteria:   EBESCT00000045186
- GeneID:   1036152
- GenomeReviews:   AE014075_GR
- KEGG:   ecc:c2042
- GeneTree:   EBGT00050000009067
- HOGENOM:   HBG658459
- OMA:   TAGVPKG
- ProtClustDB:   CLSK880155
- InterPro:   IPR009057
- InterPro:   IPR015893
- InterPro:   IPR011075
- InterPro:   IPR001647
- Gene3D:   G3DSA:1.10.357.10

Pfam domain/function: PF00440 TetR_N; SSF46689 Homeodomain_like; SSF48498 TetR_like_C

EC number: NA

Molecular weight: Translated: 22276; Mature: 22276

Theoretical pI: Translated: 6.99; Mature: 6.99

Prosite motif: PS01081 HTH_TETR_1; PS50977 HTH_TETR_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

3.0 %Cys     (Translated Protein)
2.5 %Met     (Translated Protein)
5.5 %Cys+Met (Translated Protein)
3.0 %Cys     (Mature Protein)
2.5 %Met     (Mature Protein)
5.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNKHTEHDTREHLLATGEQLCLQRGFTGMGLSELLKTAEVPKGSFYHYFRSKEAFGVAML
CCCCCCHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCCCHHHHHHHCCHHHHHHHH
ERHYAAYHQRLTELLQSGEGNYRDRILAYYQQTLNQFCQHGTISGCLTVKLSAEVCDLSE
HHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHH
DMRSAMDKGARGVIALLSQALENGRENHCLTFCGEPLQQAQVLYALWLGANLQAKISRSF
HHHHHHHCCCHHHHHHHHHHHHCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCHHHHHHCH
EPLENALAHVKNIIATPAV
HHHHHHHHHHHHHHHCCCC
>Mature Secondary Structure
MNKHTEHDTREHLLATGEQLCLQRGFTGMGLSELLKTAEVPKGSFYHYFRSKEAFGVAML
CCCCCCHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCCCHHHHHHHCCHHHHHHHH
ERHYAAYHQRLTELLQSGEGNYRDRILAYYQQTLNQFCQHGTISGCLTVKLSAEVCDLSE
HHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHH
DMRSAMDKGARGVIALLSQALENGRENHCLTFCGEPLQQAQVLYALWLGANLQAKISRSF
HHHHHHHCCCHHHHHHHHHHHHCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCHHHHHHCH
EPLENALAHVKNIIATPAV
HHHHHHHHHHHHHHHCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 12471157