Definition Escherichia coli HS, complete genome.
Accession NC_009800
Length 4,643,538

Click here to switch to the map view.

The map label for this gene is nsr [H]

Identifier: 157161645

GI number: 157161645

Start: 2297531

End: 2298190

Strand: Direct

Name: nsr [H]

Synonym: EcHS_A2299

Alternate gene names: 157161645

Gene position: 2297531-2298190 (Clockwise)

Preceding gene: 157161642

Following gene: 157161651

Centisome position: 49.48

GC content: 44.55

Gene sequence:

>660_bases
ATGAGTGAATCCGCGTTTAAGGATTGCTTTTTAACGGATGTTTCAGCCGATACGCGGCTGTTTCATTTTTTAGCGCGTGA
CTACATTGTGCAGGAAGGGCAACAGCCGTCCTGGCTGTTTTATCTGACGCGAGGCCGCGCCAGGCTTTACGCCACGCTTG
CTAATGGTCGCGTGTCGCTGATCGATTTCTTTGCCGCCCCCTGTTTTATTGGTAAGATTGAGTTAATCGACAAAGACCAT
GAACCGCGTGCGGTGCAGGCTATTGAAGAGTGTTGGTGCCTTGCGCTCTCTATGAAACATTACCGCCCGCTGTTATTAAA
CGACACGCTATTTTTACGAAAACTCTGCGTCACCTTAAGTCATAAAAATTATCGTAATATTGTTTCTTTAACTCAGAATC
AATCATTTCCGTTAGTTAATCGCCTGGCAGCTTTTATATTACTCTCGCAGGAAGGTGATCTTTATCACGAAAAGCATACG
CAAGCGGCAGAGTATTTAGGCGTTTCTTATCGACATCTTTTATATGTTCTCGCGCAATTCATTCACGACGGTTTATTAAC
TAAAAGCAAGAAAGGGTATCTCATTAAAAACAGAAAGCAGTTGTCAGGACTGGCGCTGGAGATGGACCCGGAGAATAAAT
TCTCCGGGATGATGCAGTAA

Upstream 100 bases:

>100_bases
TTGAATCTTACCAACCGCGTACGTATGCTAAATATGAGAAATCTCATAGCGGATAAACATCGTGAAAGAAATCCACAATA
ATGATCTTAAGCAGCAATTG

Downstream 100 bases:

>100_bases
AAATTATTTGCAATAGCGCGATTGCCGGATGCAACGCTTAATACGTTTTATCCGGTCTACAAATCGAGCATTACGCCAGA
CCAATAAAGAACCCGGCAAT

Product: DNA-binding transcriptional activator YeiL

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 219; Mature: 218

Protein sequence:

>219_residues
MSESAFKDCFLTDVSADTRLFHFLARDYIVQEGQQPSWLFYLTRGRARLYATLANGRVSLIDFFAAPCFIGKIELIDKDH
EPRAVQAIEECWCLALSMKHYRPLLLNDTLFLRKLCVTLSHKNYRNIVSLTQNQSFPLVNRLAAFILLSQEGDLYHEKHT
QAAEYLGVSYRHLLYVLAQFIHDGLLTKSKKGYLIKNRKQLSGLALEMDPENKFSGMMQ

Sequences:

>Translated_219_residues
MSESAFKDCFLTDVSADTRLFHFLARDYIVQEGQQPSWLFYLTRGRARLYATLANGRVSLIDFFAAPCFIGKIELIDKDH
EPRAVQAIEECWCLALSMKHYRPLLLNDTLFLRKLCVTLSHKNYRNIVSLTQNQSFPLVNRLAAFILLSQEGDLYHEKHT
QAAEYLGVSYRHLLYVLAQFIHDGLLTKSKKGYLIKNRKQLSGLALEMDPENKFSGMMQ
>Mature_218_residues
SESAFKDCFLTDVSADTRLFHFLARDYIVQEGQQPSWLFYLTRGRARLYATLANGRVSLIDFFAAPCFIGKIELIDKDHE
PRAVQAIEECWCLALSMKHYRPLLLNDTLFLRKLCVTLSHKNYRNIVSLTQNQSFPLVNRLAAFILLSQEGDLYHEKHTQ
AAEYLGVSYRHLLYVLAQFIHDGLLTKSKKGYLIKNRKQLSGLALEMDPENKFSGMMQ

Specific function: Transcription regulator involved in mid-term, stationary-phase viability under nitrogen starvation. Might control expression of the salvage pathways or in some other way repress the recycling of nucleobases to nucleic acids and enhance their use as genera

COG id: COG0664

COG function: function code T; cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases

Gene ontology:

Cell location: Cytoplasm (Probable) [H]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HTH crp-type DNA-binding domain [H]

Homologues:

Organism=Escherichia coli, GI1788487, Length=219, Percent_Identity=98.6301369863014, Blast_Score=450, Evalue=1e-128,

Paralogues:

None

Copy number: 10-20 Molecules/Cell [C]

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR018490
- InterPro:   IPR000595
- InterPro:   IPR012318
- InterPro:   IPR014710 [H]

Pfam domain/function: PF00027 cNMP_binding [H]

EC number: NA

Molecular weight: Translated: 25272; Mature: 25140

Theoretical pI: Translated: 8.69; Mature: 8.69

Prosite motif: PS50042 CNMP_BINDING_3 ; PS51063 HTH_CRP_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.3 %Cys     (Translated Protein)
2.3 %Met     (Translated Protein)
4.6 %Cys+Met (Translated Protein)
2.3 %Cys     (Mature Protein)
1.8 %Met     (Mature Protein)
4.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSESAFKDCFLTDVSADTRLFHFLARDYIVQEGQQPSWLFYLTRGRARLYATLANGRVSL
CCCCHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCEEEEEECCCEEEEEEECCCCCHH
IDFFAAPCFIGKIELIDKDHEPRAVQAIEECWCLALSMKHYRPLLLNDTLFLRKLCVTLS
HHHHHHHHHHCEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHC
HKNYRNIVSLTQNQSFPLVNRLAAFILLSQEGDLYHEKHTQAAEYLGVSYRHLLYVLAQF
CCCHHHHHHHHCCCCCCHHHHHHHHHEECCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHH
IHDGLLTKSKKGYLIKNRKQLSGLALEMDPENKFSGMMQ
HHHHHHCCCCCCCEEECCHHHCCEEEEECCCCCCCCCCC
>Mature Secondary Structure 
SESAFKDCFLTDVSADTRLFHFLARDYIVQEGQQPSWLFYLTRGRARLYATLANGRVSL
CCCHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCEEEEEECCCEEEEEEECCCCCHH
IDFFAAPCFIGKIELIDKDHEPRAVQAIEECWCLALSMKHYRPLLLNDTLFLRKLCVTLS
HHHHHHHHHHCEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHC
HKNYRNIVSLTQNQSFPLVNRLAAFILLSQEGDLYHEKHTQAAEYLGVSYRHLLYVLAQF
CCCHHHHHHHHCCCCCCHHHHHHHHHEECCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHH
IHDGLLTKSKKGYLIKNRKQLSGLALEMDPENKFSGMMQ
HHHHHHCCCCCCCEEECCHHHCCEEEEECCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796 [H]