| Definition | Escherichia coli HS, complete genome. |
|---|---|
| Accession | NC_009800 |
| Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is nsr [H]
Identifier: 157161645
GI number: 157161645
Start: 2297531
End: 2298190
Strand: Direct
Name: nsr [H]
Synonym: EcHS_A2299
Alternate gene names: 157161645
Gene position: 2297531-2298190 (Clockwise)
Preceding gene: 157161642
Following gene: 157161651
Centisome position: 49.48
GC content: 44.55
Gene sequence:
>660_bases ATGAGTGAATCCGCGTTTAAGGATTGCTTTTTAACGGATGTTTCAGCCGATACGCGGCTGTTTCATTTTTTAGCGCGTGA CTACATTGTGCAGGAAGGGCAACAGCCGTCCTGGCTGTTTTATCTGACGCGAGGCCGCGCCAGGCTTTACGCCACGCTTG CTAATGGTCGCGTGTCGCTGATCGATTTCTTTGCCGCCCCCTGTTTTATTGGTAAGATTGAGTTAATCGACAAAGACCAT GAACCGCGTGCGGTGCAGGCTATTGAAGAGTGTTGGTGCCTTGCGCTCTCTATGAAACATTACCGCCCGCTGTTATTAAA CGACACGCTATTTTTACGAAAACTCTGCGTCACCTTAAGTCATAAAAATTATCGTAATATTGTTTCTTTAACTCAGAATC AATCATTTCCGTTAGTTAATCGCCTGGCAGCTTTTATATTACTCTCGCAGGAAGGTGATCTTTATCACGAAAAGCATACG CAAGCGGCAGAGTATTTAGGCGTTTCTTATCGACATCTTTTATATGTTCTCGCGCAATTCATTCACGACGGTTTATTAAC TAAAAGCAAGAAAGGGTATCTCATTAAAAACAGAAAGCAGTTGTCAGGACTGGCGCTGGAGATGGACCCGGAGAATAAAT TCTCCGGGATGATGCAGTAA
Upstream 100 bases:
>100_bases TTGAATCTTACCAACCGCGTACGTATGCTAAATATGAGAAATCTCATAGCGGATAAACATCGTGAAAGAAATCCACAATA ATGATCTTAAGCAGCAATTG
Downstream 100 bases:
>100_bases AAATTATTTGCAATAGCGCGATTGCCGGATGCAACGCTTAATACGTTTTATCCGGTCTACAAATCGAGCATTACGCCAGA CCAATAAAGAACCCGGCAAT
Product: DNA-binding transcriptional activator YeiL
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 219; Mature: 218
Protein sequence:
>219_residues MSESAFKDCFLTDVSADTRLFHFLARDYIVQEGQQPSWLFYLTRGRARLYATLANGRVSLIDFFAAPCFIGKIELIDKDH EPRAVQAIEECWCLALSMKHYRPLLLNDTLFLRKLCVTLSHKNYRNIVSLTQNQSFPLVNRLAAFILLSQEGDLYHEKHT QAAEYLGVSYRHLLYVLAQFIHDGLLTKSKKGYLIKNRKQLSGLALEMDPENKFSGMMQ
Sequences:
>Translated_219_residues MSESAFKDCFLTDVSADTRLFHFLARDYIVQEGQQPSWLFYLTRGRARLYATLANGRVSLIDFFAAPCFIGKIELIDKDH EPRAVQAIEECWCLALSMKHYRPLLLNDTLFLRKLCVTLSHKNYRNIVSLTQNQSFPLVNRLAAFILLSQEGDLYHEKHT QAAEYLGVSYRHLLYVLAQFIHDGLLTKSKKGYLIKNRKQLSGLALEMDPENKFSGMMQ >Mature_218_residues SESAFKDCFLTDVSADTRLFHFLARDYIVQEGQQPSWLFYLTRGRARLYATLANGRVSLIDFFAAPCFIGKIELIDKDHE PRAVQAIEECWCLALSMKHYRPLLLNDTLFLRKLCVTLSHKNYRNIVSLTQNQSFPLVNRLAAFILLSQEGDLYHEKHTQ AAEYLGVSYRHLLYVLAQFIHDGLLTKSKKGYLIKNRKQLSGLALEMDPENKFSGMMQ
Specific function: Transcription regulator involved in mid-term, stationary-phase viability under nitrogen starvation. Might control expression of the salvage pathways or in some other way repress the recycling of nucleobases to nucleic acids and enhance their use as genera
COG id: COG0664
COG function: function code T; cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases
Gene ontology:
Cell location: Cytoplasm (Probable) [H]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 HTH crp-type DNA-binding domain [H]
Homologues:
Organism=Escherichia coli, GI1788487, Length=219, Percent_Identity=98.6301369863014, Blast_Score=450, Evalue=1e-128,
Paralogues:
None
Copy number: 10-20 Molecules/Cell [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR018490 - InterPro: IPR000595 - InterPro: IPR012318 - InterPro: IPR014710 [H]
Pfam domain/function: PF00027 cNMP_binding [H]
EC number: NA
Molecular weight: Translated: 25272; Mature: 25140
Theoretical pI: Translated: 8.69; Mature: 8.69
Prosite motif: PS50042 CNMP_BINDING_3 ; PS51063 HTH_CRP_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.3 %Cys (Translated Protein) 2.3 %Met (Translated Protein) 4.6 %Cys+Met (Translated Protein) 2.3 %Cys (Mature Protein) 1.8 %Met (Mature Protein) 4.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSESAFKDCFLTDVSADTRLFHFLARDYIVQEGQQPSWLFYLTRGRARLYATLANGRVSL CCCCHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCEEEEEECCCEEEEEEECCCCCHH IDFFAAPCFIGKIELIDKDHEPRAVQAIEECWCLALSMKHYRPLLLNDTLFLRKLCVTLS HHHHHHHHHHCEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHC HKNYRNIVSLTQNQSFPLVNRLAAFILLSQEGDLYHEKHTQAAEYLGVSYRHLLYVLAQF CCCHHHHHHHHCCCCCCHHHHHHHHHEECCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHH IHDGLLTKSKKGYLIKNRKQLSGLALEMDPENKFSGMMQ HHHHHHCCCCCCCEEECCHHHCCEEEEECCCCCCCCCCC >Mature Secondary Structure SESAFKDCFLTDVSADTRLFHFLARDYIVQEGQQPSWLFYLTRGRARLYATLANGRVSL CCCHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCEEEEEECCCEEEEEEECCCCCHH IDFFAAPCFIGKIELIDKDHEPRAVQAIEECWCLALSMKHYRPLLLNDTLFLRKLCVTLS HHHHHHHHHHCEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHC HKNYRNIVSLTQNQSFPLVNRLAAFILLSQEGDLYHEKHTQAAEYLGVSYRHLLYVLAQF CCCHHHHHHHHCCCCCCHHHHHHHHHEECCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHH IHDGLLTKSKKGYLIKNRKQLSGLALEMDPENKFSGMMQ HHHHHHCCCCCCCEEECCHHHCCEEEEECCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]