Definition | Escherichia coli HS, complete genome. |
---|---|
Accession | NC_009800 |
Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is pinR [H]
Identifier: 157161021
GI number: 157161021
Start: 1656688
End: 1657278
Strand: Direct
Name: pinR [H]
Synonym: EcHS_A1635
Alternate gene names: 157161021
Gene position: 1656688-1657278 (Clockwise)
Preceding gene: 157161020
Following gene: 157161036
Centisome position: 35.68
GC content: 45.01
Gene sequence:
>591_bases ATGTCTCGAATTTTTGCTTACTGTCGGATATCAACGCTGGATCAGACCACCGAAAATCAACGCCGGGAAATCGAAAGTGC AGGTTTTAAAATCAAACCTCAGCAAATAATCGAAGAACACATTAGCGGCTCAGCAGCAACCAGTGAGCGTCCTGGTTTTA ACCGGTTGCTTGCTCGCCTGAAATGTGGTGATCAATTGATTGTGACAAAACTGGATCGCCTTGGTTGTAATGCAATGGAT ATCAGGAAAACAGTGGAACAACTGACCGAAACAGGTATCAGAGTGCATTGCTTAGCATTGGGGGGCATTGACCTGACCAG TCCAACAGGAAAAATGATGATGCACGTAATTTCAGCAGTCGCTGAATTTGAACGAGACCTTTTACTTGAACGCACTCATT CCGGGATAGTAAGAGCCCGCGGCGCAGGGAAACGTTTTGGTCGACCACCTGTGTTAAATGAAGAACAGAAACAGGTGGTA TTCGAACGAATTAAGTCAGGTGTAAGTATAAGTGCCATTGCCCGGGAATTCAAAACCTCGCGGCAAACCATTTTAAGAGC CAAAGCAAAACTTCAGACACCTGACATATAA
Upstream 100 bases:
>100_bases TTATTTTCGAACGTGTACATACAAATATGCACAAAAATAATCAAAATTATTTTCTGAGATGCATTATGATATGAACACCA ATTTCGTATAGAGTCTCACT
Downstream 100 bases:
>100_bases AAAATAATCTCGGTGTGAGATGCTTTACGTCTTCCAAGCCCCCTTCCTTGCCGTAAATGGAAAGATACATCTAATTATAG AATTTATATGTTTTACCCTA
Product: site-specific recombinase resolvase family protein PinR
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 196; Mature: 195
Protein sequence:
>196_residues MSRIFAYCRISTLDQTTENQRREIESAGFKIKPQQIIEEHISGSAATSERPGFNRLLARLKCGDQLIVTKLDRLGCNAMD IRKTVEQLTETGIRVHCLALGGIDLTSPTGKMMMHVISAVAEFERDLLLERTHSGIVRARGAGKRFGRPPVLNEEQKQVV FERIKSGVSISAIAREFKTSRQTILRAKAKLQTPDI
Sequences:
>Translated_196_residues MSRIFAYCRISTLDQTTENQRREIESAGFKIKPQQIIEEHISGSAATSERPGFNRLLARLKCGDQLIVTKLDRLGCNAMD IRKTVEQLTETGIRVHCLALGGIDLTSPTGKMMMHVISAVAEFERDLLLERTHSGIVRARGAGKRFGRPPVLNEEQKQVV FERIKSGVSISAIAREFKTSRQTILRAKAKLQTPDI >Mature_195_residues SRIFAYCRISTLDQTTENQRREIESAGFKIKPQQIIEEHISGSAATSERPGFNRLLARLKCGDQLIVTKLDRLGCNAMDI RKTVEQLTETGIRVHCLALGGIDLTSPTGKMMMHVISAVAEFERDLLLERTHSGIVRARGAGKRFGRPPVLNEEQKQVVF ERIKSGVSISAIAREFKTSRQTILRAKAKLQTPDI
Specific function: Unknown
COG id: COG1961
COG function: function code L; Site-specific recombinases, DNA invertase Pin homologs
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the site-specific recombinase resolvase family [H]
Homologues:
Organism=Escherichia coli, GI1787638, Length=196, Percent_Identity=99.4897959183673, Blast_Score=397, Evalue=1e-112, Organism=Escherichia coli, GI1787827, Length=196, Percent_Identity=98.469387755102, Blast_Score=393, Evalue=1e-111, Organism=Escherichia coli, GI1787404, Length=183, Percent_Identity=38.2513661202186, Blast_Score=112, Evalue=1e-26,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR009057 - InterPro: IPR012287 - InterPro: IPR006118 - InterPro: IPR006119 - InterPro: IPR006120 [H]
Pfam domain/function: PF02796 HTH_7; PF00239 Resolvase [H]
EC number: NA
Molecular weight: Translated: 21918; Mature: 21787
Theoretical pI: Translated: 10.52; Mature: 10.52
Prosite motif: PS00398 RECOMBINASES_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.0 %Cys (Translated Protein) 2.6 %Met (Translated Protein) 4.6 %Cys+Met (Translated Protein) 2.1 %Cys (Mature Protein) 2.1 %Met (Mature Protein) 4.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSRIFAYCRISTLDQTTENQRREIESAGFKIKPQQIIEEHISGSAATSERPGFNRLLARL CCCCEEHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHH KCGDQLIVTKLDRLGCNAMDIRKTVEQLTETGIRVHCLALGGIDLTSPTGKMMMHVISAV HCCCHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCHHHHHHHHHHHH AEFERDLLLERTHSGIVRARGAGKRFGRPPVLNEEQKQVVFERIKSGVSISAIAREFKTS HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHH RQTILRAKAKLQTPDI HHHHHHHHHHCCCCCC >Mature Secondary Structure SRIFAYCRISTLDQTTENQRREIESAGFKIKPQQIIEEHISGSAATSERPGFNRLLARL CCCEEHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHH KCGDQLIVTKLDRLGCNAMDIRKTVEQLTETGIRVHCLALGGIDLTSPTGKMMMHVISAV HCCCHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCHHHHHHHHHHHH AEFERDLLLERTHSGIVRARGAGKRFGRPPVLNEEQKQVVFERIKSGVSISAIAREFKTS HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHH RQTILRAKAKLQTPDI HHHHHHHHHHCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 9097039; 9278503 [H]