Definition Escherichia coli HS, complete genome.
Accession NC_009800
Length 4,643,538

Click here to switch to the map view.

The map label for this gene is hinT

Identifier: 157160630

GI number: 157160630

Start: 1225565

End: 1225924

Strand: Direct

Name: hinT

Synonym: EcHS_A1226

Alternate gene names: 157160630

Gene position: 1225565-1225924 (Clockwise)

Preceding gene: 157160627

Following gene: 157160631

Centisome position: 26.39

GC content: 50.83

Gene sequence:

>360_bases
GTGGCAGAAGAAACTATATTCAGCAAAATTATTCGTCGTGAGATCCCCTCCGATATCGTCTACCAGGATGATCTGGTAAC
GGCGTTTCGCGATATTTCGCCGCAAGCGCCAACGCATATTCTGATCATTCCGAATATCCTCATACCGACTGTGAACGACG
TGTCAGCTGAGCATGAGCAGGCGCTGGGACGCATGATCACCGTAGCGGCAAAAATTGCTGAGCAAGAAGGTATTGCCGAA
GATGGCTATCGTCTGATCATGAACACCAACCGCCATGGCGGACAAGAGGTTTACCACATCCATATGCACTTGTTGGGTGG
CCGTCCGCTGGGACCAATGCTGGCGCATAAAGGTCTGTAA

Upstream 100 bases:

>100_bases
AAGGTTGAAAGAGCAGGTTTAACTCGACCATACTCTATACTCGCAGTGTGGCGCGGCGTAGCATGGCGCAACGCATGGCT
ATTTGAAAAAGGAAAATGTC

Downstream 100 bases:

>100_bases
CGATGAGAAAAGGATGCTTTGGGCTGGTGTCTCTGGCGTTGTTACTGCTGGTGGGCTGTCGTTCACATCCGGAAATTCCG
GTGAATGATGAGCAATCGCT

Product: purine nucleoside phosphoramidase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 119; Mature: 118

Protein sequence:

>119_residues
MAEETIFSKIIRREIPSDIVYQDDLVTAFRDISPQAPTHILIIPNILIPTVNDVSAEHEQALGRMITVAAKIAEQEGIAE
DGYRLIMNTNRHGGQEVYHIHMHLLGGRPLGPMLAHKGL

Sequences:

>Translated_119_residues
MAEETIFSKIIRREIPSDIVYQDDLVTAFRDISPQAPTHILIIPNILIPTVNDVSAEHEQALGRMITVAAKIAEQEGIAE
DGYRLIMNTNRHGGQEVYHIHMHLLGGRPLGPMLAHKGL
>Mature_118_residues
AEETIFSKIIRREIPSDIVYQDDLVTAFRDISPQAPTHILIIPNILIPTVNDVSAEHEQALGRMITVAAKIAEQEGIAED
GYRLIMNTNRHGGQEVYHIHMHLLGGRPLGPMLAHKGL

Specific function: Unknown

COG id: COG0537

COG function: function code FGR; Diadenosine tetraphosphate (Ap4A) hydrolase and other HIT family hydrolases

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HIT domain

Homologues:

Organism=Homo sapiens, GI14211923, Length=109, Percent_Identity=46.7889908256881, Blast_Score=114, Evalue=1e-26,
Organism=Homo sapiens, GI4885413, Length=107, Percent_Identity=47.6635514018692, Blast_Score=110, Evalue=2e-25,
Organism=Escherichia coli, GI1787346, Length=119, Percent_Identity=100, Blast_Score=242, Evalue=5e-66,
Organism=Caenorhabditis elegans, GI17506713, Length=109, Percent_Identity=44.0366972477064, Blast_Score=110, Evalue=1e-25,
Organism=Drosophila melanogaster, GI28574010, Length=109, Percent_Identity=44.954128440367, Blast_Score=112, Evalue=4e-26,
Organism=Drosophila melanogaster, GI24581222, Length=109, Percent_Identity=44.954128440367, Blast_Score=112, Evalue=5e-26,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): HINT_ECO57 (P0ACE8)

Other databases:

- EMBL:   AE005174
- EMBL:   BA000007
- PIR:   A99814
- PIR:   E85673
- RefSeq:   NP_287237.1
- RefSeq:   NP_309508.2
- ProteinModelPortal:   P0ACE8
- SMR:   P0ACE8
- MINT:   MINT-1266090
- EnsemblBacteria:   EBESCT00000024019
- EnsemblBacteria:   EBESCT00000057256
- GeneID:   912424
- GeneID:   959454
- GenomeReviews:   AE005174_GR
- GenomeReviews:   BA000007_GR
- KEGG:   ece:Z1742
- KEGG:   ecs:ECs1481
- GeneTree:   EBGT00050000011176
- HOGENOM:   HBG743217
- OMA:   NCNRHGG
- ProtClustDB:   PRK10687
- BioCyc:   ECOL83334:ECS1481-MONOMER
- InterPro:   IPR011146
- InterPro:   IPR011151
- InterPro:   IPR019808
- InterPro:   IPR001310
- Gene3D:   G3DSA:3.30.428.10
- PANTHER:   PTHR23089
- PRINTS:   PR00332

Pfam domain/function: PF01230 HIT; SSF54197 His_triad-like_motif

EC number: NA

Molecular weight: Translated: 13242; Mature: 13110

Theoretical pI: Translated: 6.14; Mature: 6.14

Prosite motif: PS00892 HIT_1; PS51084 HIT_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.0 %Cys     (Translated Protein)
4.2 %Met     (Translated Protein)
4.2 %Cys+Met (Translated Protein)
0.0 %Cys     (Mature Protein)
3.4 %Met     (Mature Protein)
3.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MAEETIFSKIIRREIPSDIVYQDDLVTAFRDISPQAPTHILIIPNILIPTVNDVSAEHEQ
CCHHHHHHHHHHHHCCCCCEECHHHHHHHHHCCCCCCEEEEEECCEECCCCCCCCHHHHH
ALGRMITVAAKIAEQEGIAEDGYRLIMNTNRHGGQEVYHIHMHLLGGRPLGPMLAHKGL
HHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCCCEEEEEEEEEECCCCCCHHHHCCCC
>Mature Secondary Structure 
AEETIFSKIIRREIPSDIVYQDDLVTAFRDISPQAPTHILIIPNILIPTVNDVSAEHEQ
CHHHHHHHHHHHHCCCCCEECHHHHHHHHHCCCCCCEEEEEECCEECCCCCCCCHHHHH
ALGRMITVAAKIAEQEGIAEDGYRLIMNTNRHGGQEVYHIHMHLLGGRPLGPMLAHKGL
HHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCCCEEEEEEEEEECCCCCCHHHHCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796