Definition | Escherichia coli HS, complete genome. |
---|---|
Accession | NC_009800 |
Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is yjiP [H]
Identifier: 157163784
GI number: 157163784
Start: 4573994
End: 4574938
Strand: Direct
Name: yjiP [H]
Synonym: EcHS_A4566
Alternate gene names: 157163784
Gene position: 4573994-4574938 (Clockwise)
Preceding gene: 157163777
Following gene: 157163787
Centisome position: 98.5
GC content: 49.42
Gene sequence:
>945_bases ATGACAAACTTCACAACCAGCACGCCGCATGACGCATTATTTAAATCCTTTCTCACGCACCCTGACACCGCGCGGGATTT TATGGAGATCCACTTACCCAAAGATTTACGTGAACTGTGCGATCTCGACAGCTTAAAACTGGAATCCGCCAGCTTCGTCG ATGAAAAATTGCGGGCGCTACACTCCGATATTCTGTGGTCGGTAAAGACCCGTGAAGGTGATGGTTATATTTACGTAGTG ATTGAACATCAGAGCCGCGAGGATATCCATATGGCCTTTCGCCTGATGCGATATTCCATGGCGGTGATGCAGCGCCATAT CGAGCATGATAAACGCCGGCCGCTACCGCTGGTCATCCCGATGCTGTTTTATCACGGTAGCCGTAGTCCTTATCCCTGGT CCCTGTGCTGGCTGGACGAATTTGCTGACCCGACCACCGCACGGAAGCTTTATACCGCAGCGTTCCCGCTGGTGGATGTC ACTGTCGTGCCAGACGACGAGATTGTGCAGCACCGCAGAGTCGCCCTGTTGGAGTTGATCCAAAAGCATATTCGCCAGCG CGATCTGATGGGGCTTATCGATCAACTGGTAATATTACTGGTTACAGAGTGTGCTAATGACAGCCAGATAACTGCGCTGT TAAATTACATTTTACTGACTGGCGATGAAGCGCGTTTTAAGAAGTTTATCAGCGAACTTACCCGTCGAATGCCACAACAC AGGGAGCGAATAATGACGATTGCAGAGCGAATTTATAATGATGGATGGCTGTTGGGGATGGAAAAGGGGAAAGAAGAAGG GGAACAACGCCTCCTTAGATTGTTGTTGCAGAATGGGGCAGATCCTGAATGGATACAAAAGATTACCGGACTTTCGACAG AGCAAATGCAGGCATTAGAGCAGCCCTTGCCTGAAATCAAGCGCGATCCATGGATCGAATACTAA
Upstream 100 bases:
>100_bases ATAGCCTGGAAAGCGCCTCGGGGAACGAGAAATTGCCGGGTGAGAATGGTTTTGTTAGTCGCTACAGTCGGGCCATCTTA TCTACAGGTGACGGATCGCC
Downstream 100 bases:
>100_bases TCAGAGACGGATGACAAACGCAAAGCTGCCTGATGCGCTACGCTTATCAGACCTACATTTCCTCTGCAATCTATTGAATT TGCGCGGTTTGTAGGCCGGA
Product: hypothetical protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 314; Mature: 313
Protein sequence:
>314_residues MTNFTTSTPHDALFKSFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKTREGDGYIYVV IEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRRPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYTAAFPLVDV TVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVILLVTECANDSQITALLNYILLTGDEARFKKFISELTRRMPQH RERIMTIAERIYNDGWLLGMEKGKEEGEQRLLRLLLQNGADPEWIQKITGLSTEQMQALEQPLPEIKRDPWIEY
Sequences:
>Translated_314_residues MTNFTTSTPHDALFKSFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKTREGDGYIYVV IEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRRPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYTAAFPLVDV TVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVILLVTECANDSQITALLNYILLTGDEARFKKFISELTRRMPQH RERIMTIAERIYNDGWLLGMEKGKEEGEQRLLRLLLQNGADPEWIQKITGLSTEQMQALEQPLPEIKRDPWIEY >Mature_313_residues TNFTTSTPHDALFKSFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKTREGDGYIYVVI EHQSREDIHMAFRLMRYSMAVMQRHIEHDKRRPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYTAAFPLVDVT VVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVILLVTECANDSQITALLNYILLTGDEARFKKFISELTRRMPQHR ERIMTIAERIYNDGWLLGMEKGKEEGEQRLLRLLLQNGADPEWIQKITGLSTEQMQALEQPLPEIKRDPWIEY
Specific function: Unknown
COG id: COG5464
COG function: function code S; Uncharacterized conserved protein
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the yadD/yfaD/yhgA/yjiP family [H]
Homologues:
Organism=Escherichia coli, GI1788643, Length=298, Percent_Identity=62.0805369127517, Blast_Score=382, Evalue=1e-107, Organism=Escherichia coli, GI1789816, Length=305, Percent_Identity=55.4098360655738, Blast_Score=350, Evalue=5e-98, Organism=Escherichia coli, GI1788577, Length=292, Percent_Identity=56.5068493150685, Blast_Score=347, Evalue=7e-97, Organism=Escherichia coli, GI1786324, Length=303, Percent_Identity=48.1848184818482, Blast_Score=291, Evalue=4e-80,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR010106 - InterPro: IPR006842 [H]
Pfam domain/function: PF04754 Transposase_31 [H]
EC number: NA
Molecular weight: Translated: 36837; Mature: 36706
Theoretical pI: Translated: 6.09; Mature: 6.09
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.0 %Cys (Translated Protein) 3.8 %Met (Translated Protein) 4.8 %Cys+Met (Translated Protein) 1.0 %Cys (Mature Protein) 3.5 %Met (Mature Protein) 4.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTNFTTSTPHDALFKSFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL CCCCCCCCCHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHCCCCHHCCHHHHHHHHHHHHH HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRRPLPLVIP HHHHHEEEEEECCCCEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHH MLFYHGSRSPYPWSLCWLDEFADPTTARKLYTAAFPLVDVTVVPDDEIVQHRRVALLELI HHHHCCCCCCCCEEEEEHHHHCCCHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHHHH QKHIRQRDLMGLIDQLVILLVTECANDSQITALLNYILLTGDEARFKKFISELTRRMPQH HHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHH RERIMTIAERIYNDGWLLGMEKGKEEGEQRLLRLLLQNGADPEWIQKITGLSTEQMQALE HHHHHHHHHHHHCCCEEEECHHCHHHHHHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHH QPLPEIKRDPWIEY CCHHHHCCCCCCCC >Mature Secondary Structure TNFTTSTPHDALFKSFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRAL CCCCCCCCHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHCCCCHHCCHHHHHHHHHHHHH HSDILWSVKTREGDGYIYVVIEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRRPLPLVIP HHHHHEEEEEECCCCEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHH MLFYHGSRSPYPWSLCWLDEFADPTTARKLYTAAFPLVDVTVVPDDEIVQHRRVALLELI HHHHCCCCCCCCEEEEEHHHHCCCHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHHHH QKHIRQRDLMGLIDQLVILLVTECANDSQITALLNYILLTGDEARFKKFISELTRRMPQH HHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHH RERIMTIAERIYNDGWLLGMEKGKEEGEQRLLRLLLQNGADPEWIQKITGLSTEQMQALE HHHHHHHHHHHHCCCEEEECHHCHHHHHHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHH QPLPEIKRDPWIEY CCHHHHCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 7610040; 9278503 [H]