Definition | Escherichia coli HS, complete genome. |
---|---|
Accession | NC_009800 |
Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is ybiP
Identifier: 157160293
GI number: 157160293
Start: 877617
End: 879200
Strand: Reverse
Name: ybiP
Synonym: EcHS_A0872
Alternate gene names: 157160293
Gene position: 879200-877617 (Counterclockwise)
Preceding gene: 157160294
Following gene: 157160291
Centisome position: 18.93
GC content: 48.36
Gene sequence:
>1584_bases ATGAATTTAACCCTCAAAGAATCGCTTGTTACCCGTAGCCGGGTATTTAGCCCGTGGACTGCGTTCTACTTTTTACAGTC GCTATTAATTAACCTCGGCTTAGGTTACCCCTTCAGTTTGCTCTACACCGCTGCGTTTACGGCTATTTTGCTTTTGCTAT GGCGAACATTGCCTCGCGTACAAAAAGTTCTGGTCGGTGTCAGTTCGCTGGTGGCGGCTTGTTATTTCCCTTTTGCTCAG GCCTACGGCGCGCCTAATTTCAATACATTGCTGGCATTGCACTCCACCAATATGGAAGAGTCGACCGAAATCCTGACGAT TTTTCCGTGGTACAGCTACCTGGTCGGCTTATTTATTTTTGCGCTCGGCGTAATAGCAATCAGGCGAAAAAAAGAGAATG AAAAAGCGCGCTGGAATACCTTCGACAGCCTGTGTCTGGTATTCAGTGTGGCGACATTTTTTGTTGCTCCCGTGCAAAAC CTGGCCTGGGGTGGCGTATTTAAACTGAAAGATACTGGCTATCCGGTATTTCGTTTTGCTAAGGATGTCATCGTCAATAA TAACGAGGTGATTGAAGAGCAAGAACGGATGGCAAAACTTTCCGGAATGAAAGATACCTGGACGGTCACTGCCGTTAAGC CGAAGTATCAGACCTATGTGGTGGTGATCGGTGAAAGCGCGCGTCGCGATGCCCTCGGTGCCTTTGGCGGTCACTGGGAC AATACCCCGTTTGCCAGCAGCGTTAACGGTTTGATATTTGCTGACTACATTGCCGCCAGTGGCTCCACGCAGAAATCGCT TGGCTTAACGCTCAATCGCGTTGTCGATGGCAAACCACAGTTTCAGGATAACTTTGTCACCCTGGCAAATCGCGCGGGCT TCCAGACCTGGTGGTTTTCCAACCAGGGTCAAATCGGCGAATACGATACCGCTATCGCCAGCATCGCCAAACGAGCAGAT GAAGTGTACTTCCTGAAAGAAGGTAATTTTGAAGCAGATAAAAACACCAAAGACGAAGCGTTACTGGATATGACCGCTCA AGTGCTGGCGCAAGAGCACTCGCAACCGCAGCTGATTGTTCTACATCTGATGGGCTCACATCCGCAGGCCTGCGACAGGA CACAAGGAAAATACGAAACCTTTGTGCAATCGAAAGAAACGTCGTGCTATCTCTATACCATGACGCAAACGGACGATTTA CTGCGCAAGCTGTACGATCAGTTACGCAACAGCGGCAGCAGCTTCTCGCTGGTTTACTTTTCTGACCACGGTCTGGCCTT TAAAGAGCGCGGTAAAGACGTGCAATACCTTGCCCATGATGATAAATATCAGCAAAATTTCCAGGTGCCTTTTATGGTCA TTTCCAGCGACGATAAAGCGCATCGTGTGATTAAAGCCCGCCGCTCAGCCAATGACTTCTTAGGCTTTTTCTCCCAGTGG ACGGGGATTAAAGCGAAGGAAATTAACATCAAATACCCGTTTATATCTGAGAAGAAAGCCGGGCCGATATACATCACCAA CTTCCAGTTACAGAAGGTGGATTACAACCATCTCGGAACCGATATTTTCGACCCGAAACCTTAA
Upstream 100 bases:
>100_bases GGCTCGCGGATGCGGACCCCTTTCCACTCTTCACGCACTCTTGCAGGTATTGACCCTTGACGCCAGGGTAAGCACATGGC GTTTGTTACGATAGTGGCAT
Downstream 100 bases:
>100_bases AAACAAAAATCCGCCCCGAGAGGCGGATTTTTTATATCACCAAAGTGATTAGAAGCGGTAACCAACACCGGCAATCCAGG TGCCTACGTCAACGCTACGA
Product: sulfatase family protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 527; Mature: 527
Protein sequence:
>527_residues MNLTLKESLVTRSRVFSPWTAFYFLQSLLINLGLGYPFSLLYTAAFTAILLLLWRTLPRVQKVLVGVSSLVAACYFPFAQ AYGAPNFNTLLALHSTNMEESTEILTIFPWYSYLVGLFIFALGVIAIRRKKENEKARWNTFDSLCLVFSVATFFVAPVQN LAWGGVFKLKDTGYPVFRFAKDVIVNNNEVIEEQERMAKLSGMKDTWTVTAVKPKYQTYVVVIGESARRDALGAFGGHWD NTPFASSVNGLIFADYIAASGSTQKSLGLTLNRVVDGKPQFQDNFVTLANRAGFQTWWFSNQGQIGEYDTAIASIAKRAD EVYFLKEGNFEADKNTKDEALLDMTAQVLAQEHSQPQLIVLHLMGSHPQACDRTQGKYETFVQSKETSCYLYTMTQTDDL LRKLYDQLRNSGSSFSLVYFSDHGLAFKERGKDVQYLAHDDKYQQNFQVPFMVISSDDKAHRVIKARRSANDFLGFFSQW TGIKAKEINIKYPFISEKKAGPIYITNFQLQKVDYNHLGTDIFDPKP
Sequences:
>Translated_527_residues MNLTLKESLVTRSRVFSPWTAFYFLQSLLINLGLGYPFSLLYTAAFTAILLLLWRTLPRVQKVLVGVSSLVAACYFPFAQ AYGAPNFNTLLALHSTNMEESTEILTIFPWYSYLVGLFIFALGVIAIRRKKENEKARWNTFDSLCLVFSVATFFVAPVQN LAWGGVFKLKDTGYPVFRFAKDVIVNNNEVIEEQERMAKLSGMKDTWTVTAVKPKYQTYVVVIGESARRDALGAFGGHWD NTPFASSVNGLIFADYIAASGSTQKSLGLTLNRVVDGKPQFQDNFVTLANRAGFQTWWFSNQGQIGEYDTAIASIAKRAD EVYFLKEGNFEADKNTKDEALLDMTAQVLAQEHSQPQLIVLHLMGSHPQACDRTQGKYETFVQSKETSCYLYTMTQTDDL LRKLYDQLRNSGSSFSLVYFSDHGLAFKERGKDVQYLAHDDKYQQNFQVPFMVISSDDKAHRVIKARRSANDFLGFFSQW TGIKAKEINIKYPFISEKKAGPIYITNFQLQKVDYNHLGTDIFDPKP >Mature_527_residues MNLTLKESLVTRSRVFSPWTAFYFLQSLLINLGLGYPFSLLYTAAFTAILLLLWRTLPRVQKVLVGVSSLVAACYFPFAQ AYGAPNFNTLLALHSTNMEESTEILTIFPWYSYLVGLFIFALGVIAIRRKKENEKARWNTFDSLCLVFSVATFFVAPVQN LAWGGVFKLKDTGYPVFRFAKDVIVNNNEVIEEQERMAKLSGMKDTWTVTAVKPKYQTYVVVIGESARRDALGAFGGHWD NTPFASSVNGLIFADYIAASGSTQKSLGLTLNRVVDGKPQFQDNFVTLANRAGFQTWWFSNQGQIGEYDTAIASIAKRAD EVYFLKEGNFEADKNTKDEALLDMTAQVLAQEHSQPQLIVLHLMGSHPQACDRTQGKYETFVQSKETSCYLYTMTQTDDL LRKLYDQLRNSGSSFSLVYFSDHGLAFKERGKDVQYLAHDDKYQQNFQVPFMVISSDDKAHRVIKARRSANDFLGFFSQW TGIKAKEINIKYPFISEKKAGPIYITNFQLQKVDYNHLGTDIFDPKP
Specific function: Unknown
COG id: COG2194
COG function: function code R; Predicted membrane-associated, metal-dependent hydrolase
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the phosphoethanolamine transferase family
Homologues:
Organism=Escherichia coli, GI1787035, Length=527, Percent_Identity=100, Blast_Score=1094, Evalue=0.0, Organism=Escherichia coli, GI87082223, Length=268, Percent_Identity=30.2238805970149, Blast_Score=122, Evalue=4e-29, Organism=Escherichia coli, GI1790392, Length=503, Percent_Identity=21.272365805169, Blast_Score=77, Evalue=4e-15,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): YBIP_ECOLI (P75785)
Other databases:
- EMBL: U00096 - EMBL: AP009048 - PIR: G64818 - RefSeq: AP_001446.1 - RefSeq: NP_415336.1 - ProteinModelPortal: P75785 - STRING: P75785 - EnsemblBacteria: EBESCT00000000447 - EnsemblBacteria: EBESCT00000014675 - GeneID: 945360 - GenomeReviews: AP009048_GR - GenomeReviews: U00096_GR - KEGG: ecj:JW0800 - KEGG: eco:b0815 - EchoBASE: EB3105 - EcoGene: EG13321 - GeneTree: EBGT00050000009020 - HOGENOM: HBG644083 - OMA: DKNTRDE - ProtClustDB: CLSK879772 - BioCyc: EcoCyc:G6418-MONOMER - Genevestigator: P75785 - InterPro: IPR017849 - InterPro: IPR017850 - InterPro: IPR000917 - Gene3D: G3DSA:3.40.720.10
Pfam domain/function: PF00884 Sulfatase; SSF53649 Alkaline_phosphatase_core
EC number: NA
Molecular weight: Translated: 59707; Mature: 59707
Theoretical pI: Translated: 8.68; Mature: 8.68
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
HASH(0x127f04ec)-; HASH(0x12bc2fa4)-; HASH(0x1375f40c)-; HASH(0x13858f8c)-;
Cys/Met content:
0.8 %Cys (Translated Protein) 1.5 %Met (Translated Protein) 2.3 %Cys+Met (Translated Protein) 0.8 %Cys (Mature Protein) 1.5 %Met (Mature Protein) 2.3 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MNLTLKESLVTRSRVFSPWTAFYFLQSLLINLGLGYPFSLLYTAAFTAILLLLWRTLPRV CCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHH QKVLVGVSSLVAACYFPFAQAYGAPNFNTLLALHSTNMEESTEILTIFPWYSYLVGLFIF HHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCCCCCEEEEECHHHHHHHHHHH ALGVIAIRRKKENEKARWNTFDSLCLVFSVATFFVAPVQNLAWGGVFKLKDTGYPVFRFA HHHHHHHHHCCCCCCHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCHHHHHH KDVIVNNNEVIEEQERMAKLSGMKDTWTVTAVKPKYQTYVVVIGESARRDALGAFGGHWD HHHHCCCCHHHHHHHHHHHHCCCCCCEEEEEECCCCEEEEEEECCCCCCHHHHCCCCCCC NTPFASSVNGLIFADYIAASGSTQKSLGLTLNRVVDGKPQFQDNFVTLANRAGFQTWWFS CCCCCCCCCCEEEEEEHHCCCCCCHHHCCEEEHHCCCCCCCCCCEEEEECCCCCEEEEEC NQGQIGEYDTAIASIAKRADEVYFLKEGNFEADKNTKDEALLDMTAQVLAQEHSQPQLIV CCCCCCHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEE LHLMGSHPQACDRTQGKYETFVQSKETSCYLYTMTQTDDLLRKLYDQLRNSGSSFSLVYF EEEECCCCCHHHCCCCHHHHHHCCCCCEEEEEEECCHHHHHHHHHHHHHCCCCCEEEEEE SDHGLAFKERGKDVQYLAHDDKYQQNFQVPFMVISSDDKAHRVIKARRSANDFLGFFSQW ECCCCCHHHCCCCEEEEECCCHHCCCCCCCEEEEECCCHHHHHHHHHHCHHHHHHHHHHH TGIKAKEINIKYPFISEKKAGPIYITNFQLQKVDYNHLGTDIFDPKP CCCEEEEEEEECCCCCCCCCCCEEEEEEEEEEECHHHCCCCCCCCCH >Mature Secondary Structure MNLTLKESLVTRSRVFSPWTAFYFLQSLLINLGLGYPFSLLYTAAFTAILLLLWRTLPRV CCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHH QKVLVGVSSLVAACYFPFAQAYGAPNFNTLLALHSTNMEESTEILTIFPWYSYLVGLFIF HHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCCCCCEEEEECHHHHHHHHHHH ALGVIAIRRKKENEKARWNTFDSLCLVFSVATFFVAPVQNLAWGGVFKLKDTGYPVFRFA HHHHHHHHHCCCCCCHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCHHHHHH KDVIVNNNEVIEEQERMAKLSGMKDTWTVTAVKPKYQTYVVVIGESARRDALGAFGGHWD HHHHCCCCHHHHHHHHHHHHCCCCCCEEEEEECCCCEEEEEEECCCCCCHHHHCCCCCCC NTPFASSVNGLIFADYIAASGSTQKSLGLTLNRVVDGKPQFQDNFVTLANRAGFQTWWFS CCCCCCCCCCEEEEEEHHCCCCCCHHHCCEEEHHCCCCCCCCCCEEEEECCCCCEEEEEC NQGQIGEYDTAIASIAKRADEVYFLKEGNFEADKNTKDEALLDMTAQVLAQEHSQPQLIV CCCCCCHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEE LHLMGSHPQACDRTQGKYETFVQSKETSCYLYTMTQTDDLLRKLYDQLRNSGSSFSLVYF EEEECCCCCHHHCCCCHHHHHHCCCCCEEEEEEECCHHHHHHHHHHHHHCCCCCEEEEEE SDHGLAFKERGKDVQYLAHDDKYQQNFQVPFMVISSDDKAHRVIKARRSANDFLGFFSQW ECCCCCHHHCCCCEEEEECCCHHCCCCCCCEEEEECCCHHHHHHHHHHCHHHHHHHHHHH TGIKAKEINIKYPFISEKKAGPIYITNFQLQKVDYNHLGTDIFDPKP CCCEEEEEEEECCCCCCCCCCCEEEEEEEEEEECHHHCCCCCCCCCH
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 8905232; 9278503