| Definition | Yersinia pestis CO92 chromosome, complete genome. |
|---|---|
| Accession | NC_003143 |
| Length | 4,653,728 |
Click here to switch to the map view.
The map label for this gene is ydeN [H]
Identifier: 218928018
GI number: 218928018
Start: 922103
End: 923710
Strand: Direct
Name: ydeN [H]
Synonym: YPO0842
Alternate gene names: 218928018
Gene position: 922103-923710 (Clockwise)
Preceding gene: 218928017
Following gene: 218928019
Centisome position: 19.81
GC content: 48.63
Gene sequence:
>1608_bases ATGAAGTTACCCGCAGGAAAAAGAAGCCTGTTGGCAGGGATGATCGCTGCCGCTGGTATGAGTATGACACCTGTGACTCT GGCGGCACCGGCAGAAAAACCCAATGTATTGCTGGTAATCATGGATGATCTGGGTACCGGGCAGTTAGATTTCACCCTCA ATAATCTGGATAAAAAAGCACTAAGCCAGCGCCCAGTTCCCGTACGCTATCAAGGCGATCTGGACAAGATGATCGATGCG GCACAGCGGGCGATGCCGAATGTGTCTTTGTTGGCCAAAAACGGGGTCAAAATGACCAATGCGTTTGTGGCGCATCCGGT ATGCGGGCCTTCGCGCGCGGGTATTTATACAGGTCGCCACCCAACCAGTTTTGGTACTTACAGTAATGATGATGCCATGC AGGGGATCCCACTGGATATTAAACTGCTGCCCGCCTTGTTTCAGGAGCATGGCTATGCAACCGCAAATATCGGGAAATGG CACAACGCACGCATAGAGAAAAAAGCGTTCGTCGCCGATGAGGTCAAAAGCCGCGATTATCACGACAACATGATCTCCGT CAGCGCCCCCGGATATGCACCTGAAAAACGGGGTTTTGACTATTCCTACAGTTATTACGCCTCAGGCGCGGCATTGTGGC ACTCTCCAGCCATCTGGCAAAACAGCAAAAATATTGCCGCCCCAGGCTATCTGACCCATAACCTGACGGATGAAACGCTG AAATTTATTGATGACTCAGGGAAAAAACCGTTTTTCATCAGCCTGGCTTACAGCGTGCCACATATTCCATTAGAGCAAGC ATCACCCGCGAAATATATGGATCGGTTTAATACCGGCAACGTTGAAGCAGATAAATATTTTGCTGCCATTAATGCCGCAG ACGAGGGGATTGGTAGAATTGTTCAGCACTTACAAGAAAAAGGTGAGCTGGATAACACACTGATTTTCTTCATTTCGGAT AACGGGGCGGTTCATGAATCCCCAATGCCAATGAATGGCATGGACCGTGGACATAAAGGACAAATGTATAACGGGGGGGT GCATATTCCCTTCGTCGCTTACTGGCCAAAACAGATCCCCGCAGGTACGCAAAGTGATGCATTGGTGAGTGCATTAGATA TTTTACCGACGGCATTGAAAGCCGCGGGTATTGCCATCCCAGCGGAGATGAGAGTGGATGGTAAAGATATTCTGCCGGTA CTGGCAGGTAAGGAACAAACCTCGCCGCATCAATATATGTACTGGGCTGGGCCGGGGGCAAAGCATTACAGCGATGAGAA TCAGTCATTCTGGCATGACTACTGGAAATGGATCACTTACGAACATCAACAGGCGCCTAAAAATGATCATGTAGAGACAT TATCGAAAGCCTCTTGGGCAATCCGCGATCAGGAGTGGGCACTCTACTTCTATGATGACGGCACCAATACGCCAAAATTA TTTAATGATAAGCATGATCCCATGGAATCAAAGGATTTAGCTGATCAGTACCCTGAGCGTGTCAGTGCAATGAAAGCGGC ATTCTATGATTGGATCAAAGATAAACCCAAACCCGTGGCTTGGGGGCAAGATCGCTATCAGATCTTAGCAAGCTCCGCGA AAAGTTAA
Upstream 100 bases:
>100_bases TGATGCAGGCCATGTGCCAGATTATTCAACGCGGGGGCATTGCTGCGGACATTATGCCGCAACTGACACAGCCTAAATAA CGCTTGAGGAACACATAATA
Downstream 100 bases:
>100_bases GCTCGTGCGGTGTAACGGGGAGATAAGGTGGAGCGAGTGTTCTTAGCCCCCTCTTTTATCCCCTCTTACCCCTCTTTTTA TTACCGTTCTCCTTATCCTA
Product: putative sulfatase
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 535; Mature: 535
Protein sequence:
>535_residues MKLPAGKRSLLAGMIAAAGMSMTPVTLAAPAEKPNVLLVIMDDLGTGQLDFTLNNLDKKALSQRPVPVRYQGDLDKMIDA AQRAMPNVSLLAKNGVKMTNAFVAHPVCGPSRAGIYTGRHPTSFGTYSNDDAMQGIPLDIKLLPALFQEHGYATANIGKW HNARIEKKAFVADEVKSRDYHDNMISVSAPGYAPEKRGFDYSYSYYASGAALWHSPAIWQNSKNIAAPGYLTHNLTDETL KFIDDSGKKPFFISLAYSVPHIPLEQASPAKYMDRFNTGNVEADKYFAAINAADEGIGRIVQHLQEKGELDNTLIFFISD NGAVHESPMPMNGMDRGHKGQMYNGGVHIPFVAYWPKQIPAGTQSDALVSALDILPTALKAAGIAIPAEMRVDGKDILPV LAGKEQTSPHQYMYWAGPGAKHYSDENQSFWHDYWKWITYEHQQAPKNDHVETLSKASWAIRDQEWALYFYDDGTNTPKL FNDKHDPMESKDLADQYPERVSAMKAAFYDWIKDKPKPVAWGQDRYQILASSAKS
Sequences:
>Translated_535_residues MKLPAGKRSLLAGMIAAAGMSMTPVTLAAPAEKPNVLLVIMDDLGTGQLDFTLNNLDKKALSQRPVPVRYQGDLDKMIDA AQRAMPNVSLLAKNGVKMTNAFVAHPVCGPSRAGIYTGRHPTSFGTYSNDDAMQGIPLDIKLLPALFQEHGYATANIGKW HNARIEKKAFVADEVKSRDYHDNMISVSAPGYAPEKRGFDYSYSYYASGAALWHSPAIWQNSKNIAAPGYLTHNLTDETL KFIDDSGKKPFFISLAYSVPHIPLEQASPAKYMDRFNTGNVEADKYFAAINAADEGIGRIVQHLQEKGELDNTLIFFISD NGAVHESPMPMNGMDRGHKGQMYNGGVHIPFVAYWPKQIPAGTQSDALVSALDILPTALKAAGIAIPAEMRVDGKDILPV LAGKEQTSPHQYMYWAGPGAKHYSDENQSFWHDYWKWITYEHQQAPKNDHVETLSKASWAIRDQEWALYFYDDGTNTPKL FNDKHDPMESKDLADQYPERVSAMKAAFYDWIKDKPKPVAWGQDRYQILASSAKS >Mature_535_residues MKLPAGKRSLLAGMIAAAGMSMTPVTLAAPAEKPNVLLVIMDDLGTGQLDFTLNNLDKKALSQRPVPVRYQGDLDKMIDA AQRAMPNVSLLAKNGVKMTNAFVAHPVCGPSRAGIYTGRHPTSFGTYSNDDAMQGIPLDIKLLPALFQEHGYATANIGKW HNARIEKKAFVADEVKSRDYHDNMISVSAPGYAPEKRGFDYSYSYYASGAALWHSPAIWQNSKNIAAPGYLTHNLTDETL KFIDDSGKKPFFISLAYSVPHIPLEQASPAKYMDRFNTGNVEADKYFAAINAADEGIGRIVQHLQEKGELDNTLIFFISD NGAVHESPMPMNGMDRGHKGQMYNGGVHIPFVAYWPKQIPAGTQSDALVSALDILPTALKAAGIAIPAEMRVDGKDILPV LAGKEQTSPHQYMYWAGPGAKHYSDENQSFWHDYWKWITYEHQQAPKNDHVETLSKASWAIRDQEWALYFYDDGTNTPKL FNDKHDPMESKDLADQYPERVSAMKAAFYDWIKDKPKPVAWGQDRYQILASSAKS
Specific function: Unknown
COG id: COG3119
COG function: function code P; Arylsulfatase A and related enzymes
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: Non Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the sulfatase family [H]
Homologues:
Organism=Homo sapiens, GI4503899, Length=407, Percent_Identity=28.992628992629, Blast_Score=143, Evalue=4e-34, Organism=Homo sapiens, GI59797060, Length=476, Percent_Identity=27.7310924369748, Blast_Score=116, Evalue=5e-26, Organism=Homo sapiens, GI45430057, Length=420, Percent_Identity=25.7142857142857, Blast_Score=112, Evalue=8e-25, Organism=Homo sapiens, GI146229331, Length=402, Percent_Identity=26.865671641791, Blast_Score=111, Evalue=2e-24, Organism=Homo sapiens, GI146229329, Length=402, Percent_Identity=26.865671641791, Blast_Score=111, Evalue=2e-24, Organism=Homo sapiens, GI6005990, Length=402, Percent_Identity=26.865671641791, Blast_Score=111, Evalue=2e-24, Organism=Homo sapiens, GI146229324, Length=402, Percent_Identity=26.865671641791, Blast_Score=111, Evalue=2e-24, Organism=Homo sapiens, GI109389362, Length=480, Percent_Identity=25.4166666666667, Blast_Score=107, Evalue=2e-23, Organism=Homo sapiens, GI53831991, Length=297, Percent_Identity=27.6094276094276, Blast_Score=107, Evalue=3e-23, Organism=Homo sapiens, GI38569405, Length=539, Percent_Identity=23.0055658627087, Blast_Score=96, Evalue=8e-20, Organism=Homo sapiens, GI38569407, Length=397, Percent_Identity=24.6851385390428, Blast_Score=95, Evalue=2e-19, Organism=Homo sapiens, GI31742482, Length=279, Percent_Identity=25.8064516129032, Blast_Score=84, Evalue=4e-16, Organism=Homo sapiens, GI146229327, Length=298, Percent_Identity=27.1812080536913, Blast_Score=79, Evalue=1e-14, Organism=Homo sapiens, GI157266309, Length=252, Percent_Identity=25, Blast_Score=78, Evalue=3e-14, Organism=Homo sapiens, GI58743319, Length=197, Percent_Identity=24.3654822335025, Blast_Score=70, Evalue=4e-12, Organism=Homo sapiens, GI71852584, Length=197, Percent_Identity=22.8426395939086, Blast_Score=69, Evalue=9e-12, Organism=Escherichia coli, GI87081924, Length=514, Percent_Identity=39.4941634241245, Blast_Score=395, Evalue=1e-111, Organism=Escherichia coli, GI1790233, Length=417, Percent_Identity=26.1390887290168, Blast_Score=100, Evalue=2e-22, Organism=Escherichia coli, GI1790112, Length=361, Percent_Identity=23.8227146814404, Blast_Score=85, Evalue=1e-17, Organism=Caenorhabditis elegans, GI115533416, Length=526, Percent_Identity=23.0038022813688, Blast_Score=103, Evalue=3e-22, Organism=Caenorhabditis elegans, GI115533418, Length=462, Percent_Identity=23.3766233766234, Blast_Score=96, Evalue=6e-20, Organism=Drosophila melanogaster, GI24666109, Length=415, Percent_Identity=26.0240963855422, Blast_Score=128, Evalue=9e-30, Organism=Drosophila melanogaster, GI281366397, Length=423, Percent_Identity=26.241134751773, Blast_Score=128, Evalue=1e-29, Organism=Drosophila melanogaster, GI281366395, Length=423, Percent_Identity=26.241134751773, Blast_Score=128, Evalue=1e-29, Organism=Drosophila melanogaster, GI24666163, Length=423, Percent_Identity=26.241134751773, Blast_Score=128, Evalue=1e-29, Organism=Drosophila melanogaster, GI24666175, Length=411, Percent_Identity=25.0608272506083, Blast_Score=125, Evalue=1e-28, Organism=Drosophila melanogaster, GI281363223, Length=437, Percent_Identity=24.4851258581236, Blast_Score=103, Evalue=4e-22,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR017849 - InterPro: IPR017850 - InterPro: IPR000917 [H]
Pfam domain/function: PF00884 Sulfatase [H]
EC number: 3.1.6.- [C]
Molecular weight: Translated: 59365; Mature: 59365
Theoretical pI: Translated: 6.80; Mature: 6.80
Prosite motif: PS00149 SULFATASE_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.2 %Cys (Translated Protein) 3.6 %Met (Translated Protein) 3.7 %Cys+Met (Translated Protein) 0.2 %Cys (Mature Protein) 3.6 %Met (Mature Protein) 3.7 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKLPAGKRSLLAGMIAAAGMSMTPVTLAAPAEKPNVLLVIMDDLGTGQLDFTLNNLDKKA CCCCCCHHHHHHHHHHHCCCCCCCEEEECCCCCCCEEEEEECCCCCCEEEEEECHHHHHH LSQRPVPVRYQGDLDKMIDAAQRAMPNVSLLAKNGVKMTNAFVAHPVCGPSRAGIYTGRH HHCCCCCEEECCCHHHHHHHHHHHCCCEEEEECCCCEEEHHEEECCCCCCCCCCEECCCC PTSFGTYSNDDAMQGIPLDIKLLPALFQEHGYATANIGKWHNARIEKKAFVADEVKSRDY CCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCEEECCCCCCCCCCCHHHHHHHHHHCCCC HDNMISVSAPGYAPEKRGFDYSYSYYASGAALWHSPAIWQNSKNIAAPGYLTHNLTDETL CCCEEEEECCCCCCHHCCCCCCCCEEECCCCEECCCCCCCCCCCCCCCCEEECCCCHHHH KFIDDSGKKPFFISLAYSVPHIPLEQASPAKYMDRFNTGNVEADKYFAAINAADEGIGRI HHHHCCCCCCEEEEEECCCCCCCCCCCCCHHHHHCCCCCCCCCCCEEEEECCCHHHHHHH VQHLQEKGELDNTLIFFISDNGAVHESPMPMNGMDRGHKGQMYNGGVHIPFVAYWPKQIP HHHHHHCCCCCCEEEEEECCCCCEECCCCCCCCCCCCCCCCEECCCEEEEEEEECCCCCC AGTQSDALVSALDILPTALKAAGIAIPAEMRVDGKDILPVLAGKEQTSPHQYMYWAGPGA CCCCCHHHHHHHHHHHHHHHHCCEECCHHEEECCCCCHHEECCCCCCCCCCEEEECCCCC KHYSDENQSFWHDYWKWITYEHQQAPKNDHVETLSKASWAIRDQEWALYFYDDGTNTPKL CCCCCCCCHHHHHHHHHHEEEHHCCCCCHHHHHHHHHHHEEECCCEEEEEEECCCCCCCC FNDKHDPMESKDLADQYPERVSAMKAAFYDWIKDKPKPVAWGQDRYQILASSAKS CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCC >Mature Secondary Structure MKLPAGKRSLLAGMIAAAGMSMTPVTLAAPAEKPNVLLVIMDDLGTGQLDFTLNNLDKKA CCCCCCHHHHHHHHHHHCCCCCCCEEEECCCCCCCEEEEEECCCCCCEEEEEECHHHHHH LSQRPVPVRYQGDLDKMIDAAQRAMPNVSLLAKNGVKMTNAFVAHPVCGPSRAGIYTGRH HHCCCCCEEECCCHHHHHHHHHHHCCCEEEEECCCCEEEHHEEECCCCCCCCCCEECCCC PTSFGTYSNDDAMQGIPLDIKLLPALFQEHGYATANIGKWHNARIEKKAFVADEVKSRDY CCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCEEECCCCCCCCCCCHHHHHHHHHHCCCC HDNMISVSAPGYAPEKRGFDYSYSYYASGAALWHSPAIWQNSKNIAAPGYLTHNLTDETL CCCEEEEECCCCCCHHCCCCCCCCEEECCCCEECCCCCCCCCCCCCCCCEEECCCCHHHH KFIDDSGKKPFFISLAYSVPHIPLEQASPAKYMDRFNTGNVEADKYFAAINAADEGIGRI HHHHCCCCCCEEEEEECCCCCCCCCCCCCHHHHHCCCCCCCCCCCEEEEECCCHHHHHHH VQHLQEKGELDNTLIFFISDNGAVHESPMPMNGMDRGHKGQMYNGGVHIPFVAYWPKQIP HHHHHHCCCCCCEEEEEECCCCCEECCCCCCCCCCCCCCCCEECCCEEEEEEEECCCCCC AGTQSDALVSALDILPTALKAAGIAIPAEMRVDGKDILPVLAGKEQTSPHQYMYWAGPGA CCCCCHHHHHHHHHHHHHHHHCCEECCHHEEECCCCCHHEECCCCCCCCCCEEEECCCCC KHYSDENQSFWHDYWKWITYEHQQAPKNDHVETLSKASWAIRDQEWALYFYDDGTNTPKL CCCCCCCCHHHHHHHHHHEEEHHCCCCCHHHHHHHHHHHEEECCCEEEEEEECCCCCCCC FNDKHDPMESKDLADQYPERVSAMKAAFYDWIKDKPKPVAWGQDRYQILASSAKS CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9097039; 9278503 [H]