Definition | Escherichia coli ED1a chromosome, complete genome. |
---|---|
Accession | NC_011745 |
Length | 5,209,548 |
Click here to switch to the map view.
The map label for this gene is srfJ
Identifier: 218688155
GI number: 218688155
Start: 323372
End: 324715
Strand: Direct
Name: srfJ
Synonym: ECED1_0305
Alternate gene names: NA
Gene position: 323372-324715 (Clockwise)
Preceding gene: 218688154
Following gene: 218688156
Centisome position: 6.21
GC content: 53.2
Gene sequence:
>1344_bases ATGAAAGGCAGACTCATCTCTTCCGATCCGTATCGTCAGCAATTCCTTGTTGAGCGTGCGGTCTCTTTTTCGCATCGTCA GCGTGATTGCAGTGAATTAATCAGCGTCTTACCGCGCCACACGTTACAGCAGATTGACGGATTCGGCGGCAGCTTTACCG AAGGTGCGGGCGTGGTATTCAACAGCATGAGCGAAAAGACGAAGGCGCAATTTCTTTCCCTTTATTTTTCTGATCAGGAA CATAATTACACTCTGGCGCGGATGCCAATTCAGAGCTGTGATTTTTCCCTGGGCAATTACGCGTATGTCGATTCCAGCGC TGACCTGCAGCAGGGACGGCTCTCCTTTTCCCGCGATGAAGCGCATTTAATACCGCTGATTTCCGGGGCGTTGCGGTTAA ATCCACACATGAAGCTGATGGCTTCGCCGTGGAGTCCGCCGGCGTTTATGAAAACTAATAACGATATGAACGGTGGCGGC AAGCTGCGGCGCGAATGCTACGCCGACTGGGCCGATATCATTATCAACTACCTGCTGGAATACCGCCGCCACGGCATTAA TGTGCAGGCGCTCTCCGTGCAGAATGAGCCGGTGGCGGTAAAAACCTGGGACTCCTGTCTGTATAGCGTGGAAGAGGAGA CAGCCTTTGCCGTGCAGTATCTGCGTCCGCGCCTCGCCCGGCAGGGTATGGATGAGATGGAGATCTATATCTGGGATCAC GATAAAGATGGCCTGGTAGACTGGGCTGAACGCGCCTTTGCTGACGAAGCTAATTATAAGGGAATTAACGGGCTGGCATT CCACTGGTATACCGGCGACCATTTTTCGCAAATACAGTATCTGGCCCAGTGCCTGCCGGATAAAAAACTCCTGTTTTCCG AAGGCTGTGTGCCAATGGAGAGCGATGCCGGTAGCCAGATTCGCCACTGGCATACCTATCTCCATGACATGATTGGTAAT TTCAAATCGGGTTGTAGCGGGTTTATCGACTGGAATCTGCTGTTGAACAGTGAGGGTGGGCCGAATCATCAGGGTAATCT GTGTGAAGCGCCCATTCAATACGATGCGCAAAACGACGTGCTGCGGCGTAACCACTCCTGGTATGGTATTGGCCACTTCT GCCGCTATGTGCGTCCGGGGGCGAGGGTCATGCTTTCTTCAAGTTACGATAATCTTCTGGAAGAGGTGGGATTTGTGAAT CCCGACGGCGAGCGTGTGCTGGTGGTGTATAACCGCGATGTCCAGGAAAGGCGTTGCCGGGTGCTGGATGGCGATAAAGA GATCGCGTTAACGCTGCCGCCGTCAGGCGCCAGTACGTTGCTATGGCGGCAGGAGTCGATCTGA
Upstream 100 bases:
>100_bases TAAACCTTCCGCGTATTAAAACCTTCTGCCGGATAGCGATGCTTAAGCGTCCTATCCGGCATATCACAATTTATTATCTT CTTTTATCCGGAACTTCCCT
Downstream 100 bases:
>100_bases ATGATGAAGCTGGGATTTAATGAAGCGACCTGTATGCGAAACTCCACGCTGGCACAGGATGTTGTGTTGGCGGAACGTTT TGGCTATGACTACATTGAAA
Product: putative glucosylceramidase
Products: NA
Alternate protein names: Glucan Endo-1 6-Beta-Glucosidase; Glycoside Hydrolase Family; O-Glycosyl Hydrolase Family; Glycoside Hydrolase Family Protein; O-Glycosyl Hydrolase; Glycosyl Hydrolase; Ricin B Lectin; O-Glycosyl Hydrolase Family Protein; Glycoside Hydrolase; Lysosomal Glucosyl Ceramidase-Like Protein; Glycosy Hydrolase Family Protein; Glucan EndO-1 6-Beta-Glucosidase; Beta-Glycosidase; Helix-Turn-Helix AraC Type; Glycosyl Hydrolase Family; LPXTG-Motif Cell Wall Anchor Domain Protein; Glycosyl Hydrolases Family; Endo-1 6-Beta-D-Glucanase
Number of amino acids: Translated: 447; Mature: 447
Protein sequence:
>447_residues MKGRLISSDPYRQQFLVERAVSFSHRQRDCSELISVLPRHTLQQIDGFGGSFTEGAGVVFNSMSEKTKAQFLSLYFSDQE HNYTLARMPIQSCDFSLGNYAYVDSSADLQQGRLSFSRDEAHLIPLISGALRLNPHMKLMASPWSPPAFMKTNNDMNGGG KLRRECYADWADIIINYLLEYRRHGINVQALSVQNEPVAVKTWDSCLYSVEEETAFAVQYLRPRLARQGMDEMEIYIWDH DKDGLVDWAERAFADEANYKGINGLAFHWYTGDHFSQIQYLAQCLPDKKLLFSEGCVPMESDAGSQIRHWHTYLHDMIGN FKSGCSGFIDWNLLLNSEGGPNHQGNLCEAPIQYDAQNDVLRRNHSWYGIGHFCRYVRPGARVMLSSSYDNLLEEVGFVN PDGERVLVVYNRDVQERRCRVLDGDKEIALTLPPSGASTLLWRQESI
Sequences:
>Translated_447_residues MKGRLISSDPYRQQFLVERAVSFSHRQRDCSELISVLPRHTLQQIDGFGGSFTEGAGVVFNSMSEKTKAQFLSLYFSDQE HNYTLARMPIQSCDFSLGNYAYVDSSADLQQGRLSFSRDEAHLIPLISGALRLNPHMKLMASPWSPPAFMKTNNDMNGGG KLRRECYADWADIIINYLLEYRRHGINVQALSVQNEPVAVKTWDSCLYSVEEETAFAVQYLRPRLARQGMDEMEIYIWDH DKDGLVDWAERAFADEANYKGINGLAFHWYTGDHFSQIQYLAQCLPDKKLLFSEGCVPMESDAGSQIRHWHTYLHDMIGN FKSGCSGFIDWNLLLNSEGGPNHQGNLCEAPIQYDAQNDVLRRNHSWYGIGHFCRYVRPGARVMLSSSYDNLLEEVGFVN PDGERVLVVYNRDVQERRCRVLDGDKEIALTLPPSGASTLLWRQESI >Mature_447_residues MKGRLISSDPYRQQFLVERAVSFSHRQRDCSELISVLPRHTLQQIDGFGGSFTEGAGVVFNSMSEKTKAQFLSLYFSDQE HNYTLARMPIQSCDFSLGNYAYVDSSADLQQGRLSFSRDEAHLIPLISGALRLNPHMKLMASPWSPPAFMKTNNDMNGGG KLRRECYADWADIIINYLLEYRRHGINVQALSVQNEPVAVKTWDSCLYSVEEETAFAVQYLRPRLARQGMDEMEIYIWDH DKDGLVDWAERAFADEANYKGINGLAFHWYTGDHFSQIQYLAQCLPDKKLLFSEGCVPMESDAGSQIRHWHTYLHDMIGN FKSGCSGFIDWNLLLNSEGGPNHQGNLCEAPIQYDAQNDVLRRNHSWYGIGHFCRYVRPGARVMLSSSYDNLLEEVGFVN PDGERVLVVYNRDVQERRCRVLDGDKEIALTLPPSGASTLLWRQESI
Specific function: Unknown
COG id: COG5520
COG function: function code M; O-Glycosyl hydrolase
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
Organism=Homo sapiens, GI284807150, Length=432, Percent_Identity=30.787037037037, Blast_Score=211, Evalue=2e-54, Organism=Homo sapiens, GI54607043, Length=432, Percent_Identity=30.787037037037, Blast_Score=210, Evalue=2e-54, Organism=Homo sapiens, GI54607045, Length=432, Percent_Identity=30.787037037037, Blast_Score=210, Evalue=2e-54, Organism=Homo sapiens, GI54607047, Length=432, Percent_Identity=30.787037037037, Blast_Score=210, Evalue=2e-54, Organism=Homo sapiens, GI284807152, Length=382, Percent_Identity=31.151832460733, Blast_Score=181, Evalue=1e-45, Organism=Caenorhabditis elegans, GI17539758, Length=429, Percent_Identity=29.3706293706294, Blast_Score=201, Evalue=6e-52, Organism=Caenorhabditis elegans, GI17539756, Length=437, Percent_Identity=28.8329519450801, Blast_Score=200, Evalue=1e-51, Organism=Caenorhabditis elegans, GI17542884, Length=440, Percent_Identity=27.7272727272727, Blast_Score=183, Evalue=2e-46, Organism=Caenorhabditis elegans, GI25151335, Length=367, Percent_Identity=30.7901907356948, Blast_Score=177, Evalue=1e-44, Organism=Caenorhabditis elegans, GI115532470, Length=423, Percent_Identity=27.8959810874704, Blast_Score=164, Evalue=1e-40, Organism=Caenorhabditis elegans, GI193204210, Length=391, Percent_Identity=28.9002557544757, Blast_Score=164, Evalue=1e-40, Organism=Caenorhabditis elegans, GI115532472, Length=423, Percent_Identity=27.8959810874704, Blast_Score=164, Evalue=1e-40, Organism=Drosophila melanogaster, GI21355305, Length=456, Percent_Identity=26.0964912280702, Blast_Score=151, Evalue=7e-37, Organism=Drosophila melanogaster, GI161078544, Length=443, Percent_Identity=26.410835214447, Blast_Score=137, Evalue=1e-32,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
NA
Pfam domain/function: NA
EC number: NA
Molecular weight: Translated: 50983; Mature: 50983
Theoretical pI: Translated: 5.85; Mature: 5.85
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.2 %Cys (Translated Protein) 2.7 %Met (Translated Protein) 4.9 %Cys+Met (Translated Protein) 2.2 %Cys (Mature Protein) 2.7 %Met (Mature Protein) 4.9 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKGRLISSDPYRQQFLVERAVSFSHRQRDCSELISVLPRHTLQQIDGFGGSFTEGAGVVF CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCEE NSMSEKTKAQFLSLYFSDQEHNYTLARMPIQSCDFSLGNYAYVDSSADLQQGRLSFSRDE HHHHHHHHHHHHHEEECCCCCCEEEEECCHHHCCCCCCCEEEECCCCCHHHCCCCCCCCC AHLIPLISGALRLNPHMKLMASPWSPPAFMKTNNDMNGGGKLRRECYADWADIIINYLLE CEEEHHHHCCEEECCCEEEEECCCCCCCEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHH YRRHGINVQALSVQNEPVAVKTWDSCLYSVEEETAFAVQYLRPRLARQGMDEMEIYIWDH HHHCCCCEEEEEECCCCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEEEC DKDGLVDWAERAFADEANYKGINGLAFHWYTGDHFSQIQYLAQCLPDKKLLFSEGCVPME CCCCHHHHHHHHHHCCCCCCCCCCEEEEEECCCHHHHHHHHHHHCCCCHHHHHCCCCCCC SDAGSQIRHWHTYLHDMIGNFKSGCSGFIDWNLLLNSEGGPNHQGNLCEAPIQYDAQNDV CCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCHH LRRNHSWYGIGHFCRYVRPGARVMLSSSYDNLLEEVGFVNPDGERVLVVYNRDVQERRCR HHCCCCEEEHHHHHHHHCCCCEEEEECCHHHHHHHCCCCCCCCCEEEEEECCCHHHHHHE VLDGDKEIALTLPPSGASTLLWRQESI EECCCCEEEEEECCCCCCEEEEECCCC >Mature Secondary Structure MKGRLISSDPYRQQFLVERAVSFSHRQRDCSELISVLPRHTLQQIDGFGGSFTEGAGVVF CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCEE NSMSEKTKAQFLSLYFSDQEHNYTLARMPIQSCDFSLGNYAYVDSSADLQQGRLSFSRDE HHHHHHHHHHHHHEEECCCCCCEEEEECCHHHCCCCCCCEEEECCCCCHHHCCCCCCCCC AHLIPLISGALRLNPHMKLMASPWSPPAFMKTNNDMNGGGKLRRECYADWADIIINYLLE CEEEHHHHCCEEECCCEEEEECCCCCCCEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHH YRRHGINVQALSVQNEPVAVKTWDSCLYSVEEETAFAVQYLRPRLARQGMDEMEIYIWDH HHHCCCCEEEEEECCCCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEEEC DKDGLVDWAERAFADEANYKGINGLAFHWYTGDHFSQIQYLAQCLPDKKLLFSEGCVPME CCCCHHHHHHHHHHCCCCCCCCCCEEEEEECCCHHHHHHHHHHHCCCCHHHHHCCCCCCC SDAGSQIRHWHTYLHDMIGNFKSGCSGFIDWNLLLNSEGGPNHQGNLCEAPIQYDAQNDV CCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCHH LRRNHSWYGIGHFCRYVRPGARVMLSSSYDNLLEEVGFVNPDGERVLVVYNRDVQERRCR HHCCCCEEEHHHHHHHHCCCCEEEEECCHHHHHHHCCCCCCCCCEEEEEECCCHHHHHHE VLDGDKEIALTLPPSGASTLLWRQESI EECCCCEEEEEECCCCCCEEEEECCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA