| Definition | Escherichia coli 55989, complete genome. |
|---|---|
| Accession | NC_011748 |
| Length | 5,154,862 |
Click here to switch to the map view.
The map label for this gene is treA
Identifier: 218694711
GI number: 218694711
Start: 1341513
End: 1343210
Strand: Reverse
Name: treA
Synonym: EC55989_1293
Alternate gene names: 218694711
Gene position: 1343210-1341513 (Counterclockwise)
Preceding gene: 218694712
Following gene: 218694708
Centisome position: 26.06
GC content: 52.18
Gene sequence:
>1698_bases ATGAAATCCCCCGCACCTTCTCGCCCGCAAAAAATGGCGTTAATTCCAGCCTGTATCTTTTTGTGTTTCGCTGCGCTATC GGTGCAGGCAGAAGAAACACCGGTAACACCACAGCCGCCTGATATTTTATTAGGGCCGCTGTTTAATGATGTGCAAAACG CCAAACTTTTTCCGGACCAAAAAACCTTTGCCGATGCCATGCCGAACAGCGATCCGCTGATGATCCTTGCTGATTATCGG ATGCAGCAAAACCAGAGCGGATTTGATCTGCGCCATTTCGTTAACGTCAATTTCACCCTGCCGAAAGAAGGCGAGAAATA TGTTCCGCCAGAGGGGCAGTCACTGCGCGAACATATTGACGGACTTTGGCCGGTATTAACGCGTTCTACCGAAAACACCG AAAAATGGGATTCTCTGTTACCGCTGCCGGAACCTTATGTCGTGCCGGGCGGACGTTTTCGCGAGGTATATTACTGGGAC AGTTACTTCACCATGTTAGGACTTGCCGAAAGCGGTCACTGGGATAAAGTCGCGGATATGGTGGCCAATTTTGCTCATGA AATAGACACTTACGGTCATATTCCCAACGGCAACCGCAGTTACTATTTAAGCCGCTCGCAACCGCCCTTCTTTGCCCTGA TGGTAGAGTTACTGGCGCAGCATGAAGGCGATGCCGCGTTGAAGCAATACCTGCCGCAAATGCAAAAAGAATATGCTTAC TGGATGGACGGTGTTGAAAACCTGCAAGCCGGACAACAGGAAAAACGCGTTGTCAAACTTCAGGATGGTACCCTTCTCAA CCGCTACTGGGACGATCGCGATACGCCACGACCAGAGTCATGGGTGGAAGATATTGCCACCGCCAAAAGCAATCCGAATC GACCTGCCACTGAAATTTACCGCGACCTGCGCTCTGCCGCTGCGTCTGGCTGGGATTTCAGCTCGCGCTGGATGGACAAC CCGCAGCAGTTAAATACCTTACGCACCACCAGCATCGTACCGGTCGATCTGAACAGCCTGATGTTTAAAATGGAAAAAAT CCTCGCCCGCGCCAGCAAAGCTGCCGGAGATAACGCGATGGCAAACCAGTACGAAACGCTGGCAAATGCCCGTCAAAAAG GGATCGAAAAATACCTGTGGAACGATCAACAAGGCTGGTATGCCGATTACGACCTGAAAAGTCATAAAGTGCGCAATCAG TTAACCGCGGCCGCCCTGTTCCCGCTGTACGTCAATGCGGCAGCGAAAGATCGCGCCAACAAAATGGCGACGGCGACGAA AACACATCTGCTGCAACCCGGCGGCCTGAACACCACGTCGGTGAAAAGTGGGCAACAATGGGATGCGCCAAATGGCTGGG CACCGTTACAGTGGGTCGCGACAGAAGGATTACAAAACTACGGGCAAAAAGAGGTGGCGATGGACATTAGCTGGCACTTC CTGACCAATGTTCAGCACACCTATGACCGGGAGAAAAAGCTGGTGGAAAAATATGATGTCAGCACCACCGGAACGGGGGG CGGCGGTGGCGAATATCCATTACAGGATGGCTTTGGCTGGACCAATGGCGTGACGCTGAAAATGCTGGATTTGATCTGCC CGAAAGAGCAACCGTGTGACAATGTTCCGGCGACGCGTCCGACCGTTAAGTCAGCAACGACGCAACCCTCAACCAAAGAG GCACAACCCACACCTTAA
Upstream 100 bases:
>100_bases GCAGAATGAGATTTCGATCATGCAGCTAGTGCGATCCTGAACTAAGGTTTTCTGATACTTGAATACCGTTTTATTCGGTT TCGCCAAAGGAGAATGATTG
Downstream 100 bases:
>100_bases CCAGCGCTTACTCCGACAACGTCATTCGGCTTCTCCGGATGGCGTTGTCGGCAGATAACACCATAGCTATGGTTTTTTCT ACACAGACGGTTTTTCAAGT
Product: trehalase
Products: NA
Alternate protein names: Alpha,alpha-trehalase; Alpha,alpha-trehalose glucohydrolase [H]
Number of amino acids: Translated: 565; Mature: 565
Protein sequence:
>565_residues MKSPAPSRPQKMALIPACIFLCFAALSVQAEETPVTPQPPDILLGPLFNDVQNAKLFPDQKTFADAMPNSDPLMILADYR MQQNQSGFDLRHFVNVNFTLPKEGEKYVPPEGQSLREHIDGLWPVLTRSTENTEKWDSLLPLPEPYVVPGGRFREVYYWD SYFTMLGLAESGHWDKVADMVANFAHEIDTYGHIPNGNRSYYLSRSQPPFFALMVELLAQHEGDAALKQYLPQMQKEYAY WMDGVENLQAGQQEKRVVKLQDGTLLNRYWDDRDTPRPESWVEDIATAKSNPNRPATEIYRDLRSAAASGWDFSSRWMDN PQQLNTLRTTSIVPVDLNSLMFKMEKILARASKAAGDNAMANQYETLANARQKGIEKYLWNDQQGWYADYDLKSHKVRNQ LTAAALFPLYVNAAAKDRANKMATATKTHLLQPGGLNTTSVKSGQQWDAPNGWAPLQWVATEGLQNYGQKEVAMDISWHF LTNVQHTYDREKKLVEKYDVSTTGTGGGGGEYPLQDGFGWTNGVTLKMLDLICPKEQPCDNVPATRPTVKSATTQPSTKE AQPTP
Sequences:
>Translated_565_residues MKSPAPSRPQKMALIPACIFLCFAALSVQAEETPVTPQPPDILLGPLFNDVQNAKLFPDQKTFADAMPNSDPLMILADYR MQQNQSGFDLRHFVNVNFTLPKEGEKYVPPEGQSLREHIDGLWPVLTRSTENTEKWDSLLPLPEPYVVPGGRFREVYYWD SYFTMLGLAESGHWDKVADMVANFAHEIDTYGHIPNGNRSYYLSRSQPPFFALMVELLAQHEGDAALKQYLPQMQKEYAY WMDGVENLQAGQQEKRVVKLQDGTLLNRYWDDRDTPRPESWVEDIATAKSNPNRPATEIYRDLRSAAASGWDFSSRWMDN PQQLNTLRTTSIVPVDLNSLMFKMEKILARASKAAGDNAMANQYETLANARQKGIEKYLWNDQQGWYADYDLKSHKVRNQ LTAAALFPLYVNAAAKDRANKMATATKTHLLQPGGLNTTSVKSGQQWDAPNGWAPLQWVATEGLQNYGQKEVAMDISWHF LTNVQHTYDREKKLVEKYDVSTTGTGGGGGEYPLQDGFGWTNGVTLKMLDLICPKEQPCDNVPATRPTVKSATTQPSTKE AQPTP >Mature_565_residues MKSPAPSRPQKMALIPACIFLCFAALSVQAEETPVTPQPPDILLGPLFNDVQNAKLFPDQKTFADAMPNSDPLMILADYR MQQNQSGFDLRHFVNVNFTLPKEGEKYVPPEGQSLREHIDGLWPVLTRSTENTEKWDSLLPLPEPYVVPGGRFREVYYWD SYFTMLGLAESGHWDKVADMVANFAHEIDTYGHIPNGNRSYYLSRSQPPFFALMVELLAQHEGDAALKQYLPQMQKEYAY WMDGVENLQAGQQEKRVVKLQDGTLLNRYWDDRDTPRPESWVEDIATAKSNPNRPATEIYRDLRSAAASGWDFSSRWMDN PQQLNTLRTTSIVPVDLNSLMFKMEKILARASKAAGDNAMANQYETLANARQKGIEKYLWNDQQGWYADYDLKSHKVRNQ LTAAALFPLYVNAAAKDRANKMATATKTHLLQPGGLNTTSVKSGQQWDAPNGWAPLQWVATEGLQNYGQKEVAMDISWHF LTNVQHTYDREKKLVEKYDVSTTGTGGGGGEYPLQDGFGWTNGVTLKMLDLICPKEQPCDNVPATRPTVKSATTQPSTKE AQPTP
Specific function: Provides the cells with the ability to utilize trehalose at high osmolarity by splitting it into glucose molecules that can subsequently be taken up by the phosphotransferase-mediated uptake system [H]
COG id: COG1626
COG function: function code G; Neutral trehalase
Gene ontology:
Cell location: Periplasm [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the glycosyl hydrolase 37 family [H]
Homologues:
Organism=Homo sapiens, GI116284412, Length=527, Percent_Identity=34.3453510436433, Blast_Score=265, Evalue=1e-70, Organism=Escherichia coli, GI1787447, Length=565, Percent_Identity=99.8230088495575, Blast_Score=1177, Evalue=0.0, Organism=Escherichia coli, GI1789936, Length=487, Percent_Identity=51.129363449692, Blast_Score=512, Evalue=1e-146, Organism=Caenorhabditis elegans, GI17542196, Length=525, Percent_Identity=34.0952380952381, Blast_Score=270, Evalue=2e-72, Organism=Caenorhabditis elegans, GI25148109, Length=574, Percent_Identity=31.7073170731707, Blast_Score=248, Evalue=6e-66, Organism=Caenorhabditis elegans, GI25141398, Length=527, Percent_Identity=31.6888045540797, Blast_Score=242, Evalue=5e-64, Organism=Caenorhabditis elegans, GI17565078, Length=545, Percent_Identity=31.5596330275229, Blast_Score=234, Evalue=8e-62, Organism=Caenorhabditis elegans, GI71987755, Length=403, Percent_Identity=29.0322580645161, Blast_Score=183, Evalue=2e-46, Organism=Saccharomyces cerevisiae, GI6320204, Length=436, Percent_Identity=30.045871559633, Blast_Score=159, Evalue=9e-40, Organism=Saccharomyces cerevisiae, GI6319473, Length=429, Percent_Identity=31.002331002331, Blast_Score=153, Evalue=7e-38, Organism=Drosophila melanogaster, GI24656680, Length=528, Percent_Identity=34.280303030303, Blast_Score=273, Evalue=2e-73, Organism=Drosophila melanogaster, GI24656675, Length=528, Percent_Identity=34.280303030303, Blast_Score=273, Evalue=2e-73, Organism=Drosophila melanogaster, GI24656661, Length=528, Percent_Identity=34.280303030303, Blast_Score=273, Evalue=3e-73, Organism=Drosophila melanogaster, GI17933716, Length=528, Percent_Identity=34.280303030303, Blast_Score=273, Evalue=3e-73, Organism=Drosophila melanogaster, GI24656670, Length=528, Percent_Identity=34.280303030303, Blast_Score=273, Evalue=3e-73, Organism=Drosophila melanogaster, GI24656685, Length=502, Percent_Identity=33.6653386454183, Blast_Score=255, Evalue=6e-68, Organism=Drosophila melanogaster, GI22024178, Length=527, Percent_Identity=31.1195445920304, Blast_Score=237, Evalue=2e-62, Organism=Drosophila melanogaster, GI45551104, Length=367, Percent_Identity=32.425068119891, Blast_Score=172, Evalue=4e-43, Organism=Drosophila melanogaster, GI28573474, Length=314, Percent_Identity=28.343949044586, Blast_Score=121, Evalue=1e-27,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR008928 - InterPro: IPR001661 - InterPro: IPR018232 [H]
Pfam domain/function: PF01204 Trehalase [H]
EC number: =3.2.1.28 [H]
Molecular weight: Translated: 63669; Mature: 63669
Theoretical pI: Translated: 5.69; Mature: 5.69
Prosite motif: PS00927 TREHALASE_1 ; PS00928 TREHALASE_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.7 %Cys (Translated Protein) 3.0 %Met (Translated Protein) 3.7 %Cys+Met (Translated Protein) 0.7 %Cys (Mature Protein) 3.0 %Met (Mature Protein) 3.7 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKSPAPSRPQKMALIPACIFLCFAALSVQAEETPVTPQPPDILLGPLFNDVQNAKLFPDQ CCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEECHHHHHHHCCCCCCCH KTFADAMPNSDPLMILADYRMQQNQSGFDLRHFVNVNFTLPKEGEKYVPPEGQSLREHID HHHHHCCCCCCCEEEEECCHHHCCCCCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHH GLWPVLTRSTENTEKWDSLLPLPEPYVVPGGRFREVYYWDSYFTMLGLAESGHWDKVADM HHHHHHHCCCCCHHHHHHCCCCCCCEECCCCCEEEEEEECHHHHHHHCCCCCCHHHHHHH VANFAHEIDTYGHIPNGNRSYYLSRSQPPFFALMVELLAQHEGDAALKQYLPQMQKEYAY HHHHHHHHHCCCCCCCCCCEEEECCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHH WMDGVENLQAGQQEKRVVKLQDGTLLNRYWDDRDTPRPESWVEDIATAKSNPNRPATEIY HHHHHHHHHCCCHHCCEEEECCCEEHHHHCCCCCCCCHHHHHHHHHHHCCCCCCCHHHHH RDLRSAAASGWDFSSRWMDNPQQLNTLRTTSIVPVDLNSLMFKMEKILARASKAAGDNAM HHHHHHHCCCCCCHHHHCCCHHHHHHHHHCEEEEECHHHHHHHHHHHHHHHHHHCCCCHH ANQYETLANARQKGIEKYLWNDQQGWYADYDLKSHKVRNQLTAAALFPLYVNAAAKDRAN HHHHHHHHHHHHHHHHHHHCCCCCCCEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH KMATATKTHLLQPGGLNTTSVKSGQQWDAPNGWAPLQWVATEGLQNYGQKEVAMDISWHF HHHHHHHHHEECCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCEEEEEEHHH LTNVQHTYDREKKLVEKYDVSTTGTGGGGGEYPLQDGFGWTNGVTLKMLDLICPKEQPCD HHCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCC NVPATRPTVKSATTQPSTKEAQPTP CCCCCCCCCCCCCCCCCCCCCCCCC >Mature Secondary Structure MKSPAPSRPQKMALIPACIFLCFAALSVQAEETPVTPQPPDILLGPLFNDVQNAKLFPDQ CCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEECHHHHHHHCCCCCCCH KTFADAMPNSDPLMILADYRMQQNQSGFDLRHFVNVNFTLPKEGEKYVPPEGQSLREHID HHHHHCCCCCCCEEEEECCHHHCCCCCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHH GLWPVLTRSTENTEKWDSLLPLPEPYVVPGGRFREVYYWDSYFTMLGLAESGHWDKVADM HHHHHHHCCCCCHHHHHHCCCCCCCEECCCCCEEEEEEECHHHHHHHCCCCCCHHHHHHH VANFAHEIDTYGHIPNGNRSYYLSRSQPPFFALMVELLAQHEGDAALKQYLPQMQKEYAY HHHHHHHHHCCCCCCCCCCEEEECCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHH WMDGVENLQAGQQEKRVVKLQDGTLLNRYWDDRDTPRPESWVEDIATAKSNPNRPATEIY HHHHHHHHHCCCHHCCEEEECCCEEHHHHCCCCCCCCHHHHHHHHHHHCCCCCCCHHHHH RDLRSAAASGWDFSSRWMDNPQQLNTLRTTSIVPVDLNSLMFKMEKILARASKAAGDNAM HHHHHHHCCCCCCHHHHCCCHHHHHHHHHCEEEEECHHHHHHHHHHHHHHHHHHCCCCHH ANQYETLANARQKGIEKYLWNDQQGWYADYDLKSHKVRNQLTAAALFPLYVNAAAKDRAN HHHHHHHHHHHHHHHHHHHCCCCCCCEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH KMATATKTHLLQPGGLNTTSVKSGQQWDAPNGWAPLQWVATEGLQNYGQKEVAMDISWHF HHHHHHHHHEECCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCEEEEEEHHH LTNVQHTYDREKKLVEKYDVSTTGTGGGGGEYPLQDGFGWTNGVTLKMLDLICPKEQPCD HHCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCC NVPATRPTVKSATTQPSTKEAQPTP CCCCCCCCCCCCCCCCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA