| Definition | Xanthomonas campestris pv. campestris str. 8004 chromosome, complete genome. |
|---|---|
| Accession | NC_007086 |
| Length | 5,148,708 |
Click here to switch to the map view.
The map label for this gene is treA [H]
Identifier: 77761100
GI number: 77761100
Start: 750746
End: 752419
Strand: Reverse
Name: treA [H]
Synonym: XC_0631
Alternate gene names: 77761100
Gene position: 752419-750746 (Counterclockwise)
Preceding gene: 66766970
Following gene: 66766968
Centisome position: 14.61
GC content: 67.62
Gene sequence:
>1674_bases ATGAGCGCTGCTGCGCCCCCGTGCTGCACCTCGTTGCTTGGTCTGTCGCTGAGCATGTTCGTTGCGCCCTGCGCCCTGGC GGCCACGCCGCTGGAAGGTGCGGTGGTCAGCGCGCCCGCGCCCACGCCGCCCACGCCCGACCTGGCGTATCCGGAGCTGT TCCAGGCCGTGCAGCGCGGGGAGCTGTTCGACGACCAGAAGCATTTCGTCGACTTTCTGCCGCTGCGCGACCCAGCCCTG ATCAACGCCGACTATCTGGCGCAGCACGAGCATGCCGGCTTTGACCTGCGCAAGTTCGTGGATGCCAACTTCGAGGAATC GCCGCCGGTACAGACCGATGCGATCCGCCAGGACACCGCGCTGCGCGAGCACATCGACGCGCTGTGGCCCAAGCTGGTAC GCAGCCAGACCAACGTGCCCGCACACAGCAGCCTGCTGGCGTTGCCGCACCCGTACGTGGTGCCGGGCGGACGCTTCCGC GAGGTGTATTACTGGGATTCGTACTTCACCATGCTGGGGCTGGTGAAAAGTGGCGAAACCACGCTGAGCCGGCAGATGCT GGACAACTTCGCCTACCTGATCGACACCTACGGGCACATCCCCAACGGCAACCGCACCTACTACCTGAGCCGCTCGCAGC CGCCGTTATTCTCCTACATGGTGGAACTGCAGGCCGGCGTGGAAGGCGAGGCGGTGTACCAGCGCTACCTGCCGCAGCTG CAGAAGGAATACGCGTACTGGATGCAGGGCGGCGACGATCTGCAACCCGGCCAGGCCGCACGCCATGTGGTGCGCCTGGC CGATGGCAGCGTGCTCAATCGCTATTGGGACGAGCGCGATACCCCGCGCCCGGAAGCCTGGCTGCACGACACCCGCACCG CTGCCGAGGCGCATGACCGCCCGGCCGCCGATGTCTACCGCGACCTGCGCGCCGGCGCCGAAAGTGGCTGGGACTACACC AGCCGCTGGCTGGCCGACGGCAAGACGCTGAGCACCATCCGCACCACCGCGATTGTGCCGATCGATCTCAACAGCCTGCT GTATCACCTGGAACGCACCCTGGCGCAGGCCTGCGCGCACACCGGCACCGCCTGCAGCCAGGACTACGCTGCGCTCGCAC AGCAGCGCAAGCAGGCCATCGACGCGCACCTGTGGAATGCAGCCGGCTACTACGCCGACTACGACTGGCAGACCCGCACG CTGAGCAACCAGGTCACCGCGGCGGCGCTGTACCCGCTGTTCGCCGGCCTGGCTTCGGATGACCACGCCAAGCGCACTGC CACCAGCGTGCGCGCCCGCCTGCTGCGTCCCGGCGGCCTGGCCACCACCGCGTTGAAGACCGGCCAGCAGTGGGACGAAC CCAACGGCTGGGCGCCATTGCAATGGGTGGCCGTGGACGGCCTGCGTCGCTACGGCGAAGACGGCCTGGCCCGCACCATC GGCGAGCGCTTCCTCACCCAGGTGCAGGCGCTATTCGCGCGCGAGCACAAGCTGGTCGAAAAATACGGCCTGGACGCCGA TGCAGCCGGCGGCGGCGGTGGCGAATATGCATTGCAGGACGGCTTTGGCTGGACCAATGGCGTCACGTTGATGCTGTTGA ACCTGTACCCCTCACAGGGCGCCACCCAGGCTCCGGCCAAGACCAAGCGCAAGCCCGAGCCCGCCGCTCCCTGA
Upstream 100 bases:
>100_bases CCGCGTGCGGCGTGCCGGTCCCCCGACACAGCGCCTGCGGCTCCACCCGCCACTTCGGTCGTTTACCATGCCGCACGCCC CCGCCCGTTCTGGAGATGCC
Downstream 100 bases:
>100_bases GCGTGGCGGTGGTTGGCGCATGCCATGCCAGCAGCGCCAGCAGCATTGCGGCCTGATCGACGACGGCTGTTGGTAGCGGG CCGATGCGCTTGACCGAGAC
Product: trehalase
Products: NA
Alternate protein names: Alpha,alpha-trehalase; Alpha,alpha-trehalose glucohydrolase [H]
Number of amino acids: Translated: 557; Mature: 556
Protein sequence:
>557_residues MSAAAPPCCTSLLGLSLSMFVAPCALAATPLEGAVVSAPAPTPPTPDLAYPELFQAVQRGELFDDQKHFVDFLPLRDPAL INADYLAQHEHAGFDLRKFVDANFEESPPVQTDAIRQDTALREHIDALWPKLVRSQTNVPAHSSLLALPHPYVVPGGRFR EVYYWDSYFTMLGLVKSGETTLSRQMLDNFAYLIDTYGHIPNGNRTYYLSRSQPPLFSYMVELQAGVEGEAVYQRYLPQL QKEYAYWMQGGDDLQPGQAARHVVRLADGSVLNRYWDERDTPRPEAWLHDTRTAAEAHDRPAADVYRDLRAGAESGWDYT SRWLADGKTLSTIRTTAIVPIDLNSLLYHLERTLAQACAHTGTACSQDYAALAQQRKQAIDAHLWNAAGYYADYDWQTRT LSNQVTAAALYPLFAGLASDDHAKRTATSVRARLLRPGGLATTALKTGQQWDEPNGWAPLQWVAVDGLRRYGEDGLARTI GERFLTQVQALFAREHKLVEKYGLDADAAGGGGGEYALQDGFGWTNGVTLMLLNLYPSQGATQAPAKTKRKPEPAAP
Sequences:
>Translated_557_residues MSAAAPPCCTSLLGLSLSMFVAPCALAATPLEGAVVSAPAPTPPTPDLAYPELFQAVQRGELFDDQKHFVDFLPLRDPAL INADYLAQHEHAGFDLRKFVDANFEESPPVQTDAIRQDTALREHIDALWPKLVRSQTNVPAHSSLLALPHPYVVPGGRFR EVYYWDSYFTMLGLVKSGETTLSRQMLDNFAYLIDTYGHIPNGNRTYYLSRSQPPLFSYMVELQAGVEGEAVYQRYLPQL QKEYAYWMQGGDDLQPGQAARHVVRLADGSVLNRYWDERDTPRPEAWLHDTRTAAEAHDRPAADVYRDLRAGAESGWDYT SRWLADGKTLSTIRTTAIVPIDLNSLLYHLERTLAQACAHTGTACSQDYAALAQQRKQAIDAHLWNAAGYYADYDWQTRT LSNQVTAAALYPLFAGLASDDHAKRTATSVRARLLRPGGLATTALKTGQQWDEPNGWAPLQWVAVDGLRRYGEDGLARTI GERFLTQVQALFAREHKLVEKYGLDADAAGGGGGEYALQDGFGWTNGVTLMLLNLYPSQGATQAPAKTKRKPEPAAP >Mature_556_residues SAAAPPCCTSLLGLSLSMFVAPCALAATPLEGAVVSAPAPTPPTPDLAYPELFQAVQRGELFDDQKHFVDFLPLRDPALI NADYLAQHEHAGFDLRKFVDANFEESPPVQTDAIRQDTALREHIDALWPKLVRSQTNVPAHSSLLALPHPYVVPGGRFRE VYYWDSYFTMLGLVKSGETTLSRQMLDNFAYLIDTYGHIPNGNRTYYLSRSQPPLFSYMVELQAGVEGEAVYQRYLPQLQ KEYAYWMQGGDDLQPGQAARHVVRLADGSVLNRYWDERDTPRPEAWLHDTRTAAEAHDRPAADVYRDLRAGAESGWDYTS RWLADGKTLSTIRTTAIVPIDLNSLLYHLERTLAQACAHTGTACSQDYAALAQQRKQAIDAHLWNAAGYYADYDWQTRTL SNQVTAAALYPLFAGLASDDHAKRTATSVRARLLRPGGLATTALKTGQQWDEPNGWAPLQWVAVDGLRRYGEDGLARTIG ERFLTQVQALFAREHKLVEKYGLDADAAGGGGGEYALQDGFGWTNGVTLMLLNLYPSQGATQAPAKTKRKPEPAAP
Specific function: Provides the cells with the ability to utilize trehalose at high osmolarity by splitting it into glucose molecules that can subsequently be taken up by the phosphotransferase-mediated uptake system [H]
COG id: COG1626
COG function: function code G; Neutral trehalase
Gene ontology:
Cell location: Periplasm [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the glycosyl hydrolase 37 family [H]
Homologues:
Organism=Homo sapiens, GI116284412, Length=547, Percent_Identity=35.1005484460695, Blast_Score=257, Evalue=2e-68, Organism=Escherichia coli, GI1787447, Length=537, Percent_Identity=53.2588454376164, Blast_Score=568, Evalue=1e-163, Organism=Escherichia coli, GI1789936, Length=518, Percent_Identity=50.7722007722008, Blast_Score=499, Evalue=1e-142, Organism=Caenorhabditis elegans, GI17542196, Length=536, Percent_Identity=31.5298507462687, Blast_Score=246, Evalue=3e-65, Organism=Caenorhabditis elegans, GI25148109, Length=566, Percent_Identity=31.2720848056537, Blast_Score=243, Evalue=1e-64, Organism=Caenorhabditis elegans, GI25141398, Length=541, Percent_Identity=31.9778188539741, Blast_Score=237, Evalue=1e-62, Organism=Caenorhabditis elegans, GI17565078, Length=521, Percent_Identity=30.1343570057582, Blast_Score=231, Evalue=8e-61, Organism=Caenorhabditis elegans, GI71987755, Length=424, Percent_Identity=29.0094339622642, Blast_Score=176, Evalue=3e-44, Organism=Saccharomyces cerevisiae, GI6319473, Length=440, Percent_Identity=30.9090909090909, Blast_Score=150, Evalue=8e-37, Organism=Saccharomyces cerevisiae, GI6320204, Length=468, Percent_Identity=27.3504273504274, Blast_Score=146, Evalue=9e-36, Organism=Drosophila melanogaster, GI24656680, Length=529, Percent_Identity=32.5141776937618, Blast_Score=234, Evalue=8e-62, Organism=Drosophila melanogaster, GI24656675, Length=529, Percent_Identity=32.5141776937618, Blast_Score=234, Evalue=8e-62, Organism=Drosophila melanogaster, GI24656661, Length=518, Percent_Identity=32.8185328185328, Blast_Score=234, Evalue=9e-62, Organism=Drosophila melanogaster, GI17933716, Length=518, Percent_Identity=32.8185328185328, Blast_Score=234, Evalue=9e-62, Organism=Drosophila melanogaster, GI24656670, Length=518, Percent_Identity=32.8185328185328, Blast_Score=234, Evalue=9e-62, Organism=Drosophila melanogaster, GI24656685, Length=467, Percent_Identity=33.6188436830835, Blast_Score=228, Evalue=6e-60, Organism=Drosophila melanogaster, GI22024178, Length=531, Percent_Identity=28.6252354048964, Blast_Score=203, Evalue=2e-52, Organism=Drosophila melanogaster, GI45551104, Length=373, Percent_Identity=29.4906166219839, Blast_Score=147, Evalue=2e-35, Organism=Drosophila melanogaster, GI28573474, Length=298, Percent_Identity=28.8590604026846, Blast_Score=119, Evalue=4e-27,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR008928 - InterPro: IPR001661 - InterPro: IPR018232 [H]
Pfam domain/function: PF01204 Trehalase [H]
EC number: =3.2.1.28 [H]
Molecular weight: Translated: 61602; Mature: 61470
Theoretical pI: Translated: 5.86; Mature: 5.86
Prosite motif: PS00927 TREHALASE_1 ; PS00928 TREHALASE_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.9 %Cys (Translated Protein) 1.3 %Met (Translated Protein) 2.2 %Cys+Met (Translated Protein) 0.9 %Cys (Mature Protein) 1.1 %Met (Mature Protein) 2.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSAAAPPCCTSLLGLSLSMFVAPCALAATPLEGAVVSAPAPTPPTPDLAYPELFQAVQRG CCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEECCCCCCCCCCCCHHHHHHHHHHC ELFDDQKHFVDFLPLRDPALINADYLAQHEHAGFDLRKFVDANFEESPPVQTDAIRQDTA CCCCCHHHHHHHCCCCCCCEECHHHHHHHHCCCCHHHHHHCCCCCCCCCCCHHHHHHHHH LREHIDALWPKLVRSQTNVPAHSSLLALPHPYVVPGGRFREVYYWDSYFTMLGLVKSGET HHHHHHHHHHHHHHHCCCCCCCCCEEECCCCEECCCCCEEEEEEEHHHHHHHHHHHCCCH TLSRQMLDNFAYLIDTYGHIPNGNRTYYLSRSQPPLFSYMVELQAGVEGEAVYQRYLPQL HHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHCCCCHHHHHHHHHHHH QKEYAYWMQGGDDLQPGQAARHVVRLADGSVLNRYWDERDTPRPEAWLHDTRTAAEAHDR HHHHHHHHCCCCCCCCCHHHHHHHHHHCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCC PAADVYRDLRAGAESGWDYTSRWLADGKTLSTIRTTAIVPIDLNSLLYHLERTLAQACAH CHHHHHHHHHHCCCCCCCHHHHHHHCCCCHHEEHEEEEEEECHHHHHHHHHHHHHHHHHH TGTACSQDYAALAQQRKQAIDAHLWNAAGYYADYDWQTRTLSNQVTAAALYPLFAGLASD CCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCC DHAKRTATSVRARLLRPGGLATTALKTGQQWDEPNGWAPLQWVAVDGLRRYGEDGLARTI HHHHHHHHHHHHHHCCCCCCHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHH GERFLTQVQALFAREHKLVEKYGLDADAAGGGGGEYALQDGFGWTNGVTLMLLNLYPSQG HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEECCCCCCCCCEEEEEEECCCCC ATQAPAKTKRKPEPAAP CCCCCHHCCCCCCCCCC >Mature Secondary Structure SAAAPPCCTSLLGLSLSMFVAPCALAATPLEGAVVSAPAPTPPTPDLAYPELFQAVQRG CCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEECCCCCCCCCCCCHHHHHHHHHHC ELFDDQKHFVDFLPLRDPALINADYLAQHEHAGFDLRKFVDANFEESPPVQTDAIRQDTA CCCCCHHHHHHHCCCCCCCEECHHHHHHHHCCCCHHHHHHCCCCCCCCCCCHHHHHHHHH LREHIDALWPKLVRSQTNVPAHSSLLALPHPYVVPGGRFREVYYWDSYFTMLGLVKSGET HHHHHHHHHHHHHHHCCCCCCCCCEEECCCCEECCCCCEEEEEEEHHHHHHHHHHHCCCH TLSRQMLDNFAYLIDTYGHIPNGNRTYYLSRSQPPLFSYMVELQAGVEGEAVYQRYLPQL HHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHCCCCHHHHHHHHHHHH QKEYAYWMQGGDDLQPGQAARHVVRLADGSVLNRYWDERDTPRPEAWLHDTRTAAEAHDR HHHHHHHHCCCCCCCCCHHHHHHHHHHCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCC PAADVYRDLRAGAESGWDYTSRWLADGKTLSTIRTTAIVPIDLNSLLYHLERTLAQACAH CHHHHHHHHHHCCCCCCCHHHHHHHCCCCHHEEHEEEEEEECHHHHHHHHHHHHHHHHHH TGTACSQDYAALAQQRKQAIDAHLWNAAGYYADYDWQTRTLSNQVTAAALYPLFAGLASD CCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCC DHAKRTATSVRARLLRPGGLATTALKTGQQWDEPNGWAPLQWVAVDGLRRYGEDGLARTI HHHHHHHHHHHHHHCCCCCCHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHH GERFLTQVQALFAREHKLVEKYGLDADAAGGGGGEYALQDGFGWTNGVTLMLLNLYPSQG HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEECCCCCCCCCEEEEEEECCCCC ATQAPAKTKRKPEPAAP CCCCCHHCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 12024217 [H]