Definition Shewanella amazonensis SB2B chromosome, complete genome.
Accession NC_008700
Length 4,306,142

Click here to switch to the map view.

The map label for this gene is 119773441

Identifier: 119773441

GI number: 119773441

Start: 362780

End: 364558

Strand: Direct

Name: 119773441

Synonym: Sama_0300

Alternate gene names: NA

Gene position: 362780-364558 (Clockwise)

Preceding gene: 119773440

Following gene: 119773442

Centisome position: 8.42

GC content: 57.0

Gene sequence:

>1779_bases
ATGAAGAAATCCGCTCTCTCTCTCGCTTTGCTGCTTGCTGGCCTTTGCTTGCCCGCTGTGACCTTTGCTGATCCCCAGCC
GGGAACAGAAAAGGTGGCCGGAGGCATTCTCGACAGTCAGCCTCCCTTTGCCCTGACCATAGCCCAGGCCAAGGCCTGGT
CTCCAAAAGGCGCCACGGCAGACAGCCGCAATGTGTCTCAGGTGCCCTTGGCATCGCGCTTCAGCGCACCCCTTGCAGGG
CAGAAGGTGTCTGAGGCCAAGGTGCTCTACGCCCCCGATGGCATGAATAACTTTGCCAATTACCTGGATACTCAGGGCCA
GTTCAACCTGTACAACTTTACCCACTGGTCGCAAATCGATGTGCTCAATTGGTTTGCTGGCACTGCCGACCTTGGGGTGC
AAATTCCCGCCCGCCCCTGGGTGGAAACCGCCCATAAAAACGGGGTGAAGGTGATAGGCTCTGTCTTTCTCGGGGTGGCC
CAATGGGGCGGCAGCGCCGACAAGGTGGAGGCCTTGCTTGAGCAGGACGCCACAGGAAACTTTATTGTTGCCGACAAACT
TATCCAGATGGCCCGGTTCTATGGCTTTGATGGCTGGTTGATTAATCAGGAAACCGACCTGACCGCCGTGAAGGATGCCG
ACAACAACCTGCTGGCGGATAAAAGCGACCCGGTACGTGGCAGGGAATTGGCTGCGCGTATGCTCGCCTTCGTGCAATAC
CTTACCGCCAACGCCCCTCAGGGCATGGAAATCCATTGGTACGATGCCATGATTGCCAGCGGCGAGGTGCGTTGGCAAAA
CCAGCTTAATGAACATAACCGGGTTTACCTGCAAGACAGTGCGCAGGACGGCTCGCAGGGCGGTTTACAAAATAAGGTGC
GTTCATCCGATGCCATCTTTCTGAATTACTGGTGGAACAAAGACATGGTCCGTGCGTCGGTGCAAGAGGCCAAAGCCTTG
GGCCGTTCTCCCTATGATGTGTATGTGGGAGCCGACCTCTGGCCGAGCCGCAACGCTCAGCGCGCCTTCAGCCGCCATCA
GTGGCTCGATTGGTTATTCGATGACGGTGAGGCGCTGACCTCCATTGCCCTGTTTGCCCCCAATGTGAACTTCAATTTTG
ACGGCGAGCCCCATACGCCGCCGTTCAGCAACTTCCGAAATGACCCAGGCGATGCGGCGCGCTTTTATGCCACAGAAGTG
CGGCTGTTCGCCGGGGACGATATGAATCTCGCCACTGCGGATGAAGCCGGTTGGAAGGGCGTGGGTGCTTATCTGCCTGC
CAAGTCCACCCTGAACAGCCTGCCGTTTCGCACCAGCTTTAATACCGGACAGGGTAAGCAGTGGGTGGAAAAGGGTGAGG
TTAAGGGCGGCGCCTGGACCGACATGGGCCGTCAGGATTTCTTGCCCACATGGCAATTTGCCGGTGAAGGGGCGCTGAAG
CTCAGTTTTGATTTTGATACCGTCTACCAGGGCGGCAGCAGTCTGGCGGTAACAGCCAAAGGAGCTGCCACAGCGCCGCT
TTATGCACTTGATGTGATGCTGAGCGAATCAAGCCAGCTAACCCTGATAAGCCAGGGGCAGGCCAAGGGCCTGAGTCTCT
ATGTTGAAACCGCCGACGGTGAAAAGCTCTCTTTAGTCCTTGGCGACCATGCTGACTGGACGAGTCAGTCACTCGGGCTT
TCGGGTCTTAAAGGGAAAAAGCTGGTGGTTATCGCACTGCAAGCCGATGGCAGCGCGAGCATCAATGCCCATCTTGGCCA
TTTGGAGCTGTTGCCATGA

Upstream 100 bases:

>100_bases
AATACGGCGCCTACTGCATCCAGAGTGGTTTCAATTACGCGGTGCGCCAGCAATCGGGTGCAACCCTGATTGACGGCAAG
AGTCTGCTGGGGAGTAATGC

Downstream 100 bases:

>100_bases
TAGCGCTGGAAGTGTGCATCGATGCCGATGACCTGCTCGCGCTGCCTGCGGATGTGGCAGCTGCCAAAGCGGGCGGCGCC
GTGCGCATCGAGCTCTGCGG

Product: glycoside hydrolase family protein

Products: NA

Alternate protein names: Mannosyl-Glycoprotein Endo-Beta-N-Acetylglucosaminidase; Glycoside Hydrolase Family Protein; Glycosyl Hydrolase Family; Glycoside Hydrolase Family; Endo-Beta-N-Acetylglucosaminidase D; Endo-Beta-N-Acetylglucosaminidase Family Protein; Glycosyl Hydrolase Family LPXTG Cell Wall Surface Protein

Number of amino acids: Translated: 592; Mature: 592

Protein sequence:

>592_residues
MKKSALSLALLLAGLCLPAVTFADPQPGTEKVAGGILDSQPPFALTIAQAKAWSPKGATADSRNVSQVPLASRFSAPLAG
QKVSEAKVLYAPDGMNNFANYLDTQGQFNLYNFTHWSQIDVLNWFAGTADLGVQIPARPWVETAHKNGVKVIGSVFLGVA
QWGGSADKVEALLEQDATGNFIVADKLIQMARFYGFDGWLINQETDLTAVKDADNNLLADKSDPVRGRELAARMLAFVQY
LTANAPQGMEIHWYDAMIASGEVRWQNQLNEHNRVYLQDSAQDGSQGGLQNKVRSSDAIFLNYWWNKDMVRASVQEAKAL
GRSPYDVYVGADLWPSRNAQRAFSRHQWLDWLFDDGEALTSIALFAPNVNFNFDGEPHTPPFSNFRNDPGDAARFYATEV
RLFAGDDMNLATADEAGWKGVGAYLPAKSTLNSLPFRTSFNTGQGKQWVEKGEVKGGAWTDMGRQDFLPTWQFAGEGALK
LSFDFDTVYQGGSSLAVTAKGAATAPLYALDVMLSESSQLTLISQGQAKGLSLYVETADGEKLSLVLGDHADWTSQSLGL
SGLKGKKLVVIALQADGSASINAHLGHLELLP

Sequences:

>Translated_592_residues
MKKSALSLALLLAGLCLPAVTFADPQPGTEKVAGGILDSQPPFALTIAQAKAWSPKGATADSRNVSQVPLASRFSAPLAG
QKVSEAKVLYAPDGMNNFANYLDTQGQFNLYNFTHWSQIDVLNWFAGTADLGVQIPARPWVETAHKNGVKVIGSVFLGVA
QWGGSADKVEALLEQDATGNFIVADKLIQMARFYGFDGWLINQETDLTAVKDADNNLLADKSDPVRGRELAARMLAFVQY
LTANAPQGMEIHWYDAMIASGEVRWQNQLNEHNRVYLQDSAQDGSQGGLQNKVRSSDAIFLNYWWNKDMVRASVQEAKAL
GRSPYDVYVGADLWPSRNAQRAFSRHQWLDWLFDDGEALTSIALFAPNVNFNFDGEPHTPPFSNFRNDPGDAARFYATEV
RLFAGDDMNLATADEAGWKGVGAYLPAKSTLNSLPFRTSFNTGQGKQWVEKGEVKGGAWTDMGRQDFLPTWQFAGEGALK
LSFDFDTVYQGGSSLAVTAKGAATAPLYALDVMLSESSQLTLISQGQAKGLSLYVETADGEKLSLVLGDHADWTSQSLGL
SGLKGKKLVVIALQADGSASINAHLGHLELLP
>Mature_592_residues
MKKSALSLALLLAGLCLPAVTFADPQPGTEKVAGGILDSQPPFALTIAQAKAWSPKGATADSRNVSQVPLASRFSAPLAG
QKVSEAKVLYAPDGMNNFANYLDTQGQFNLYNFTHWSQIDVLNWFAGTADLGVQIPARPWVETAHKNGVKVIGSVFLGVA
QWGGSADKVEALLEQDATGNFIVADKLIQMARFYGFDGWLINQETDLTAVKDADNNLLADKSDPVRGRELAARMLAFVQY
LTANAPQGMEIHWYDAMIASGEVRWQNQLNEHNRVYLQDSAQDGSQGGLQNKVRSSDAIFLNYWWNKDMVRASVQEAKAL
GRSPYDVYVGADLWPSRNAQRAFSRHQWLDWLFDDGEALTSIALFAPNVNFNFDGEPHTPPFSNFRNDPGDAARFYATEV
RLFAGDDMNLATADEAGWKGVGAYLPAKSTLNSLPFRTSFNTGQGKQWVEKGEVKGGAWTDMGRQDFLPTWQFAGEGALK
LSFDFDTVYQGGSSLAVTAKGAATAPLYALDVMLSESSQLTLISQGQAKGLSLYVETADGEKLSLVLGDHADWTSQSLGL
SGLKGKKLVVIALQADGSASINAHLGHLELLP

Specific function: Unknown

COG id: COG4724

COG function: function code G; Endo-beta-N-acetylglucosaminidase D

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

Organism=Homo sapiens, GI110431350, Length=471, Percent_Identity=27.6008492569002, Blast_Score=151, Evalue=2e-36,
Organism=Caenorhabditis elegans, GI71983972, Length=263, Percent_Identity=31.9391634980989, Blast_Score=102, Evalue=4e-22,
Organism=Caenorhabditis elegans, GI71983968, Length=263, Percent_Identity=31.9391634980989, Blast_Score=102, Evalue=7e-22,
Organism=Caenorhabditis elegans, GI71983964, Length=263, Percent_Identity=31.9391634980989, Blast_Score=100, Evalue=2e-21,
Organism=Drosophila melanogaster, GI24642794, Length=427, Percent_Identity=25.2927400468384, Blast_Score=113, Evalue=5e-25,
Organism=Drosophila melanogaster, GI28571242, Length=427, Percent_Identity=25.2927400468384, Blast_Score=112, Evalue=6e-25,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 64258; Mature: 64258

Theoretical pI: Translated: 4.94; Mature: 4.94

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.2 %Cys     (Translated Protein)
1.7 %Met     (Translated Protein)
1.9 %Cys+Met (Translated Protein)
0.2 %Cys     (Mature Protein)
1.7 %Met     (Mature Protein)
1.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKKSALSLALLLAGLCLPAVTFADPQPGTEKVAGGILDSQPPFALTIAQAKAWSPKGATA
CCHHHHHHHHHHHHHHHHHEECCCCCCCHHHHHCCCCCCCCCEEEEEEECCCCCCCCCCC
DSRNVSQVPLASRFSAPLAGQKVSEAKVLYAPDGMNNFANYLDTQGQFNLYNFTHWSQID
CCCCCCCCCCHHHCCCCCCCCCCCCCEEEECCCCCHHHHHHHCCCCCEEEEEECCCCCCC
VLNWFAGTADLGVQIPARPWVETAHKNGVKVIGSVFLGVAQWGGSADKVEALLEQDATGN
HHHHHCCCCCCCEECCCCHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCC
FIVADKLIQMARFYGFDGWLINQETDLTAVKDADNNLLADKSDPVRGRELAARMLAFVQY
EEHHHHHHHHHHHCCCCCEEECCCCCCEEEECCCCCEECCCCCCCCHHHHHHHHHHHHHH
LTANAPQGMEIHWYDAMIASGEVRWQNQLNEHNRVYLQDSAQDGSQGGLQNKVRSSDAIF
HHCCCCCCCEEEEEEEEEECCCEEEHHHCCCCCEEEEECCCCCCCCCCCHHHHCCCCEEE
LNYWWNKDMVRASVQEAKALGRSPYDVYVGADLWPSRNAQRAFSRHQWLDWLFDDGEALT
EEEECCCHHHHHHHHHHHHHCCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHCCCHHHE
SIALFAPNVNFNFDGEPHTPPFSNFRNDPGDAARFYATEVRLFAGDDMNLATADEAGWKG
EEEEEECCCCCCCCCCCCCCCCHHCCCCCCCHHEEEEEEEEEEECCCCCEEECCCCCCCC
VGAYLPAKSTLNSLPFRTSFNTGQGKQWVEKGEVKGGAWTDMGRQDFLPTWQFAGEGALK
CCCCCCCHHHHHCCCCEECCCCCCCHHHHHHCCCCCCCCCCCCCHHCCCCCEECCCCEEE
LSFDFDTVYQGGSSLAVTAKGAATAPLYALDVMLSESSQLTLISQGQAKGLSLYVETADG
EEECHHHHHCCCCEEEEEECCCCCCCEEEEEEEECCCCCEEEEECCCCCCEEEEEEECCC
EKLSLVLGDHADWTSQSLGLSGLKGKKLVVIALQADGSASINAHLGHLELLP
CEEEEEECCCCCCCCHHCCCCCCCCCEEEEEEEECCCCCEEEEECCEEEECC
>Mature Secondary Structure
MKKSALSLALLLAGLCLPAVTFADPQPGTEKVAGGILDSQPPFALTIAQAKAWSPKGATA
CCHHHHHHHHHHHHHHHHHEECCCCCCCHHHHHCCCCCCCCCEEEEEEECCCCCCCCCCC
DSRNVSQVPLASRFSAPLAGQKVSEAKVLYAPDGMNNFANYLDTQGQFNLYNFTHWSQID
CCCCCCCCCCHHHCCCCCCCCCCCCCEEEECCCCCHHHHHHHCCCCCEEEEEECCCCCCC
VLNWFAGTADLGVQIPARPWVETAHKNGVKVIGSVFLGVAQWGGSADKVEALLEQDATGN
HHHHHCCCCCCCEECCCCHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCC
FIVADKLIQMARFYGFDGWLINQETDLTAVKDADNNLLADKSDPVRGRELAARMLAFVQY
EEHHHHHHHHHHHCCCCCEEECCCCCCEEEECCCCCEECCCCCCCCHHHHHHHHHHHHHH
LTANAPQGMEIHWYDAMIASGEVRWQNQLNEHNRVYLQDSAQDGSQGGLQNKVRSSDAIF
HHCCCCCCCEEEEEEEEEECCCEEEHHHCCCCCEEEEECCCCCCCCCCCHHHHCCCCEEE
LNYWWNKDMVRASVQEAKALGRSPYDVYVGADLWPSRNAQRAFSRHQWLDWLFDDGEALT
EEEECCCHHHHHHHHHHHHHCCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHCCCHHHE
SIALFAPNVNFNFDGEPHTPPFSNFRNDPGDAARFYATEVRLFAGDDMNLATADEAGWKG
EEEEEECCCCCCCCCCCCCCCCHHCCCCCCCHHEEEEEEEEEEECCCCCEEECCCCCCCC
VGAYLPAKSTLNSLPFRTSFNTGQGKQWVEKGEVKGGAWTDMGRQDFLPTWQFAGEGALK
CCCCCCCHHHHHCCCCEECCCCCCCHHHHHHCCCCCCCCCCCCCHHCCCCCEECCCCEEE
LSFDFDTVYQGGSSLAVTAKGAATAPLYALDVMLSESSQLTLISQGQAKGLSLYVETADG
EEECHHHHHCCCCEEEEEECCCCCCCEEEEEEEECCCCCEEEEECCCCCCEEEEEEECCC
EKLSLVLGDHADWTSQSLGLSGLKGKKLVVIALQADGSASINAHLGHLELLP
CEEEEEECCCCCCCCHHCCCCCCCCCEEEEEEEECCCCCEEEEECCEEEECC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA