Definition Geobacter bemidjiensis Bem chromosome, complete genome.
Accession NC_011146
Length 4,615,150

Click here to switch to the map view.

The map label for this gene is endo I [H]

Identifier: 197118183

GI number: 197118183

Start: 2076906

End: 2079389

Strand: Direct

Name: endo I [H]

Synonym: Gbem_1798

Alternate gene names: 197118183

Gene position: 2076906-2079389 (Clockwise)

Preceding gene: 197118182

Following gene: 197118184

Centisome position: 45.0

GC content: 62.64

Gene sequence:

>2484_bases
ATGAAGAAGTTGAAGGCAATAACGGCAACAGCGTTAGGAGTCCTGATGTACACGAGTGTCGCCTCCGCGGCAGTACTGTT
CTCAGAGGATTTCAACGCGCAACCAGACTGGAGCCCTTCACAGGGGACCTCCGGTGCTACCTGCATTCCCGGGCAAAACT
GCGCCACTCCCATCCCGACCGGTTTTTACGACTACCGCCTGGCCGGCACGGAAGCTTGCAGCAACCTCGACGGCAAGCAC
AACACCCTGAACATAAACGGCCTGCACCCGAGAGGGACGGGGAAGAGCTTCATGATGTGGAACGAGCCCTGCTATAGCAG
AAGCGGCAGCTGGGGCTCCGATGGCCTCCTCGGGATCGATTTCGCCCCTCAAAACGAGGTGTACGTTAGGTACTGGATCC
AGTTCCAGCCCGACTGGAGGTGGGATGGCGACGGTTCGGTCGGCGGGCGCGCCGTGGGTACGGCGACCACGAGCCCGATG
GAAAAGTTCATGCACATCTCCCACCTGAACACCGGGAACACGAACTTCTGGGACTTCTTCAGCGGCACCCAGAACAAACC
CCGCTTCACCCCGCAGCTCGCCAAGTTCGGCGGCGGCAGCTATCGGCTCCAGTTCAACCTCCCTCATTCCCCTCTGACTG
CGGCACGTGATAGTTCTGCCTCGTTCACTACTAATGTATTCCTTGGCGCTGCCCCCCTCGACATGAACGTACCGGGCCCC
GGTGGTTTGCCGGCAGCTCCCCGGGACGGCAAGTGGCACTGCTTCGAGTTCTACGTCAAGCTGAACAGCGCAGGGGGGGT
GGCCGACGGCGTCTCGAAGGTCTGGTACGATGGCGCCCTGGTCGGCTCGACCACTAACGCCGTATGGATTCCTTTAGGCG
ATGATCCGGCCCAGTGGAAGTGGAACCACGCCTGGCTCGGCGGCAACAACGCCAACCTCTACCTGCCCGCCAACGAGCAG
TGGTACGCCATAGACGACGTTGTGGTGAGCACCACCTACAGCGGCCCGCCGGCGCAGCCGGGGAGCGTCACCGCCACCGC
CAGCGCGGCGAACACCGTGAGCCTGCGCTGGAGCCCCGGCAGCAACGGGGTCCCGTTCTCCCTGAACGGCTACCGGATTT
ACTACGGCAAGGACGCAACCAACCTGAACATGAAAGTGGACGTGGGGAACGTGCAGCAGTACAGCATCTCCTCCCTCGAC
CCCTCGACCAAGTACTACTTCGCGGTCTCCGCCTACAACAAGGGGAGCTACGACAGTAATGACAACGAAGGGATGCCCTC
GGTAACGGCAAGCGCGACCACCGTTTCTAGCAGTACCACTACCACGACCACTGCAGATGCCGTCGCGCCGGTAGCTTCCA
TCTCCTCGCCGGCTACCGGCTCCACCGTGAGCGGCAACGTCACTATCAACGTGGCAGCCAGCGACAACGTGGCGGTCAGC
AAGGTAGAGCTGTACCTGAACGGCTCCATCTTCGGCGTGGTGGGGTCGGCCCCCTACACCCTGAGCTGGAATACGGCCAA
TAACCCCAACGCCACCTACACCCTGACCGCGAAGGCCTACGACGCAGCCGGCAACGTGGGGCAGGCTTCGAGCTCGGTGA
CCGTGAAAAACGCAGTCGCAGTAACCGACGCTTCGGCCCCTGTGATCAGCTCCTTCACCATGCCCGCGTCCGCCACATCG
CTTACCGTTCCGGTTACCGCGTTCGCCGCCACCGACGACGTCGGCGTCACCGGCTACCAGATCACCGAGAGCGCGACCGC
TCCCGCGGCGGGCGCCACCACCTGGAAAGCAACCGCGCCGACCTCCTGCACCTTCTCGGCAGCAGGTATCCGCACCGCTT
ACGCCTGGTGCAAGGACGCCTCCGGCAAGGTATCGGCAGCAAAAACCGCCCAGGTCAACATCACCACCGCTACCACCACG
ACCGGGGACACGACCGCCCCGGTGGTCAGCATCGCCTCGCCGGTGACCGGTTCCACGGTCAAGGGGGCAGTGACTGTATC
CGCCAACGCCACCGACAACGTCGGTGTGAAGAAGGCCGAGTTCTACGTGAACGGGGTGCTCAAGCTGACCTCCACTACCG
CGCCCTACAGCGTCACCTGGGGGACCACCAACTACCCTAACGGCTCCAACAGCGTCACCGTAAAGGCGTACGACGCAGCC
GGCAACGTCGGCCAGGCAACTTCGACCGTCACCATCATGAACGGCGACACTACTGCCCCGAAGGTTTCCATAACCTCCCC
GACCTCCGGGTCCACCGCCAAGGGGGTAGTGACAGTATCCGCTAACGTCACCGACAACGTCGGCGTAAAAAAGGTCGAGT
TCTACGTGAACGGCGTGCTCAAGCTGACCTCCACCGCGGCTCCCTACACCGTCACCTGGGGCACCACGAATTACCCGAAC
GGCTCCAACAGCGTGACCATCAAGGGGTATGACGCGGCCGGCAACGTCGGCCAGGCCACAACGACTGTAACCGTGGCGAA
TTGA

Upstream 100 bases:

>100_bases
TCTCCCTGAGACAGCCGGGTCGCCGATTCTACGCGATGGAACCCCCATCGTAGACCGGCGACCCGGCTTTTTTTTGTATC
GGGGATCAAAGGAGAGTTAA

Downstream 100 bases:

>100_bases
GAAGACAGCCCCGGCCATAAAGGGCTCCGTGGCAAGCAAGTCCGTGACCAGGCTGTATTGAGCTAGCTGATACCGCCTTG
TATGTGGAAGCAGCAGCAAA

Product: fibronectin type III domain-containing protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 827; Mature: 827

Protein sequence:

>827_residues
MKKLKAITATALGVLMYTSVASAAVLFSEDFNAQPDWSPSQGTSGATCIPGQNCATPIPTGFYDYRLAGTEACSNLDGKH
NTLNINGLHPRGTGKSFMMWNEPCYSRSGSWGSDGLLGIDFAPQNEVYVRYWIQFQPDWRWDGDGSVGGRAVGTATTSPM
EKFMHISHLNTGNTNFWDFFSGTQNKPRFTPQLAKFGGGSYRLQFNLPHSPLTAARDSSASFTTNVFLGAAPLDMNVPGP
GGLPAAPRDGKWHCFEFYVKLNSAGGVADGVSKVWYDGALVGSTTNAVWIPLGDDPAQWKWNHAWLGGNNANLYLPANEQ
WYAIDDVVVSTTYSGPPAQPGSVTATASAANTVSLRWSPGSNGVPFSLNGYRIYYGKDATNLNMKVDVGNVQQYSISSLD
PSTKYYFAVSAYNKGSYDSNDNEGMPSVTASATTVSSSTTTTTTADAVAPVASISSPATGSTVSGNVTINVAASDNVAVS
KVELYLNGSIFGVVGSAPYTLSWNTANNPNATYTLTAKAYDAAGNVGQASSSVTVKNAVAVTDASAPVISSFTMPASATS
LTVPVTAFAATDDVGVTGYQITESATAPAAGATTWKATAPTSCTFSAAGIRTAYAWCKDASGKVSAAKTAQVNITTATTT
TGDTTAPVVSIASPVTGSTVKGAVTVSANATDNVGVKKAEFYVNGVLKLTSTTAPYSVTWGTTNYPNGSNSVTVKAYDAA
GNVGQATSTVTIMNGDTTAPKVSITSPTSGSTAKGVVTVSANVTDNVGVKKVEFYVNGVLKLTSTAAPYTVTWGTTNYPN
GSNSVTIKGYDAAGNVGQATTTVTVAN

Sequences:

>Translated_827_residues
MKKLKAITATALGVLMYTSVASAAVLFSEDFNAQPDWSPSQGTSGATCIPGQNCATPIPTGFYDYRLAGTEACSNLDGKH
NTLNINGLHPRGTGKSFMMWNEPCYSRSGSWGSDGLLGIDFAPQNEVYVRYWIQFQPDWRWDGDGSVGGRAVGTATTSPM
EKFMHISHLNTGNTNFWDFFSGTQNKPRFTPQLAKFGGGSYRLQFNLPHSPLTAARDSSASFTTNVFLGAAPLDMNVPGP
GGLPAAPRDGKWHCFEFYVKLNSAGGVADGVSKVWYDGALVGSTTNAVWIPLGDDPAQWKWNHAWLGGNNANLYLPANEQ
WYAIDDVVVSTTYSGPPAQPGSVTATASAANTVSLRWSPGSNGVPFSLNGYRIYYGKDATNLNMKVDVGNVQQYSISSLD
PSTKYYFAVSAYNKGSYDSNDNEGMPSVTASATTVSSSTTTTTTADAVAPVASISSPATGSTVSGNVTINVAASDNVAVS
KVELYLNGSIFGVVGSAPYTLSWNTANNPNATYTLTAKAYDAAGNVGQASSSVTVKNAVAVTDASAPVISSFTMPASATS
LTVPVTAFAATDDVGVTGYQITESATAPAAGATTWKATAPTSCTFSAAGIRTAYAWCKDASGKVSAAKTAQVNITTATTT
TGDTTAPVVSIASPVTGSTVKGAVTVSANATDNVGVKKAEFYVNGVLKLTSTTAPYSVTWGTTNYPNGSNSVTVKAYDAA
GNVGQATSTVTIMNGDTTAPKVSITSPTSGSTAKGVVTVSANVTDNVGVKKVEFYVNGVLKLTSTAAPYTVTWGTTNYPN
GSNSVTIKGYDAAGNVGQATTTVTVAN
>Mature_827_residues
MKKLKAITATALGVLMYTSVASAAVLFSEDFNAQPDWSPSQGTSGATCIPGQNCATPIPTGFYDYRLAGTEACSNLDGKH
NTLNINGLHPRGTGKSFMMWNEPCYSRSGSWGSDGLLGIDFAPQNEVYVRYWIQFQPDWRWDGDGSVGGRAVGTATTSPM
EKFMHISHLNTGNTNFWDFFSGTQNKPRFTPQLAKFGGGSYRLQFNLPHSPLTAARDSSASFTTNVFLGAAPLDMNVPGP
GGLPAAPRDGKWHCFEFYVKLNSAGGVADGVSKVWYDGALVGSTTNAVWIPLGDDPAQWKWNHAWLGGNNANLYLPANEQ
WYAIDDVVVSTTYSGPPAQPGSVTATASAANTVSLRWSPGSNGVPFSLNGYRIYYGKDATNLNMKVDVGNVQQYSISSLD
PSTKYYFAVSAYNKGSYDSNDNEGMPSVTASATTVSSSTTTTTTADAVAPVASISSPATGSTVSGNVTINVAASDNVAVS
KVELYLNGSIFGVVGSAPYTLSWNTANNPNATYTLTAKAYDAAGNVGQASSSVTVKNAVAVTDASAPVISSFTMPASATS
LTVPVTAFAATDDVGVTGYQITESATAPAAGATTWKATAPTSCTFSAAGIRTAYAWCKDASGKVSAAKTAQVNITTATTT
TGDTTAPVVSIASPVTGSTVKGAVTVSANATDNVGVKKAEFYVNGVLKLTSTTAPYSVTWGTTNYPNGSNSVTVKAYDAA
GNVGQATSTVTIMNGDTTAPKVSITSPTSGSTAKGVVTVSANVTDNVGVKKVEFYVNGVLKLTSTAAPYTVTWGTTNYPN
GSNSVTIKGYDAAGNVGQATTTVTVAN

Specific function: Hydrolyzes chitin oligosaccharides; (GlcNAc)4 to (GlcNAc)2 and (GlcNAc)5,6 to (GlcNAc)2 and (GlcNAc)3. Inactive towards chitin, glucosamine oligosaccharides, glycoproteins and glycopeptides containing (GlcNAc)2 [H]

COG id: COG3979

COG function: function code R; Uncharacterized protein contain chitin-binding domain type 3

Gene ontology:

Cell location: Periplasm (Probable) [H]

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the glycosyl hydrolase 18 family [H]

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003610
- InterPro:   IPR009470
- InterPro:   IPR011583
- InterPro:   IPR001223
- InterPro:   IPR017853
- InterPro:   IPR013781 [H]

Pfam domain/function: PF02839 CBM_5_12; PF06483 ChiC; PF00704 Glyco_hydro_18 [H]

EC number: =3.2.1.14 [H]

Molecular weight: Translated: 85690; Mature: 85690

Theoretical pI: Translated: 6.86; Mature: 6.86

Prosite motif: PS50853 FN3

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.8 %Cys     (Translated Protein)
1.3 %Met     (Translated Protein)
2.2 %Cys+Met (Translated Protein)
0.8 %Cys     (Mature Protein)
1.3 %Met     (Mature Protein)
2.2 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKKLKAITATALGVLMYTSVASAAVLFSEDFNAQPDWSPSQGTSGATCIPGQNCATPIPT
CCCHHHHHHHHHHHHHHHHHHHEEEEEECCCCCCCCCCCCCCCCCCEECCCCCCCCCCCC
GFYDYRLAGTEACSNLDGKHNTLNINGLHPRGTGKSFMMWNEPCYSRSGSWGSDGLLGID
CCEEEEEECHHHHHCCCCCCCEEEEECCCCCCCCCEEEEECCCCCCCCCCCCCCCEEEEE
FAPQNEVYVRYWIQFQPDWRWDGDGSVGGRAVGTATTSPMEKFMHISHLNTGNTNFWDFF
ECCCCCEEEEEEEEECCCCCCCCCCCCCCEEEECCCCCHHHHHHEEEECCCCCCCEEEEC
SGTQNKPRFTPQLAKFGGGSYRLQFNLPHSPLTAARDSSASFTTNVFLGAAPLDMNVPGP
CCCCCCCCCCHHHHHCCCCEEEEEEECCCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCC
GGLPAAPRDGKWHCFEFYVKLNSAGGVADGVSKVWYDGALVGSTTNAVWIPLGDDPAQWK
CCCCCCCCCCCEEEEEEEEEECCCCCCHHCCHHHEECCEEECCCCCEEEEECCCCCCCEE
WNHAWLGGNNANLYLPANEQWYAIDDVVVSTTYSGPPAQPGSVTATASAANTVSLRWSPG
EEEEEECCCCCEEEECCCCCEEEEEEEEEEEECCCCCCCCCCEEEEECCCEEEEEEECCC
SNGVPFSLNGYRIYYGKDATNLNMKVDVGNVQQYSISSLDPSTKYYFAVSAYNKGSYDSN
CCCCCEEECCEEEEECCCCCCEEEEEEECCCEEEEECCCCCCCEEEEEEEECCCCCCCCC
DNEGMPSVTASATTVSSSTTTTTTADAVAPVASISSPATGSTVSGNVTINVAASDNVAVS
CCCCCCCEEEEEEEECCCCCEEECHHHHHHHHHCCCCCCCCEEECCEEEEEECCCCEEEE
KVELYLNGSIFGVVGSAPYTLSWNTANNPNATYTLTAKAYDAAGNVGQASSSVTVKNAVA
EEEEEECCEEEEEECCCCEEEEECCCCCCCEEEEEEEEEECCCCCCCCCCCCEEEEEEEE
VTDASAPVISSFTMPASATSLTVPVTAFAATDDVGVTGYQITESATAPAAGATTWKATAP
EECCCCCEEEEEECCCCCCEEEEEEEEEEECCCCCCCEEEEECCCCCCCCCCCEEECCCC
TSCTFSAAGIRTAYAWCKDASGKVSAAKTAQVNITTATTTTGDTTAPVVSIASPVTGSTV
CCEEEECCCCEEEEHHHCCCCCCEEEEEEEEEEEEEEEECCCCCCCCEEEECCCCCCCEE
KGAVTVSANATDNVGVKKAEFYVNGVLKLTSTTAPYSVTWGTTNYPNGSNSVTVKAYDAA
EEEEEEECCCCCCCCCEEEEEEEEEEEEEEECCCCEEEECCCCCCCCCCCEEEEEEEECC
GNVGQATSTVTIMNGDTTAPKVSITSPTSGSTAKGVVTVSANVTDNVGVKKVEFYVNGVL
CCCCCCEEEEEEEECCCCCCEEEEECCCCCCCCCEEEEEECCCCCCCCCEEEEEEEEEEE
KLTSTAAPYTVTWGTTNYPNGSNSVTIKGYDAAGNVGQATTTVTVAN
EEECCCCCEEEECCCCCCCCCCCEEEEEEECCCCCCCCCEEEEEEEC
>Mature Secondary Structure
MKKLKAITATALGVLMYTSVASAAVLFSEDFNAQPDWSPSQGTSGATCIPGQNCATPIPT
CCCHHHHHHHHHHHHHHHHHHHEEEEEECCCCCCCCCCCCCCCCCCEECCCCCCCCCCCC
GFYDYRLAGTEACSNLDGKHNTLNINGLHPRGTGKSFMMWNEPCYSRSGSWGSDGLLGID
CCEEEEEECHHHHHCCCCCCCEEEEECCCCCCCCCEEEEECCCCCCCCCCCCCCCEEEEE
FAPQNEVYVRYWIQFQPDWRWDGDGSVGGRAVGTATTSPMEKFMHISHLNTGNTNFWDFF
ECCCCCEEEEEEEEECCCCCCCCCCCCCCEEEECCCCCHHHHHHEEEECCCCCCCEEEEC
SGTQNKPRFTPQLAKFGGGSYRLQFNLPHSPLTAARDSSASFTTNVFLGAAPLDMNVPGP
CCCCCCCCCCHHHHHCCCCEEEEEEECCCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCC
GGLPAAPRDGKWHCFEFYVKLNSAGGVADGVSKVWYDGALVGSTTNAVWIPLGDDPAQWK
CCCCCCCCCCCEEEEEEEEEECCCCCCHHCCHHHEECCEEECCCCCEEEEECCCCCCCEE
WNHAWLGGNNANLYLPANEQWYAIDDVVVSTTYSGPPAQPGSVTATASAANTVSLRWSPG
EEEEEECCCCCEEEECCCCCEEEEEEEEEEEECCCCCCCCCCEEEEECCCEEEEEEECCC
SNGVPFSLNGYRIYYGKDATNLNMKVDVGNVQQYSISSLDPSTKYYFAVSAYNKGSYDSN
CCCCCEEECCEEEEECCCCCCEEEEEEECCCEEEEECCCCCCCEEEEEEEECCCCCCCCC
DNEGMPSVTASATTVSSSTTTTTTADAVAPVASISSPATGSTVSGNVTINVAASDNVAVS
CCCCCCCEEEEEEEECCCCCEEECHHHHHHHHHCCCCCCCCEEECCEEEEEECCCCEEEE
KVELYLNGSIFGVVGSAPYTLSWNTANNPNATYTLTAKAYDAAGNVGQASSSVTVKNAVA
EEEEEECCEEEEEECCCCEEEEECCCCCCCEEEEEEEEEECCCCCCCCCCCCEEEEEEEE
VTDASAPVISSFTMPASATSLTVPVTAFAATDDVGVTGYQITESATAPAAGATTWKATAP
EECCCCCEEEEEECCCCCCEEEEEEEEEEECCCCCCCEEEEECCCCCCCCCCCEEECCCC
TSCTFSAAGIRTAYAWCKDASGKVSAAKTAQVNITTATTTTGDTTAPVVSIASPVTGSTV
CCEEEECCCCEEEEHHHCCCCCCEEEEEEEEEEEEEEEECCCCCCCCEEEECCCCCCCEE
KGAVTVSANATDNVGVKKAEFYVNGVLKLTSTTAPYSVTWGTTNYPNGSNSVTVKAYDAA
EEEEEEECCCCCCCCCEEEEEEEEEEEEEEECCCCEEEECCCCCCCCCCCEEEEEEEECC
GNVGQATSTVTIMNGDTTAPKVSITSPTSGSTAKGVVTVSANVTDNVGVKKVEFYVNGVL
CCCCCCEEEEEEEECCCCCCEEEEECCCCCCCCCEEEEEECCCCCCCCCEEEEEEEEEEE
KLTSTAAPYTVTWGTTNYPNGSNSVTIKGYDAAGNVGQATTTVTVAN
EEECCCCCEEEECCCCCCCCCCCEEEEEEECCCCCCCCCEEEEEEEC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8969204 [H]