Definition Bacteroides vulgatus ATCC 8482 chromosome, complete genome.
Accession NC_009614
Length 5,163,189

Click here to switch to the map view.

The map label for this gene is 150005560

Identifier: 150005560

GI number: 150005560

Start: 3852185

End: 3854050

Strand: Reverse

Name: 150005560

Synonym: BVU_3045

Alternate gene names: NA

Gene position: 3854050-3852185 (Counterclockwise)

Preceding gene: 150005564

Following gene: 150005559

Centisome position: 74.64

GC content: 44.48

Gene sequence:

>1866_bases
GTGAATAAAACAGTTTTTTTTGTACTTTTGCCACTCAGTTTAATCAACACACTGCAAATGAAATTTATTCGAGCATTATA
TCTTATCGCAGCCTTGTCTGCCAGCAGCACCGGAATGATGGCTCAAACACAGTCTTCCGGCTATTTCCTGCACACCATTA
CCAAAGGGCAAAGTCTTTATTCCATAGCCAGCATGTATAATGTCACTACCGGTGATATCGTTAAAATGAACCCCGGCAGC
GACCAGAAAATTAAAACCGGTGAAACATTGAAGATTCCACAAAAGAACATCGGAACGGAACAGCAGATGTTCCATACCAT
CCAGTCGGGTGAAACCTTGTACAAGCTGACACAACGCTATGGAGTAACCGCACAACGCATCTGCCAGGCAAATCCCGGAC
TGAGCGCGGAGAATTTCCGTATCGGACAGGTTATCGTAATTCCCGCCAAAGTGACGGACAGCGAGGAGATAATCATGAAC
GAGGTGAAAGCCGCACAGACTATCAGACCGGCCACCACTTCCACCCCACTGAAACCCAATTGCAGGGATATGCATAAAGT
GGAACGCAAGGAAACGATATTCAGCATCAGCCGCCTGTATGGCATTACAGAAGCCGAACTGATTGCCGCCAATCCTGAGT
TGCGTACCGAAAAACTGAAAAAAGGAAGGTTCCTTTGTATTCCATATCCTAAGGATACAAAGACTGAAACGCCTGTGGAT
AATACGCCTGCCGTAATCCCCACGGACGACCAGCTGTTTAATGAAAGCAAGAAGGAGGCACGCAAAATATCTACCATCAA
AGCTGCCGTAATGCTGCCATTCATGACAGACGGCAAGGGCAACCGTGACGAACAGACACGTATGGTGGAATACTATGAAG
GCTTCCTGATGGCAGTAGACAGCTTGAAGGAAAAAGGAGTATCCATCGACCTGTACTCATACGATACCCATAACAATACT
TCTTCCATCAAGAACATTCTGGATAGAAGTGAATTGAAGAGCATGGATATTATCTTCGGCCCGGCTTATCCCGACCAGGT
GAAACCGGTTGCGGAATTCGCCAAGAAAAACAATATCCGGCTGGTAGTACCTTTTACCTCCAAAGGGAATGAAGTTTTCA
GCAATCCCGCCATCTATCAGATAAATACTCCCCAATCTTATCTGTACTCGGAAGTTTACGAGCATTTTACCCGCAAGTTT
ACAACCGCCAATGTTATCTTCCTGGATGCAGAAGATGGTGACAAAGACAAAGTGGATTTTATAAAAGGACTGAAAGAGGA
GTTAAAAACCAAACGCATTCCTTTTACCGAATTAAAAGGGGAAAATATTACTCCGGAATCATTGAAAGGCGCGATGAACC
ATAGTATGGATAATGTATTCATCCCGACTTCCGGCACCAACGTGGCATTGATAAAACTGTTACCACAACTGATTGTGACC
TCCCGTGACAATCCCGACTACCGTATGCAGTTATTCGGCTATCCGGAATGGCAGACCTATACCAACGACCATCTGGCCAG
TTTCTATGAACTGGACACCTATTTCTATTCTTCGTTTTACACAAACAATTTATTTCCGGAAGCCGTTCAGTTCTCATCGG
CTTACCGCAAATGGTACAGCAAGGATATGCTCAACTCATTCCCTAAATATGGTATGCTGGGATTCGATACAGGATATTTC
TTCCTGAAGGGCCTGTCCCAATACGGCAACAAACTGGAAGACAAACTGGACAAGGTGGCTGTCACCCCTATCCAGACCGG
ATTTAAATTTGAACGCGTAAACAACTGGGGTGGATTTATCAATCGTAAAGTGTTCTTCGTTCATTTTACCAAGGACTTTG
AACTGATTAAACTTGATTTTGAATAA

Upstream 100 bases:

>100_bases
CATAATCTTCTTACTCTGCGTAATCTTTGCAAAATTAATGCTTCCCGACGAAATATTTAATTAAAAAGCCTAATAATATT
TTCTATTCTTTCTATATAAT

Downstream 100 bases:

>100_bases
TGAAAAAGTATAAGAACTTCGGGCTGCTTGTACTGGCATTGCTCTTTGCATTGCCGGCAGCTGCACAGTTGGGTGAAGAA
CGTCATAATTTTGCCGTCGG

Product: LysM repeat-containing protein

Products: NA

Alternate protein names: LysM Domain-Containing Protein; N-Acetylmuramoyl-L-Alanine Amidase; LysM-Repeat Protein; LysM Domain-Containing Proteins; Family; LysM-Repeat Domain Protein; LysM Repeat-Containing Protein; LysM-Repeat Domain-Containing Protein; NLP/P60 Protein; ErfK/YbiS/YcfS/YnhG; LysM-Repeat-Containing Protein

Number of amino acids: Translated: 621; Mature: 621

Protein sequence:

>621_residues
MNKTVFFVLLPLSLINTLQMKFIRALYLIAALSASSTGMMAQTQSSGYFLHTITKGQSLYSIASMYNVTTGDIVKMNPGS
DQKIKTGETLKIPQKNIGTEQQMFHTIQSGETLYKLTQRYGVTAQRICQANPGLSAENFRIGQVIVIPAKVTDSEEIIMN
EVKAAQTIRPATTSTPLKPNCRDMHKVERKETIFSISRLYGITEAELIAANPELRTEKLKKGRFLCIPYPKDTKTETPVD
NTPAVIPTDDQLFNESKKEARKISTIKAAVMLPFMTDGKGNRDEQTRMVEYYEGFLMAVDSLKEKGVSIDLYSYDTHNNT
SSIKNILDRSELKSMDIIFGPAYPDQVKPVAEFAKKNNIRLVVPFTSKGNEVFSNPAIYQINTPQSYLYSEVYEHFTRKF
TTANVIFLDAEDGDKDKVDFIKGLKEELKTKRIPFTELKGENITPESLKGAMNHSMDNVFIPTSGTNVALIKLLPQLIVT
SRDNPDYRMQLFGYPEWQTYTNDHLASFYELDTYFYSSFYTNNLFPEAVQFSSAYRKWYSKDMLNSFPKYGMLGFDTGYF
FLKGLSQYGNKLEDKLDKVAVTPIQTGFKFERVNNWGGFINRKVFFVHFTKDFELIKLDFE

Sequences:

>Translated_621_residues
MNKTVFFVLLPLSLINTLQMKFIRALYLIAALSASSTGMMAQTQSSGYFLHTITKGQSLYSIASMYNVTTGDIVKMNPGS
DQKIKTGETLKIPQKNIGTEQQMFHTIQSGETLYKLTQRYGVTAQRICQANPGLSAENFRIGQVIVIPAKVTDSEEIIMN
EVKAAQTIRPATTSTPLKPNCRDMHKVERKETIFSISRLYGITEAELIAANPELRTEKLKKGRFLCIPYPKDTKTETPVD
NTPAVIPTDDQLFNESKKEARKISTIKAAVMLPFMTDGKGNRDEQTRMVEYYEGFLMAVDSLKEKGVSIDLYSYDTHNNT
SSIKNILDRSELKSMDIIFGPAYPDQVKPVAEFAKKNNIRLVVPFTSKGNEVFSNPAIYQINTPQSYLYSEVYEHFTRKF
TTANVIFLDAEDGDKDKVDFIKGLKEELKTKRIPFTELKGENITPESLKGAMNHSMDNVFIPTSGTNVALIKLLPQLIVT
SRDNPDYRMQLFGYPEWQTYTNDHLASFYELDTYFYSSFYTNNLFPEAVQFSSAYRKWYSKDMLNSFPKYGMLGFDTGYF
FLKGLSQYGNKLEDKLDKVAVTPIQTGFKFERVNNWGGFINRKVFFVHFTKDFELIKLDFE
>Mature_621_residues
MNKTVFFVLLPLSLINTLQMKFIRALYLIAALSASSTGMMAQTQSSGYFLHTITKGQSLYSIASMYNVTTGDIVKMNPGS
DQKIKTGETLKIPQKNIGTEQQMFHTIQSGETLYKLTQRYGVTAQRICQANPGLSAENFRIGQVIVIPAKVTDSEEIIMN
EVKAAQTIRPATTSTPLKPNCRDMHKVERKETIFSISRLYGITEAELIAANPELRTEKLKKGRFLCIPYPKDTKTETPVD
NTPAVIPTDDQLFNESKKEARKISTIKAAVMLPFMTDGKGNRDEQTRMVEYYEGFLMAVDSLKEKGVSIDLYSYDTHNNT
SSIKNILDRSELKSMDIIFGPAYPDQVKPVAEFAKKNNIRLVVPFTSKGNEVFSNPAIYQINTPQSYLYSEVYEHFTRKF
TTANVIFLDAEDGDKDKVDFIKGLKEELKTKRIPFTELKGENITPESLKGAMNHSMDNVFIPTSGTNVALIKLLPQLIVT
SRDNPDYRMQLFGYPEWQTYTNDHLASFYELDTYFYSSFYTNNLFPEAVQFSSAYRKWYSKDMLNSFPKYGMLGFDTGYF
FLKGLSQYGNKLEDKLDKVAVTPIQTGFKFERVNNWGGFINRKVFFVHFTKDFELIKLDFE

Specific function: Unknown

COG id: COG1388

COG function: function code M; FOG: LysM repeat

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 70729; Mature: 70729

Theoretical pI: Translated: 8.88; Mature: 8.88

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.5 %Cys     (Translated Protein)
3.1 %Met     (Translated Protein)
3.5 %Cys+Met (Translated Protein)
0.5 %Cys     (Mature Protein)
3.1 %Met     (Mature Protein)
3.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNKTVFFVLLPLSLINTLQMKFIRALYLIAALSASSTGMMAQTQSSGYFLHTITKGQSLY
CCCEEEEEEHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEECCCCEEEEEECCCCHHH
SIASMYNVTTGDIVKMNPGSDQKIKTGETLKIPQKNIGTEQQMFHTIQSGETLYKLTQRY
HHHHHHCCCCCCEEEECCCCCCCCCCCCEEECCCCCCCCHHHHHHHHHCCHHHHHHHHHH
GVTAQRICQANPGLSAENFRIGQVIVIPAKVTDSEEIIMNEVKAAQTIRPATTSTPLKPN
CCCHHHHHCCCCCCCCCCEEECEEEEEECCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCC
CRDMHKVERKETIFSISRLYGITEAELIAANPELRTEKLKKGRFLCIPYPKDTKTETPVD
HHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCHHHHHHHCCCEEEEECCCCCCCCCCCC
NTPAVIPTDDQLFNESKKEARKISTIKAAVMLPFMTDGKGNRDEQTRMVEYYEGFLMAVD
CCCEEECCCHHHHHHHHHHHHHHHHHHHEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHH
SLKEKGVSIDLYSYDTHNNTSSIKNILDRSELKSMDIIFGPAYPDQVKPVAEFAKKNNIR
HHHHCCCEEEEEEECCCCCHHHHHHHHHHHHHCCEEEEECCCCCHHHHHHHHHHHCCCCE
LVVPFTSKGNEVFSNPAIYQINTPQSYLYSEVYEHFTRKFTTANVIFLDAEDGDKDKVDF
EEEEECCCCCEECCCCEEEEECCCHHHHHHHHHHHHHHHCCEEEEEEEECCCCCCHHHHH
IKGLKEELKTKRIPFTELKGENITPESLKGAMNHSMDNVFIPTSGTNVALIKLLPQLIVT
HHHHHHHHHHCCCCCHHCCCCCCCHHHHHHHHCCCCCCEEEECCCCCCHHHHHHHHHHHC
SRDNPDYRMQLFGYPEWQTYTNDHLASFYELDTYFYSSFYTNNLFPEAVQFSSAYRKWYS
CCCCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHH
KDMLNSFPKYGMLGFDTGYFFLKGLSQYGNKLEDKLDKVAVTPIQTGFKFERVNNWGGFI
HHHHHCCCCCCCEECCHHHHHHHHHHHHHHHHHHHHHHEEECCCCCCCEEEEECCCCCCC
NRKVFFVHFTKDFELIKLDFE
CCEEEEEEEECCEEEEEEECC
>Mature Secondary Structure
MNKTVFFVLLPLSLINTLQMKFIRALYLIAALSASSTGMMAQTQSSGYFLHTITKGQSLY
CCCEEEEEEHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEECCCCEEEEEECCCCHHH
SIASMYNVTTGDIVKMNPGSDQKIKTGETLKIPQKNIGTEQQMFHTIQSGETLYKLTQRY
HHHHHHCCCCCCEEEECCCCCCCCCCCCEEECCCCCCCCHHHHHHHHHCCHHHHHHHHHH
GVTAQRICQANPGLSAENFRIGQVIVIPAKVTDSEEIIMNEVKAAQTIRPATTSTPLKPN
CCCHHHHHCCCCCCCCCCEEECEEEEEECCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCC
CRDMHKVERKETIFSISRLYGITEAELIAANPELRTEKLKKGRFLCIPYPKDTKTETPVD
HHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCHHHHHHHCCCEEEEECCCCCCCCCCCC
NTPAVIPTDDQLFNESKKEARKISTIKAAVMLPFMTDGKGNRDEQTRMVEYYEGFLMAVD
CCCEEECCCHHHHHHHHHHHHHHHHHHHEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHH
SLKEKGVSIDLYSYDTHNNTSSIKNILDRSELKSMDIIFGPAYPDQVKPVAEFAKKNNIR
HHHHCCCEEEEEEECCCCCHHHHHHHHHHHHHCCEEEEECCCCCHHHHHHHHHHHCCCCE
LVVPFTSKGNEVFSNPAIYQINTPQSYLYSEVYEHFTRKFTTANVIFLDAEDGDKDKVDF
EEEEECCCCCEECCCCEEEEECCCHHHHHHHHHHHHHHHCCEEEEEEEECCCCCCHHHHH
IKGLKEELKTKRIPFTELKGENITPESLKGAMNHSMDNVFIPTSGTNVALIKLLPQLIVT
HHHHHHHHHHCCCCCHHCCCCCCCHHHHHHHHCCCCCCEEEECCCCCCHHHHHHHHHHHC
SRDNPDYRMQLFGYPEWQTYTNDHLASFYELDTYFYSSFYTNNLFPEAVQFSSAYRKWYS
CCCCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHH
KDMLNSFPKYGMLGFDTGYFFLKGLSQYGNKLEDKLDKVAVTPIQTGFKFERVNNWGGFI
HHHHHCCCCCCCEECCHHHHHHHHHHHHHHHHHHHHHHEEECCCCCCCEEEEECCCCCCC
NRKVFFVHFTKDFELIKLDFE
CCEEEEEEEECCEEEEEEECC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA