The gene/protein map for NC_008044 is currently unavailable.
Definition Ruegeria sp. TM1040, complete genome.
Accession NC_008044
Length 3,200,938

Click here to switch to the map view.

The map label for this gene is ygaU [C]

Identifier: 99080820

GI number: 99080820

Start: 1045222

End: 1047177

Strand: Direct

Name: ygaU [C]

Synonym: TM1040_0979

Alternate gene names: 99080820

Gene position: 1045222-1047177 (Clockwise)

Preceding gene: 99080818

Following gene: 99080821

Centisome position: 32.65

GC content: 65.7

Gene sequence:

>1956_bases
ATGACGAAGACAAGCGGGATAGGCGCGGGGGCCAGTTTGGCCATTGGCACCGTCGCCACGGTGGTGGTCGTTGGAGGCGG
GGTCTTTCTCGCACGGGGTGGGATCTTGGGCGAAGGGGCCAGATCCATGGTCGAGCAGCAGCTGGTGGCCCTGGGCCTTG
CAGCGCCGCCGGCGCCCGAAGTGGTGCCCGTGAAGCCTGTGGTGACACAGCCGCAGACGGCCGATCCCGAGACGCGCGTG
GTCGAGCCTGAGCCGACGCCCGAAGCCACGGCTGGCGAGACGGTAGAAACGCAACAGGACGCACCAACCGCAGAGCCCGC
TTTTGTACTGCAGGCGCCCAAGCTGGAGATCGCCCGGTTTGAGCCAGACGGCTCCGGTATCGTAGCGGCGTCCGCTCAGG
CGGGGGTCGAGGTGCAGGTGCTTCTTGACGATGAGGTTCTCGATACGCAAACCGTGCCCGCTGCGGGGGAGTTCGTGTCC
TTTGTGACCATCGACCTCAGTGACAAGCCGCGGCTGCTGACGCTGCTGGCGCGCCACAACGGGCAGGAGCTGGCCTCGGA
AGACAGCTTTATCCTTGCGCCGATGCCCGCGCCCGCCGCGCCGGAACCGCAGGTCGATCAGCTTGCCGCGGCGCAGACCG
ATTCTGATATCGCTGCCCCCGAGGAGGAGCCAATTGAGCTCGCCGAGGCAACCGAAACCGCCGATCCGAATGTGGCAGAT
CAGGCGACTGATGCGCCAGATCCAGACGCGCCAGGCGACGGCGCAGCGGAGGGGAGCACAACGGTGACGGCGCAGTCCGA
AGAGGTCGCATTGGCTGATGTCGCAGTCGATAGCACCGATCCGGACGCGGAGGGCGATGCCTCCTCGACGGAGTCGGCTG
CGAGTGGTGCCGCTGACAATGGAGTGGCAACTGATATGGCCGCCGTCGAAAACACCGGTGATCAACTCCCCGATGCTGCC
TCTGAGGCCGTATCTGAAGCGTCACCCGAAGCGGTAGACCCCTCTGTAGATGTGGCCGAGGCCACCAGTGCTTTGCCGGA
GACCGAGGTCACAGCGGAAGACGCGCCCGCAGCAGAGGCACCGGAAGAGACAGTCGAGACCGCCGCCTTGGAACAGGCGA
TCGACGACGAGTCCGCTCGCGAATCTTCTGAAGACTCGGTCCCGGCTCCTGAGCCTGAGGTGGCAGCTGTTGCAGACACA
TCAGAGCCGCCCGCTCCGGACACGACACCTGCACCGCAGTCCCCGGTCGAGGTTGCCGAGGCCGTTGACACACCCGAGGT
GCCATCTTCCAAGACGGACACAATGGCTGCCGTAGAAGAGGTGCAGCAGCCGCAGCCGCAAGACCCCGACACAGAGAGCC
CCTCTGGCGAGGCGGCTCCCGCGCCGCAGGCGACTTCCTCTGTCGCGGTGCTGCGCGCTGGTCGCGATGGGGTGACGCTG
GTTCAACCTGCGGCCCCAGCCGCACCAGAGCTGGTGGGCAAGGTGGCGCTCGATACGATCAGCTACACCGAGACGGGCGA
TGTTCAGCTTGCGGGACGGGCCAGGCCCGAGGCCCTGGTGCGTGTCTACCTCGACAACAGCCCTGTGGCCGAGCTTGCCG
CCGCGTCCGATGGTCAATGGAGCGGCAGCCTCACCTCGGTGGCGCCGGGGATCTACACCCTGCGCCTTGATGAGATCGAC
CCTGTTGACGGTATCGTCCTGAGCCGCCTTGAGACCCCGTTCAAACGCGAGGCTCCAGAGGTCCTGCAGCCTGCGGTGAC
GGCGGATCAGGCGCCAGATCAGGCTGCGCCTGTGGTGCGCGCCGTGACGGTGCAGGAAGGCGATACGCTCTGGGCGATTT
CCCAGCAGCGCTATGGCAGCGGTTTTCTCTATGTGCGGGTGTTTGAGGCCAACAAGGGCGATATCCGCGATCCAGACCTG
ATCTACCCTGGTCAGATCTTCACTCTGCCCGAGTAA

Upstream 100 bases:

>100_bases
ATCGTGCTTTGACCCCTCATAGCGGTATTGCTAGGTTTGCTCAACCGAGAGCGCGGCAACAGTGCTCCGGTACAAAGGGA
CCTAAAGGAAAAGTCGCAAG

Downstream 100 bases:

>100_bases
AGGCATAGATAGATTGATGAAATGGCGCGCCCCCGTCGTGGGGCGCGTTTTCTCTTGTCCGACCATGGCGCAGACCCAAG
TCGGACCAAACTGACAGAGG

Product: peptidoglycan-binding LysM

Products: NA

Alternate protein names: LysM Domain-Containing Protein

Number of amino acids: Translated: 651; Mature: 650

Protein sequence:

>651_residues
MTKTSGIGAGASLAIGTVATVVVVGGGVFLARGGILGEGARSMVEQQLVALGLAAPPAPEVVPVKPVVTQPQTADPETRV
VEPEPTPEATAGETVETQQDAPTAEPAFVLQAPKLEIARFEPDGSGIVAASAQAGVEVQVLLDDEVLDTQTVPAAGEFVS
FVTIDLSDKPRLLTLLARHNGQELASEDSFILAPMPAPAAPEPQVDQLAAAQTDSDIAAPEEEPIELAEATETADPNVAD
QATDAPDPDAPGDGAAEGSTTVTAQSEEVALADVAVDSTDPDAEGDASSTESAASGAADNGVATDMAAVENTGDQLPDAA
SEAVSEASPEAVDPSVDVAEATSALPETEVTAEDAPAAEAPEETVETAALEQAIDDESARESSEDSVPAPEPEVAAVADT
SEPPAPDTTPAPQSPVEVAEAVDTPEVPSSKTDTMAAVEEVQQPQPQDPDTESPSGEAAPAPQATSSVAVLRAGRDGVTL
VQPAAPAAPELVGKVALDTISYTETGDVQLAGRARPEALVRVYLDNSPVAELAAASDGQWSGSLTSVAPGIYTLRLDEID
PVDGIVLSRLETPFKREAPEVLQPAVTADQAPDQAAPVVRAVTVQEGDTLWAISQQRYGSGFLYVRVFEANKGDIRDPDL
IYPGQIFTLPE

Sequences:

>Translated_651_residues
MTKTSGIGAGASLAIGTVATVVVVGGGVFLARGGILGEGARSMVEQQLVALGLAAPPAPEVVPVKPVVTQPQTADPETRV
VEPEPTPEATAGETVETQQDAPTAEPAFVLQAPKLEIARFEPDGSGIVAASAQAGVEVQVLLDDEVLDTQTVPAAGEFVS
FVTIDLSDKPRLLTLLARHNGQELASEDSFILAPMPAPAAPEPQVDQLAAAQTDSDIAAPEEEPIELAEATETADPNVAD
QATDAPDPDAPGDGAAEGSTTVTAQSEEVALADVAVDSTDPDAEGDASSTESAASGAADNGVATDMAAVENTGDQLPDAA
SEAVSEASPEAVDPSVDVAEATSALPETEVTAEDAPAAEAPEETVETAALEQAIDDESARESSEDSVPAPEPEVAAVADT
SEPPAPDTTPAPQSPVEVAEAVDTPEVPSSKTDTMAAVEEVQQPQPQDPDTESPSGEAAPAPQATSSVAVLRAGRDGVTL
VQPAAPAAPELVGKVALDTISYTETGDVQLAGRARPEALVRVYLDNSPVAELAAASDGQWSGSLTSVAPGIYTLRLDEID
PVDGIVLSRLETPFKREAPEVLQPAVTADQAPDQAAPVVRAVTVQEGDTLWAISQQRYGSGFLYVRVFEANKGDIRDPDL
IYPGQIFTLPE
>Mature_650_residues
TKTSGIGAGASLAIGTVATVVVVGGGVFLARGGILGEGARSMVEQQLVALGLAAPPAPEVVPVKPVVTQPQTADPETRVV
EPEPTPEATAGETVETQQDAPTAEPAFVLQAPKLEIARFEPDGSGIVAASAQAGVEVQVLLDDEVLDTQTVPAAGEFVSF
VTIDLSDKPRLLTLLARHNGQELASEDSFILAPMPAPAAPEPQVDQLAAAQTDSDIAAPEEEPIELAEATETADPNVADQ
ATDAPDPDAPGDGAAEGSTTVTAQSEEVALADVAVDSTDPDAEGDASSTESAASGAADNGVATDMAAVENTGDQLPDAAS
EAVSEASPEAVDPSVDVAEATSALPETEVTAEDAPAAEAPEETVETAALEQAIDDESARESSEDSVPAPEPEVAAVADTS
EPPAPDTTPAPQSPVEVAEAVDTPEVPSSKTDTMAAVEEVQQPQPQDPDTESPSGEAAPAPQATSSVAVLRAGRDGVTLV
QPAAPAAPELVGKVALDTISYTETGDVQLAGRARPEALVRVYLDNSPVAELAAASDGQWSGSLTSVAPGIYTLRLDEIDP
VDGIVLSRLETPFKREAPEVLQPAVTADQAPDQAAPVVRAVTVQEGDTLWAISQQRYGSGFLYVRVFEANKGDIRDPDLI
YPGQIFTLPE

Specific function: Unknown

COG id: COG1652

COG function: function code S; Uncharacterized protein containing LysM domain

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

None

Paralogues:

None

Copy number: 4340 Molecules/Cell In: Early Stationary Phase, Rich Media (Based on E. coli). [C]

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 66473; Mature: 66342

Theoretical pI: Translated: 3.50; Mature: 3.50

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.0 %Cys     (Translated Protein)
0.8 %Met     (Translated Protein)
0.8 %Cys+Met (Translated Protein)
0.0 %Cys     (Mature Protein)
0.6 %Met     (Mature Protein)
0.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTKTSGIGAGASLAIGTVATVVVVGGGVFLARGGILGEGARSMVEQQLVALGLAAPPAPE
CCCCCCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHHHHHHHHCCCCCCCC
VVPVKPVVTQPQTADPETRVVEPEPTPEATAGETVETQQDAPTAEPAFVLQAPKLEIARF
EEECCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCEEEEEE
EPDGSGIVAASAQAGVEVQVLLDDEVLDTQTVPAAGEFVSFVTIDLSDKPRLLTLLARHN
CCCCCCEEEECCCCCCEEEEEECCCCCCCCCCCCCCCEEEEEEEECCCCCCEEEHHHHCC
GQELASEDSFILAPMPAPAAPEPQVDQLAAAQTDSDIAAPEEEPIELAEATETADPNVAD
CHHHHCCCCEEEECCCCCCCCCCCHHHHHHHCCCCCCCCCCCCCHHHHHHHCCCCCCCCC
QATDAPDPDAPGDGAAEGSTTVTAQSEEVALADVAVDSTDPDAEGDASSTESAASGAADN
CCCCCCCCCCCCCCCCCCCEEEEECCCCEEEEEEEECCCCCCCCCCCCCCHHHHCCCCCC
GVATDMAAVENTGDQLPDAASEAVSEASPEAVDPSVDVAEATSALPETEVTAEDAPAAEA
CCHHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCC
PEETVETAALEQAIDDESARESSEDSVPAPEPEVAAVADTSEPPAPDTTPAPQSPVEVAE
CHHHHHHHHHHHHHCCHHHHCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCHHHHHH
AVDTPEVPSSKTDTMAAVEEVQQPQPQDPDTESPSGEAAPAPQATSSVAVLRAGRDGVTL
HCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCEE
VQPAAPAAPELVGKVALDTISYTETGDVQLAGRARPEALVRVYLDNSPVAELAAASDGQW
ECCCCCCCHHHHHHHHHHHCCCCCCCCEEEECCCCCCEEEEEEECCCCHHHHHHCCCCCC
SGSLTSVAPGIYTLRLDEIDPVDGIVLSRLETPFKREAPEVLQPAVTADQAPDQAAPVVR
CCCHHHHCCCEEEEEECCCCCCHHHHHHHHCCCHHHCCHHHHHCHHHCCCCCCCCCCEEE
AVTVQEGDTLWAISQQRYGSGFLYVRVFEANKGDIRDPDLIYPGQIFTLPE
EEEEECCCEEEEEEHHCCCCCEEEEEEEECCCCCCCCCCEECCCEEEECCC
>Mature Secondary Structure 
TKTSGIGAGASLAIGTVATVVVVGGGVFLARGGILGEGARSMVEQQLVALGLAAPPAPE
CCCCCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHHHHHHHHCCCCCCCC
VVPVKPVVTQPQTADPETRVVEPEPTPEATAGETVETQQDAPTAEPAFVLQAPKLEIARF
EEECCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCEEEEEE
EPDGSGIVAASAQAGVEVQVLLDDEVLDTQTVPAAGEFVSFVTIDLSDKPRLLTLLARHN
CCCCCCEEEECCCCCCEEEEEECCCCCCCCCCCCCCCEEEEEEEECCCCCCEEEHHHHCC
GQELASEDSFILAPMPAPAAPEPQVDQLAAAQTDSDIAAPEEEPIELAEATETADPNVAD
CHHHHCCCCEEEECCCCCCCCCCCHHHHHHHCCCCCCCCCCCCCHHHHHHHCCCCCCCCC
QATDAPDPDAPGDGAAEGSTTVTAQSEEVALADVAVDSTDPDAEGDASSTESAASGAADN
CCCCCCCCCCCCCCCCCCCEEEEECCCCEEEEEEEECCCCCCCCCCCCCCHHHHCCCCCC
GVATDMAAVENTGDQLPDAASEAVSEASPEAVDPSVDVAEATSALPETEVTAEDAPAAEA
CCHHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCC
PEETVETAALEQAIDDESARESSEDSVPAPEPEVAAVADTSEPPAPDTTPAPQSPVEVAE
CHHHHHHHHHHHHHCCHHHHCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCHHHHHH
AVDTPEVPSSKTDTMAAVEEVQQPQPQDPDTESPSGEAAPAPQATSSVAVLRAGRDGVTL
HCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCEE
VQPAAPAAPELVGKVALDTISYTETGDVQLAGRARPEALVRVYLDNSPVAELAAASDGQW
ECCCCCCCHHHHHHHHHHHCCCCCCCCEEEECCCCCCEEEEEEECCCCHHHHHHCCCCCC
SGSLTSVAPGIYTLRLDEIDPVDGIVLSRLETPFKREAPEVLQPAVTADQAPDQAAPVVR
CCCHHHHCCCEEEEEECCCCCCHHHHHHHHCCCHHHCCHHHHHCHHHCCCCCCCCCCEEE
AVTVQEGDTLWAISQQRYGSGFLYVRVFEANKGDIRDPDLIYPGQIFTLPE
EEEEECCCEEEEEEHHCCCCCEEEEEEEECCCCCCCCCCEECCCEEEECCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA