The gene/protein map for NC_004741 is currently unavailable.
Definition Shigella flexneri 2a str. 2457T, complete genome.
Accession NC_004741
Length 4,599,354

Click here to switch to the map view.

The map label for this gene is mdoC

Identifier: 30062572

GI number: 30062572

Start: 1087980

End: 1089137

Strand: Reverse

Name: mdoC

Synonym: S1117

Alternate gene names: 30062572

Gene position: 1089137-1087980 (Counterclockwise)

Preceding gene: 30062575

Following gene: 30062564

Centisome position: 23.68

GC content: 43.7

Gene sequence:

>1158_bases
ATGAATCCAGTACCCGCGCAACGTGAATATTTCCTCGACTCCATCCGCGCCTGGCTGATGTTGTTAGGGATCCCTTTTCA
TATTTCTTTAATCTATTCGAGCCATACATGGCATGTGAATAGCGCCGAACCGTCATTGTGGCTAACCCTTTTTAATGACT
TCATCCACTCGTTCCGCATGCAGGTATTTTTCGTTATATCCGGATACTTTTCCTACATGCTTTTTTTACGCTATCCCTTG
AAAAAATGGTGGAAAGTACGTGTCGAACGTGTAGGTATCCCGATGTTAACAGCCATCCCCCTACTGACATTACCGCAATT
TATTATGCTGCAATATGTCAAAGGGAAAGCGGAAAGTTGGCCTGGGCTATCATTGTATGACAAATATAATACGTTGGCCT
GGGAATTAATATCACACCTGTGGTTTTTACTGGTGTTAGTAGTCATGACGACGCTGTGTGTATGGATATTTAAGCGCATC
AGAAATAATTTAGAAAATTCTGATAAAACGAATAAAAAATTCTCGATGGTAAAACTATCGGTGATTTTTTTATGCCTCGG
CATCGGCTATGCGGTAATAAGAAGAACGATTTTTATTGTGTATCCACCCATTCTGAGTAATGGCATGTTCAATTTTATTG
TCATGCAAACGCTATTTTATTTACCGTTCTTTATCCTCGGCGCACTGGCTTTCATTTTCCCTCATCTTAAAGCCTTGTTT
ACCACGCCGTCTCGTGGCTGTACCCTCGCTGCAGCATTGGCGTTTGTCGCTTATTTACTCAACCAGCGCTATGGCAGTGG
CGATGCCTGGATGTACGAAACCGAGTCGGTGATCACCATGGTCCTCGGCCTGTGGATGGTGAATGTGGTCTTCTCCTTTG
GCCACCGTTTGCTTAACTTCCAGTCAGCGCGGGTGACTTATTTTGTTAACGCATCGCTGTTTATCTATCTGGTTCACCAC
CCGTTAACGCTGTTTTTCGGCGCATACATTACACCGCACATCACCTCCAACTGGCTTGGTTTTCTCTGTGGCCTGATATT
CGTAGTAGGGATTGCGATAATTCTGTATGAAATTCATTTACGCATCCCGTTACTGAAGTTTTTGTTCTCTGGTAAACCGG
TTGTTAAGCGTGAGAACGATAAAGCACCAGCCCGTTAA

Upstream 100 bases:

>100_bases
CCTGCGCCTGGCGCGGTCAGGTCGGTGTCGCTATGGCAACAAGCCTAAAGGCATAACCCGACAGGGATAAGACGAAAAAT
CCGAGATTAGTTAACCATAT

Downstream 100 bases:

>100_bases
GCCACATTTACAATAACCATTCCACGGGCAATATCGACGCCAGTCTGACCATAACCCGCTTCCAGAAACTGGTGGCGGGT
TCTTTTTTGAGAATAACCTC

Product: glucans biosynthesis protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 385; Mature: 385

Protein sequence:

>385_residues
MNPVPAQREYFLDSIRAWLMLLGIPFHISLIYSSHTWHVNSAEPSLWLTLFNDFIHSFRMQVFFVISGYFSYMLFLRYPL
KKWWKVRVERVGIPMLTAIPLLTLPQFIMLQYVKGKAESWPGLSLYDKYNTLAWELISHLWFLLVLVVMTTLCVWIFKRI
RNNLENSDKTNKKFSMVKLSVIFLCLGIGYAVIRRTIFIVYPPILSNGMFNFIVMQTLFYLPFFILGALAFIFPHLKALF
TTPSRGCTLAAALAFVAYLLNQRYGSGDAWMYETESVITMVLGLWMVNVVFSFGHRLLNFQSARVTYFVNASLFIYLVHH
PLTLFFGAYITPHITSNWLGFLCGLIFVVGIAIILYEIHLRIPLLKFLFSGKPVVKRENDKAPAR

Sequences:

>Translated_385_residues
MNPVPAQREYFLDSIRAWLMLLGIPFHISLIYSSHTWHVNSAEPSLWLTLFNDFIHSFRMQVFFVISGYFSYMLFLRYPL
KKWWKVRVERVGIPMLTAIPLLTLPQFIMLQYVKGKAESWPGLSLYDKYNTLAWELISHLWFLLVLVVMTTLCVWIFKRI
RNNLENSDKTNKKFSMVKLSVIFLCLGIGYAVIRRTIFIVYPPILSNGMFNFIVMQTLFYLPFFILGALAFIFPHLKALF
TTPSRGCTLAAALAFVAYLLNQRYGSGDAWMYETESVITMVLGLWMVNVVFSFGHRLLNFQSARVTYFVNASLFIYLVHH
PLTLFFGAYITPHITSNWLGFLCGLIFVVGIAIILYEIHLRIPLLKFLFSGKPVVKRENDKAPAR
>Mature_385_residues
MNPVPAQREYFLDSIRAWLMLLGIPFHISLIYSSHTWHVNSAEPSLWLTLFNDFIHSFRMQVFFVISGYFSYMLFLRYPL
KKWWKVRVERVGIPMLTAIPLLTLPQFIMLQYVKGKAESWPGLSLYDKYNTLAWELISHLWFLLVLVVMTTLCVWIFKRI
RNNLENSDKTNKKFSMVKLSVIFLCLGIGYAVIRRTIFIVYPPILSNGMFNFIVMQTLFYLPFFILGALAFIFPHLKALF
TTPSRGCTLAAALAFVAYLLNQRYGSGDAWMYETESVITMVLGLWMVNVVFSFGHRLLNFQSARVTYFVNASLFIYLVHH
PLTLFFGAYITPHITSNWLGFLCGLIFVVGIAIILYEIHLRIPLLKFLFSGKPVVKRENDKAPAR

Specific function: Necessary for the succinyl substitution of periplasmic glucans. Could catalyze the transfer of succinyl residues from the cytoplasmic side of the membrane to the nascent glucan backbones on the periplasmic side of the membrane [H]

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cell membrane; Multi-pass membrane protein (Potential) [H]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the acyltransferase 3 family. OpgC subfamily [H]

Homologues:

Organism=Escherichia coli, GI1787285, Length=385, Percent_Identity=99.7402597402597, Blast_Score=779, Evalue=0.0,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR002656 [H]

Pfam domain/function: PF01757 Acyl_transf_3 [H]

EC number: 2.1.-.- [C]

Molecular weight: Translated: 44701; Mature: 44701

Theoretical pI: Translated: 10.13; Mature: 10.13

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.0 %Cys     (Translated Protein)
3.4 %Met     (Translated Protein)
4.4 %Cys+Met (Translated Protein)
1.0 %Cys     (Mature Protein)
3.4 %Met     (Mature Protein)
4.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNPVPAQREYFLDSIRAWLMLLGIPFHISLIYSSHTWHVNSAEPSLWLTLFNDFIHSFRM
CCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCEEEECCCCCHHHHHHHHHHHHHHHH
QVFFVISGYFSYMLFLRYPLKKWWKVRVERVGIPMLTAIPLLTLPQFIMLQYVKGKAESW
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCC
PGLSLYDKYNTLAWELISHLWFLLVLVVMTTLCVWIFKRIRNNLENSDKTNKKFSMVKLS
CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHH
VIFLCLGIGYAVIRRTIFIVYPPILSNGMFNFIVMQTLFYLPFFILGALAFIFPHLKALF
HHHHHHHHHHHHHHHHHHEEECHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
TTPSRGCTLAAALAFVAYLLNQRYGSGDAWMYETESVITMVLGLWMVNVVFSFGHRLLNF
CCCCCCHHHHHHHHHHHHHHHCCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHCC
QSARVTYFVNASLFIYLVHHPLTLFFGAYITPHITSNWLGFLCGLIFVVGIAIILYEIHL
CCCEEEEEECHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
RIPLLKFLFSGKPVVKRENDKAPAR
HHHHHHHHHCCCCCEECCCCCCCCC
>Mature Secondary Structure
MNPVPAQREYFLDSIRAWLMLLGIPFHISLIYSSHTWHVNSAEPSLWLTLFNDFIHSFRM
CCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCEEEECCCCCHHHHHHHHHHHHHHHH
QVFFVISGYFSYMLFLRYPLKKWWKVRVERVGIPMLTAIPLLTLPQFIMLQYVKGKAESW
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCC
PGLSLYDKYNTLAWELISHLWFLLVLVVMTTLCVWIFKRIRNNLENSDKTNKKFSMVKLS
CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHH
VIFLCLGIGYAVIRRTIFIVYPPILSNGMFNFIVMQTLFYLPFFILGALAFIFPHLKALF
HHHHHHHHHHHHHHHHHHEEECHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
TTPSRGCTLAAALAFVAYLLNQRYGSGDAWMYETESVITMVLGLWMVNVVFSFGHRLLNF
CCCCCCHHHHHHHHHHHHHHHCCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHCC
QSARVTYFVNASLFIYLVHHPLTLFFGAYITPHITSNWLGFLCGLIFVVGIAIILYEIHL
CCCEEEEEECHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
RIPLLKFLFSGKPVVKRENDKAPAR
HHHHHHHHHCCCCCEECCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: NA