The gene/protein map for NC_010658 is currently unavailable.
Definition Shigella boydii CDC 3083-94 chromosome, complete genome.
Accession NC_010658
Length 4,615,997

Click here to switch to the map view.

The map label for this gene is wcaD

Identifier: 187734147

GI number: 187734147

Start: 1079296

End: 1080513

Strand: Direct

Name: wcaD

Synonym: SbBS512_E1176

Alternate gene names: 187734147

Gene position: 1079296-1080513 (Clockwise)

Preceding gene: 187731835

Following gene: 187733816

Centisome position: 23.38

GC content: 42.04

Gene sequence:

>1218_bases
ATGTCAACTTCTATCAGAATCTGTAGCTACCTGTTGCTGCCGCTGATTTATCTGCTGGTTAACGTCAAAATCGCCCAGCT
TGGCGAAAGTTTCCCCATCACCATCGTCACTTTTTTACCTGTCTTGTTACTGCTGTTTTTAGAACGCATCAGCGTTAAAA
AATTGATGATTGCCTTAGGGATTGGCGCGGGACTCACGGCGTTTAACTATCTGTTTGGTCAGTCGCTGGATGCCAGTAAA
TACGTCACTTCAACTATGCTGTTTGTCTATATTGTGATCATTATTGGCATGGTGTGGAGTATTCGTTTTAAAACAATTTC
GCCACACAACCATCGCAAGATATTACGTTTCTTTTATCTGGTGGTCGGGCTGGTGGTGGCGCTGGCGGCGGTGGAGATGG
CACAAATTATCCTCACCGGTGGCAGCAGTATTATGGAGTCGATTTCGAAATATCTGATTTACAGCAACAGCTATGTGCTG
AATTTCATTAAATTCGGCGGCAAGCGCACGACAGCACTTTATTTCGAACCGGCATTTTTCGCTCTGGCATTAATCTCAAT
TTGGCTCAGCATCAAACAGTTTGGTATCAAAACGCCTAAAACAGATGCTATGATTCTCGCAGGGATAATATTATCCGGAT
CGTTTTCAGGGGTTATGACCTTTATCCTGTTTTATTTGCTGGAGTGGGCATTTCAATATCTGAATAAAGAGGCGATTAAG
AAAAAGTTACCGTTAGCATTGATTTCTCTGGCTGTATTCCTGGTTGGTGTGGTAATCGCGTTTCCTTATATTTCCACCCG
TCTGGGCGATTTAGGTACGGAAGGATCGTCATCATATTATCGTATTGTCGGTCCGCTGGTGATGGTCGGTTATTCTTTGA
CCCATATTGACGGTGTAGTCAGATTTGGCTCACTTTATGAATATGTCGCATCATTCGGAATATTTAACGGTGCGGATGTC
GGAAAAACCATAGACAATGGTTTGTATCTGCTGATTATTTATTTTTCCTGGTTCGCGGTGTTTTTATCACTGTGGTACAT
GGGGAAAGTGATAAAAATGATGATCAACGCTTTTGGTGATAACCGCAATTTTCGCGTGCAATTATATCTTTTTACTCCGG
TATCGCTGTTTTTTACCGGTTCGATATTTAGCCCGGAATATGCATTTTTAATCGTCTGTCCGTTTATTTTGCGAAAAGCG
TTAAATATTACGAGGTAA

Upstream 100 bases:

>100_bases
AGCAAACCGGAAATCGCGCAGGCGATATTTGGTACCACGCTGGCTGAGTTCAGCCAACGCAGCCGCGCCGCCTACAGTGG
ACAACAGATGCTGGAGGAGT

Downstream 100 bases:

>100_bases
GAATAAGAACATGTTGCTTAGCATAATCACTGTCGCGTTTCGTAACCTCGAAGGGATAGTCAAAACACATGCCTCGCTGG
CGCATCTGGCGCAGGTGGAA

Product: putative colanic acid biosynthesis protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 405; Mature: 404

Protein sequence:

>405_residues
MSTSIRICSYLLLPLIYLLVNVKIAQLGESFPITIVTFLPVLLLLFLERISVKKLMIALGIGAGLTAFNYLFGQSLDASK
YVTSTMLFVYIVIIIGMVWSIRFKTISPHNHRKILRFFYLVVGLVVALAAVEMAQIILTGGSSIMESISKYLIYSNSYVL
NFIKFGGKRTTALYFEPAFFALALISIWLSIKQFGIKTPKTDAMILAGIILSGSFSGVMTFILFYLLEWAFQYLNKEAIK
KKLPLALISLAVFLVGVVIAFPYISTRLGDLGTEGSSSYYRIVGPLVMVGYSLTHIDGVVRFGSLYEYVASFGIFNGADV
GKTIDNGLYLLIIYFSWFAVFLSLWYMGKVIKMMINAFGDNRNFRVQLYLFTPVSLFFTGSIFSPEYAFLIVCPFILRKA
LNITR

Sequences:

>Translated_405_residues
MSTSIRICSYLLLPLIYLLVNVKIAQLGESFPITIVTFLPVLLLLFLERISVKKLMIALGIGAGLTAFNYLFGQSLDASK
YVTSTMLFVYIVIIIGMVWSIRFKTISPHNHRKILRFFYLVVGLVVALAAVEMAQIILTGGSSIMESISKYLIYSNSYVL
NFIKFGGKRTTALYFEPAFFALALISIWLSIKQFGIKTPKTDAMILAGIILSGSFSGVMTFILFYLLEWAFQYLNKEAIK
KKLPLALISLAVFLVGVVIAFPYISTRLGDLGTEGSSSYYRIVGPLVMVGYSLTHIDGVVRFGSLYEYVASFGIFNGADV
GKTIDNGLYLLIIYFSWFAVFLSLWYMGKVIKMMINAFGDNRNFRVQLYLFTPVSLFFTGSIFSPEYAFLIVCPFILRKA
LNITR
>Mature_404_residues
STSIRICSYLLLPLIYLLVNVKIAQLGESFPITIVTFLPVLLLLFLERISVKKLMIALGIGAGLTAFNYLFGQSLDASKY
VTSTMLFVYIVIIIGMVWSIRFKTISPHNHRKILRFFYLVVGLVVALAAVEMAQIILTGGSSIMESISKYLIYSNSYVLN
FIKFGGKRTTALYFEPAFFALALISIWLSIKQFGIKTPKTDAMILAGIILSGSFSGVMTFILFYLLEWAFQYLNKEAIKK
KLPLALISLAVFLVGVVIAFPYISTRLGDLGTEGSSSYYRIVGPLVMVGYSLTHIDGVVRFGSLYEYVASFGIFNGADVG
KTIDNGLYLLIIYFSWFAVFLSLWYMGKVIKMMINAFGDNRNFRVQLYLFTPVSLFFTGSIFSPEYAFLIVCPFILRKAL
NITR

Specific function: Slime polysaccharide colanic acid biosynthesis. [C]

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cell inner membrane; Multi-pass membrane protein

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

Organism=Escherichia coli, GI1788369, Length=405, Percent_Identity=100, Blast_Score=800, Evalue=0.0,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): WCAD_ECOLI (P71238)

Other databases:

- EMBL:   U38473
- EMBL:   U00096
- EMBL:   AP009048
- PIR:   G64971
- RefSeq:   AP_002656.1
- RefSeq:   NP_416560.1
- ProteinModelPortal:   P71238
- DIP:   DIP-11120N
- MINT:   MINT-1239365
- STRING:   P71238
- EnsemblBacteria:   EBESCT00000004034
- EnsemblBacteria:   EBESCT00000016742
- GeneID:   946550
- GenomeReviews:   AP009048_GR
- GenomeReviews:   U00096_GR
- KEGG:   ecj:JW2041
- KEGG:   eco:b2056
- EchoBASE:   EB3342
- EcoGene:   EG13572
- eggNOG:   NOG10766
- GeneTree:   EBGT00050000011726
- HOGENOM:   HBG417265
- OMA:   SWFAVLL
- ProtClustDB:   PRK09953
- BioCyc:   EcoCyc:G7101-MONOMER
- Genevestigator:   P71238

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 45410; Mature: 45279

Theoretical pI: Translated: 10.01; Mature: 10.01

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

HASH(0x1ce77fa4)-; HASH(0x1d207f2c)-; HASH(0x1d5c6b80)-; HASH(0x1d28cdb8)-; HASH(0x1d41c6d4)-; HASH(0x1d1b9c4c)-; HASH(0x1d53b410)-; HASH(0x1d46e1a4)-; HASH(0x1ced92f4)-; HASH(0x1d601704)-; HASH(0x1cc7b6b4)-;

Cys/Met content:

0.5 %Cys     (Translated Protein)
3.0 %Met     (Translated Protein)
3.5 %Cys+Met (Translated Protein)
0.5 %Cys     (Mature Protein)
2.7 %Met     (Mature Protein)
3.2 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSTSIRICSYLLLPLIYLLVNVKIAQLGESFPITIVTFLPVLLLLFLERISVKKLMIALG
CCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
IGAGLTAFNYLFGQSLDASKYVTSTMLFVYIVIIIGMVWSIRFKTISPHNHRKILRFFYL
CCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHEEECCCCHHHHHHHHHHH
VVGLVVALAAVEMAQIILTGGSSIMESISKYLIYSNSYVLNFIKFGGKRTTALYFEPAFF
HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCHHHHHHHHHCCCEEEEEEECHHHH
ALALISIWLSIKQFGIKTPKTDAMILAGIILSGSFSGVMTFILFYLLEWAFQYLNKEAIK
HHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCHHHHH
KKLPLALISLAVFLVGVVIAFPYISTRLGDLGTEGSSSYYRIVGPLVMVGYSLTHIDGVV
HHCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH
RFGSLYEYVASFGIFNGADVGKTIDNGLYLLIIYFSWFAVFLSLWYMGKVIKMMINAFGD
HHHHHHHHHHHHCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC
NRNFRVQLYLFTPVSLFFTGSIFSPEYAFLIVCPFILRKALNITR
CCCEEEEEEEECCHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCH
>Mature Secondary Structure 
STSIRICSYLLLPLIYLLVNVKIAQLGESFPITIVTFLPVLLLLFLERISVKKLMIALG
CCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
IGAGLTAFNYLFGQSLDASKYVTSTMLFVYIVIIIGMVWSIRFKTISPHNHRKILRFFYL
CCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHEEECCCCHHHHHHHHHHH
VVGLVVALAAVEMAQIILTGGSSIMESISKYLIYSNSYVLNFIKFGGKRTTALYFEPAFF
HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCHHHHHHHHHCCCEEEEEEECHHHH
ALALISIWLSIKQFGIKTPKTDAMILAGIILSGSFSGVMTFILFYLLEWAFQYLNKEAIK
HHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCHHHHH
KKLPLALISLAVFLVGVVIAFPYISTRLGDLGTEGSSSYYRIVGPLVMVGYSLTHIDGVV
HHCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH
RFGSLYEYVASFGIFNGADVGKTIDNGLYLLIIYFSWFAVFLSLWYMGKVIKMMINAFGD
HHHHHHHHHHHHCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC
NRNFRVQLYLFTPVSLFFTGSIFSPEYAFLIVCPFILRKALNITR
CCCEEEEEEEECCHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCH

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 8759852; 9278503