Definition | Shigella boydii CDC 3083-94 chromosome, complete genome. |
---|---|
Accession | NC_010658 |
Length | 4,615,997 |
Click here to switch to the map view.
The map label for this gene is wcaD
Identifier: 187734147
GI number: 187734147
Start: 1079296
End: 1080513
Strand: Direct
Name: wcaD
Synonym: SbBS512_E1176
Alternate gene names: 187734147
Gene position: 1079296-1080513 (Clockwise)
Preceding gene: 187731835
Following gene: 187733816
Centisome position: 23.38
GC content: 42.04
Gene sequence:
>1218_bases ATGTCAACTTCTATCAGAATCTGTAGCTACCTGTTGCTGCCGCTGATTTATCTGCTGGTTAACGTCAAAATCGCCCAGCT TGGCGAAAGTTTCCCCATCACCATCGTCACTTTTTTACCTGTCTTGTTACTGCTGTTTTTAGAACGCATCAGCGTTAAAA AATTGATGATTGCCTTAGGGATTGGCGCGGGACTCACGGCGTTTAACTATCTGTTTGGTCAGTCGCTGGATGCCAGTAAA TACGTCACTTCAACTATGCTGTTTGTCTATATTGTGATCATTATTGGCATGGTGTGGAGTATTCGTTTTAAAACAATTTC GCCACACAACCATCGCAAGATATTACGTTTCTTTTATCTGGTGGTCGGGCTGGTGGTGGCGCTGGCGGCGGTGGAGATGG CACAAATTATCCTCACCGGTGGCAGCAGTATTATGGAGTCGATTTCGAAATATCTGATTTACAGCAACAGCTATGTGCTG AATTTCATTAAATTCGGCGGCAAGCGCACGACAGCACTTTATTTCGAACCGGCATTTTTCGCTCTGGCATTAATCTCAAT TTGGCTCAGCATCAAACAGTTTGGTATCAAAACGCCTAAAACAGATGCTATGATTCTCGCAGGGATAATATTATCCGGAT CGTTTTCAGGGGTTATGACCTTTATCCTGTTTTATTTGCTGGAGTGGGCATTTCAATATCTGAATAAAGAGGCGATTAAG AAAAAGTTACCGTTAGCATTGATTTCTCTGGCTGTATTCCTGGTTGGTGTGGTAATCGCGTTTCCTTATATTTCCACCCG TCTGGGCGATTTAGGTACGGAAGGATCGTCATCATATTATCGTATTGTCGGTCCGCTGGTGATGGTCGGTTATTCTTTGA CCCATATTGACGGTGTAGTCAGATTTGGCTCACTTTATGAATATGTCGCATCATTCGGAATATTTAACGGTGCGGATGTC GGAAAAACCATAGACAATGGTTTGTATCTGCTGATTATTTATTTTTCCTGGTTCGCGGTGTTTTTATCACTGTGGTACAT GGGGAAAGTGATAAAAATGATGATCAACGCTTTTGGTGATAACCGCAATTTTCGCGTGCAATTATATCTTTTTACTCCGG TATCGCTGTTTTTTACCGGTTCGATATTTAGCCCGGAATATGCATTTTTAATCGTCTGTCCGTTTATTTTGCGAAAAGCG TTAAATATTACGAGGTAA
Upstream 100 bases:
>100_bases AGCAAACCGGAAATCGCGCAGGCGATATTTGGTACCACGCTGGCTGAGTTCAGCCAACGCAGCCGCGCCGCCTACAGTGG ACAACAGATGCTGGAGGAGT
Downstream 100 bases:
>100_bases GAATAAGAACATGTTGCTTAGCATAATCACTGTCGCGTTTCGTAACCTCGAAGGGATAGTCAAAACACATGCCTCGCTGG CGCATCTGGCGCAGGTGGAA
Product: putative colanic acid biosynthesis protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 405; Mature: 404
Protein sequence:
>405_residues MSTSIRICSYLLLPLIYLLVNVKIAQLGESFPITIVTFLPVLLLLFLERISVKKLMIALGIGAGLTAFNYLFGQSLDASK YVTSTMLFVYIVIIIGMVWSIRFKTISPHNHRKILRFFYLVVGLVVALAAVEMAQIILTGGSSIMESISKYLIYSNSYVL NFIKFGGKRTTALYFEPAFFALALISIWLSIKQFGIKTPKTDAMILAGIILSGSFSGVMTFILFYLLEWAFQYLNKEAIK KKLPLALISLAVFLVGVVIAFPYISTRLGDLGTEGSSSYYRIVGPLVMVGYSLTHIDGVVRFGSLYEYVASFGIFNGADV GKTIDNGLYLLIIYFSWFAVFLSLWYMGKVIKMMINAFGDNRNFRVQLYLFTPVSLFFTGSIFSPEYAFLIVCPFILRKA LNITR
Sequences:
>Translated_405_residues MSTSIRICSYLLLPLIYLLVNVKIAQLGESFPITIVTFLPVLLLLFLERISVKKLMIALGIGAGLTAFNYLFGQSLDASK YVTSTMLFVYIVIIIGMVWSIRFKTISPHNHRKILRFFYLVVGLVVALAAVEMAQIILTGGSSIMESISKYLIYSNSYVL NFIKFGGKRTTALYFEPAFFALALISIWLSIKQFGIKTPKTDAMILAGIILSGSFSGVMTFILFYLLEWAFQYLNKEAIK KKLPLALISLAVFLVGVVIAFPYISTRLGDLGTEGSSSYYRIVGPLVMVGYSLTHIDGVVRFGSLYEYVASFGIFNGADV GKTIDNGLYLLIIYFSWFAVFLSLWYMGKVIKMMINAFGDNRNFRVQLYLFTPVSLFFTGSIFSPEYAFLIVCPFILRKA LNITR >Mature_404_residues STSIRICSYLLLPLIYLLVNVKIAQLGESFPITIVTFLPVLLLLFLERISVKKLMIALGIGAGLTAFNYLFGQSLDASKY VTSTMLFVYIVIIIGMVWSIRFKTISPHNHRKILRFFYLVVGLVVALAAVEMAQIILTGGSSIMESISKYLIYSNSYVLN FIKFGGKRTTALYFEPAFFALALISIWLSIKQFGIKTPKTDAMILAGIILSGSFSGVMTFILFYLLEWAFQYLNKEAIKK KLPLALISLAVFLVGVVIAFPYISTRLGDLGTEGSSSYYRIVGPLVMVGYSLTHIDGVVRFGSLYEYVASFGIFNGADVG KTIDNGLYLLIIYFSWFAVFLSLWYMGKVIKMMINAFGDNRNFRVQLYLFTPVSLFFTGSIFSPEYAFLIVCPFILRKAL NITR
Specific function: Slime polysaccharide colanic acid biosynthesis. [C]
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
Organism=Escherichia coli, GI1788369, Length=405, Percent_Identity=100, Blast_Score=800, Evalue=0.0,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): WCAD_ECOLI (P71238)
Other databases:
- EMBL: U38473 - EMBL: U00096 - EMBL: AP009048 - PIR: G64971 - RefSeq: AP_002656.1 - RefSeq: NP_416560.1 - ProteinModelPortal: P71238 - DIP: DIP-11120N - MINT: MINT-1239365 - STRING: P71238 - EnsemblBacteria: EBESCT00000004034 - EnsemblBacteria: EBESCT00000016742 - GeneID: 946550 - GenomeReviews: AP009048_GR - GenomeReviews: U00096_GR - KEGG: ecj:JW2041 - KEGG: eco:b2056 - EchoBASE: EB3342 - EcoGene: EG13572 - eggNOG: NOG10766 - GeneTree: EBGT00050000011726 - HOGENOM: HBG417265 - OMA: SWFAVLL - ProtClustDB: PRK09953 - BioCyc: EcoCyc:G7101-MONOMER - Genevestigator: P71238
Pfam domain/function: NA
EC number: NA
Molecular weight: Translated: 45410; Mature: 45279
Theoretical pI: Translated: 10.01; Mature: 10.01
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
HASH(0x1ce77fa4)-; HASH(0x1d207f2c)-; HASH(0x1d5c6b80)-; HASH(0x1d28cdb8)-; HASH(0x1d41c6d4)-; HASH(0x1d1b9c4c)-; HASH(0x1d53b410)-; HASH(0x1d46e1a4)-; HASH(0x1ced92f4)-; HASH(0x1d601704)-; HASH(0x1cc7b6b4)-;
Cys/Met content:
0.5 %Cys (Translated Protein) 3.0 %Met (Translated Protein) 3.5 %Cys+Met (Translated Protein) 0.5 %Cys (Mature Protein) 2.7 %Met (Mature Protein) 3.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSTSIRICSYLLLPLIYLLVNVKIAQLGESFPITIVTFLPVLLLLFLERISVKKLMIALG CCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH IGAGLTAFNYLFGQSLDASKYVTSTMLFVYIVIIIGMVWSIRFKTISPHNHRKILRFFYL CCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHEEECCCCHHHHHHHHHHH VVGLVVALAAVEMAQIILTGGSSIMESISKYLIYSNSYVLNFIKFGGKRTTALYFEPAFF HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCHHHHHHHHHCCCEEEEEEECHHHH ALALISIWLSIKQFGIKTPKTDAMILAGIILSGSFSGVMTFILFYLLEWAFQYLNKEAIK HHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCHHHHH KKLPLALISLAVFLVGVVIAFPYISTRLGDLGTEGSSSYYRIVGPLVMVGYSLTHIDGVV HHCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH RFGSLYEYVASFGIFNGADVGKTIDNGLYLLIIYFSWFAVFLSLWYMGKVIKMMINAFGD HHHHHHHHHHHHCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC NRNFRVQLYLFTPVSLFFTGSIFSPEYAFLIVCPFILRKALNITR CCCEEEEEEEECCHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCH >Mature Secondary Structure STSIRICSYLLLPLIYLLVNVKIAQLGESFPITIVTFLPVLLLLFLERISVKKLMIALG CCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH IGAGLTAFNYLFGQSLDASKYVTSTMLFVYIVIIIGMVWSIRFKTISPHNHRKILRFFYL CCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHEEECCCCHHHHHHHHHHH VVGLVVALAAVEMAQIILTGGSSIMESISKYLIYSNSYVLNFIKFGGKRTTALYFEPAFF HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCHHHHHHHHHCCCEEEEEEECHHHH ALALISIWLSIKQFGIKTPKTDAMILAGIILSGSFSGVMTFILFYLLEWAFQYLNKEAIK HHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCHHHHH KKLPLALISLAVFLVGVVIAFPYISTRLGDLGTEGSSSYYRIVGPLVMVGYSLTHIDGVV HHCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH RFGSLYEYVASFGIFNGADVGKTIDNGLYLLIIYFSWFAVFLSLWYMGKVIKMMINAFGD HHHHHHHHHHHHCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC NRNFRVQLYLFTPVSLFFTGSIFSPEYAFLIVCPFILRKALNITR CCCEEEEEEEECCHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCH
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 8759852; 9278503