| Definition | Shigella flexneri 2a str. 2457T, complete genome. |
|---|---|
| Accession | NC_004741 |
| Length | 4,599,354 |
Click here to switch to the map view.
The map label for this gene is plsB
Identifier: 161486422
GI number: 161486422
Start: 3435891
End: 3438314
Strand: Reverse
Name: plsB
Synonym: S3567
Alternate gene names: 161486422
Gene position: 3438314-3435891 (Counterclockwise)
Preceding gene: 161486420
Following gene: 30064625
Centisome position: 74.76
GC content: 55.16
Gene sequence:
>2424_bases ATGTCCGGCTGGCCACGAATTTACTACAAATTACTGAATTTACCATTAAGCATCCTGGTAAAAAGCAAGTCTATTCCGGC AGATCCTGCCCCGGAACTGGGGCTGGATACCTCTCGTCCAATTATGTACGTTTTACCGTACAACTCGAAAGCAGATTTGC TGACGTTGCGCGCCCAGTGTCTGGCACATGATTTGCCTGACCCGTTAGAGCCGCTGGAAATCGACGGCACGCTACTGCCG CGCTATGTGTTCATTCACGGCGGGCCGCGTGTATTCACCTATTACACGCCGAAAGAAGAGTCTATTAAGCTTTTCCACGA CTATCTCGATTTGCACCGTAGCAACCCAAATCTGGATGTGCAGATGGTGCCAGTATCGGTGATGTTTGGTCGCGCGCCGG GGCGTGAAAAAGGCGAAGTGAACCCGCCGCTGCGTATGCTTAACGGCGTACAGAAATTTTTCGCTGTACTGTGGCTCGGT CGCGACAGTTTTGTGCGTTTCTCGCCGTCAGTGTCGCTGCGCCGTATGGCGGATGAACACGGCACGGATAAAACTATCGC TCAGAAACTGGCGCGCGTGGCGCGTATGCACTTTGCCCGTCAACGTCTGGCTGCCGTAGGCCCACGTCTGCCTGCTCGTC AGGATCTGTTTAATAAGCTGCTCGCCTCCCGCGCCATTGCCAAAGCGGTAGAAGATGAAGCGCGCAGCAAAAAAATCTCC CATGAAAAAGCGCAGCAGAACGCGATTGCGCTGATGGAAGAGATTGCGGCGAATTTCTCTTACGAGATGATCCGCCTGAC TGACCGCATTCTGGGCTTCACCTGGAACCGATTTTATCAGGGCATCAACGTCCATAACGCCGAGCGCGTTCGCCAGCTGG CCCACGACGGTCATGAGCTGGTGTATGTGCCTTGCCACCGCAGTCACATGGACTACCTGCTACTTTCTTACGTGCTGTAC CACCAGGGGCTGGTGCCGCCGCATATCGCCGCCGGGATCAACCTGAACTTCTGGCCGGCCGGGCCGATTTTCCGCCGTCT GGGGGCGTTCTTTATTCGCCGTACTTTTAAAGGCAATAAACTTTATTCCACCGTTTTCCGTGAGTATCTCGGCGAACTGT TCAGCCGTGGTTATTCCGTCGAGTACTTCGTGGAAGGCGGTCGTTCCCGCACGGGGCGTTTGCTGGATCCGAAAACCGGT ACGCTGTCGATGACCATTCAGGCGATGCTGCGTGGCGGCACGCGTCCGATTACGCTGATTCCGATCTATATCGGTTATGA GCACGTCATGGAAGTGGGTACTTACGCCAAAGAACTGCGCGGCGCGACGAAAGAGAAAGAGAGCCTGCCGCAGATGCTGC GCGGTTTAAGCAAGCTGCGTAATCTCGGTCAGGGTTACGTCAACTTCGGTGAACCAATGCCGTTGATGACCTACCTTAAC CAGCACGTACCAGACTGGCGTGAATCTATCGATCCCATCGAAGCGGTGCGTCCGGCCTGGTTAACGCCGACGGTCAATAA TATTGCTGCCGATCTGATGGTACGCATTAACAACGCAGGCGCGGCAAACGCCATGAACCTGTGCTGTACTGCGCTACTGG CATCACGTCAGCGCTCACTCACCCGCGAGCAGTTAACCGAGCAACTCAACTGCTACCTGGATCTGATGCGCAACGTACCT TACTCCACGGACTCTACCGTTCCTTCAGCCAGCGCCAGCGAGCTTATCGATCACGCGCTGCAAATGAACAAGTTTGAAGT CGAGAAAGACACTATCGGCGACATCATCATTCTGCCACGCGAGCAAGCGGTGCTGATGACCTACTATCGCAACAACATTG CGCATATGTTGGTGCTGCCTTCGCTGATGGCGGCAATTGTCACCCAGCATCGCCACATCTCCCGCGACGTATTGATGGAG CACGTCAATGTGCTTTACCCAATGCTGAAAGCGGAGCTGTTCCTGCGCTGGGATCGCGACGAGTTGCCGGACGTTATTGA TGCGCTGGCAAATGAGATGCAACGTCAGGGGCTGATTACCCTGCAAGATGATGAGTTGCATATCAACCCGGCGCATTCTC GCACCCTACAGCTGCTGGCCGCAGGCGCGCGCGAAACGCTGCAACGTTATGCCATCACCTTCTGGTTGTTGAGTGCCAAC CCGTCGATCAACCGCGGTACGCTGGAGAAAGAGAGCCGCACCGTTGCGCAACGTCTCTCCGTGCTGCACGGCATCAACGC GCCGGAGTTCTTCGACAAGGCGGTGTTCAGTTCTCTGGTGCTGACGCTGCGTGATGAAGGGTATATCAGCGATAGCGGCG ATGCCGAACCAGCAGAAACGATGAAGGTTTATCAGTTGCTGGCGGAGTTGATTACATCAGACGTGCGTTTGACGATTGAG AGTGCGACGCAGGGCGAAGGGTAA
Upstream 100 bases:
>100_bases AACGCCGCGCGAAACATGAGCGGATACCACAGAATTTCCCATGACTTTCTGCTATCCTTGCCGCGCATTTGCATTATTAA CCAGAGGCTTTACATCGTTT
Downstream 100 bases:
>100_bases TCAGAGAGAATTGCCGGATGCGGCGAAAACGCCTTATCCGGCCTACCATGACCTGCAAATTCAATAAATTGCGATTCACC GGGTAGGCCTGAGAAGCGCA
Product: glycerol-3-phosphate acyltransferase
Products: NA
Alternate protein names: GPAT [H]
Number of amino acids: Translated: 807; Mature: 806
Protein sequence:
>807_residues MSGWPRIYYKLLNLPLSILVKSKSIPADPAPELGLDTSRPIMYVLPYNSKADLLTLRAQCLAHDLPDPLEPLEIDGTLLP RYVFIHGGPRVFTYYTPKEESIKLFHDYLDLHRSNPNLDVQMVPVSVMFGRAPGREKGEVNPPLRMLNGVQKFFAVLWLG RDSFVRFSPSVSLRRMADEHGTDKTIAQKLARVARMHFARQRLAAVGPRLPARQDLFNKLLASRAIAKAVEDEARSKKIS HEKAQQNAIALMEEIAANFSYEMIRLTDRILGFTWNRFYQGINVHNAERVRQLAHDGHELVYVPCHRSHMDYLLLSYVLY HQGLVPPHIAAGINLNFWPAGPIFRRLGAFFIRRTFKGNKLYSTVFREYLGELFSRGYSVEYFVEGGRSRTGRLLDPKTG TLSMTIQAMLRGGTRPITLIPIYIGYEHVMEVGTYAKELRGATKEKESLPQMLRGLSKLRNLGQGYVNFGEPMPLMTYLN QHVPDWRESIDPIEAVRPAWLTPTVNNIAADLMVRINNAGAANAMNLCCTALLASRQRSLTREQLTEQLNCYLDLMRNVP YSTDSTVPSASASELIDHALQMNKFEVEKDTIGDIIILPREQAVLMTYYRNNIAHMLVLPSLMAAIVTQHRHISRDVLME HVNVLYPMLKAELFLRWDRDELPDVIDALANEMQRQGLITLQDDELHINPAHSRTLQLLAAGARETLQRYAITFWLLSAN PSINRGTLEKESRTVAQRLSVLHGINAPEFFDKAVFSSLVLTLRDEGYISDSGDAEPAETMKVYQLLAELITSDVRLTIE SATQGEG
Sequences:
>Translated_807_residues MSGWPRIYYKLLNLPLSILVKSKSIPADPAPELGLDTSRPIMYVLPYNSKADLLTLRAQCLAHDLPDPLEPLEIDGTLLP RYVFIHGGPRVFTYYTPKEESIKLFHDYLDLHRSNPNLDVQMVPVSVMFGRAPGREKGEVNPPLRMLNGVQKFFAVLWLG RDSFVRFSPSVSLRRMADEHGTDKTIAQKLARVARMHFARQRLAAVGPRLPARQDLFNKLLASRAIAKAVEDEARSKKIS HEKAQQNAIALMEEIAANFSYEMIRLTDRILGFTWNRFYQGINVHNAERVRQLAHDGHELVYVPCHRSHMDYLLLSYVLY HQGLVPPHIAAGINLNFWPAGPIFRRLGAFFIRRTFKGNKLYSTVFREYLGELFSRGYSVEYFVEGGRSRTGRLLDPKTG TLSMTIQAMLRGGTRPITLIPIYIGYEHVMEVGTYAKELRGATKEKESLPQMLRGLSKLRNLGQGYVNFGEPMPLMTYLN QHVPDWRESIDPIEAVRPAWLTPTVNNIAADLMVRINNAGAANAMNLCCTALLASRQRSLTREQLTEQLNCYLDLMRNVP YSTDSTVPSASASELIDHALQMNKFEVEKDTIGDIIILPREQAVLMTYYRNNIAHMLVLPSLMAAIVTQHRHISRDVLME HVNVLYPMLKAELFLRWDRDELPDVIDALANEMQRQGLITLQDDELHINPAHSRTLQLLAAGARETLQRYAITFWLLSAN PSINRGTLEKESRTVAQRLSVLHGINAPEFFDKAVFSSLVLTLRDEGYISDSGDAEPAETMKVYQLLAELITSDVRLTIE SATQGEG >Mature_806_residues SGWPRIYYKLLNLPLSILVKSKSIPADPAPELGLDTSRPIMYVLPYNSKADLLTLRAQCLAHDLPDPLEPLEIDGTLLPR YVFIHGGPRVFTYYTPKEESIKLFHDYLDLHRSNPNLDVQMVPVSVMFGRAPGREKGEVNPPLRMLNGVQKFFAVLWLGR DSFVRFSPSVSLRRMADEHGTDKTIAQKLARVARMHFARQRLAAVGPRLPARQDLFNKLLASRAIAKAVEDEARSKKISH EKAQQNAIALMEEIAANFSYEMIRLTDRILGFTWNRFYQGINVHNAERVRQLAHDGHELVYVPCHRSHMDYLLLSYVLYH QGLVPPHIAAGINLNFWPAGPIFRRLGAFFIRRTFKGNKLYSTVFREYLGELFSRGYSVEYFVEGGRSRTGRLLDPKTGT LSMTIQAMLRGGTRPITLIPIYIGYEHVMEVGTYAKELRGATKEKESLPQMLRGLSKLRNLGQGYVNFGEPMPLMTYLNQ HVPDWRESIDPIEAVRPAWLTPTVNNIAADLMVRINNAGAANAMNLCCTALLASRQRSLTREQLTEQLNCYLDLMRNVPY STDSTVPSASASELIDHALQMNKFEVEKDTIGDIIILPREQAVLMTYYRNNIAHMLVLPSLMAAIVTQHRHISRDVLMEH VNVLYPMLKAELFLRWDRDELPDVIDALANEMQRQGLITLQDDELHINPAHSRTLQLLAAGARETLQRYAITFWLLSANP SINRGTLEKESRTVAQRLSVLHGINAPEFFDKAVFSSLVLTLRDEGYISDSGDAEPAETMKVYQLLAELITSDVRLTIES ATQGEG
Specific function: De novo phospholipid biosynthesis; first step. [C]
COG id: COG2937
COG function: function code I; Glycerol-3-phosphate O-acyltransferase
Gene ontology:
Cell location: Cell inner membrane; Peripheral membrane protein; Cytoplasmic side [H]
Metaboloic importance: Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the GPAT/DAPAT family [H]
Homologues:
Organism=Homo sapiens, GI7657134, Length=435, Percent_Identity=27.816091954023, Blast_Score=150, Evalue=5e-36, Organism=Homo sapiens, GI190358539, Length=272, Percent_Identity=33.8235294117647, Blast_Score=135, Evalue=1e-31, Organism=Escherichia coli, GI87082362, Length=807, Percent_Identity=99.8760842627014, Blast_Score=1664, Evalue=0.0, Organism=Caenorhabditis elegans, GI71988723, Length=278, Percent_Identity=31.294964028777, Blast_Score=142, Evalue=6e-34, Organism=Caenorhabditis elegans, GI25147672, Length=279, Percent_Identity=29.7491039426523, Blast_Score=138, Evalue=1e-32, Organism=Caenorhabditis elegans, GI71988728, Length=190, Percent_Identity=37.8947368421053, Blast_Score=134, Evalue=1e-31, Organism=Drosophila melanogaster, GI24650754, Length=499, Percent_Identity=25.6513026052104, Blast_Score=146, Evalue=5e-35, Organism=Drosophila melanogaster, GI21357731, Length=499, Percent_Identity=25.6513026052104, Blast_Score=146, Evalue=6e-35, Organism=Drosophila melanogaster, GI24650752, Length=499, Percent_Identity=25.6513026052104, Blast_Score=146, Evalue=6e-35, Organism=Drosophila melanogaster, GI17864692, Length=270, Percent_Identity=30.7407407407407, Blast_Score=143, Evalue=4e-34,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR002123 - InterPro: IPR022284 [H]
Pfam domain/function: PF01553 Acyltransferase [H]
EC number: =2.3.1.15 [H]
Molecular weight: Translated: 91417; Mature: 91286
Theoretical pI: Translated: 8.56; Mature: 8.56
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.6 %Cys (Translated Protein) 3.3 %Met (Translated Protein) 4.0 %Cys+Met (Translated Protein) 0.6 %Cys (Mature Protein) 3.2 %Met (Mature Protein) 3.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSGWPRIYYKLLNLPLSILVKSKSIPADPAPELGLDTSRPIMYVLPYNSKADLLTLRAQC CCCCHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEECCCCCCHHHHHHHHH LAHDLPDPLEPLEIDGTLLPRYVFIHGGPRVFTYYTPKEESIKLFHDYLDLHRSNPNLDV HHCCCCCCCCCCCCCCEEECEEEEEECCCEEEEEECCCHHHHHHHHHHHHHHCCCCCCEE QMVPVSVMFGRAPGREKGEVNPPLRMLNGVQKFFAVLWLGRDSFVRFSPSVSLRRMADEH EEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEECCCHHHHHHHHHC GTDKTIAQKLARVARMHFARQRLAAVGPRLPARQDLFNKLLASRAIAKAVEDEARSKKIS CCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH HEKAQQNAIALMEEIAANFSYEMIRLTDRILGFTWNRFYQGINVHNAERVRQLAHDGHEL HHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCCEE VYVPCHRSHMDYLLLSYVLYHQGLVPPHIAAGINLNFWPAGPIFRRLGAFFIRRTFKGNK EEECCCCCHHHHHHHHHHHHHCCCCCCHHHCCCCCCCCCCCHHHHHHHHHHHHHHCCCCH LYSTVFREYLGELFSRGYSVEYFVEGGRSRTGRLLDPKTGTLSMTIQAMLRGGTRPITLI HHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCEECCCCCHHHHHHHHHHCCCCCCEEEE PIYIGYEHVMEVGTYAKELRGATKEKESLPQMLRGLSKLRNLGQGYVNFGEPMPLMTYLN EEECCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHH QHVPDWRESIDPIEAVRPAWLTPTVNNIAADLMVRINNAGAANAMNLCCTALLASRQRSL HCCCCHHHCCCHHHHCCCCCCCCCHHHHHHHHHEEECCCCCCHHHHHHHHHHHHHHHHHH TREQLTEQLNCYLDLMRNVPYSTDSTVPSASASELIDHALQMNKFEVEKDTIGDIIILPR HHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHCCCEEECCCCCCCEEEECC EQAVLMTYYRNNIAHMLVLPSLMAAIVTQHRHISRDVLMEHVNVLYPMLKAELFLRWDRD CCHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECCHH ELPDVIDALANEMQRQGLITLQDDELHINPAHSRTLQLLAAGARETLQRYAITFWLLSAN HHHHHHHHHHHHHHHCCCEEEECCCEEECCCHHHHHHHHHHHHHHHHHHHHHHEEEEECC PSINRGTLEKESRTVAQRLSVLHGINAPEFFDKAVFSSLVLTLRDEGYISDSGDAEPAET CCCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHH MKVYQLLAELITSDVRLTIESATQGEG HHHHHHHHHHHCCCCEEEEECCCCCCC >Mature Secondary Structure SGWPRIYYKLLNLPLSILVKSKSIPADPAPELGLDTSRPIMYVLPYNSKADLLTLRAQC CCCHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEECCCCCCHHHHHHHHH LAHDLPDPLEPLEIDGTLLPRYVFIHGGPRVFTYYTPKEESIKLFHDYLDLHRSNPNLDV HHCCCCCCCCCCCCCCEEECEEEEEECCCEEEEEECCCHHHHHHHHHHHHHHCCCCCCEE QMVPVSVMFGRAPGREKGEVNPPLRMLNGVQKFFAVLWLGRDSFVRFSPSVSLRRMADEH EEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEECCCHHHHHHHHHC GTDKTIAQKLARVARMHFARQRLAAVGPRLPARQDLFNKLLASRAIAKAVEDEARSKKIS CCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH HEKAQQNAIALMEEIAANFSYEMIRLTDRILGFTWNRFYQGINVHNAERVRQLAHDGHEL HHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCCEE VYVPCHRSHMDYLLLSYVLYHQGLVPPHIAAGINLNFWPAGPIFRRLGAFFIRRTFKGNK EEECCCCCHHHHHHHHHHHHHCCCCCCHHHCCCCCCCCCCCHHHHHHHHHHHHHHCCCCH LYSTVFREYLGELFSRGYSVEYFVEGGRSRTGRLLDPKTGTLSMTIQAMLRGGTRPITLI HHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCEECCCCCHHHHHHHHHHCCCCCCEEEE PIYIGYEHVMEVGTYAKELRGATKEKESLPQMLRGLSKLRNLGQGYVNFGEPMPLMTYLN EEECCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHH QHVPDWRESIDPIEAVRPAWLTPTVNNIAADLMVRINNAGAANAMNLCCTALLASRQRSL HCCCCHHHCCCHHHHCCCCCCCCCHHHHHHHHHEEECCCCCCHHHHHHHHHHHHHHHHHH TREQLTEQLNCYLDLMRNVPYSTDSTVPSASASELIDHALQMNKFEVEKDTIGDIIILPR HHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHCCCEEECCCCCCCEEEECC EQAVLMTYYRNNIAHMLVLPSLMAAIVTQHRHISRDVLMEHVNVLYPMLKAELFLRWDRD CCHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECCHH ELPDVIDALANEMQRQGLITLQDDELHINPAHSRTLQLLAAGARETLQRYAITFWLLSAN HHHHHHHHHHHHHHHCCCEEEECCCEEECCCHHHHHHHHHHHHHHHHHHHHHHEEEEECC PSINRGTLEKESRTVAQRLSVLHGINAPEFFDKAVFSSLVLTLRDEGYISDSGDAEPAET CCCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHH MKVYQLLAELITSDVRLTIESATQGEG HHHHHHHHHHHCCCCEEEEECCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: NA