Definition Salmonella enterica subsp. enterica serovar Typhi str. Ty2 chromosome, complete genome.
Accession NC_004631
Length 4,791,961

Click here to switch to the map view.

The map label for this gene is entF [H]

Identifier: 29142679

GI number: 29142679

Start: 2347636

End: 2351520

Strand: Reverse

Name: entF [H]

Synonym: t2280

Alternate gene names: 29142679

Gene position: 2351520-2347636 (Counterclockwise)

Preceding gene: 29142680

Following gene: 29142675

Centisome position: 49.07

GC content: 58.94

Gene sequence:

>3885_bases
ATGACGCAGCGTTTACCGTTAGTCGCCGCCCAGCCGGGGATCTGGATGGCGGAAAAACTCTCTGATTTACCCTCCGCCTG
GAGCGTGGCGCACTATGTGGAACTGAATGGCGAGCTGGATGCCCCCTTGCTGGCAAAAGCGGTAGCGGTAGGGATGCAAC
AGGCGGATACGCTACGGATGCGTTTTACCGAAGAGAACGGCGAGGTCTGGCAATGGATAGATCCTGAGCACACCTTCGGC
GAACCCCCGATTGCCGATTTACGCGACCAGCCCGATCCGCATCTCGCGGCGCTGGCGTTAATGCAGGCGGATTTACGGCA
AAACCTGCGCGCAGATAGCGGTAAGCCGCTGGCGTTTCACCAGTTAATCCGCATTGATGATACCCGCTGGTATTGGTATC
AGCGCTATCACCATTTGCTGGTGGATGGCTTTAGCTTCCCGGCGATAACTCGCCAGATTGCGGCGATTTATCGCGCCTGG
CAGAGCGACGCCCCTACGCCGGAGTCGCCTTTTACCCCCTTTGTGGATGTGGTTGAGGAATATCAGCGCTATCGCCAGAG
CGAGGCCTGGCAGCGTGACGGCGCCTTCTGGGCGCAGCAGCGCCGCGAGCTGCCGCCGCCGGCGTCAATGTCTGCCGCGC
CGTTGCCGGGGCGGTCGGCGAGCGCGGATATTCTGCGTATGAAATTGAGCGCGCCGGCGGGGGCGTTTCGCCAACTGGCG
GCGCACATGCCTGAGATACCGCGAGCGGATCTTGCCCTGGCGCTGGTGACGTTGTGGCTGGGACGATTATGTGGGCGTAT
GGATTATGCCGCCGGATTTATCTTTATGCGGCGGATGGGCTCAGCGGCGTTAACGGCGACCGGGCCTGTCCTTAACGTGT
TGCCGCTGGCGGTTAACCTTCATGCGACGGAAGATCTGCCAACGCTGGCGAAGCGTCTTGCGGCGCAGTTAAAGAAGATG
CGCCGCCACCAGCGTTACGACGCCGAACAGATTGTACGCGATAGCGGGCGAGCCGCAGGGGAAACGCCGCTATTTGGCCC
GGTGCTCAACATAAAAGTATTTGATTATCATCTGGATTTTCCTGGCATACAGGCGCAAACCCATACCCTGGCGACGGGGC
CGGTTAACGATCTTGAACTGGCGCTTTTTCCGGATGAAAACGGCGGTCTGGATATTGAATTGCTGGCGAATGCGCAGCGT
TACGATGACGCCACGCTTTCCCGCCATGCCTTACGGTTGATGGCGCTTATCACGCAGTTTGCTGATAACCCGGCGCTGCG
CTGCGGCGATGCGCAAATGCTGCTGGCGGAAGAACAAACGCAATTAACACACCTTAATAATACGGCGGTAACGATTCCCG
CCGCCACGCTTAGCGATTTGGTGGCGCAGCAGGCGCAAAAAACGCCAGAGGCTTCCGCGTTGGCAGATGCGCATTATCAC
TTTACCTACCGTGAAATGCGCGAACAGGTTGTGGCGCTGGCATACGCGCTGCGGGAACGCGGCGTTCAGCCTGGCGATAG
CGTGGCGGTGGCGTTGCCGCGGTCGGTTTTTCTGACCTTAGCGCTGCACGGCATTGTCGAAGCGGGCGCCGCCTGGCTGC
CGCTGGATACCGGTTATCCTGACGATCGGCTGCGAATGATGCTGGAAGATGCGCAGCCGAAACTGTTAATTACGACTCAG
GCGCAGCTGGCGCGCTTTCACGATATTCCGGGGATGGAATATTTGTGTTATAGCCAACCGCTACCGGTCAGTGACGCCAC
TCCGCTGGGGCTGTCGCTACCGCATCATACCGCTTACATCATTTTCACCTCTGGTTCGACGGGCAGGCCGAAAGGGGTGA
TGGTGGGACAAACGGCGATAGTCAACCGGCTGCTGTGGATGCAGGATCACTATCCGTTGACGGCGGATGATGTAGTAGCG
CAAAAAACACCGTGCAGTTTTGACGTCTCAGTATGGGAGTTTTTCTGGCCGTTTATCGCCGGGGCGAAACTGGTGATGGC
TGAACCGGAAGCGCACCGCGATCCGCTCGCGATGCAGCGGTTCTTTGCGCAATACGGCGTCACGACCACCCATTTTGTGC
CGTCGATGCTGGCGGCCTTTATTGCCTCGCTCACGCCAGCGTCGGCTGGGAAAAGCTGCGCTTCCTTAAAGCGCGTTTTC
TGTAGCGGCGAGGCCCTGCCGACGGCGCTGTGCCGCGAATGGGAGACGTTAACCAACGCGCCGCTACACAATCTGTACGG
GCCAACGGAGGCCGCGGTGGATGTGAGCTGGTATCCGGCCTGTGGCGATGAGCTGGCGGCTGTTGACGGCAACAGTATCC
CGATTGGTTATCCCGTCTGGAATACCGGTTTACGTATTCTCGATGCGCATATGCAGCCGGTGCCGCCGGGCGTGGCTGGC
GATCTCTATCTTACCGGTATCCAACTGGCGCAGGGCTATCTGGGGCGTCCGGATCTTACCGCCAGCCGTTTTATCGCCGA
TCCTTTTGCGCCAGGTGAACGGATGTACCGTACCGGAGATGTGGCGCGCTGGCTGGATTCCGGCGTGGTGGAGTACCTGG
GACGCAGCGACGATCAGCTCAAAATTCGGGGGCAACGTATTGAACTGGGTGAGATTGACCGCGTCATGCAAACGCTGCCG
GATGTTGAACAGGCGGTAGCACATGCCTGCGTCTTTAATCAGGCGGCGGCGACGGGCGGGGATGCCCGGCAACTGGTCGG
CTATCTGGTGTCGCACTCCGGTTTACCGCTGGATTTACCGGCGCTGCAGGATAAATTACGCCAAAAACTTCCCGCGCATA
TGGTGCCGGTTGTGCTGTTGCAACTTGCCAGCTTGCCGCTTAGCGCTAATGGCAAGCTGGATCGCAAAGCCCTGCCGCTG
CCGGATCTGACGCCGCGCGTGAAAGGGCGTGCGCCGCAGTCCGCAACGGAGATTGCTGTCGCCGCGGCGTTTTCCCGCCT
GCTGGGCTGTGAGATTAACGACGTTGAAAGTGATTTCTTTGCGCTGGGCGGACACTCGCTGCTGGCGATGAAACTGGCCG
CGCAGCTAAGCCAGACGTTTAACCGCCAGGTGACGCCGGGGCAGGTGATGGTGGCGTCTGACGTGGCGCAGTTGAGTAAG
CTGCTCGATACTGACGATGACGAACGCTCGCGCAATCTGGGGTTCGGGCCGCTGTTGCCGCTGCGTGAAAGCGATGGTCC
AACGTTATTTTGTTTCCATCCGGCGTCGGGGTTCGCCTGGCAATTTAGCGTATTATCACGCTATCTCAGCCCGTCGTGGT
CGATTATGGGGATTCAGTCGCCGCGCCCCGTTGGCCCTATGCAAATCGCAACCACCCTTGATGAGGTATGTGAACATCAT
CTGGCGACATTGCTCGCCCGGCAGCCGCATGGCCCTTATTATTTATTAGGCTATTCGCTGGGCGGGACGCTGGCGCAAGG
GATCGCCGCCCGGTTGCGCGCGCGCGGTGAAACGGTGGCGTTTTTGGGGTTACTGGACACCTGGCCGCCGGAAACACAAA
ACTGGCGGGGAAAAGAGGCGAATGGCCTCAATCCGGACGTATTGGCGGAAATTGAACGCGAACGCGCAGCGTTTGTCGCC
GCGCAGCAGGGAAATGCCTCCGACGCCTTATTTACCGCCATTGAAGGCAATTATGCGGATGCCGTCCGTTTGCTGACGAC
GGCGCATAGCGCACCGTTTGATGGACATGCCACGCTCTTTGTGGCGGATAAAACGGTGCCGGAAGACGTATCGCCTGAAC
AGAGCTGGTCGCCGTGGATAGCGTCTTTGGCTATTTATCGCCAGCCGTGCGCGCATGTAGATATTATTTCTCCCTCCGCT
TTTGAAACTATCGGGCCCATCATCAGCGAACTTATTAATAAATAA

Upstream 100 bases:

>100_bases
TGGTATGCGAACCGCAATCGCAGGACGCGTGCCAGCAGTGGCTTAATACGCGCTGGACAACACTGAATCCGGCGCATTAT
GCCGATAAGCAGGAGGCGAA

Downstream 100 bases:

>100_bases
CGGGCGTTGTTTCTGCCTTTAACAAATTAAATCCTGAAACCCATAATAATGACTAATTATTATGGGTTTTTTATTGCAAC
TATTAATTCTTTTAACATAA

Product: enterobactin synthase subunit F

Products: NA

Alternate protein names: Enterochelin synthase F; Serine-activating enzyme; Seryl-AMP ligase [H]

Number of amino acids: Translated: 1294; Mature: 1293

Protein sequence:

>1294_residues
MTQRLPLVAAQPGIWMAEKLSDLPSAWSVAHYVELNGELDAPLLAKAVAVGMQQADTLRMRFTEENGEVWQWIDPEHTFG
EPPIADLRDQPDPHLAALALMQADLRQNLRADSGKPLAFHQLIRIDDTRWYWYQRYHHLLVDGFSFPAITRQIAAIYRAW
QSDAPTPESPFTPFVDVVEEYQRYRQSEAWQRDGAFWAQQRRELPPPASMSAAPLPGRSASADILRMKLSAPAGAFRQLA
AHMPEIPRADLALALVTLWLGRLCGRMDYAAGFIFMRRMGSAALTATGPVLNVLPLAVNLHATEDLPTLAKRLAAQLKKM
RRHQRYDAEQIVRDSGRAAGETPLFGPVLNIKVFDYHLDFPGIQAQTHTLATGPVNDLELALFPDENGGLDIELLANAQR
YDDATLSRHALRLMALITQFADNPALRCGDAQMLLAEEQTQLTHLNNTAVTIPAATLSDLVAQQAQKTPEASALADAHYH
FTYREMREQVVALAYALRERGVQPGDSVAVALPRSVFLTLALHGIVEAGAAWLPLDTGYPDDRLRMMLEDAQPKLLITTQ
AQLARFHDIPGMEYLCYSQPLPVSDATPLGLSLPHHTAYIIFTSGSTGRPKGVMVGQTAIVNRLLWMQDHYPLTADDVVA
QKTPCSFDVSVWEFFWPFIAGAKLVMAEPEAHRDPLAMQRFFAQYGVTTTHFVPSMLAAFIASLTPASAGKSCASLKRVF
CSGEALPTALCREWETLTNAPLHNLYGPTEAAVDVSWYPACGDELAAVDGNSIPIGYPVWNTGLRILDAHMQPVPPGVAG
DLYLTGIQLAQGYLGRPDLTASRFIADPFAPGERMYRTGDVARWLDSGVVEYLGRSDDQLKIRGQRIELGEIDRVMQTLP
DVEQAVAHACVFNQAAATGGDARQLVGYLVSHSGLPLDLPALQDKLRQKLPAHMVPVVLLQLASLPLSANGKLDRKALPL
PDLTPRVKGRAPQSATEIAVAAAFSRLLGCEINDVESDFFALGGHSLLAMKLAAQLSQTFNRQVTPGQVMVASDVAQLSK
LLDTDDDERSRNLGFGPLLPLRESDGPTLFCFHPASGFAWQFSVLSRYLSPSWSIMGIQSPRPVGPMQIATTLDEVCEHH
LATLLARQPHGPYYLLGYSLGGTLAQGIAARLRARGETVAFLGLLDTWPPETQNWRGKEANGLNPDVLAEIERERAAFVA
AQQGNASDALFTAIEGNYADAVRLLTTAHSAPFDGHATLFVADKTVPEDVSPEQSWSPWIASLAIYRQPCAHVDIISPSA
FETIGPIISELINK

Sequences:

>Translated_1294_residues
MTQRLPLVAAQPGIWMAEKLSDLPSAWSVAHYVELNGELDAPLLAKAVAVGMQQADTLRMRFTEENGEVWQWIDPEHTFG
EPPIADLRDQPDPHLAALALMQADLRQNLRADSGKPLAFHQLIRIDDTRWYWYQRYHHLLVDGFSFPAITRQIAAIYRAW
QSDAPTPESPFTPFVDVVEEYQRYRQSEAWQRDGAFWAQQRRELPPPASMSAAPLPGRSASADILRMKLSAPAGAFRQLA
AHMPEIPRADLALALVTLWLGRLCGRMDYAAGFIFMRRMGSAALTATGPVLNVLPLAVNLHATEDLPTLAKRLAAQLKKM
RRHQRYDAEQIVRDSGRAAGETPLFGPVLNIKVFDYHLDFPGIQAQTHTLATGPVNDLELALFPDENGGLDIELLANAQR
YDDATLSRHALRLMALITQFADNPALRCGDAQMLLAEEQTQLTHLNNTAVTIPAATLSDLVAQQAQKTPEASALADAHYH
FTYREMREQVVALAYALRERGVQPGDSVAVALPRSVFLTLALHGIVEAGAAWLPLDTGYPDDRLRMMLEDAQPKLLITTQ
AQLARFHDIPGMEYLCYSQPLPVSDATPLGLSLPHHTAYIIFTSGSTGRPKGVMVGQTAIVNRLLWMQDHYPLTADDVVA
QKTPCSFDVSVWEFFWPFIAGAKLVMAEPEAHRDPLAMQRFFAQYGVTTTHFVPSMLAAFIASLTPASAGKSCASLKRVF
CSGEALPTALCREWETLTNAPLHNLYGPTEAAVDVSWYPACGDELAAVDGNSIPIGYPVWNTGLRILDAHMQPVPPGVAG
DLYLTGIQLAQGYLGRPDLTASRFIADPFAPGERMYRTGDVARWLDSGVVEYLGRSDDQLKIRGQRIELGEIDRVMQTLP
DVEQAVAHACVFNQAAATGGDARQLVGYLVSHSGLPLDLPALQDKLRQKLPAHMVPVVLLQLASLPLSANGKLDRKALPL
PDLTPRVKGRAPQSATEIAVAAAFSRLLGCEINDVESDFFALGGHSLLAMKLAAQLSQTFNRQVTPGQVMVASDVAQLSK
LLDTDDDERSRNLGFGPLLPLRESDGPTLFCFHPASGFAWQFSVLSRYLSPSWSIMGIQSPRPVGPMQIATTLDEVCEHH
LATLLARQPHGPYYLLGYSLGGTLAQGIAARLRARGETVAFLGLLDTWPPETQNWRGKEANGLNPDVLAEIERERAAFVA
AQQGNASDALFTAIEGNYADAVRLLTTAHSAPFDGHATLFVADKTVPEDVSPEQSWSPWIASLAIYRQPCAHVDIISPSA
FETIGPIISELINK
>Mature_1293_residues
TQRLPLVAAQPGIWMAEKLSDLPSAWSVAHYVELNGELDAPLLAKAVAVGMQQADTLRMRFTEENGEVWQWIDPEHTFGE
PPIADLRDQPDPHLAALALMQADLRQNLRADSGKPLAFHQLIRIDDTRWYWYQRYHHLLVDGFSFPAITRQIAAIYRAWQ
SDAPTPESPFTPFVDVVEEYQRYRQSEAWQRDGAFWAQQRRELPPPASMSAAPLPGRSASADILRMKLSAPAGAFRQLAA
HMPEIPRADLALALVTLWLGRLCGRMDYAAGFIFMRRMGSAALTATGPVLNVLPLAVNLHATEDLPTLAKRLAAQLKKMR
RHQRYDAEQIVRDSGRAAGETPLFGPVLNIKVFDYHLDFPGIQAQTHTLATGPVNDLELALFPDENGGLDIELLANAQRY
DDATLSRHALRLMALITQFADNPALRCGDAQMLLAEEQTQLTHLNNTAVTIPAATLSDLVAQQAQKTPEASALADAHYHF
TYREMREQVVALAYALRERGVQPGDSVAVALPRSVFLTLALHGIVEAGAAWLPLDTGYPDDRLRMMLEDAQPKLLITTQA
QLARFHDIPGMEYLCYSQPLPVSDATPLGLSLPHHTAYIIFTSGSTGRPKGVMVGQTAIVNRLLWMQDHYPLTADDVVAQ
KTPCSFDVSVWEFFWPFIAGAKLVMAEPEAHRDPLAMQRFFAQYGVTTTHFVPSMLAAFIASLTPASAGKSCASLKRVFC
SGEALPTALCREWETLTNAPLHNLYGPTEAAVDVSWYPACGDELAAVDGNSIPIGYPVWNTGLRILDAHMQPVPPGVAGD
LYLTGIQLAQGYLGRPDLTASRFIADPFAPGERMYRTGDVARWLDSGVVEYLGRSDDQLKIRGQRIELGEIDRVMQTLPD
VEQAVAHACVFNQAAATGGDARQLVGYLVSHSGLPLDLPALQDKLRQKLPAHMVPVVLLQLASLPLSANGKLDRKALPLP
DLTPRVKGRAPQSATEIAVAAAFSRLLGCEINDVESDFFALGGHSLLAMKLAAQLSQTFNRQVTPGQVMVASDVAQLSKL
LDTDDDERSRNLGFGPLLPLRESDGPTLFCFHPASGFAWQFSVLSRYLSPSWSIMGIQSPRPVGPMQIATTLDEVCEHHL
ATLLARQPHGPYYLLGYSLGGTLAQGIAARLRARGETVAFLGLLDTWPPETQNWRGKEANGLNPDVLAEIERERAAFVAA
QQGNASDALFTAIEGNYADAVRLLTTAHSAPFDGHATLFVADKTVPEDVSPEQSWSPWIASLAIYRQPCAHVDIISPSAF
ETIGPIISELINK

Specific function: Activates the carboxylate group of L-serine via ATP- dependent PPi exchange reactions to the aminoacyladenylate, preparing that molecule for the final stages of enterobactin synthesis. Holo-EntF acts as the catalyst for the formation of the three amide an

COG id: COG1020

COG function: function code Q; Non-ribosomal peptide synthetase modules and related proteins

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 acyl carrier domain [H]

Homologues:

Organism=Homo sapiens, GI45580730, Length=467, Percent_Identity=26.1241970021413, Blast_Score=119, Evalue=3e-26,
Organism=Homo sapiens, GI42544132, Length=493, Percent_Identity=22.920892494929, Blast_Score=80, Evalue=2e-14,
Organism=Homo sapiens, GI38505220, Length=519, Percent_Identity=22.9287090558767, Blast_Score=80, Evalue=2e-14,
Organism=Escherichia coli, GI1786801, Length=1294, Percent_Identity=78.6707882534776, Blast_Score=2025, Evalue=0.0,
Organism=Escherichia coli, GI145693145, Length=550, Percent_Identity=24.3636363636364, Blast_Score=104, Evalue=5e-23,
Organism=Escherichia coli, GI1788107, Length=553, Percent_Identity=22.6039783001808, Blast_Score=95, Evalue=3e-20,
Organism=Escherichia coli, GI1786810, Length=542, Percent_Identity=23.8007380073801, Blast_Score=93, Evalue=1e-19,
Organism=Caenorhabditis elegans, GI17556356, Length=534, Percent_Identity=25.8426966292135, Blast_Score=132, Evalue=1e-30,
Organism=Caenorhabditis elegans, GI17550940, Length=584, Percent_Identity=22.4315068493151, Blast_Score=102, Evalue=1e-21,
Organism=Saccharomyces cerevisiae, GI6319591, Length=687, Percent_Identity=24.745269286754, Blast_Score=191, Evalue=7e-49,
Organism=Drosophila melanogaster, GI24648676, Length=505, Percent_Identity=31.0891089108911, Blast_Score=216, Evalue=7e-56,
Organism=Drosophila melanogaster, GI24582852, Length=304, Percent_Identity=27.6315789473684, Blast_Score=80, Evalue=1e-14,
Organism=Drosophila melanogaster, GI24648257, Length=536, Percent_Identity=23.6940298507463, Blast_Score=78, Evalue=3e-14,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR010071
- InterPro:   IPR009081
- InterPro:   IPR020845
- InterPro:   IPR000873
- InterPro:   IPR001242
- InterPro:   IPR006163
- InterPro:   IPR006162
- InterPro:   IPR001031 [H]

Pfam domain/function: PF00501 AMP-binding; PF00668 Condensation; PF00550 PP-binding; PF00975 Thioesterase [H]

EC number: 2.7.7.-

Molecular weight: Translated: 141780; Mature: 141649

Theoretical pI: Translated: 5.63; Mature: 5.63

Prosite motif: PS00012 PHOSPHOPANTETHEINE ; PS50075 ACP_DOMAIN ; PS00455 AMP_BINDING

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.0 %Cys     (Translated Protein)
2.4 %Met     (Translated Protein)
3.4 %Cys+Met (Translated Protein)
1.0 %Cys     (Mature Protein)
2.3 %Met     (Mature Protein)
3.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTQRLPLVAAQPGIWMAEKLSDLPSAWSVAHYVELNGELDAPLLAKAVAVGMQQADTLRM
CCCCCCEEECCCCCHHHHHHHCCCCHHCEEEEEEECCCCCCHHHHHHHHHHHHHCCEEEE
RFTEENGEVWQWIDPEHTFGEPPIADLRDQPDPHLAALALMQADLRQNLRADSGKPLAFH
EEECCCCCEEEEECCCCCCCCCCCHHCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCHHHH
QLIRIDDTRWYWYQRYHHLLVDGFSFPAITRQIAAIYRAWQSDAPTPESPFTPFVDVVEE
HHEEECCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHH
YQRYRQSEAWQRDGAFWAQQRRELPPPASMSAAPLPGRSASADILRMKLSAPAGAFRQLA
HHHHHHHHHHHHCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHEEEEECCCHHHHHHHH
AHMPEIPRADLALALVTLWLGRLCGRMDYAAGFIFMRRMGSAALTATGPVLNVLPLAVNL
HHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCHHHHHHHEEEE
HATEDLPTLAKRLAAQLKKMRRHQRYDAEQIVRDSGRAAGETPLFGPVLNIKVFDYHLDF
ECCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCCCCCCCCCCEEEEEEEEEEECC
PGIQAQTHTLATGPVNDLELALFPDENGGLDIELLANAQRYDDATLSRHALRLMALITQF
CCCCCCCEEEECCCCCCCEEEEECCCCCCEEEEEECCCHHCCHHHHHHHHHHHHHHHHHH
ADNPALRCGDAQMLLAEEQTQLTHLNNTAVTIPAATLSDLVAQQAQKTPEASALADAHYH
CCCCCEECCCCCEEHHHHHHHHHCCCCCEEEECHHHHHHHHHHHHHCCCCHHHHHHHHHH
FTYREMREQVVALAYALRERGVQPGDSVAVALPRSVFLTLALHGIVEAGAAWLPLDTGYP
HHHHHHHHHHHHHHHHHHHCCCCCCCCEEEECHHHHHHHHHHHHHHHCCCEEEECCCCCC
DDRLRMMLEDAQPKLLITTQAQLARFHDIPGMEYLCYSQPLPVSDATPLGLSLPHHTAYI
HHHHHHHHHCCCCCEEEECHHHHHHHHCCCCCEEEECCCCCCCCCCCCCCCCCCCCEEEE
IFTSGSTGRPKGVMVGQTAIVNRLLWMQDHYPLTADDVVAQKTPCSFDVSVWEFFWPFIA
EEECCCCCCCCCEEECHHHHHHHHHHHCCCCCCCHHHHHHCCCCCCCCHHHHHHHHHHHH
GAKLVMAEPEAHRDPLAMQRFFAQYGVTTTHFVPSMLAAFIASLTPASAGKSCASLKRVF
CCEEEEECCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHH
CSGEALPTALCREWETLTNAPLHNLYGPTEAAVDVSWYPACGDELAAVDGNSIPIGYPVW
HCCCCHHHHHHHHHHHHCCCCHHHCCCCCHHEEEEEECCCCCCCEEEECCCCCCCCCCCH
NTGLRILDAHMQPVPPGVAGDLYLTGIQLAQGYLGRPDLTASRFIADPFAPGERMYRTGD
HCCHHHEECCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCHHHHHCCCCCCCHHHHHHHH
VARWLDSGVVEYLGRSDDQLKIRGQRIELGEIDRVMQTLPDVEQAVAHACVFNQAAATGG
HHHHHHHHHHHHHCCCCCEEEEECCEECHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCC
DARQLVGYLVSHSGLPLDLPALQDKLRQKLPAHMVPVVLLQLASLPLSANGKLDRKALPL
HHHHHHHHHHHCCCCCEECHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCC
PDLTPRVKGRAPQSATEIAVAAAFSRLLGCEINDVESDFFALGGHSLLAMKLAAQLSQTF
CCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHH
NRQVTPGQVMVASDVAQLSKLLDTDDDERSRNLGFGPLLPLRESDGPTLFCFHPASGFAW
CCCCCCCCEEEHHHHHHHHHHHCCCCCHHHCCCCCCCCCCCCCCCCCEEEEEECCCCCEE
QFSVLSRYLSPSWSIMGIQSPRPVGPMQIATTLDEVCEHHLATLLARQPHGPYYLLGYSL
EHHHHHHHHCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECC
GGTLAQGIAARLRARGETVAFLGLLDTWPPETQNWRGKEANGLNPDVLAEIERERAAFVA
CHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHEEE
AQQGNASDALFTAIEGNYADAVRLLTTAHSAPFDGHATLFVADKTVPEDVSPEQSWSPWI
ECCCCCCCCEEEEECCCHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCCCCHHH
ASLAIYRQPCAHVDIISPSAFETIGPIISELINK
HHHHHHHCCCCCEEEECCHHHHHHHHHHHHHHCC
>Mature Secondary Structure 
TQRLPLVAAQPGIWMAEKLSDLPSAWSVAHYVELNGELDAPLLAKAVAVGMQQADTLRM
CCCCCEEECCCCCHHHHHHHCCCCHHCEEEEEEECCCCCCHHHHHHHHHHHHHCCEEEE
RFTEENGEVWQWIDPEHTFGEPPIADLRDQPDPHLAALALMQADLRQNLRADSGKPLAFH
EEECCCCCEEEEECCCCCCCCCCCHHCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCHHHH
QLIRIDDTRWYWYQRYHHLLVDGFSFPAITRQIAAIYRAWQSDAPTPESPFTPFVDVVEE
HHEEECCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHH
YQRYRQSEAWQRDGAFWAQQRRELPPPASMSAAPLPGRSASADILRMKLSAPAGAFRQLA
HHHHHHHHHHHHCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHEEEEECCCHHHHHHHH
AHMPEIPRADLALALVTLWLGRLCGRMDYAAGFIFMRRMGSAALTATGPVLNVLPLAVNL
HHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCHHHHHHHEEEE
HATEDLPTLAKRLAAQLKKMRRHQRYDAEQIVRDSGRAAGETPLFGPVLNIKVFDYHLDF
ECCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCCCCCCCCCCEEEEEEEEEEECC
PGIQAQTHTLATGPVNDLELALFPDENGGLDIELLANAQRYDDATLSRHALRLMALITQF
CCCCCCCEEEECCCCCCCEEEEECCCCCCEEEEEECCCHHCCHHHHHHHHHHHHHHHHHH
ADNPALRCGDAQMLLAEEQTQLTHLNNTAVTIPAATLSDLVAQQAQKTPEASALADAHYH
CCCCCEECCCCCEEHHHHHHHHHCCCCCEEEECHHHHHHHHHHHHHCCCCHHHHHHHHHH
FTYREMREQVVALAYALRERGVQPGDSVAVALPRSVFLTLALHGIVEAGAAWLPLDTGYP
HHHHHHHHHHHHHHHHHHHCCCCCCCCEEEECHHHHHHHHHHHHHHHCCCEEEECCCCCC
DDRLRMMLEDAQPKLLITTQAQLARFHDIPGMEYLCYSQPLPVSDATPLGLSLPHHTAYI
HHHHHHHHHCCCCCEEEECHHHHHHHHCCCCCEEEECCCCCCCCCCCCCCCCCCCCEEEE
IFTSGSTGRPKGVMVGQTAIVNRLLWMQDHYPLTADDVVAQKTPCSFDVSVWEFFWPFIA
EEECCCCCCCCCEEECHHHHHHHHHHHCCCCCCCHHHHHHCCCCCCCCHHHHHHHHHHHH
GAKLVMAEPEAHRDPLAMQRFFAQYGVTTTHFVPSMLAAFIASLTPASAGKSCASLKRVF
CCEEEEECCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHH
CSGEALPTALCREWETLTNAPLHNLYGPTEAAVDVSWYPACGDELAAVDGNSIPIGYPVW
HCCCCHHHHHHHHHHHHCCCCHHHCCCCCHHEEEEEECCCCCCCEEEECCCCCCCCCCCH
NTGLRILDAHMQPVPPGVAGDLYLTGIQLAQGYLGRPDLTASRFIADPFAPGERMYRTGD
HCCHHHEECCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCHHHHHCCCCCCCHHHHHHHH
VARWLDSGVVEYLGRSDDQLKIRGQRIELGEIDRVMQTLPDVEQAVAHACVFNQAAATGG
HHHHHHHHHHHHHCCCCCEEEEECCEECHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCC
DARQLVGYLVSHSGLPLDLPALQDKLRQKLPAHMVPVVLLQLASLPLSANGKLDRKALPL
HHHHHHHHHHHCCCCCEECHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCC
PDLTPRVKGRAPQSATEIAVAAAFSRLLGCEINDVESDFFALGGHSLLAMKLAAQLSQTF
CCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHH
NRQVTPGQVMVASDVAQLSKLLDTDDDERSRNLGFGPLLPLRESDGPTLFCFHPASGFAW
CCCCCCCCEEEHHHHHHHHHHHCCCCCHHHCCCCCCCCCCCCCCCCCEEEEEECCCCCEE
QFSVLSRYLSPSWSIMGIQSPRPVGPMQIATTLDEVCEHHLATLLARQPHGPYYLLGYSL
EHHHHHHHHCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECC
GGTLAQGIAARLRARGETVAFLGLLDTWPPETQNWRGKEANGLNPDVLAEIERERAAFVA
CHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHEEE
AQQGNASDALFTAIEGNYADAVRLLTTAHSAPFDGHATLFVADKTVPEDVSPEQSWSPWI
ECCCCCCCCEEEEECCCHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCCCCHHH
ASLAIYRQPCAHVDIISPSAFETIGPIISELINK
HHHHHHHCCCCCEEEECCHHHHHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: Phosphopantetheine. [C]

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: Transferases; Acyltransferases; Transferring groups other than amino-acyl groups [C]

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796 [H]