| Definition | Streptococcus pneumoniae D39, complete genome. |
|---|---|
| Accession | NC_008533 |
| Length | 2,046,115 |
Click here to switch to the map view.
The map label for this gene is fruA [H]
Identifier: 116516221
GI number: 116516221
Start: 783562
End: 785514
Strand: Direct
Name: fruA [H]
Synonym: SPD_0773
Alternate gene names: 116516221
Gene position: 783562-785514 (Clockwise)
Preceding gene: 116516872
Following gene: 116517016
Centisome position: 38.3
GC content: 44.6
Gene sequence:
>1953_bases ATGAAAATTCAAGACCTATTGAGAAAAGATGTCATGTTGCTAGATTTGCAGGCAACTGAAAAAACAGCTGTCATCGACGA GATGATTAAAAATTTGACAGAGCACGGTTATGTAACAGATTTTGAAACATTTAAAGAAGGAATTTTGGCGCGTGAAGCTT TGACTTCTACTGGTTTGGGTGATGGAATCGCAATGCCTCACAGCAAAAACGCTGCTGTCAAAGAAGCGACAGTTCTATTT GCTAAGTCAAATAAGGGTGTTGACTACGAGAGCTTGGATGGACAAGCAACTGACCTCTTCTTCATGATTGCAGCTCCAGA AGGTGCCAATGATACTCACTTGGCAGCCTTGGCAGAATTGTCTCAATACTTGATGAAAGACGGTTTTGCAGACAAACTTC GTCAAGCAACATCTGCAGACCAAGTTATCGAACTTTTTGACCAAGCTTCAGAAAAAACTGAGGAACTTGTTCAAGCACCT GCTAATGACTCTGGTGACTTTATCGTAGCTGTTACAGCTTGTACAACAGGTATTGCCCACACTTACATGGCCCAAGAAGC CCTTCAAAAAGTAGCTGCTGAAATGGGGGTTGGTATCAAGGTCGAAACCAACGGTGCTAGCGGTGTTGGAAATCAACTAA CTGCAGAAGATATCCGTAAGGCTAAAGCTATTATCATTGCAGCAGACAAGGCCGTTGAAATGGATCGATTTGATGGAAAA CCATTGATCAATCGTCCAGTTGCTGACGGTATCCGTAAGACAGAAGAGCTAATTAACTTGGCTCTTTCAGGAGATACTGA AGTCTACCGTGCCGCTAATGGTGCCAAAGCTGCAACAGCCTCTAACGAAAAACAAAGCCTTGGTGGTGCCTTGTACAAAC ACTTGATGAGTGGTGTATCTCAAATGTTACCATTCGTTATCGGTGGTGGTATCATGATTGCCCTTGCCTTCTTGATTGAC GGTGCTTTGGGTGTTCCAAATGAAAACCTTGGCAATCTTGGTTCTTACCATGAGTTAGCTTCTATGTTCATGAAAATTGG TGGAGCTGCCTTTGGTTTGATGCTTCCAGTCTTTGCGGGTTATGTTGCCTACTCTATTGCTGAAAAACCGGGTTTGGTAG CAGGTTTCGTGGCTGGTGCTATTGCCAAAGAAGGTTTTGCCTTTGGTAAAATTCCTTATGCCGCAGGTGGTGAAGCAACT TCAACTCTTGCAGGTGTCTCATCTGGTTTCCTAGGTGCCCTTGTTGGTGGATTTATCGCAGGTGCCTTGGTTCTTGCCAT CAAGAAATACGTTAAAGTTCCTCGTTCACTCGAAGGTGCTAAATCAATCCTTCTATTGCCACTTCTTGGAACAATCTTGA CAGGATTTGTTATGCTAGCTGTGAATATCCCAATGGCTGCAATCAACACTGCTATGAATGACTTCCTAGGCGGTCTTGGA GGAGGTTCAGCTGTCCTTCTTGGTATCGTCCTTGGTGGAATGATGGCTGTTGACATGGGTGGACCAGTTAATAAAGCAGC TTGTGTCTTTGGTACAGGTACGCTTGCAGCAACTGTTTCTTCAGGTGGTTCTGTAGCCATGGCAGCAGTTATGGCTGGAG GAATGGTGCCACCACTTGCAATCTTTGTCGCAACTCTTCTTTTCAAAGATAAATTTACTAAGGAAGAACGTAACTCTGGT TTGACAAACATCATCATGGGCTTGTCATTTATCACTGAGGGAGCGATTCCATTTGGTGCCGCTGACCCAGCTCGTGCGAT TCCAAGCTTCATCCTTGGTTCAGCAGTAGCAGGTGGACTCGTTGGTCTTACTGGTATCAAACTCATGGCGCCACACGGAG GAATCTTCGTTATCGCCCTTACTTCAAATGCTCTCCTTTACCTCGTTTCTGTCTTGGTAGGAGCAATCGTAAGTGGTGTG GTTTATGGTTACCTACGCAAACCACAAGCATAA
Upstream 100 bases:
>100_bases AATGGGGAGTGGCTTGCGGAACGGCAACTACCTTCTCAGATGACTTGGCAACGGCGGAATTTATTAAAGAAACATATGGA AAAGTTGAGGTAGAAAAAAG
Downstream 100 bases:
>100_bases AAAATAGAAAAATGAAAAGATTGGACCGTTTGGTGCAGTCTTTTTCTCTTCCCGAAATGCCTGTGAAATATGGTATAATA GAAGAATGGCAAACAAGAAT
Product: PTS system, fructose specific IIABC components
Products: NA
Alternate protein names: EIIABC-Fru; Fructose-specific phosphotransferase enzyme IIA component; EII-Fru; PTS system fructose-specific EIIA component; Fructose-specific phosphotransferase enzyme IIB component; EIII-Fru; PTS system fructose-specific EIIB component; Fructose permease IIC component; PTS system fructose-specific EIIC component [H]
Number of amino acids: Translated: 650; Mature: 650
Protein sequence:
>650_residues MKIQDLLRKDVMLLDLQATEKTAVIDEMIKNLTEHGYVTDFETFKEGILAREALTSTGLGDGIAMPHSKNAAVKEATVLF AKSNKGVDYESLDGQATDLFFMIAAPEGANDTHLAALAELSQYLMKDGFADKLRQATSADQVIELFDQASEKTEELVQAP ANDSGDFIVAVTACTTGIAHTYMAQEALQKVAAEMGVGIKVETNGASGVGNQLTAEDIRKAKAIIIAADKAVEMDRFDGK PLINRPVADGIRKTEELINLALSGDTEVYRAANGAKAATASNEKQSLGGALYKHLMSGVSQMLPFVIGGGIMIALAFLID GALGVPNENLGNLGSYHELASMFMKIGGAAFGLMLPVFAGYVAYSIAEKPGLVAGFVAGAIAKEGFAFGKIPYAAGGEAT STLAGVSSGFLGALVGGFIAGALVLAIKKYVKVPRSLEGAKSILLLPLLGTILTGFVMLAVNIPMAAINTAMNDFLGGLG GGSAVLLGIVLGGMMAVDMGGPVNKAACVFGTGTLAATVSSGGSVAMAAVMAGGMVPPLAIFVATLLFKDKFTKEERNSG LTNIIMGLSFITEGAIPFGAADPARAIPSFILGSAVAGGLVGLTGIKLMAPHGGIFVIALTSNALLYLVSVLVGAIVSGV VYGYLRKPQA
Sequences:
>Translated_650_residues MKIQDLLRKDVMLLDLQATEKTAVIDEMIKNLTEHGYVTDFETFKEGILAREALTSTGLGDGIAMPHSKNAAVKEATVLF AKSNKGVDYESLDGQATDLFFMIAAPEGANDTHLAALAELSQYLMKDGFADKLRQATSADQVIELFDQASEKTEELVQAP ANDSGDFIVAVTACTTGIAHTYMAQEALQKVAAEMGVGIKVETNGASGVGNQLTAEDIRKAKAIIIAADKAVEMDRFDGK PLINRPVADGIRKTEELINLALSGDTEVYRAANGAKAATASNEKQSLGGALYKHLMSGVSQMLPFVIGGGIMIALAFLID GALGVPNENLGNLGSYHELASMFMKIGGAAFGLMLPVFAGYVAYSIAEKPGLVAGFVAGAIAKEGFAFGKIPYAAGGEAT STLAGVSSGFLGALVGGFIAGALVLAIKKYVKVPRSLEGAKSILLLPLLGTILTGFVMLAVNIPMAAINTAMNDFLGGLG GGSAVLLGIVLGGMMAVDMGGPVNKAACVFGTGTLAATVSSGGSVAMAAVMAGGMVPPLAIFVATLLFKDKFTKEERNSG LTNIIMGLSFITEGAIPFGAADPARAIPSFILGSAVAGGLVGLTGIKLMAPHGGIFVIALTSNALLYLVSVLVGAIVSGV VYGYLRKPQA >Mature_650_residues MKIQDLLRKDVMLLDLQATEKTAVIDEMIKNLTEHGYVTDFETFKEGILAREALTSTGLGDGIAMPHSKNAAVKEATVLF AKSNKGVDYESLDGQATDLFFMIAAPEGANDTHLAALAELSQYLMKDGFADKLRQATSADQVIELFDQASEKTEELVQAP ANDSGDFIVAVTACTTGIAHTYMAQEALQKVAAEMGVGIKVETNGASGVGNQLTAEDIRKAKAIIIAADKAVEMDRFDGK PLINRPVADGIRKTEELINLALSGDTEVYRAANGAKAATASNEKQSLGGALYKHLMSGVSQMLPFVIGGGIMIALAFLID GALGVPNENLGNLGSYHELASMFMKIGGAAFGLMLPVFAGYVAYSIAEKPGLVAGFVAGAIAKEGFAFGKIPYAAGGEAT STLAGVSSGFLGALVGGFIAGALVLAIKKYVKVPRSLEGAKSILLLPLLGTILTGFVMLAVNIPMAAINTAMNDFLGGLG GGSAVLLGIVLGGMMAVDMGGPVNKAACVFGTGTLAATVSSGGSVAMAAVMAGGMVPPLAIFVATLLFKDKFTKEERNSG LTNIIMGLSFITEGAIPFGAADPARAIPSFILGSAVAGGLVGLTGIKLMAPHGGIFVIALTSNALLYLVSVLVGAIVSGV VYGYLRKPQA
Specific function: The phosphoenolpyruvate-dependent sugar phosphotransferase system (sugar PTS), a major carbohydrate active -transport system, catalyzes the phosphorylation of incoming sugar substrates concomitantly with their translocation across the cell membrane. This
COG id: COG1299
COG function: function code G; Phosphotransferase system, fructose-specific IIC component
Gene ontology:
Cell location: Cell membrane; Multi-pass membrane protein (Potential) [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 PTS EIIC type-2 domain [H]
Homologues:
Organism=Escherichia coli, GI1788492, Length=487, Percent_Identity=42.0944558521561, Blast_Score=352, Evalue=3e-98, Organism=Escherichia coli, GI1786951, Length=664, Percent_Identity=35.2409638554217, Blast_Score=338, Evalue=4e-94, Organism=Escherichia coli, GI87082348, Length=466, Percent_Identity=33.6909871244635, Blast_Score=203, Evalue=4e-53, Organism=Escherichia coli, GI1790386, Length=337, Percent_Identity=36.4985163204748, Blast_Score=181, Evalue=1e-46, Organism=Escherichia coli, GI1788729, Length=399, Percent_Identity=29.5739348370927, Blast_Score=139, Evalue=5e-34, Organism=Escherichia coli, GI2367327, Length=135, Percent_Identity=28.8888888888889, Blast_Score=84, Evalue=2e-17, Organism=Escherichia coli, GI1790387, Length=93, Percent_Identity=40.8602150537634, Blast_Score=80, Evalue=3e-16, Organism=Escherichia coli, GI1788730, Length=88, Percent_Identity=40.9090909090909, Blast_Score=67, Evalue=3e-12, Organism=Escherichia coli, GI1790390, Length=63, Percent_Identity=47.6190476190476, Blast_Score=66, Evalue=7e-12,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR016152 - InterPro: IPR002178 - InterPro: IPR013011 - InterPro: IPR003501 - InterPro: IPR003352 - InterPro: IPR013014 - InterPro: IPR004715 - InterPro: IPR003353 - InterPro: IPR006327 [H]
Pfam domain/function: PF00359 PTS_EIIA_2; PF02378 PTS_EIIC; PF02302 PTS_IIB [H]
EC number: =2.7.1.69 [H]
Molecular weight: Translated: 66876; Mature: 66876
Theoretical pI: Translated: 5.13; Mature: 5.13
Prosite motif: PS51094 PTS_EIIA_TYPE_2 ; PS51099 PTS_EIIB_TYPE_2 ; PS51104 PTS_EIIC_TYPE_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.3 %Cys (Translated Protein) 4.0 %Met (Translated Protein) 4.3 %Cys+Met (Translated Protein) 0.3 %Cys (Mature Protein) 4.0 %Met (Mature Protein) 4.3 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKIQDLLRKDVMLLDLQATEKTAVIDEMIKNLTEHGYVTDFETFKEGILAREALTSTGLG CCHHHHHHHCCEEEEECCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCC DGIAMPHSKNAAVKEATVLFAKSNKGVDYESLDGQATDLFFMIAAPEGANDTHLAALAEL CCCCCCCCCCCHHHHEEEEEEECCCCCCCCCCCCCCCEEEEEEECCCCCCCHHHHHHHHH SQYLMKDGFADKLRQATSADQVIELFDQASEKTEELVQAPANDSGDFIVAVTACTTGIAH HHHHHHCCHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEHHHHHHHH TYMAQEALQKVAAEMGVGIKVETNGASGVGNQLTAEDIRKAKAIIIAADKAVEMDRFDGK HHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCHHHHHHHCEEEEECCCHHHHHCCCCC PLINRPVADGIRKTEELINLALSGDTEVYRAANGAKAATASNEKQSLGGALYKHLMSGVS CCCCCCHHHHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHH QMLPFVIGGGIMIALAFLIDGALGVPNENLGNLGSYHELASMFMKIGGAAFGLMLPVFAG HHHHHHHCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH YVAYSIAEKPGLVAGFVAGAIAKEGFAFGKIPYAAGGEATSTLAGVSSGFLGALVGGFIA HHHHHHHCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH GALVLAIKKYVKVPRSLEGAKSILLLPLLGTILTGFVMLAVNIPMAAINTAMNDFLGGLG HHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCC GGSAVLLGIVLGGMMAVDMGGPVNKAACVFGTGTLAATVSSGGSVAMAAVMAGGMVPPLA CHHHHHHHHHHHHHHEEECCCCCCCCEEEEECCHHEEEECCCCHHHHHHHHHCCCCHHHH IFVATLLFKDKFTKEERNSGLTNIIMGLSFITEGAIPFGAADPARAIPSFILGSAVAGGL HHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHH VGLTGIKLMAPHGGIFVIALTSNALLYLVSVLVGAIVSGVVYGYLRKPQA HHHCCEEEECCCCCEEEEEECCCHHHHHHHHHHHHHHHHHHHHHHCCCCC >Mature Secondary Structure MKIQDLLRKDVMLLDLQATEKTAVIDEMIKNLTEHGYVTDFETFKEGILAREALTSTGLG CCHHHHHHHCCEEEEECCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCC DGIAMPHSKNAAVKEATVLFAKSNKGVDYESLDGQATDLFFMIAAPEGANDTHLAALAEL CCCCCCCCCCCHHHHEEEEEEECCCCCCCCCCCCCCCEEEEEEECCCCCCCHHHHHHHHH SQYLMKDGFADKLRQATSADQVIELFDQASEKTEELVQAPANDSGDFIVAVTACTTGIAH HHHHHHCCHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEHHHHHHHH TYMAQEALQKVAAEMGVGIKVETNGASGVGNQLTAEDIRKAKAIIIAADKAVEMDRFDGK HHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCHHHHHHHCEEEEECCCHHHHHCCCCC PLINRPVADGIRKTEELINLALSGDTEVYRAANGAKAATASNEKQSLGGALYKHLMSGVS CCCCCCHHHHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHH QMLPFVIGGGIMIALAFLIDGALGVPNENLGNLGSYHELASMFMKIGGAAFGLMLPVFAG HHHHHHHCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH YVAYSIAEKPGLVAGFVAGAIAKEGFAFGKIPYAAGGEATSTLAGVSSGFLGALVGGFIA HHHHHHHCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH GALVLAIKKYVKVPRSLEGAKSILLLPLLGTILTGFVMLAVNIPMAAINTAMNDFLGGLG HHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCC GGSAVLLGIVLGGMMAVDMGGPVNKAACVFGTGTLAATVSSGGSVAMAAVMAGGMVPPLA CHHHHHHHHHHHHHHEEECCCCCCCCEEEEECCHHEEEECCCCHHHHHHHHHCCCCHHHH IFVATLLFKDKFTKEERNSGLTNIIMGLSFITEGAIPFGAADPARAIPSFILGSAVAGGL HHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHH VGLTGIKLMAPHGGIFVIALTSNALLYLVSVLVGAIVSGVVYGYLRKPQA HHHCCEEEECCCCCEEEEEECCCHHHHHHHHHHHHHHHHHHHHHHCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 9384377 [H]