Definition Streptococcus pneumoniae Taiwan19F-14, complete genome.
Accession NC_012469
Length 2,112,148

Click here to switch to the map view.

The map label for this gene is mngB [H]

Identifier: 225861961

GI number: 225861961

Start: 2003398

End: 2006043

Strand: Reverse

Name: mngB [H]

Synonym: SPT_2156

Alternate gene names: 225861961

Gene position: 2006043-2003398 (Counterclockwise)

Preceding gene: 225861962

Following gene: 225861960

Centisome position: 94.98

GC content: 44.82

Gene sequence:

>2646_bases
ATGGAAAATGTTGTTGTACATATTATCTCACATAGTCACTGGGACCGTGAGTGGTACTTGCCTTTTGAAAGCCATCGTAT
GCAGTTGGTGGAATTGTTTGACAATCTCTTTGATCTCTTTGAAAATGACCCTGAGTTCAAGAGTTTCCACTTGGATGGAC
AAACTATTGTCCTTGACGACTACTTACAAATTCGCCCTGAAAATCGTGACAAGGTCCAACGCTACATTGACGAGGGCAAA
CTTAAAATTGGTCCCTTTTACATCTTGCAGGATGACTACTTGATCTCCAGTGAAGCCAATGTCCGCAATACCTTGATTGG
TCAACAAGAAGCTGCCAAATGGGGTAAATCAACCCAGATTGGCTACTTTCCAGATACCTTTGGAAATATGGGACAAGCGC
CTCAAATTCTTCAAAAATCAGGCATTCACGTGGCGGCCTTTGGTCGTGGTGTGAAGCCGATTGGATTTGACAACCAAGTC
CTTGAAGATGAGCAGTTTACATCCCAGTTTTCAGAAATGTACTGGCAGGGTGTGGATGGTAGTCGTGTTTTAGGTATTCT
CTTTGCCAACTGGTACAGTAACGGGAATGAAATTCCAGTTGACAAAGATGAGGCCTTGACCTTCTGGAAACAAAAATTGT
CAGATGTGCGTGCCTACGCTTCGACCAACCAATGGTTGATGATGAACGGCTGTGACCACCAGCCTGTACAGAAAAATCTG
AGCGAAGCCATTCGTGTGGCAAATGAACTCTTCCCAGATGTAATCTTTGTTCATAGTTCTTTTGATGAATATGTTCAAGC
TGTAGAAGGTGCGCTTCCTGAACACTTATCAACTGTTACAGGTGAGTTGACCAGTCAGGAAACAGATGGCTGGTACACAC
TTGCCAACACTTCTTCATCCCGCATTTACCTAAAACAAGCCTTCCAAGAAAATAGCAACCTCCTAGAGCAAGTGGTAGAA
CCCTTGACTATTATCACTGGTGGACACAACCACAAGGACCAGTTGACCTATGCTTGGAAAACACTTTTGCAGAACGCGCC
ACATGATAGTATCTGTGGCTGTAGCGTGGACGAAGTTCACCGCGAGATGGAAACGCGTTTTGCCAAGGTCAACCAAGTAG
GAAACTTTGTTAAAAGTAACTTGCTCAACGAGTGGAAGGGTAAAATTGCTACGGATAAGGCTCAAAGTGACTATCTCTTT
ACTGTCATTAACACAGGCTTGCATGATAAGGTCGATACTGTCAGCACAGTGATTGATGTGGCGACTTGTGATTTCAAGGA
ATTGCACCCAACAGAAGGCTACAAAAAGATGGCTGCTCTTATCTTGCCAAGTTACCGTGTGGAGGACTTGGATGGTCATC
CTGTAGAGGCTACAATCGAAGACCTCGGAGCTAATTTTGAGTATGATTTACCAAAAGACAAGTTCCGCCAAGCTCGTATT
GCTCGTCAAGTGCGCGTGACCATTCCAGTTCACCTAGCGCCGCTTTCTTGGACAACCTTCCAATTGCTGGAAGGAAAACA
AGAACACCGTGAGGGTATTTACCAAAACGGAGTGATTGATACACCATTCGTAACGGTGAGTGTGGATGACAACATCACAG
TCTATGACAAGACAACTCACGAAGCCTATGAAGACTTTATCCGCTTTGAAGACCGTGGGGACATCGGAAACGAGTATATC
TATTTCCAACCAAAAGGAACAGAGCCAATCTTTGCAGAGCTTAAGGGCCACGAGGTCTTGGAAAACACAGCTTGCTATGC
TAAAATCTTGCTCAAACATGAATTGACCGTGCCTGTCAGCGCGGATGAAAAGCTAGAAGAAGAGCAACAAGGTATCATCG
AGTTTATGAAGCGTGAGGCTGGACGGTCAGAAGAATTGACAAACATTCCTCTGGAAACTGAGTTGACTGTCTTCGTTGAC
AATCCACAAATCCGCTTCAAGACTCGCTTTACTAACACTGCCAAGGATCACCGTATCCGTCTCTTGGTCAAGACTCATAA
CACGCGTCCAAGCAATGATTCTGAAAGCATTTATGAGGTGGTGACACGTCCAAACAGACCAGCTGCTTCATGGGAAAATC
CTGAAAATCCTCAACACCAACAAGCCTTTGTTAGTCTGTATGACGATGAAAAAGGTGTGACTGTATCCAACAAGGGATTG
AATGAATACGAAATCCTTGGAGACGACACCATTGCAGTGACTATTTTGCGTGCATCAGGTGAGCTAGGTGACTGGGGCTA
CTTCCCAACACCAGAAGCCCAATGCTTGCGTGAGTTTGAAGTCGAGTATGCGCTTGAGTGCCACCAAGCCCAAGAACGCT
TCTCAGCCTATCGTAGTGCCAAAGCCTTGCAGACACCGTTTACCAGCCTTCAGCTTGCTAGACAGGAAGGAAGCGTGGCT
GCGACTGGTAGCCTCTTGAGTCATTCTGTTCTCAGCATACCGCAAATCTGTCCAACAGCCTTTAAAGTGGCGGAAAATGA
AGAAGGATATGTACTCCGTTACTACAATATGAGTCAAGAAAATGTGCGCATATCAGAACACCAACAAACCATTCTTGACT
TACTTGAGCGACCATATCCAGTTCATTCAGGACTATTGGATCCACAAGAGATTCGTACAGAATTCATCAAAAAAGAAGAA
ATTTAA

Upstream 100 bases:

>100_bases
TATTCGCTAAGGGGCTCGCTTTAGCTCAACCGATTCTTATCAGAATCACAAGTTTACATTTAAAACGTTAAAATTTAAAT
TTAGAATGAGGTTTTACTTC

Downstream 100 bases:

>100_bases
TTTCAAAAAGTAAACATCAAAAGAAAGGAGGGGCGAAAAAGTAAGAACTAACTGCTGATTCGCCCCTTTTTATGGTAAAA
ACAATGACCATTGCAACGAT

Product: glycosyl hydrolase, family 38

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 881; Mature: 881

Protein sequence:

>881_residues
MENVVVHIISHSHWDREWYLPFESHRMQLVELFDNLFDLFENDPEFKSFHLDGQTIVLDDYLQIRPENRDKVQRYIDEGK
LKIGPFYILQDDYLISSEANVRNTLIGQQEAAKWGKSTQIGYFPDTFGNMGQAPQILQKSGIHVAAFGRGVKPIGFDNQV
LEDEQFTSQFSEMYWQGVDGSRVLGILFANWYSNGNEIPVDKDEALTFWKQKLSDVRAYASTNQWLMMNGCDHQPVQKNL
SEAIRVANELFPDVIFVHSSFDEYVQAVEGALPEHLSTVTGELTSQETDGWYTLANTSSSRIYLKQAFQENSNLLEQVVE
PLTIITGGHNHKDQLTYAWKTLLQNAPHDSICGCSVDEVHREMETRFAKVNQVGNFVKSNLLNEWKGKIATDKAQSDYLF
TVINTGLHDKVDTVSTVIDVATCDFKELHPTEGYKKMAALILPSYRVEDLDGHPVEATIEDLGANFEYDLPKDKFRQARI
ARQVRVTIPVHLAPLSWTTFQLLEGKQEHREGIYQNGVIDTPFVTVSVDDNITVYDKTTHEAYEDFIRFEDRGDIGNEYI
YFQPKGTEPIFAELKGHEVLENTACYAKILLKHELTVPVSADEKLEEEQQGIIEFMKREAGRSEELTNIPLETELTVFVD
NPQIRFKTRFTNTAKDHRIRLLVKTHNTRPSNDSESIYEVVTRPNRPAASWENPENPQHQQAFVSLYDDEKGVTVSNKGL
NEYEILGDDTIAVTILRASGELGDWGYFPTPEAQCLREFEVEYALECHQAQERFSAYRSAKALQTPFTSLQLARQEGSVA
ATGSLLSHSVLSIPQICPTAFKVAENEEGYVLRYYNMSQENVRISEHQQTILDLLERPYPVHSGLLDPQEIRTEFIKKEE
I

Sequences:

>Translated_881_residues
MENVVVHIISHSHWDREWYLPFESHRMQLVELFDNLFDLFENDPEFKSFHLDGQTIVLDDYLQIRPENRDKVQRYIDEGK
LKIGPFYILQDDYLISSEANVRNTLIGQQEAAKWGKSTQIGYFPDTFGNMGQAPQILQKSGIHVAAFGRGVKPIGFDNQV
LEDEQFTSQFSEMYWQGVDGSRVLGILFANWYSNGNEIPVDKDEALTFWKQKLSDVRAYASTNQWLMMNGCDHQPVQKNL
SEAIRVANELFPDVIFVHSSFDEYVQAVEGALPEHLSTVTGELTSQETDGWYTLANTSSSRIYLKQAFQENSNLLEQVVE
PLTIITGGHNHKDQLTYAWKTLLQNAPHDSICGCSVDEVHREMETRFAKVNQVGNFVKSNLLNEWKGKIATDKAQSDYLF
TVINTGLHDKVDTVSTVIDVATCDFKELHPTEGYKKMAALILPSYRVEDLDGHPVEATIEDLGANFEYDLPKDKFRQARI
ARQVRVTIPVHLAPLSWTTFQLLEGKQEHREGIYQNGVIDTPFVTVSVDDNITVYDKTTHEAYEDFIRFEDRGDIGNEYI
YFQPKGTEPIFAELKGHEVLENTACYAKILLKHELTVPVSADEKLEEEQQGIIEFMKREAGRSEELTNIPLETELTVFVD
NPQIRFKTRFTNTAKDHRIRLLVKTHNTRPSNDSESIYEVVTRPNRPAASWENPENPQHQQAFVSLYDDEKGVTVSNKGL
NEYEILGDDTIAVTILRASGELGDWGYFPTPEAQCLREFEVEYALECHQAQERFSAYRSAKALQTPFTSLQLARQEGSVA
ATGSLLSHSVLSIPQICPTAFKVAENEEGYVLRYYNMSQENVRISEHQQTILDLLERPYPVHSGLLDPQEIRTEFIKKEE
I
>Mature_881_residues
MENVVVHIISHSHWDREWYLPFESHRMQLVELFDNLFDLFENDPEFKSFHLDGQTIVLDDYLQIRPENRDKVQRYIDEGK
LKIGPFYILQDDYLISSEANVRNTLIGQQEAAKWGKSTQIGYFPDTFGNMGQAPQILQKSGIHVAAFGRGVKPIGFDNQV
LEDEQFTSQFSEMYWQGVDGSRVLGILFANWYSNGNEIPVDKDEALTFWKQKLSDVRAYASTNQWLMMNGCDHQPVQKNL
SEAIRVANELFPDVIFVHSSFDEYVQAVEGALPEHLSTVTGELTSQETDGWYTLANTSSSRIYLKQAFQENSNLLEQVVE
PLTIITGGHNHKDQLTYAWKTLLQNAPHDSICGCSVDEVHREMETRFAKVNQVGNFVKSNLLNEWKGKIATDKAQSDYLF
TVINTGLHDKVDTVSTVIDVATCDFKELHPTEGYKKMAALILPSYRVEDLDGHPVEATIEDLGANFEYDLPKDKFRQARI
ARQVRVTIPVHLAPLSWTTFQLLEGKQEHREGIYQNGVIDTPFVTVSVDDNITVYDKTTHEAYEDFIRFEDRGDIGNEYI
YFQPKGTEPIFAELKGHEVLENTACYAKILLKHELTVPVSADEKLEEEQQGIIEFMKREAGRSEELTNIPLETELTVFVD
NPQIRFKTRFTNTAKDHRIRLLVKTHNTRPSNDSESIYEVVTRPNRPAASWENPENPQHQQAFVSLYDDEKGVTVSNKGL
NEYEILGDDTIAVTILRASGELGDWGYFPTPEAQCLREFEVEYALECHQAQERFSAYRSAKALQTPFTSLQLARQEGSVA
ATGSLLSHSVLSIPQICPTAFKVAENEEGYVLRYYNMSQENVRISEHQQTILDLLERPYPVHSGLLDPQEIRTEFIKKEE
I

Specific function: May hydrolyze mannosyl-D-glycerate to mannose-6- phosphate and glycerate [H]

COG id: COG0383

COG function: function code G; Alpha-mannosidase

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the glycosyl hydrolase 38 family [H]

Homologues:

Organism=Homo sapiens, GI46852164, Length=183, Percent_Identity=28.9617486338798, Blast_Score=67, Evalue=9e-11,
Organism=Escherichia coli, GI1786952, Length=875, Percent_Identity=26.4, Blast_Score=269, Evalue=5e-73,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR011013
- InterPro:   IPR011330
- InterPro:   IPR011682
- InterPro:   IPR015341
- InterPro:   IPR000602 [H]

Pfam domain/function: PF09261 Alpha-mann_mid; PF01074 Glyco_hydro_38; PF07748 Glyco_hydro_38C [H]

EC number: NA

Molecular weight: Translated: 100688; Mature: 100688

Theoretical pI: Translated: 4.65; Mature: 4.65

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.9 %Cys     (Translated Protein)
1.1 %Met     (Translated Protein)
2.0 %Cys+Met (Translated Protein)
0.9 %Cys     (Mature Protein)
1.1 %Met     (Mature Protein)
2.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MENVVVHIISHSHWDREWYLPFESHRMQLVELFDNLFDLFENDPEFKSFHLDGQTIVLDD
CCCEEEEEEECCCCCCEEECCHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCEEEEEC
YLQIRPENRDKVQRYIDEGKLKIGPFYILQDDYLISSEANVRNTLIGQQEAAKWGKSTQI
EEEECCCCHHHHHHHHHCCCEEECCEEEEECCEEECCCCCHHHHHCCCHHHHHCCCCCCC
GYFPDTFGNMGQAPQILQKSGIHVAAFGRGVKPIGFDNQVLEDEQFTSQFSEMYWQGVDG
CCCCCCCCCCCCCHHHHHHCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCC
SRVLGILFANWYSNGNEIPVDKDEALTFWKQKLSDVRAYASTNQWLMMNGCDHQPVQKNL
CCEEEEEEEHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHH
SEAIRVANELFPDVIFVHSSFDEYVQAVEGALPEHLSTVTGELTSQETDGWYTLANTSSS
HHHHHHHHHHCCCEEEEECCHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCEEEEECCCCC
RIYLKQAFQENSNLLEQVVEPLTIITGGHNHKDQLTYAWKTLLQNAPHDSICGCSVDEVH
EEHHHHHHHHHHHHHHHHHCCEEEEECCCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHH
REMETRFAKVNQVGNFVKSNLLNEWKGKIATDKAQSDYLFTVINTGLHDKVDTVSTVIDV
HHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCCCCCCEEEEEECCCCCCHHHHHHHHHHH
ATCDFKELHPTEGYKKMAALILPSYRVEDLDGHPVEATIEDLGANFEYDLPKDKFRQARI
HCCCCHHCCCCCHHHHHHHHHCCCCCCCCCCCCCCCHHHHHCCCCCCCCCCHHHHHHHHH
ARQVRVTIPVHLAPLSWTTFQLLEGKQEHREGIYQNGVIDTPFVTVSVDDNITVYDKTTH
HHEEEEEEEEEEECCCCHHHHHHCCHHHHHCCHHHCCCCCCCEEEEEECCCEEEEECCHH
EAYEDFIRFEDRGDIGNEYIYFQPKGTEPIFAELKGHEVLENTACYAKILLKHELTVPVS
HHHHHHHHCCCCCCCCCCEEEEECCCCCCHHHHCCCCHHHHHHHHHHHHHHHHCEECCCC
ADEKLEEEQQGIIEFMKREAGRSEELTNIPLETELTVFVDNPQIRFKTRFTNTAKDHRIR
CHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCCEEE
LLVKTHNTRPSNDSESIYEVVTRPNRPAASWENPENPQHQQAFVSLYDDEKGVTVSNKGL
EEEEECCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHEEECCCCCEEECCCC
NEYEILGDDTIAVTILRASGELGDWGYFPTPEAQCLREFEVEYALECHQAQERFSAYRSA
CCEEEECCCEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
KALQTPFTSLQLARQEGSVAATGSLLSHSVLSIPQICPTAFKVAENEEGYVLRYYNMSQE
HHHHCCHHHHHHHHHCCCEEEHHHHHHHHHHHCHHHCCHHHHHCCCCCCEEEEEECCCCC
NVRISEHQQTILDLLERPYPVHSGLLDPQEIRTEFIKKEEI
CCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHCCC
>Mature Secondary Structure
MENVVVHIISHSHWDREWYLPFESHRMQLVELFDNLFDLFENDPEFKSFHLDGQTIVLDD
CCCEEEEEEECCCCCCEEECCHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCEEEEEC
YLQIRPENRDKVQRYIDEGKLKIGPFYILQDDYLISSEANVRNTLIGQQEAAKWGKSTQI
EEEECCCCHHHHHHHHHCCCEEECCEEEEECCEEECCCCCHHHHHCCCHHHHHCCCCCCC
GYFPDTFGNMGQAPQILQKSGIHVAAFGRGVKPIGFDNQVLEDEQFTSQFSEMYWQGVDG
CCCCCCCCCCCCCHHHHHHCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCC
SRVLGILFANWYSNGNEIPVDKDEALTFWKQKLSDVRAYASTNQWLMMNGCDHQPVQKNL
CCEEEEEEEHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHH
SEAIRVANELFPDVIFVHSSFDEYVQAVEGALPEHLSTVTGELTSQETDGWYTLANTSSS
HHHHHHHHHHCCCEEEEECCHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCEEEEECCCCC
RIYLKQAFQENSNLLEQVVEPLTIITGGHNHKDQLTYAWKTLLQNAPHDSICGCSVDEVH
EEHHHHHHHHHHHHHHHHHCCEEEEECCCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHH
REMETRFAKVNQVGNFVKSNLLNEWKGKIATDKAQSDYLFTVINTGLHDKVDTVSTVIDV
HHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCCCCCCEEEEEECCCCCCHHHHHHHHHHH
ATCDFKELHPTEGYKKMAALILPSYRVEDLDGHPVEATIEDLGANFEYDLPKDKFRQARI
HCCCCHHCCCCCHHHHHHHHHCCCCCCCCCCCCCCCHHHHHCCCCCCCCCCHHHHHHHHH
ARQVRVTIPVHLAPLSWTTFQLLEGKQEHREGIYQNGVIDTPFVTVSVDDNITVYDKTTH
HHEEEEEEEEEEECCCCHHHHHHCCHHHHHCCHHHCCCCCCCEEEEEECCCEEEEECCHH
EAYEDFIRFEDRGDIGNEYIYFQPKGTEPIFAELKGHEVLENTACYAKILLKHELTVPVS
HHHHHHHHCCCCCCCCCCEEEEECCCCCCHHHHCCCCHHHHHHHHHHHHHHHHCEECCCC
ADEKLEEEQQGIIEFMKREAGRSEELTNIPLETELTVFVDNPQIRFKTRFTNTAKDHRIR
CHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCCEEE
LLVKTHNTRPSNDSESIYEVVTRPNRPAASWENPENPQHQQAFVSLYDDEKGVTVSNKGL
EEEEECCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHEEECCCCCEEECCCC
NEYEILGDDTIAVTILRASGELGDWGYFPTPEAQCLREFEVEYALECHQAQERFSAYRSA
CCEEEECCCEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
KALQTPFTSLQLARQEGSVAATGSLLSHSVLSIPQICPTAFKVAENEEGYVLRYYNMSQE
HHHHCCHHHHHHHHHCCCEEEHHHHHHHHHHHCHHHCCHHHHHCCCCCCEEEEEECCCCC
NVRISEHQQTILDLLERPYPVHSGLLDPQEIRTEFIKKEEI
CCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11058132 [H]