Definition Mycobacterium tuberculosis F11, complete genome.
Accession NC_009565
Length 4,424,435

Click here to switch to the map view.

The map label for this gene is ppe42

Identifier: 148823802

GI number: 148823802

Start: 2948363

End: 2950105

Strand: Direct

Name: ppe42

Synonym: TBFG_12627

Alternate gene names: 148823802

Gene position: 2948363-2950105 (Clockwise)

Preceding gene: 148823801

Following gene: 148823810

Centisome position: 66.64

GC content: 65.23

Gene sequence:

>1743_bases
ATGAATTTCGCCGTTTTGCCGCCGGAGGTGAATTCGGCGCGCATATTCGCCGGTGCGGGCCTGGGCCCAATGCTGGCGGC
GGCGTCGGCCTGGGACGGGTTGGCCGAGGAGTTGCATGCCGCGGCGGGCTCGTTCGCGTCGGTGACCACCGGGTTGGCGG
GCGACGCGTGGCATGGTCCGGCGTCGCTGGCGATGACCCGCGCGGCCAGCCCGTATGTGGGGTGGTTGAACACGGCGGCG
GGTCAGGCCGCGCAGGCGGCCGGCCAGGCGCGGCTAGCGGCGAGCGCGTTCGAGGCGACGCTGGCGGCCACCGTGTCTCC
AGCGATGGTCGCGGCCAACCGGACACGGCTGGCGTCGCTGGTGGCAGCCAACTTGCTGGGCCAGAACGCCCCGGCGATCG
CGGCCGCGGAGGCTGAATACGAGCAGATATGGGCCCAGGACGTGGCCGCGATGTTCGGCTATCACTCCGCCGCGTCGGCG
GTGGCCACGCAGCTGGCGCCTATTCAAGAGGGTTTGCAGCAGCAGCTGCAAAACGTGCTGGCCCAGTTGGCTAGCGGGAA
CCTGGGCAGCGGAAATGTGGGCGTCGGCAACATCGGCAACGACAACATTGGCAACGCAAACATCGGCTTCGGAAATCGAG
GCGACGCCAACATCGGCATCGGGAATATCGGCGACAGAAACCTCGGCATTGGGAACACCGGCAATTGGAATATCGGCATC
GGCATCACCGGCAACGGACAAATCGGCTTCGGCAAGCCTGCCAACCCCGACGTCTTGGTGGTGGGCAACGGCGGCCCGGG
AGTAACCGCGTTGGTCATGGGCGGCACCGACAGCCTACTGCCGCTGCCCAACATCCCCTTACTCGAGTACGCTGCGCGGT
TCATCACCCCCGTGCATCCCGGATACACCGCTACGTTCCTGGAAACGCCATCGCAGTTTTTCCCATTCACCGGGCTGAAT
AGCCTGACCTATGACGTCTCCGTGGCCCAGGGCGTAACGAATCTGCACACCGCGATCATGGCGCAACTCGCGGCGGGAAA
CGAAGTCGTCGTCTTCGGCACCTCCCAAAGCGCCACGATAGCCACCTTCGAAATGCGCTATCTGCAATCCCTGCCAGCAC
ACCTGCGTCCGGGTCTCGACGAATTGTCCTTTACGTTGACCGGCAATCCCAACCGGCCCGACGGTGGCATTCTTACGCGT
TTTGGCTTCTCCATACCGCAGTTGGGTTTCACATTGTCCGGCGCGACGCCCGCCGACGCCTACCCCACCGTCGATTACGC
GTTCCAGTACGACGGCGTCAACGACTTCCCCAAATACCCGCTGAATGTCTTCGCGACCGCCAACGCGATCGCGGGCATCC
TTTTCCTGCACTCCGGGTTGATTGCGTTGCCGCCCGATCTTGCCTCGGGCGTGGTTCAACCGGTGTCCTCACCGGACGTC
CTGACCACCTACATCCTGCTGCCCAGCCAAGATCTGCCGCTGCTGGTCCCGCTGCGTGCTATCCCCCTGCTGGGAAACCC
GCTTGCCGACCTCATCCAGCCGGACTTGCGGGTGCTCGTCGAGTTGGGTTATGACCGCACCGCCCACCAGGACGTGCCCA
GCCCGTTCGGACTGTTTCCGGACGTCGATTGGGCCGAGGTGGCCGCGGACCTGCAGCAAGGCGCCGTGCAAGGCGTCAAC
GACGCCCTGTCCGGACTGGGGCTGCCGCCGCCGTGGCAGCCGGCGCTACCCCGACTTTTCTAA

Upstream 100 bases:

>100_bases
TTTGACCGGAATGCCCGCTGACCCGTGACGACGCGGTCACCGGGGATACCCGCCGCGGTGGTGGCCAACCGATAACGGCC
AACCGAGAAAGTACACAGCG

Downstream 100 bases:

>100_bases
GCGGTCCACAAACCGTGCACGTCAGCGGATGGGCTGAGGAACGCCGGCATCGCGCGCGGCTCCGTTGTCCAGCGCGACGT
CCACCAGCCGGTTGGCTGCC

Product: PPE family protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 580; Mature: 580

Protein sequence:

>580_residues
MNFAVLPPEVNSARIFAGAGLGPMLAAASAWDGLAEELHAAAGSFASVTTGLAGDAWHGPASLAMTRAASPYVGWLNTAA
GQAAQAAGQARLAASAFEATLAATVSPAMVAANRTRLASLVAANLLGQNAPAIAAAEAEYEQIWAQDVAAMFGYHSAASA
VATQLAPIQEGLQQQLQNVLAQLASGNLGSGNVGVGNIGNDNIGNANIGFGNRGDANIGIGNIGDRNLGIGNTGNWNIGI
GITGNGQIGFGKPANPDVLVVGNGGPGVTALVMGGTDSLLPLPNIPLLEYAARFITPVHPGYTATFLETPSQFFPFTGLN
SLTYDVSVAQGVTNLHTAIMAQLAAGNEVVVFGTSQSATIATFEMRYLQSLPAHLRPGLDELSFTLTGNPNRPDGGILTR
FGFSIPQLGFTLSGATPADAYPTVDYAFQYDGVNDFPKYPLNVFATANAIAGILFLHSGLIALPPDLASGVVQPVSSPDV
LTTYILLPSQDLPLLVPLRAIPLLGNPLADLIQPDLRVLVELGYDRTAHQDVPSPFGLFPDVDWAEVAADLQQGAVQGVN
DALSGLGLPPPWQPALPRLF

Sequences:

>Translated_580_residues
MNFAVLPPEVNSARIFAGAGLGPMLAAASAWDGLAEELHAAAGSFASVTTGLAGDAWHGPASLAMTRAASPYVGWLNTAA
GQAAQAAGQARLAASAFEATLAATVSPAMVAANRTRLASLVAANLLGQNAPAIAAAEAEYEQIWAQDVAAMFGYHSAASA
VATQLAPIQEGLQQQLQNVLAQLASGNLGSGNVGVGNIGNDNIGNANIGFGNRGDANIGIGNIGDRNLGIGNTGNWNIGI
GITGNGQIGFGKPANPDVLVVGNGGPGVTALVMGGTDSLLPLPNIPLLEYAARFITPVHPGYTATFLETPSQFFPFTGLN
SLTYDVSVAQGVTNLHTAIMAQLAAGNEVVVFGTSQSATIATFEMRYLQSLPAHLRPGLDELSFTLTGNPNRPDGGILTR
FGFSIPQLGFTLSGATPADAYPTVDYAFQYDGVNDFPKYPLNVFATANAIAGILFLHSGLIALPPDLASGVVQPVSSPDV
LTTYILLPSQDLPLLVPLRAIPLLGNPLADLIQPDLRVLVELGYDRTAHQDVPSPFGLFPDVDWAEVAADLQQGAVQGVN
DALSGLGLPPPWQPALPRLF
>Mature_580_residues
MNFAVLPPEVNSARIFAGAGLGPMLAAASAWDGLAEELHAAAGSFASVTTGLAGDAWHGPASLAMTRAASPYVGWLNTAA
GQAAQAAGQARLAASAFEATLAATVSPAMVAANRTRLASLVAANLLGQNAPAIAAAEAEYEQIWAQDVAAMFGYHSAASA
VATQLAPIQEGLQQQLQNVLAQLASGNLGSGNVGVGNIGNDNIGNANIGFGNRGDANIGIGNIGDRNLGIGNTGNWNIGI
GITGNGQIGFGKPANPDVLVVGNGGPGVTALVMGGTDSLLPLPNIPLLEYAARFITPVHPGYTATFLETPSQFFPFTGLN
SLTYDVSVAQGVTNLHTAIMAQLAAGNEVVVFGTSQSATIATFEMRYLQSLPAHLRPGLDELSFTLTGNPNRPDGGILTR
FGFSIPQLGFTLSGATPADAYPTVDYAFQYDGVNDFPKYPLNVFATANAIAGILFLHSGLIALPPDLASGVVQPVSSPDV
LTTYILLPSQDLPLLVPLRAIPLLGNPLADLIQPDLRVLVELGYDRTAHQDVPSPFGLFPDVDWAEVAADLQQGAVQGVN
DALSGLGLPPPWQPALPRLF

Specific function: Elicits a high humoral and a low T cell response. Could be involved in directing the host toward development of a more humoral type of immune response

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the mycobacterial PPE family

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): PPE42_MYCTU (Q79FC6)

Other databases:

- EMBL:   BX842580
- EMBL:   AE000516
- PIR:   G70570
- RefSeq:   NP_337185.1
- RefSeq:   YP_177893.1
- HSSP:   Q79FE1
- ProteinModelPortal:   Q79FC6
- SMR:   Q79FC6
- EnsemblBacteria:   EBMYCT00000000547
- EnsemblBacteria:   EBMYCT00000071174
- GeneID:   888204
- GeneID:   925626
- GenomeReviews:   AE000516_GR
- GenomeReviews:   AL123456_GR
- KEGG:   mtc:MT2683
- KEGG:   mtu:Rv2608
- TIGR:   MT2683
- TubercuList:   Rv2608
- GeneTree:   EBGT00050000014926
- HOGENOM:   HBG569414
- ProtClustDB:   CLSK799920
- InterPro:   IPR002989
- InterPro:   IPR013228
- InterPro:   IPR000030

Pfam domain/function: PF08237 PE-PPE; PF01469 Pentapeptide_2; PF00823 PPE

EC number: NA

Molecular weight: Translated: 59675; Mature: 59675

Theoretical pI: Translated: 4.22; Mature: 4.22

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.0 %Cys     (Translated Protein)
1.4 %Met     (Translated Protein)
1.4 %Cys+Met (Translated Protein)
0.0 %Cys     (Mature Protein)
1.4 %Met     (Mature Protein)
1.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNFAVLPPEVNSARIFAGAGLGPMLAAASAWDGLAEELHAAAGSFASVTTGLAGDAWHGP
CCCEECCCCCCCCEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHCCCCCCCCCCC
ASLAMTRAASPYVGWLNTAAGQAAQAAGQARLAASAFEATLAATVSPAMVAANRTRLASL
HHHHHHHCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH
VAANLLGQNAPAIAAAEAEYEQIWAQDVAAMFGYHSAASAVATQLAPIQEGLQQQLQNVL
HHHHHHCCCCCCEEECHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHH
AQLASGNLGSGNVGVGNIGNDNIGNANIGFGNRGDANIGIGNIGDRNLGIGNTGNWNIGI
HHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEE
GITGNGQIGFGKPANPDVLVVGNGGPGVTALVMGGTDSLLPLPNIPLLEYAARFITPVHP
EEECCCCCCCCCCCCCCEEEEECCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHCCCCC
GYTATFLETPSQFFPFTGLNSLTYDVSVAQGVTNLHTAIMAQLAAGNEVVVFGTSQSATI
CCEEEEECCCHHHCCCCCCCCEEEEEHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCEE
ATFEMRYLQSLPAHLRPGLDELSFTLTGNPNRPDGGILTRFGFSIPQLGFTLSGATPADA
EHHHHHHHHHCCHHHCCCCCCEEEEEECCCCCCCCCEEECCCCCCCCCCEEECCCCCCCC
YPTVDYAFQYDGVNDFPKYPLNVFATANAIAGILFLHSGLIALPPDLASGVVQPVSSPDV
CCCCEEEEEECCCCCCCCCCCEEEEHHHHHHHHHHHHCCCEECCHHHHHHHHCCCCCCCE
LTTYILLPSQDLPLLVPLRAIPLLGNPLADLIQPDLRVLVELGYDRTAHQDVPSPFGLFP
EEEEEEECCCCCCEEECHHHHCCCCCCHHHHHCHHHHHHHHHCCCCCCCCCCCCCCCCCC
DVDWAEVAADLQQGAVQGVNDALSGLGLPPPWQPALPRLF
CCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC
>Mature Secondary Structure
MNFAVLPPEVNSARIFAGAGLGPMLAAASAWDGLAEELHAAAGSFASVTTGLAGDAWHGP
CCCEECCCCCCCCEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHCCCCCCCCCCC
ASLAMTRAASPYVGWLNTAAGQAAQAAGQARLAASAFEATLAATVSPAMVAANRTRLASL
HHHHHHHCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH
VAANLLGQNAPAIAAAEAEYEQIWAQDVAAMFGYHSAASAVATQLAPIQEGLQQQLQNVL
HHHHHHCCCCCCEEECHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHH
AQLASGNLGSGNVGVGNIGNDNIGNANIGFGNRGDANIGIGNIGDRNLGIGNTGNWNIGI
HHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEE
GITGNGQIGFGKPANPDVLVVGNGGPGVTALVMGGTDSLLPLPNIPLLEYAARFITPVHP
EEECCCCCCCCCCCCCCEEEEECCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHCCCCC
GYTATFLETPSQFFPFTGLNSLTYDVSVAQGVTNLHTAIMAQLAAGNEVVVFGTSQSATI
CCEEEEECCCHHHCCCCCCCCEEEEEHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCEE
ATFEMRYLQSLPAHLRPGLDELSFTLTGNPNRPDGGILTRFGFSIPQLGFTLSGATPADA
EHHHHHHHHHCCHHHCCCCCCEEEEEECCCCCCCCCEEECCCCCCCCCCEEECCCCCCCC
YPTVDYAFQYDGVNDFPKYPLNVFATANAIAGILFLHSGLIALPPDLASGVVQPVSSPDV
CCCCEEEEEECCCCCCCCCCCEEEEHHHHHHHHHHHHCCCEECCHHHHHHHHCCCCCCCE
LTTYILLPSQDLPLLVPLRAIPLLGNPLADLIQPDLRVLVELGYDRTAHQDVPSPFGLFP
EEEEEEECCCCCCEEECHHHHCCCCCCHHHHHCHHHHHHHHHCCCCCCCCCCCCCCCCCC
DVDWAEVAADLQQGAVQGVNDALSGLGLPPPWQPALPRLF
CCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9634230; 12218036