The gene/protein map for NC_010741 is currently unavailable.
Definition Treponema pallidum subsp. pallidum SS14, complete genome.
Accession NC_010741
Length 1,139,457

Click here to switch to the map view.

The map label for this gene is apbE

Identifier: 189026021

GI number: 189026021

Start: 863526

End: 864614

Strand: Reverse

Name: apbE

Synonym: TPASS_0796

Alternate gene names: 189026021

Gene position: 864614-863526 (Counterclockwise)

Preceding gene: 189026022

Following gene: 189026019

Centisome position: 75.88

GC content: 56.16

Gene sequence:

>1089_bases
GTGAAAAGTTCGTGCGTATATTGGCGGATCGGGGTTCTCGTTTGTATTCTGTGTGGAGTGGGGAGCTGTGGCGGTCGTGC
GCGCGTGCGCGAGTATTCGCGTGCGGAGCTTGTTATCGGTACGCTCTGTCGCGTGCGCGTGTACTCTAAGCGACCTGCTG
CTGAAGTGCACGCGGCGCTTGAGGAGGTGTTCACGCTGCTACAACAACAGGAGATGGTGCTGAGTGCTAACCGTGATGAC
TCTGCGCTTGCTGCCCTAAACGCTCAGGCAGGTTCGGCACCGGTTGTTGTTGACAGGTCGCTGTATGCGTTGCTTGAGCG
TGCGCTTTTTTTTGCAGAAAAGAGTGGGGGTGCGTTTAACCCCGCACTAGGTGCGKTAGTCAAGCTTTGGAATATTGGCT
TTGACCGTGCTGCTGTCCCTGACCCCGACGCGCTCAAGGAGGCGCTGACACGTTGTGATTTTCGTCAGGTGCACCTGCGC
GCTGGGGTATCGGTGGGCGCGCCACACACGGTACAGTTGGCACAAGCGGGCATGCAGTTGGATTTGGGCGCCATTGCTAA
AGGATTCCTTGCGGACAAGATTGTACAACTGCTCACTGCGCATGCTTTGGATTCAGCGCTCGTTGATCTGGGAGGAAATA
TTTTTGCCCTTGGTCTTAAGTATGGAGATGTGCGCTCAGCAGCCGCGCAGCGGTTGGAATGGAACGTGGGTATTCGCGAT
CCGCACGGCACGGGGCAGAAGCCTGCACTGGTGGTGTCGGTGCGCGATTGCTCGGTGGTGACTTCTGGTGCGTACGAGCG
TTTCTTTGAGCGTGACGGGGTACGCTACCATCATATCATCGATCCGGTTACCGGGTTTCCGGCACACACTGATGTGGATT
CTGTGTCTATCTTTGCACCCCGTTCCACAGATGCAGATGCGCTTGCTACCGCCTGTTTTGTATTGGGGTATGAGAAAAGC
TGTGCGCTCTTGCGTGAATTTCCCGGTGTTGACGCGCTGTTTATTTTTCCTGACAAGCGCGTGCGCGCAAGTGCAGGGAT
TGTCGATCGCGTGCGTGTGCTCGATGCACGTTTCGTGTTAGAGCGTTAG

Upstream 100 bases:

>100_bases
CAGTTATACACGCGGTGCGCGCTGCGCTCAGTTCTTCCTAAGGGGTGGGCAGGGTGCATTGCTTGTTTTTTTTGACTGCT
GACAGTACAGTTGCACCCTT

Downstream 100 bases:

>100_bases
GACAGCACGTGTGCTGTTCGTGTGTAAAAAAGTGTGGCGGACTGTCCTCATCATGGTGTGTGTGCAGGATGCGTGCGCGG
GGGTTCGGTCAGATGTCAGG

Product: hypothetical protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 362; Mature: 362

Protein sequence:

>362_residues
MKSSCVYWRIGVLVCILCGVGSCGGRARVREYSRAELVIGTLCRVRVYSKRPAAEVHAALEEVFTLLQQQEMVLSANRDD
SALAALNAQAGSAPVVVDRSLYALLERALFFAEKSGGAFNPALGAXVKLWNIGFDRAAVPDPDALKEALTRCDFRQVHLR
AGVSVGAPHTVQLAQAGMQLDLGAIAKGFLADKIVQLLTAHALDSALVDLGGNIFALGLKYGDVRSAAAQRLEWNVGIRD
PHGTGQKPALVVSVRDCSVVTSGAYERFFERDGVRYHHIIDPVTGFPAHTDVDSVSIFAPRSTDADALATACFVLGYEKS
CALLREFPGVDALFIFPDKRVRASAGIVDRVRVLDARFVLER

Sequences:

>Translated_362_residues
MKSSCVYWRIGVLVCILCGVGSCGGRARVREYSRAELVIGTLCRVRVYSKRPAAEVHAALEEVFTLLQQQEMVLSANRDD
SALAALNAQAGSAPVVVDRSLYALLERALFFAEKSGGAFNPALGAXVKLWNIGFDRAAVPDPDALKEALTRCDFRQVHLR
AGVSVGAPHTVQLAQAGMQLDLGAIAKGFLADKIVQLLTAHALDSALVDLGGNIFALGLKYGDVRSAAAQRLEWNVGIRD
PHGTGQKPALVVSVRDCSVVTSGAYERFFERDGVRYHHIIDPVTGFPAHTDVDSVSIFAPRSTDADALATACFVLGYEKS
CALLREFPGVDALFIFPDKRVRASAGIVDRVRVLDARFVLER
>Mature_362_residues
MKSSCVYWRIGVLVCILCGVGSCGGRARVREYSRAELVIGTLCRVRVYSKRPAAEVHAALEEVFTLLQQQEMVLSANRDD
SALAALNAQAGSAPVVVDRSLYALLERALFFAEKSGGAFNPALGAXVKLWNIGFDRAAVPDPDALKEALTRCDFRQVHLR
AGVSVGAPHTVQLAQAGMQLDLGAIAKGFLADKIVQLLTAHALDSALVDLGGNIFALGLKYGDVRSAAAQRLEWNVGIRD
PHGTGQKPALVVSVRDCSVVTSGAYERFFERDGVRYHHIIDPVTGFPAHTDVDSVSIFAPRSTDADALATACFVLGYEKS
CALLREFPGVDALFIFPDKRVRASAGIVDRVRVLDARFVLER

Specific function: Involved in the conversion of aminoimidazole ribotide (AIR), a purine intermediate, to the 4-amino-5-hydroxymethyl-2- methyl pyrimidine (HMP) moiety of thiamine

COG id: COG1477

COG function: function code H; Membrane-associated lipoprotein involved in thiamine biosynthesis

Gene ontology:

Cell location: Cell membrane; Lipid-anchor (Potential)

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the ApbE family

Homologues:

Organism=Escherichia coli, GI1788543, Length=325, Percent_Identity=26.1538461538462, Blast_Score=117, Evalue=2e-27,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): APBE_TREPA (O83774)

Other databases:

- EMBL:   AE000520
- PIR:   C71281
- RefSeq:   NP_219233.1
- ProteinModelPortal:   O83774
- GeneID:   2610802
- GenomeReviews:   AE000520_GR
- KEGG:   tpa:TP0796
- NMPDR:   fig|243276.1.peg.792
- TIGR:   TP_0796
- HOGENOM:   HBG292712
- OMA:   IYERHLE
- ProtClustDB:   CLSK218872
- BioCyc:   TPAL243276:TP_0796-MONOMER
- InterPro:   IPR003374

Pfam domain/function: PF02424 ApbE

EC number: NA

Molecular weight: Translated: 39010; Mature: 39010

Theoretical pI: Translated: 7.96; Mature: 7.96

Prosite motif: PS51257 PROKAR_LIPOPROTEIN; PS00013 PROKAR_LIPOPROTEIN

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.5 %Cys     (Translated Protein)
0.8 %Met     (Translated Protein)
3.3 %Cys+Met (Translated Protein)
2.5 %Cys     (Mature Protein)
0.8 %Met     (Mature Protein)
3.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKSSCVYWRIGVLVCILCGVGSCGGRARVREYSRAELVIGTLCRVRVYSKRPAAEVHAAL
CCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHEEEECCCCHHHHHHHH
EEVFTLLQQQEMVLSANRDDSALAALNAQAGSAPVVVDRSLYALLERALFFAEKSGGAFN
HHHHHHHHHHHHHHCCCCCCCHHEEECCCCCCCCEEECHHHHHHHHHHHHHHHCCCCCCC
PALGAXVKLWNIGFDRAAVPDPDALKEALTRCDFRQVHLRAGVSVGAPHTVQLAQAGMQL
CHHHCEEEEEECCCCCCCCCCHHHHHHHHHHCCHHHHHHHCCCCCCCCCHHHHHHCCCCC
DLGAIAKGFLADKIVQLLTAHALDSALVDLGGNIFALGLKYGDVRSAAAQRLEWNVGIRD
CHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHEECCCCCC
PHGTGQKPALVVSVRDCSVVTSGAYERFFERDGVRYHHIIDPVTGFPAHTDVDSVSIFAP
CCCCCCCCEEEEEECCCCEEEHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCEEEEEC
RSTDADALATACFVLGYEKSCALLREFPGVDALFIFPDKRVRASAGIVDRVRVLDARFVL
CCCCHHHHHHHHHHHCCCHHHHHHHHCCCCCEEEEECCCHHHHHCCHHHHHHHHHHHHHH
ER
CC
>Mature Secondary Structure
MKSSCVYWRIGVLVCILCGVGSCGGRARVREYSRAELVIGTLCRVRVYSKRPAAEVHAAL
CCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHEEEECCCCHHHHHHHH
EEVFTLLQQQEMVLSANRDDSALAALNAQAGSAPVVVDRSLYALLERALFFAEKSGGAFN
HHHHHHHHHHHHHHCCCCCCCHHEEECCCCCCCCEEECHHHHHHHHHHHHHHHCCCCCCC
PALGAXVKLWNIGFDRAAVPDPDALKEALTRCDFRQVHLRAGVSVGAPHTVQLAQAGMQL
CHHHCEEEEEECCCCCCCCCCHHHHHHHHHHCCHHHHHHHCCCCCCCCCHHHHHHCCCCC
DLGAIAKGFLADKIVQLLTAHALDSALVDLGGNIFALGLKYGDVRSAAAQRLEWNVGIRD
CHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHEECCCCCC
PHGTGQKPALVVSVRDCSVVTSGAYERFFERDGVRYHHIIDPVTGFPAHTDVDSVSIFAP
CCCCCCCCEEEEEECCCCEEEHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCEEEEEC
RSTDADALATACFVLGYEKSCALLREFPGVDALFIFPDKRVRASAGIVDRVRVLDARFVL
CCCCHHHHHHHHHHHCCCHHHHHHHHCCCCCEEEEECCCHHHHHCCHHHHHHHHHHHHHH
ER
CC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 7.0

TargetDB status: NA

Availability: NA

References: 9665876