Definition | Treponema pallidum subsp. pallidum SS14, complete genome. |
---|---|
Accession | NC_010741 |
Length | 1,139,457 |
Click here to switch to the map view.
The map label for this gene is est [H]
Identifier: 189026125
GI number: 189026125
Start: 985442
End: 986323
Strand: Reverse
Name: est [H]
Synonym: TPASS_0902
Alternate gene names: 189026125
Gene position: 986323-985442 (Counterclockwise)
Preceding gene: 189026142
Following gene: 189026124
Centisome position: 86.56
GC content: 55.78
Gene sequence:
>882_bases ATGCGGCCGATTGTAGCCGCGCAACGCGTGACAATACAAGAGACGCGTGCGGTGCTCGCGGCACGGTTTCTCTTTACTTT TTGTTGCTTTTTTACTACCCTCGCGCGCTATCTGCTTATGGCTGAACATACTTCCTGTACGAGCATTCATCCTCTTGTGC GCAGCGCGTTTTACGCCGGGGGTGCGCATGCAGTACTGCTTATTCATGGGTACATGGGCACCCCGCGCGAGATGCAGTTT TTAGGTCGTGCGCTCCACCGGGACGGCTTTACGGTCTCTATTCCCCGTTTACCTGGTCACGGTACGAATAGAGAGGATTT TCTTGAGACCGGGTGGAGGGATTGGCTGCGGCGCGTGTGTGATGAGTACCGTGACCTTTCCGCTGCGTACCCTTCGGTAT CTGTGGGGGGGCTGTCCATGGGAGGTGTGCTGACTGCACTCGTGGCGGCGCGTTTTTGTCCCCAGAAAGCTTTCTTTTGT GCACCGGGTTTTGCAGTTTCTGATTGGAGGATAAAGCTGTCTCCTCTAGTCAGGTGGTTTGTGCGTGAGTTTGCTGCGGA CGCGGCTCCCTTCTACCCCGAGCAAGACTTTAATGACGCCACAAAGGATTACCGGAGTGCGCACTACATTGCCCAGGTGG CGCAGTTTTACGCACTGCAAAGACGTGCGATCCGTTCGCTGGCGTGCATTCGGAGTACGTTGTTAACGATCCTGTCTCGG CAGGACCCATTGGTGCCGTGTGCAGCGGTGCAAAAATTACTCGATGCGCGTGTGCGCAGCGCACACCAGTACGTAGTGCT CGAGCACAGTGGTCACGTGATCACTGATGACGTGGAGCGGGAGCAGGTTGCCTCTTGTGTCAGTGCTTTTTTACGCACGT AG
Upstream 100 bases:
>100_bases TTCTGCACAAAAGCAGGCTGCCGCGCAGCCCCCGCCGTGCACGCCGAGGCCCATGATGGTTACCGTTTTGCCTTGAAGAA GTGCGCGCGCCTGCTCCACG
Downstream 100 bases:
>100_bases TGTTACGTGATGTCTACCCAAAAGGGAGTGCGGTGCACGGGTGCTCCACATTTCTAGTTGCTGTAGGGGGAAGAGACTGA GTCGCTGTTTCGGACCGCGT
Product: carboxylesterase
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 293; Mature: 293
Protein sequence:
>293_residues MRPIVAAQRVTIQETRAVLAARFLFTFCCFFTTLARYLLMAEHTSCTSIHPLVRSAFYAGGAHAVLLIHGYMGTPREMQF LGRALHRDGFTVSIPRLPGHGTNREDFLETGWRDWLRRVCDEYRDLSAAYPSVSVGGLSMGGVLTALVAARFCPQKAFFC APGFAVSDWRIKLSPLVRWFVREFAADAAPFYPEQDFNDATKDYRSAHYIAQVAQFYALQRRAIRSLACIRSTLLTILSR QDPLVPCAAVQKLLDARVRSAHQYVVLEHSGHVITDDVEREQVASCVSAFLRT
Sequences:
>Translated_293_residues MRPIVAAQRVTIQETRAVLAARFLFTFCCFFTTLARYLLMAEHTSCTSIHPLVRSAFYAGGAHAVLLIHGYMGTPREMQF LGRALHRDGFTVSIPRLPGHGTNREDFLETGWRDWLRRVCDEYRDLSAAYPSVSVGGLSMGGVLTALVAARFCPQKAFFC APGFAVSDWRIKLSPLVRWFVREFAADAAPFYPEQDFNDATKDYRSAHYIAQVAQFYALQRRAIRSLACIRSTLLTILSR QDPLVPCAAVQKLLDARVRSAHQYVVLEHSGHVITDDVEREQVASCVSAFLRT >Mature_293_residues MRPIVAAQRVTIQETRAVLAARFLFTFCCFFTTLARYLLMAEHTSCTSIHPLVRSAFYAGGAHAVLLIHGYMGTPREMQF LGRALHRDGFTVSIPRLPGHGTNREDFLETGWRDWLRRVCDEYRDLSAAYPSVSVGGLSMGGVLTALVAARFCPQKAFFC APGFAVSDWRIKLSPLVRWFVREFAADAAPFYPEQDFNDATKDYRSAHYIAQVAQFYALQRRAIRSLACIRSTLLTILSR QDPLVPCAAVQKLLDARVRSAHQYVVLEHSGHVITDDVEREQVASCVSAFLRT
Specific function: Involved in the detoxification of xenobiotics. Shows maximal activity with C6 substrates, with gradually decreasing activity from C8 to C12 substrates. No activity for higher chain length substrates acids rather than long-chain ones [H]
COG id: COG1647
COG function: function code R; Esterase/lipase
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: Belongs to the lipase/esterase LIP3/BchO family [H]
Homologues:
None
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR019149 - InterPro: IPR012354 [H]
Pfam domain/function: PF09752 DUF2048 [H]
EC number: =3.1.1.1 [H]
Molecular weight: Translated: 32926; Mature: 32926
Theoretical pI: Translated: 8.95; Mature: 8.95
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
3.1 %Cys (Translated Protein) 1.7 %Met (Translated Protein) 4.8 %Cys+Met (Translated Protein) 3.1 %Cys (Mature Protein) 1.7 %Met (Mature Protein) 4.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MRPIVAAQRVTIQETRAVLAARFLFTFCCFFTTLARYLLMAEHTSCTSIHPLVRSAFYAG CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCC GAHAVLLIHGYMGTPREMQFLGRALHRDGFTVSIPRLPGHGTNREDFLETGWRDWLRRVC CCEEEEEEECCCCCCHHHHHHHHHHHCCCCEEECCCCCCCCCCHHHHHHHHHHHHHHHHH DEYRDLSAAYPSVSVGGLSMGGVLTALVAARFCPQKAFFCAPGFAVSDWRIKLSPLVRWF HHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEECCCCEECCCEEHHHHHHHHH VREFAADAAPFYPEQDFNDATKDYRSAHYIAQVAQFYALQRRAIRSLACIRSTLLTILSR HHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC QDPLVPCAAVQKLLDARVRSAHQYVVLEHSGHVITDDVEREQVASCVSAFLRT CCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCEECCHHHHHHHHHHHHHHHCC >Mature Secondary Structure MRPIVAAQRVTIQETRAVLAARFLFTFCCFFTTLARYLLMAEHTSCTSIHPLVRSAFYAG CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCC GAHAVLLIHGYMGTPREMQFLGRALHRDGFTVSIPRLPGHGTNREDFLETGWRDWLRRVC CCEEEEEEECCCCCCHHHHHHHHHHHCCCCEEECCCCCCCCCCHHHHHHHHHHHHHHHHH DEYRDLSAAYPSVSVGGLSMGGVLTALVAARFCPQKAFFCAPGFAVSDWRIKLSPLVRWF HHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEECCCCEECCCEEHHHHHHHHH VREFAADAAPFYPEQDFNDATKDYRSAHYIAQVAQFYALQRRAIRSLACIRSTLLTILSR HHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC QDPLVPCAAVQKLLDARVRSAHQYVVLEHSGHVITDDVEREQVASCVSAFLRT CCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCEECCHHHHHHHHHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 9384377 [H]