Definition Francisella tularensis subsp. holarctica LVS chromosome, complete genome.
Accession NC_007880
Length 1,895,994

Click here to switch to the map view.

The map label for this gene is pepA [H]

Identifier: 89256433

GI number: 89256433

Start: 1053698

End: 1055077

Strand: Direct

Name: pepA [H]

Synonym: FTL_1108

Alternate gene names: 89256433

Gene position: 1053698-1055077 (Clockwise)

Preceding gene: 89256432

Following gene: 161353781

Centisome position: 55.57

GC content: 34.64

Gene sequence:

>1380_bases
ATGTATATATCAACCAAATTAGAGTGTTTTACACAACATAAACAAAATGATTCTTTGCCAATTTTTGTGATTAATAAGGA
TAATTTTACAAATTGGCTTAGTAGTCAAGATAGTTTTTTACAGAACTTTATTAAGCAGTTTGATGAAAAAACAAAAATTA
TAACTGTTCCAAATAATTTAGGGAATATTCAAAAAGTAATTTGTCTTGTATCTGAGGATATGTTTGGCATTGCGAATTTA
CCAAATCAGCTAGCACAAGGAAATTACCATATTGAATATACTGATATCGCTGACTTGTCACTTTATTATATAGGCTTTGC
TTTAGGAAGTTATAAGTTTGAGAAATACAAATCTAAAACACAACAATCAAAAGTAAAATTATATTTACCACAACAATATC
AGCATATATTAGCAACTATAGAAGCTAATTATTTAGTAAGAGATATGATTACTACCCCAGCCGAAGATATGGGGCCAGCT
GATATTGCTAATGTGATACAACAATTAGCCAAAGAGTTTAATGCTGATTTTGAGGAGATCGTTGGTGAAGAACTTGTCGA
ACAAGGTTATATGGGTATCTATACTGTTGGTAAAGGTAGTCATAGATCTCCTAGACTTGTGCAATTAAATTGGGGTGATA
CAGCTCACCCTACGGTATCAATAGTCGGTAAAGGTGTGGCTTTTGATACAGGTGGTTTAGATGTCAAACCATCATCAGCA
ATGCAACTAATGCACAAAGATATGGGTGGTAGTGCTAATGCTATCGGTCTTGCCTATATGATTATGAAACATAAATTGCC
TATTAGATTAAGCTTGGTAATTCCTACTGTTGAAAATGCTATAGATGCAAAATCATATCGACCTAGTGATATTATTAAGA
TGAAAAATGGTACTAATGTACAAGTTACTAACACAGATGCTGAAGGGCGTTTGATTTTAGCTGAACCACTATATGAAGAG
GCACAAAAAAAACCTCAATATTTAATCGACTTCTCAACACTTACAGGAGCAGCTAGAGTCGCTGTTGGACCTGAGATTGC
AGCATTTTTCTGTAATAATGATGATGTGGCTAGCCAAGTATATAAGTATGCTCAGGCGACGCAAGATCAAGTTTGGCGTC
TACCATTAGCTGATTGTTATAGAAAGAATCTAGAAACTGAGTTTGCAGATATTTCACACTGTGATTTATCACCATTTGCA
GCAGCTGTGAAGGCGGCATTATTTATGGAGCATTTTGTTGGTATAAAAGATGCACCTACCTGGATTCATTTTGATATGAT
GGCATGGAATATTAGCTCTACGCCAGGTAAACCTAAAGGTGGAGAAATGATGGCTGTTAGAGCTATGTTTGAGATGCTTA
AGGATAAGTTTCCAGCTTAA

Upstream 100 bases:

>100_bases
GATTTTTAAGTTTATACAAAACAGCTGGTATTTTAACAGCTAAAACAGTGCAAGATTTTAATAATTGGTTACTATTAGGT
CAAGACGTGGAGCTATAAAG

Downstream 100 bases:

>100_bases
ATTAATTAATGGATATAGTAATACTACTCATGAATAAGTTATATTTTTTAGTATATAAACATAGCTAGTTTATTTACTAA
TGTATCTGAAGTTGTATACT

Product: cytosol aminopeptidase family protein

Products: NA

Alternate protein names: Leucine aminopeptidase; LAP; Leucyl aminopeptidase [H]

Number of amino acids: Translated: 459; Mature: 459

Protein sequence:

>459_residues
MYISTKLECFTQHKQNDSLPIFVINKDNFTNWLSSQDSFLQNFIKQFDEKTKIITVPNNLGNIQKVICLVSEDMFGIANL
PNQLAQGNYHIEYTDIADLSLYYIGFALGSYKFEKYKSKTQQSKVKLYLPQQYQHILATIEANYLVRDMITTPAEDMGPA
DIANVIQQLAKEFNADFEEIVGEELVEQGYMGIYTVGKGSHRSPRLVQLNWGDTAHPTVSIVGKGVAFDTGGLDVKPSSA
MQLMHKDMGGSANAIGLAYMIMKHKLPIRLSLVIPTVENAIDAKSYRPSDIIKMKNGTNVQVTNTDAEGRLILAEPLYEE
AQKKPQYLIDFSTLTGAARVAVGPEIAAFFCNNDDVASQVYKYAQATQDQVWRLPLADCYRKNLETEFADISHCDLSPFA
AAVKAALFMEHFVGIKDAPTWIHFDMMAWNISSTPGKPKGGEMMAVRAMFEMLKDKFPA

Sequences:

>Translated_459_residues
MYISTKLECFTQHKQNDSLPIFVINKDNFTNWLSSQDSFLQNFIKQFDEKTKIITVPNNLGNIQKVICLVSEDMFGIANL
PNQLAQGNYHIEYTDIADLSLYYIGFALGSYKFEKYKSKTQQSKVKLYLPQQYQHILATIEANYLVRDMITTPAEDMGPA
DIANVIQQLAKEFNADFEEIVGEELVEQGYMGIYTVGKGSHRSPRLVQLNWGDTAHPTVSIVGKGVAFDTGGLDVKPSSA
MQLMHKDMGGSANAIGLAYMIMKHKLPIRLSLVIPTVENAIDAKSYRPSDIIKMKNGTNVQVTNTDAEGRLILAEPLYEE
AQKKPQYLIDFSTLTGAARVAVGPEIAAFFCNNDDVASQVYKYAQATQDQVWRLPLADCYRKNLETEFADISHCDLSPFA
AAVKAALFMEHFVGIKDAPTWIHFDMMAWNISSTPGKPKGGEMMAVRAMFEMLKDKFPA
>Mature_459_residues
MYISTKLECFTQHKQNDSLPIFVINKDNFTNWLSSQDSFLQNFIKQFDEKTKIITVPNNLGNIQKVICLVSEDMFGIANL
PNQLAQGNYHIEYTDIADLSLYYIGFALGSYKFEKYKSKTQQSKVKLYLPQQYQHILATIEANYLVRDMITTPAEDMGPA
DIANVIQQLAKEFNADFEEIVGEELVEQGYMGIYTVGKGSHRSPRLVQLNWGDTAHPTVSIVGKGVAFDTGGLDVKPSSA
MQLMHKDMGGSANAIGLAYMIMKHKLPIRLSLVIPTVENAIDAKSYRPSDIIKMKNGTNVQVTNTDAEGRLILAEPLYEE
AQKKPQYLIDFSTLTGAARVAVGPEIAAFFCNNDDVASQVYKYAQATQDQVWRLPLADCYRKNLETEFADISHCDLSPFA
AAVKAALFMEHFVGIKDAPTWIHFDMMAWNISSTPGKPKGGEMMAVRAMFEMLKDKFPA

Specific function: Presumably involved in the processing and regular turnover of intracellular proteins. Catalyzes the removal of unsubstituted N-terminal amino acids from various peptides [H]

COG id: COG0260

COG function: function code E; Leucyl aminopeptidase

Gene ontology:

Cell location: Cytoplasm [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the peptidase M17 family [H]

Homologues:

Organism=Homo sapiens, GI41393561, Length=375, Percent_Identity=31.4666666666667, Blast_Score=182, Evalue=6e-46,
Organism=Homo sapiens, GI47155554, Length=293, Percent_Identity=31.740614334471, Blast_Score=130, Evalue=3e-30,
Organism=Escherichia coli, GI87082123, Length=291, Percent_Identity=38.8316151202749, Blast_Score=205, Evalue=6e-54,
Organism=Escherichia coli, GI1790710, Length=334, Percent_Identity=32.9341317365269, Blast_Score=172, Evalue=3e-44,
Organism=Caenorhabditis elegans, GI17556903, Length=287, Percent_Identity=32.7526132404181, Blast_Score=149, Evalue=4e-36,
Organism=Caenorhabditis elegans, GI17565172, Length=326, Percent_Identity=27.3006134969325, Blast_Score=79, Evalue=5e-15,
Organism=Drosophila melanogaster, GI21357381, Length=296, Percent_Identity=31.0810810810811, Blast_Score=135, Evalue=5e-32,
Organism=Drosophila melanogaster, GI221379063, Length=296, Percent_Identity=31.0810810810811, Blast_Score=135, Evalue=6e-32,
Organism=Drosophila melanogaster, GI221379062, Length=296, Percent_Identity=31.0810810810811, Blast_Score=135, Evalue=6e-32,
Organism=Drosophila melanogaster, GI24661038, Length=300, Percent_Identity=30.3333333333333, Blast_Score=128, Evalue=7e-30,
Organism=Drosophila melanogaster, GI21355725, Length=351, Percent_Identity=27.0655270655271, Blast_Score=127, Evalue=2e-29,
Organism=Drosophila melanogaster, GI24662227, Length=331, Percent_Identity=26.5861027190332, Blast_Score=127, Evalue=2e-29,
Organism=Drosophila melanogaster, GI21355645, Length=356, Percent_Identity=24.7191011235955, Blast_Score=121, Evalue=9e-28,
Organism=Drosophila melanogaster, GI24662223, Length=356, Percent_Identity=24.7191011235955, Blast_Score=121, Evalue=9e-28,
Organism=Drosophila melanogaster, GI20129969, Length=300, Percent_Identity=26.6666666666667, Blast_Score=120, Evalue=2e-27,
Organism=Drosophila melanogaster, GI20129963, Length=333, Percent_Identity=24.6246246246246, Blast_Score=113, Evalue=3e-25,
Organism=Drosophila melanogaster, GI161077148, Length=359, Percent_Identity=23.9554317548747, Blast_Score=110, Evalue=2e-24,
Organism=Drosophila melanogaster, GI20130057, Length=359, Percent_Identity=23.9554317548747, Blast_Score=110, Evalue=2e-24,
Organism=Drosophila melanogaster, GI19922386, Length=344, Percent_Identity=23.546511627907, Blast_Score=106, Evalue=3e-23,
Organism=Drosophila melanogaster, GI24646701, Length=225, Percent_Identity=26.2222222222222, Blast_Score=77, Evalue=2e-14,
Organism=Drosophila melanogaster, GI24646703, Length=225, Percent_Identity=26.2222222222222, Blast_Score=77, Evalue=2e-14,
Organism=Drosophila melanogaster, GI21358201, Length=225, Percent_Identity=26.2222222222222, Blast_Score=77, Evalue=2e-14,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR011356
- InterPro:   IPR000819
- InterPro:   IPR023042
- InterPro:   IPR008283 [H]

Pfam domain/function: PF00883 Peptidase_M17; PF02789 Peptidase_M17_N [H]

EC number: =3.4.11.1; =3.4.11.10 [H]

Molecular weight: Translated: 51357; Mature: 51357

Theoretical pI: Translated: 5.83; Mature: 5.83

Prosite motif: PS00631 CYTOSOL_AP

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.1 %Cys     (Translated Protein)
3.9 %Met     (Translated Protein)
5.0 %Cys+Met (Translated Protein)
1.1 %Cys     (Mature Protein)
3.9 %Met     (Mature Protein)
5.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MYISTKLECFTQHKQNDSLPIFVINKDNFTNWLSSQDSFLQNFIKQFDEKTKIITVPNNL
CCCCCHHHHHHHCCCCCCCEEEEEECCCCHHHHCCHHHHHHHHHHHHCCCCEEEECCCCC
GNIQKVICLVSEDMFGIANLPNQLAQGNYHIEYTDIADLSLYYIGFALGSYKFEKYKSKT
CCHHHHHHHHHCCCCCHHCCCHHHHCCCEEEEEECCCHHHHHHHHHHHCCHHHHHHHHHH
QQSKVKLYLPQQYQHILATIEANYLVRDMITTPAEDMGPADIANVIQQLAKEFNADFEEI
HHCEEEEECCHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCHHHH
VGEELVEQGYMGIYTVGKGSHRSPRLVQLNWGDTAHPTVSIVGKGVAFDTGGLDVKPSSA
HHHHHHHCCCCEEEEECCCCCCCCEEEEEECCCCCCCEEEEEECCEEECCCCCCCCCHHH
MQLMHKDMGGSANAIGLAYMIMKHKLPIRLSLVIPTVENAIDAKSYRPSDIIKMKNGTNV
HHHHHHHCCCCCCHHHHHHHHHHCCCCEEEEEEECCHHHHHCCCCCCCCCEEEECCCCEE
QVTNTDAEGRLILAEPLYEEAQKKPQYLIDFSTLTGAARVAVGPEIAAFFCNNDDVASQV
EEECCCCCCEEEEECHHHHHHHCCCCEEEEEHHHCCCEEEEECCCEEEEEECCCHHHHHH
YKYAQATQDQVWRLPLADCYRKNLETEFADISHCDLSPFAAAVKAALFMEHFVGIKDAPT
HHHHHHCHHHHEECCHHHHHHHCCCCHHCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCC
WIHFDMMAWNISSTPGKPKGGEMMAVRAMFEMLKDKFPA
EEEEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCC
>Mature Secondary Structure
MYISTKLECFTQHKQNDSLPIFVINKDNFTNWLSSQDSFLQNFIKQFDEKTKIITVPNNL
CCCCCHHHHHHHCCCCCCCEEEEEECCCCHHHHCCHHHHHHHHHHHHCCCCEEEECCCCC
GNIQKVICLVSEDMFGIANLPNQLAQGNYHIEYTDIADLSLYYIGFALGSYKFEKYKSKT
CCHHHHHHHHHCCCCCHHCCCHHHHCCCEEEEEECCCHHHHHHHHHHHCCHHHHHHHHHH
QQSKVKLYLPQQYQHILATIEANYLVRDMITTPAEDMGPADIANVIQQLAKEFNADFEEI
HHCEEEEECCHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCHHHH
VGEELVEQGYMGIYTVGKGSHRSPRLVQLNWGDTAHPTVSIVGKGVAFDTGGLDVKPSSA
HHHHHHHCCCCEEEEECCCCCCCCEEEEEECCCCCCCEEEEEECCEEECCCCCCCCCHHH
MQLMHKDMGGSANAIGLAYMIMKHKLPIRLSLVIPTVENAIDAKSYRPSDIIKMKNGTNV
HHHHHHHCCCCCCHHHHHHHHHHCCCCEEEEEEECCHHHHHCCCCCCCCCEEEECCCCEE
QVTNTDAEGRLILAEPLYEEAQKKPQYLIDFSTLTGAARVAVGPEIAAFFCNNDDVASQV
EEECCCCCCEEEEECHHHHHHHCCCCEEEEEHHHCCCEEEEECCCEEEEEECCCHHHHHH
YKYAQATQDQVWRLPLADCYRKNLETEFADISHCDLSPFAAAVKAALFMEHFVGIKDAPT
HHHHHHCHHHHEECCHHHHHHHCCCCHHCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCC
WIHFDMMAWNISSTPGKPKGGEMMAVRAMFEMLKDKFPA
EEEEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA