Definition Mesorhizobium sp. BNC1, complete genome.
Accession NC_008254
Length 4,412,446

Click here to switch to the map view.

The map label for this gene is pepA [H]

Identifier: 110636277

GI number: 110636277

Start: 4256782

End: 4258149

Strand: Reverse

Name: pepA [H]

Synonym: Meso_3953

Alternate gene names: 110636277

Gene position: 4258149-4256782 (Counterclockwise)

Preceding gene: 110636283

Following gene: 110636276

Centisome position: 96.5

GC content: 63.6

Gene sequence:

>1368_bases
ATGCCCCTAAACCTCACGGACCGCAAAGTGCCCGCTTCATTGCCTGTCTATGTCGCTGCCGGCGGCGCCCTGGAGGAATC
CGGCGCGGACGCGGCCGCCTGCGCGTGGGCGCGCTCACATGGATTTTCCGGGGCGGAAGGAGCAGTGCTCACCTTCCCCA
GCCGGGACGGTGCGCTGGCAGGAGCTTTCTTCGGCATTGGCGGCAACGGCTCCTCGCCGCTCATTTACGGCAAGCTCGCC
CGTGCACTGCCGGGTGGGGATTGGCACTTCGCTGTTCCCCCCAAGGACCCGCAGCTTATGGCCGTTGCCCTTCTCCTGGG
CGGTTATGCCTTCACTTCCTACGGCGCGAAGCCAACGGCCGACATTCGCTTCCTCGTGCCGGAAGGCGCGAGTGCGGACG
AGGCCCGCCGGGTGGCGGCGGGCGTGTTCCTGGCCCGCGATCTCGTCAACACACCGACGAACGATCTGGGGCCAGATGCA
TTGGAACGAGCCGCGCGGGAACTTGCCGAGACGCATGGCGCTGCCTTTGCCTCGGTCGTTGGCGAGAATCTTCTGCAACA
GAACTTCCCCATGATTCACGCTGTCGGCCGCGCAAGCGCCATTGCGCCCCGGCTTATGGACATGCGCTGGGGAAAGGAAA
ACGCGCCGAAGGTGACCCTCGTCGGCAAGGGCGTGTGTTTCGATACTGGCGGGCTCGACATAAAGACGTTGAGCGGCATG
CTGCTCATGAAGAAGGACATGGGCGGGGCAGCCAATGTGCTGGGCCTTGCATCCATGATCATGGCCGCAAACTTGCCGAT
CCGGCTTCGGGTCCTGATCCCCGCCGTTGAAAATTCCATCGCCGGCAATGCCTTCCGCCCCGGTGACGTTTTGAGAAGCC
GCAAGGGCATCACGGTCGAGATCGGCAATACGGACGCGGAGGGCCGGCTGATCCTGGCCGACGCGCTTGCACTCGCCGAC
GAGGAAGAACCGGAGCTTCTGGTGGATATGGCAACGCTCACAGGCGCGGCGCGCGTGGCTTTGGGGCCTGATTTGCCGCC
ATTTTACACGCGCGACGATCTCTTTGCGTCCGCATTGGCCGCAGCGTCCAACCGGACGGATGATCCACTCTGGCGCATGC
CGCTCTGGCAGCCTTATGCCGACAAATTGCGCTCCCGCATCGCCGACATAAACAATGTGACCAGCGACGGATTCGCGGGA
TCCGTCACGGCGGCTTTGTTTCTCCAGCGCTTCGTGGAGAGGGCGGCAACCTGGGTTCATTTCGATATTTTCGCATGGAA
CCCGACCGAGAAACCCACCTGCCCTGTTGGCGGCGAAGCGCAGGGTATCCGCGCGCTCTTCGCCTTGCTGCGTGAGCGGT
ATGGATAG

Upstream 100 bases:

>100_bases
CCGTAATCCCCTTATTGGCAAAGACGCCGCCTTTCCACCGCCAGAGGTAATCTGTTAACCCTAATGGAGTGTTAATTGAC
TGCCGAACAGGACGCCGCCA

Downstream 100 bases:

>100_bases
GCTTTCGCGCGGGCGAGGAAAGGCGTAATGATACGGTCCCGAAACGAATTCGGACGTTCTGATGCCGGTTGTGCTTCGAC
CGATCCAGGCATTGAGGCTT

Product: peptidase M17, leucyl aminopeptidase-like

Products: NA

Alternate protein names: Leucine aminopeptidase; LAP; Leucyl aminopeptidase [H]

Number of amino acids: Translated: 455; Mature: 454

Protein sequence:

>455_residues
MPLNLTDRKVPASLPVYVAAGGALEESGADAAACAWARSHGFSGAEGAVLTFPSRDGALAGAFFGIGGNGSSPLIYGKLA
RALPGGDWHFAVPPKDPQLMAVALLLGGYAFTSYGAKPTADIRFLVPEGASADEARRVAAGVFLARDLVNTPTNDLGPDA
LERAARELAETHGAAFASVVGENLLQQNFPMIHAVGRASAIAPRLMDMRWGKENAPKVTLVGKGVCFDTGGLDIKTLSGM
LLMKKDMGGAANVLGLASMIMAANLPIRLRVLIPAVENSIAGNAFRPGDVLRSRKGITVEIGNTDAEGRLILADALALAD
EEEPELLVDMATLTGAARVALGPDLPPFYTRDDLFASALAAASNRTDDPLWRMPLWQPYADKLRSRIADINNVTSDGFAG
SVTAALFLQRFVERAATWVHFDIFAWNPTEKPTCPVGGEAQGIRALFALLRERYG

Sequences:

>Translated_455_residues
MPLNLTDRKVPASLPVYVAAGGALEESGADAAACAWARSHGFSGAEGAVLTFPSRDGALAGAFFGIGGNGSSPLIYGKLA
RALPGGDWHFAVPPKDPQLMAVALLLGGYAFTSYGAKPTADIRFLVPEGASADEARRVAAGVFLARDLVNTPTNDLGPDA
LERAARELAETHGAAFASVVGENLLQQNFPMIHAVGRASAIAPRLMDMRWGKENAPKVTLVGKGVCFDTGGLDIKTLSGM
LLMKKDMGGAANVLGLASMIMAANLPIRLRVLIPAVENSIAGNAFRPGDVLRSRKGITVEIGNTDAEGRLILADALALAD
EEEPELLVDMATLTGAARVALGPDLPPFYTRDDLFASALAAASNRTDDPLWRMPLWQPYADKLRSRIADINNVTSDGFAG
SVTAALFLQRFVERAATWVHFDIFAWNPTEKPTCPVGGEAQGIRALFALLRERYG
>Mature_454_residues
PLNLTDRKVPASLPVYVAAGGALEESGADAAACAWARSHGFSGAEGAVLTFPSRDGALAGAFFGIGGNGSSPLIYGKLAR
ALPGGDWHFAVPPKDPQLMAVALLLGGYAFTSYGAKPTADIRFLVPEGASADEARRVAAGVFLARDLVNTPTNDLGPDAL
ERAARELAETHGAAFASVVGENLLQQNFPMIHAVGRASAIAPRLMDMRWGKENAPKVTLVGKGVCFDTGGLDIKTLSGML
LMKKDMGGAANVLGLASMIMAANLPIRLRVLIPAVENSIAGNAFRPGDVLRSRKGITVEIGNTDAEGRLILADALALADE
EEPELLVDMATLTGAARVALGPDLPPFYTRDDLFASALAAASNRTDDPLWRMPLWQPYADKLRSRIADINNVTSDGFAGS
VTAALFLQRFVERAATWVHFDIFAWNPTEKPTCPVGGEAQGIRALFALLRERYG

Specific function: Presumably involved in the processing and regular turnover of intracellular proteins. Catalyzes the removal of unsubstituted N-terminal amino acids from various peptides [H]

COG id: COG0260

COG function: function code E; Leucyl aminopeptidase

Gene ontology:

Cell location: Cytoplasm [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the peptidase M17 family [H]

Homologues:

Organism=Homo sapiens, GI41393561, Length=370, Percent_Identity=34.3243243243243, Blast_Score=183, Evalue=4e-46,
Organism=Homo sapiens, GI47155554, Length=293, Percent_Identity=34.1296928327645, Blast_Score=135, Evalue=9e-32,
Organism=Escherichia coli, GI87082123, Length=315, Percent_Identity=39.0476190476191, Blast_Score=205, Evalue=6e-54,
Organism=Escherichia coli, GI1790710, Length=287, Percent_Identity=39.3728222996516, Blast_Score=183, Evalue=2e-47,
Organism=Caenorhabditis elegans, GI17556903, Length=316, Percent_Identity=36.0759493670886, Blast_Score=142, Evalue=5e-34,
Organism=Caenorhabditis elegans, GI17565172, Length=219, Percent_Identity=35.1598173515982, Blast_Score=88, Evalue=1e-17,
Organism=Drosophila melanogaster, GI24661038, Length=287, Percent_Identity=33.4494773519164, Blast_Score=152, Evalue=4e-37,
Organism=Drosophila melanogaster, GI21355725, Length=287, Percent_Identity=32.7526132404181, Blast_Score=151, Evalue=9e-37,
Organism=Drosophila melanogaster, GI221379063, Length=288, Percent_Identity=35.4166666666667, Blast_Score=142, Evalue=3e-34,
Organism=Drosophila melanogaster, GI221379062, Length=288, Percent_Identity=35.4166666666667, Blast_Score=142, Evalue=3e-34,
Organism=Drosophila melanogaster, GI21357381, Length=288, Percent_Identity=35.4166666666667, Blast_Score=142, Evalue=5e-34,
Organism=Drosophila melanogaster, GI21355645, Length=286, Percent_Identity=26.9230769230769, Blast_Score=118, Evalue=7e-27,
Organism=Drosophila melanogaster, GI24662223, Length=286, Percent_Identity=26.9230769230769, Blast_Score=118, Evalue=7e-27,
Organism=Drosophila melanogaster, GI24662227, Length=283, Percent_Identity=27.208480565371, Blast_Score=115, Evalue=4e-26,
Organism=Drosophila melanogaster, GI20129969, Length=284, Percent_Identity=27.8169014084507, Blast_Score=112, Evalue=7e-25,
Organism=Drosophila melanogaster, GI19922386, Length=283, Percent_Identity=28.6219081272085, Blast_Score=108, Evalue=9e-24,
Organism=Drosophila melanogaster, GI161077148, Length=285, Percent_Identity=29.1228070175439, Blast_Score=101, Evalue=9e-22,
Organism=Drosophila melanogaster, GI20130057, Length=285, Percent_Identity=29.1228070175439, Blast_Score=101, Evalue=9e-22,
Organism=Drosophila melanogaster, GI20129963, Length=285, Percent_Identity=27.719298245614, Blast_Score=100, Evalue=3e-21,
Organism=Drosophila melanogaster, GI24646701, Length=224, Percent_Identity=33.0357142857143, Blast_Score=93, Evalue=4e-19,
Organism=Drosophila melanogaster, GI24646703, Length=224, Percent_Identity=33.0357142857143, Blast_Score=93, Evalue=4e-19,
Organism=Drosophila melanogaster, GI21358201, Length=224, Percent_Identity=33.0357142857143, Blast_Score=93, Evalue=4e-19,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR011356
- InterPro:   IPR000819
- InterPro:   IPR023042
- InterPro:   IPR008283 [H]

Pfam domain/function: PF00883 Peptidase_M17; PF02789 Peptidase_M17_N [H]

EC number: =3.4.11.1; =3.4.11.10 [H]

Molecular weight: Translated: 48006; Mature: 47875

Theoretical pI: Translated: 5.51; Mature: 5.51

Prosite motif: PS00631 CYTOSOL_AP

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.7 %Cys     (Translated Protein)
2.6 %Met     (Translated Protein)
3.3 %Cys+Met (Translated Protein)
0.7 %Cys     (Mature Protein)
2.4 %Met     (Mature Protein)
3.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MPLNLTDRKVPASLPVYVAAGGALEESGADAAACAWARSHGFSGAEGAVLTFPSRDGALA
CCCCCCCCCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHCCCCCCCCCEEEECCCCCCEE
GAFFGIGGNGSSPLIYGKLARALPGGDWHFAVPPKDPQLMAVALLLGGYAFTSYGAKPTA
EEEEEECCCCCCCEEHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCCC
DIRFLVPEGASADEARRVAAGVFLARDLVNTPTNDLGPDALERAARELAETHGAAFASVV
CEEEEECCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH
GENLLQQNFPMIHAVGRASAIAPRLMDMRWGKENAPKVTLVGKGVCFDTGGLDIKTLSGM
HHHHHHHCCCEEEECCCHHHHHHHHHHHHCCCCCCCEEEEEECCEEEECCCCCHHHHCCE
LLMKKDMGGAANVLGLASMIMAANLPIRLRVLIPAVENSIAGNAFRPGDVLRSRKGITVE
EEEECCCCCHHHHHHHHHHHHHCCCCEEEEEEEEHHHCCCCCCCCCCHHHHHCCCCCEEE
IGNTDAEGRLILADALALADEEEPELLVDMATLTGAARVALGPDLPPFYTRDDLFASALA
ECCCCCCCCEEEEEHHHHCCCCCCHHEEEHHHHCCCEEEEECCCCCCCCCHHHHHHHHHH
AASNRTDDPLWRMPLWQPYADKLRSRIADINNVTSDGFAGSVTAALFLQRFVERAATWVH
HHCCCCCCCEECCCCCCHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCEEE
FDIFAWNPTEKPTCPVGGEAQGIRALFALLRERYG
EEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCC
>Mature Secondary Structure 
PLNLTDRKVPASLPVYVAAGGALEESGADAAACAWARSHGFSGAEGAVLTFPSRDGALA
CCCCCCCCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHCCCCCCCCCEEEECCCCCCEE
GAFFGIGGNGSSPLIYGKLARALPGGDWHFAVPPKDPQLMAVALLLGGYAFTSYGAKPTA
EEEEEECCCCCCCEEHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCCC
DIRFLVPEGASADEARRVAAGVFLARDLVNTPTNDLGPDALERAARELAETHGAAFASVV
CEEEEECCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH
GENLLQQNFPMIHAVGRASAIAPRLMDMRWGKENAPKVTLVGKGVCFDTGGLDIKTLSGM
HHHHHHHCCCEEEECCCHHHHHHHHHHHHCCCCCCCEEEEEECCEEEECCCCCHHHHCCE
LLMKKDMGGAANVLGLASMIMAANLPIRLRVLIPAVENSIAGNAFRPGDVLRSRKGITVE
EEEECCCCCHHHHHHHHHHHHHCCCCEEEEEEEEHHHCCCCCCCCCCHHHHHCCCCCEEE
IGNTDAEGRLILADALALADEEEPELLVDMATLTGAARVALGPDLPPFYTRDDLFASALA
ECCCCCCCCEEEEEHHHHCCCCCCHHEEEHHHHCCCEEEEECCCCCCCCCHHHHHHHHHH
AASNRTDDPLWRMPLWQPYADKLRSRIADINNVTSDGFAGSVTAALFLQRFVERAATWVH
HHCCCCCCCEECCCCCCHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCEEE
FDIFAWNPTEKPTCPVGGEAQGIRALFALLRERYG
EEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11792842 [H]