The gene/protein map for NC_008699 is currently unavailable.
Definition Nocardioides sp. JS614 chromosome, complete genome.
Accession NC_008699
Length 4,985,871

Click here to switch to the map view.

The map label for this gene is pepN [H]

Identifier: 119717456

GI number: 119717456

Start: 3433521

End: 3434981

Strand: Reverse

Name: pepN [H]

Synonym: Noca_3232

Alternate gene names: 119717456

Gene position: 3434981-3433521 (Counterclockwise)

Preceding gene: 119717460

Following gene: 119717453

Centisome position: 68.89

GC content: 72.14

Gene sequence:

>1461_bases
GTGGCCACCCGTCCCATCGCCCTCCCGACCGCCGCACTGGCCGCTGTCCTGGCGCTCACCGGTGCCCTCGCCCTGACCGG
TCCGTCCACCTCCGCCGAGCCGGCCGGGGCCCGTGCCGCAGCTCCCGCGGGCGCAGCCGGCATCGGCGACCCGTACTTCC
CGCTGGACGGCAACGGCGGCATCGACGTGCTGCGCTACGACGTCCACGACCGCTACCGCTTCGGCGACCGGCACCTGTCC
GGATGGACCACGGTGACCGTGCGCGCGACCGAGTCGCTGTCGAGCTTCAACCTCGACCTCCTGCTCCCGGTCCGCTCCGT
CACCGTCGACGGCCGCGACGCCGCCTTCGAGAAGACGCATCACGAGCTGGTCGTGAAGCGCCCGGTCGCGGTGGGCGAGA
CCGTGCGCGTGGTGGTGCGGTACGCCGGGTGGCCCTCGGCGTACTCCTACGAGGGCGAGGGCAACTGGCTGGCTGGCGAG
CGCGAGGTCGTCGCGATGAACCAGCCGCACATGGCGCCGTGGTGGTTCCCGGCCAACGACCACCCGCTCGACCGGGCGCA
CATCCGGATCAGCATCACGGTCCCGAAGGAGAACCAGGTCATCGCCGGCGGCCGCCTGGTCGGTCGGGAGGTGCACGGCC
GGCTGGCGACCACGACCTGGCAGGCGCGGGAACCGATGGTGCCGTACCTGGCGTTCTTCGCCGCCGGCCGGTTCGAGGTC
GAGCGCGGTGACAGCCTCGGCCGGCCCTGGTACTCGGCGGTCTCGAAGCAGCTGAGCCCCGCCCAGCGACAGGTGGCGAT
GGGGCTGATGAAGAAGACGCCGCGGCTGCTGCGGTGGCTCGAGCGGCAGGTCGGCGACTACCCGTTCACCACGACCGGCG
GGCTGACCACGGCCCTCCCGGTCGGCTTCGCGCTCGAGAACCAGACCCGGCCGACCTATCCGTGGATGGGCGACGGCCCC
GGCGCGGTGAAGACGGTGGTGCACGAGCTCGCCCACCAGTGGTTCGGGGACTCGGTCGCGGTCGAGGGCTGGACCGACAT
CTGGCTCAACGAGGGCTGGGCGACGTACTTCGAGCAGTACTACAGCGAGCAGCACGGCGGGCCGACGACGGACGCGTGGT
TGCGCGAGGCGTACCAGTCGGATGCCGACGACGCGTTCTGGAGCCACGAGGTCGCGGACCCGTGCCCGGGCCGCGAGGAC
TGCGTGAGCTGGATCTTCGCTCCCTTCGTGTACCAGCGTGGGGGCATGGCGCTGGCGGCGCTGCGCAACCGGATCGGTGA
CGCCGACTTCACCACCCTGACCCGGCAGTGGGCGCTCGAGCGTGCGGGCAGCACCGGCAACACCGCGCAGTTCCAGGCTC
TCGCCGAGCAGGTCAGCGGCCAGGACCTGGGCGGGTTCTTCGACGCCTGGGTGCGCTCGACGACCAAGCCGGCGGACACG
GCGGCCAACGGACTCGGGTGA

Upstream 100 bases:

>100_bases
TCGAGTATCGCCGCGGCGGTTGCGGCACCGCAACCGAATATGCGCCGGCCCGGCACCCGTCGAGGCCACCGACAGTCCGC
CCGTCCACTAGGTTGCTCGC

Downstream 100 bases:

>100_bases
TCAGCCCAGCAGCGGCGAGACCACCCGGGCCACGACGCTCGGCAGCCGGGTGACGGACTCCTCCGCGTCGGCGAGGGTCA
TCGCCCGCAGGCCGACGTTG

Product: peptidase M1, membrane alanine aminopeptidase

Products: NA

Alternate protein names: Alanine aminopeptidase; Lysyl aminopeptidase; Lys-AP [H]

Number of amino acids: Translated: 486; Mature: 485

Protein sequence:

>486_residues
MATRPIALPTAALAAVLALTGALALTGPSTSAEPAGARAAAPAGAAGIGDPYFPLDGNGGIDVLRYDVHDRYRFGDRHLS
GWTTVTVRATESLSSFNLDLLLPVRSVTVDGRDAAFEKTHHELVVKRPVAVGETVRVVVRYAGWPSAYSYEGEGNWLAGE
REVVAMNQPHMAPWWFPANDHPLDRAHIRISITVPKENQVIAGGRLVGREVHGRLATTTWQAREPMVPYLAFFAAGRFEV
ERGDSLGRPWYSAVSKQLSPAQRQVAMGLMKKTPRLLRWLERQVGDYPFTTTGGLTTALPVGFALENQTRPTYPWMGDGP
GAVKTVVHELAHQWFGDSVAVEGWTDIWLNEGWATYFEQYYSEQHGGPTTDAWLREAYQSDADDAFWSHEVADPCPGRED
CVSWIFAPFVYQRGGMALAALRNRIGDADFTTLTRQWALERAGSTGNTAQFQALAEQVSGQDLGGFFDAWVRSTTKPADT
AANGLG

Sequences:

>Translated_486_residues
MATRPIALPTAALAAVLALTGALALTGPSTSAEPAGARAAAPAGAAGIGDPYFPLDGNGGIDVLRYDVHDRYRFGDRHLS
GWTTVTVRATESLSSFNLDLLLPVRSVTVDGRDAAFEKTHHELVVKRPVAVGETVRVVVRYAGWPSAYSYEGEGNWLAGE
REVVAMNQPHMAPWWFPANDHPLDRAHIRISITVPKENQVIAGGRLVGREVHGRLATTTWQAREPMVPYLAFFAAGRFEV
ERGDSLGRPWYSAVSKQLSPAQRQVAMGLMKKTPRLLRWLERQVGDYPFTTTGGLTTALPVGFALENQTRPTYPWMGDGP
GAVKTVVHELAHQWFGDSVAVEGWTDIWLNEGWATYFEQYYSEQHGGPTTDAWLREAYQSDADDAFWSHEVADPCPGRED
CVSWIFAPFVYQRGGMALAALRNRIGDADFTTLTRQWALERAGSTGNTAQFQALAEQVSGQDLGGFFDAWVRSTTKPADT
AANGLG
>Mature_485_residues
ATRPIALPTAALAAVLALTGALALTGPSTSAEPAGARAAAPAGAAGIGDPYFPLDGNGGIDVLRYDVHDRYRFGDRHLSG
WTTVTVRATESLSSFNLDLLLPVRSVTVDGRDAAFEKTHHELVVKRPVAVGETVRVVVRYAGWPSAYSYEGEGNWLAGER
EVVAMNQPHMAPWWFPANDHPLDRAHIRISITVPKENQVIAGGRLVGREVHGRLATTTWQAREPMVPYLAFFAAGRFEVE
RGDSLGRPWYSAVSKQLSPAQRQVAMGLMKKTPRLLRWLERQVGDYPFTTTGGLTTALPVGFALENQTRPTYPWMGDGPG
AVKTVVHELAHQWFGDSVAVEGWTDIWLNEGWATYFEQYYSEQHGGPTTDAWLREAYQSDADDAFWSHEVADPCPGREDC
VSWIFAPFVYQRGGMALAALRNRIGDADFTTLTRQWALERAGSTGNTAQFQALAEQVSGQDLGGFFDAWVRSTTKPADTA
ANGLG

Specific function: Aminopeptidase with broad substrate specificity to several peptides. It has more affinity for oligopeptides than for dipeptides. It plays an essential role in the metabolism, it may be involved in nitrogen supply or protein turnover [H]

COG id: COG0308

COG function: function code E; Aminopeptidase N

Gene ontology:

Cell location: Cytoplasm. Note=It may be secreted through an unknown mechanism (By similarity) [H]

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the peptidase M1 family [H]

Homologues:

Organism=Homo sapiens, GI132814467, Length=336, Percent_Identity=25.8928571428571, Blast_Score=92, Evalue=2e-18,
Organism=Homo sapiens, GI158937236, Length=326, Percent_Identity=26.3803680981595, Blast_Score=89, Evalue=1e-17,
Organism=Homo sapiens, GI310123622, Length=280, Percent_Identity=29.2857142857143, Blast_Score=82, Evalue=8e-16,
Organism=Homo sapiens, GI310133497, Length=280, Percent_Identity=28.9285714285714, Blast_Score=81, Evalue=3e-15,
Organism=Homo sapiens, GI194306629, Length=304, Percent_Identity=25.6578947368421, Blast_Score=74, Evalue=3e-13,
Organism=Homo sapiens, GI11641261, Length=304, Percent_Identity=25.6578947368421, Blast_Score=74, Evalue=3e-13,
Organism=Caenorhabditis elegans, GI115533276, Length=440, Percent_Identity=24.0909090909091, Blast_Score=74, Evalue=2e-13,
Organism=Caenorhabditis elegans, GI115533278, Length=440, Percent_Identity=24.0909090909091, Blast_Score=74, Evalue=2e-13,
Organism=Caenorhabditis elegans, GI17565628, Length=240, Percent_Identity=29.5833333333333, Blast_Score=67, Evalue=2e-11,
Organism=Saccharomyces cerevisiae, GI9755335, Length=344, Percent_Identity=23.8372093023256, Blast_Score=71, Evalue=4e-13,
Organism=Saccharomyces cerevisiae, GI6321837, Length=323, Percent_Identity=23.2198142414861, Blast_Score=64, Evalue=8e-11,
Organism=Drosophila melanogaster, GI281362221, Length=401, Percent_Identity=25.1870324189526, Blast_Score=77, Evalue=2e-14,
Organism=Drosophila melanogaster, GI21358341, Length=300, Percent_Identity=26, Blast_Score=75, Evalue=7e-14,
Organism=Drosophila melanogaster, GI221330574, Length=337, Percent_Identity=27.299703264095, Blast_Score=75, Evalue=7e-14,
Organism=Drosophila melanogaster, GI24648784, Length=401, Percent_Identity=25.1870324189526, Blast_Score=75, Evalue=1e-13,
Organism=Drosophila melanogaster, GI161078673, Length=117, Percent_Identity=35.042735042735, Blast_Score=72, Evalue=1e-12,
Organism=Drosophila melanogaster, GI28571901, Length=117, Percent_Identity=35.8974358974359, Blast_Score=72, Evalue=1e-12,
Organism=Drosophila melanogaster, GI24655257, Length=153, Percent_Identity=28.1045751633987, Blast_Score=68, Evalue=2e-11,
Organism=Drosophila melanogaster, GI24655252, Length=159, Percent_Identity=27.6729559748428, Blast_Score=67, Evalue=2e-11,
Organism=Drosophila melanogaster, GI24655274, Length=159, Percent_Identity=27.6729559748428, Blast_Score=67, Evalue=2e-11,
Organism=Drosophila melanogaster, GI24655260, Length=159, Percent_Identity=27.6729559748428, Blast_Score=67, Evalue=2e-11,
Organism=Drosophila melanogaster, GI24655265, Length=159, Percent_Identity=27.6729559748428, Blast_Score=67, Evalue=2e-11,
Organism=Drosophila melanogaster, GI24655268, Length=159, Percent_Identity=27.6729559748428, Blast_Score=67, Evalue=2e-11,
Organism=Drosophila melanogaster, GI24651025, Length=293, Percent_Identity=25.2559726962457, Blast_Score=67, Evalue=3e-11,
Organism=Drosophila melanogaster, GI24651023, Length=293, Percent_Identity=25.2559726962457, Blast_Score=67, Evalue=3e-11,
Organism=Drosophila melanogaster, GI24651021, Length=293, Percent_Identity=25.2559726962457, Blast_Score=67, Evalue=3e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001930
- InterPro:   IPR014782 [H]

Pfam domain/function: PF01433 Peptidase_M1 [H]

EC number: =3.4.11.2 [H]

Molecular weight: Translated: 53132; Mature: 53001

Theoretical pI: Translated: 5.94; Mature: 5.94

Prosite motif: PS00142 ZINC_PROTEASE

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.4 %Cys     (Translated Protein)
1.6 %Met     (Translated Protein)
2.1 %Cys+Met (Translated Protein)
0.4 %Cys     (Mature Protein)
1.4 %Met     (Mature Protein)
1.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MATRPIALPTAALAAVLALTGALALTGPSTSAEPAGARAAAPAGAAGIGDPYFPLDGNGG
CCCCCCCCHHHHHHHHHHHHHHHEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
IDVLRYDVHDRYRFGDRHLSGWTTVTVRATESLSSFNLDLLLPVRSVTVDGRDAAFEKTH
CEEEEECCCHHHHCCCCCCCCEEEEEEEEECCCCCCCCEEEEEEEEEEECCCHHHHHHHH
HELVVKRPVAVGETVRVVVRYAGWPSAYSYEGEGNWLAGEREVVAMNQPHMAPWWFPAND
HHHEEECCCCCCHHHEEHHHCCCCCCCEEECCCCCEECCCCEEEEECCCCCCCEEECCCC
HPLDRAHIRISITVPKENQVIAGGRLVGREVHGRLATTTWQAREPMVPYLAFFAAGRFEV
CCCCCEEEEEEEEECCCCCEEECCEEECHHHCCCEEEEEECCCCCCHHHHHHHHCCCEEE
ERGDSLGRPWYSAVSKQLSPAQRQVAMGLMKKTPRLLRWLERQVGDYPFTTTGGLTTALP
ECCCCCCCHHHHHHHHHCCHHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCCCCCCHHHHH
VGFALENQTRPTYPWMGDGPGAVKTVVHELAHQWFGDSVAVEGWTDIWLNEGWATYFEQY
CCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEECCCCEEEECCCHHHHHHHH
YSEQHGGPTTDAWLREAYQSDADDAFWSHEVADPCPGREDCVSWIFAPFVYQRGGMALAA
HHHHCCCCCHHHHHHHHHHCCCCCHHHCCCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHH
LRNRIGDADFTTLTRQWALERAGSTGNTAQFQALAEQVSGQDLGGFFDAWVRSTTKPADT
HHHHCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHCCCCCCHH
AANGLG
HCCCCC
>Mature Secondary Structure 
ATRPIALPTAALAAVLALTGALALTGPSTSAEPAGARAAAPAGAAGIGDPYFPLDGNGG
CCCCCCCHHHHHHHHHHHHHHHEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
IDVLRYDVHDRYRFGDRHLSGWTTVTVRATESLSSFNLDLLLPVRSVTVDGRDAAFEKTH
CEEEEECCCHHHHCCCCCCCCEEEEEEEEECCCCCCCCEEEEEEEEEEECCCHHHHHHHH
HELVVKRPVAVGETVRVVVRYAGWPSAYSYEGEGNWLAGEREVVAMNQPHMAPWWFPAND
HHHEEECCCCCCHHHEEHHHCCCCCCCEEECCCCCEECCCCEEEEECCCCCCCEEECCCC
HPLDRAHIRISITVPKENQVIAGGRLVGREVHGRLATTTWQAREPMVPYLAFFAAGRFEV
CCCCCEEEEEEEEECCCCCEEECCEEECHHHCCCEEEEEECCCCCCHHHHHHHHCCCEEE
ERGDSLGRPWYSAVSKQLSPAQRQVAMGLMKKTPRLLRWLERQVGDYPFTTTGGLTTALP
ECCCCCCCHHHHHHHHHCCHHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCCCCCCHHHHH
VGFALENQTRPTYPWMGDGPGAVKTVVHELAHQWFGDSVAVEGWTDIWLNEGWATYFEQY
CCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEECCCCEEEECCCHHHHHHHH
YSEQHGGPTTDAWLREAYQSDADDAFWSHEVADPCPGREDCVSWIFAPFVYQRGGMALAA
HHHHCCCCCHHHHHHHHHHCCCCCHHHCCCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHH
LRNRIGDADFTTLTRQWALERAGSTGNTAQFQALAEQVSGQDLGGFFDAWVRSTTKPADT
HHHHCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHCCCCCCHH
AANGLG
HCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11337471 [H]