Definition | Bacillus anthracis str. Sterne chromosome, complete genome. |
---|---|
Accession | NC_005945 |
Length | 5,228,663 |
Click here to switch to the map view.
The map label for this gene is pepA
Identifier: 49187782
GI number: 49187782
Start: 4681940
End: 4683424
Strand: Reverse
Name: pepA
Synonym: BAS4792
Alternate gene names: 49187782
Gene position: 4683424-4681940 (Counterclockwise)
Preceding gene: 49187783
Following gene: 49187780
Centisome position: 89.57
GC content: 39.93
Gene sequence:
>1485_bases ATGTTCCAAGTACAAAAAGAATTAGCAAGCCATGAAGCGGTGGTTGTTGCCTTATTTGAGGAAGAGAAAACGAGTAGCTT TGTACAAGAACTGGACAAAGCGTTTGAAGGACAATTACAAGTTTTATTAGAAGAGAAAGAACTTTCTACGAAGAAGAAAG CTATTTCAAAAGTACATAGTTTAGGAAAAACAGATGTGAAACGTTATTATTTCGTTGGTTTAGGTAAGAAAGAATCTTAT ACGACAGAAACATTACGTTCGGCTCTTGGTAAGACGTTTAAAACACTGCAAGCAGCAAAAGTACAAGATGCAGCAATCTT ACTTGATTCTTTCGTAACAGAGAAATTAGACGCGATTGATGTCGCTCATATTGCGGCAGAAGTACAAGGCCTTGGTACAT ATGAGTTACAAACGTATAAATCAGATAAAAAGGATCGTGTAGAGTTAGAGAAGTTTACGGCTATTACAGCGGAAGATGCA CAGGAAATTGAAGCGGCATTAACAGTTGGCTACGTACATGGACGTGCAACAAATTCAGCTCGTACACTTGTAAATATGCC GCCAAACGTATTAACAGCAACAAAGCTTGCTGAGTATGCTGTTGAGTTAGCGGAAAAGTATGATATGGATTATAAAGTTC TTGAGAAAGAAGAGATGGAAGAGCTTGGTATGGGTGCATTACTTGCAGTAAACCAAGGTAGTGTAGAGCCACCAAAAATG ATCGCTCTTATTTATAAAGGAAAAGAAGAGTGGACAGATGTTATTGGATTCGTTGGAAAAGGAATCACATATGATACAGG TGGTTATTCTTTAAAACCACGTGAAGGTATGGTCGGTATGAAAGGTGATATGGGCGGTGCAGCTGCTGTATTAGGTGCGA TGGAAATTATCGGTGAACTTCGCCCAGAGCAAAATGTGATCGCTGTTATTCCATCAACAGATAACGTTGTAAGTGGTACA GCATTTAAGCCAGACGATGTAATCACATCTATGAGCGGAAAAACAATTGAAGTATTAAATACAGATGCAGAAGGTCGCTT AGCATTAGCTGACGGTATTACGTATGCGAAAAAATTAGGTGCAAACTATTTGATTGATGTTGCGACATTAACAGGTGGTG TAATTGTTGCACTTGGAAATCATACGACAGGTGCGATGACAAACAATGAAGAGTTGTTTGAGCAAGTACTAGAGGCATCT ATGGAAACAGATGAATCTATTTGGCAATTACCGATTTTTGATCGTGATAAAGAGCGCGTGAGAAACAGTAAGTTTGCTGA TTTAAATAACTCGCCAGGCCGCGAAGGACATGCGGTTATGGCAGGTACATTCATAGGTGAATTCGCTGAAGATACACCTT GGGTACACTTAGACATCGCTGGAACATCTGAATCATCAGGAGCGCATGATTTAGGACCAGCTGGAGCAACAGGTGCAATG GTACGTACACTTGCAACACTTGTTGAACGTTTCGGCGAAGAATAA
Upstream 100 bases:
>100_bases ATAGGGACTAGATAGAATGTAACAGAAGTTAGAATTGTCGCATTTTATCGAAAATAAAGTTATAATAAAGGTAGAGTAGA GAAATTTAGGGGGTCTGAAC
Downstream 100 bases:
>100_bases GATTATAAGAGAAGGCTGTTCCTACATTTTGGAACAGCCTTTTTATTGATATATGTGGTGTTACGGCTGATGTAATTTCA CTTTATAAAGAAATTGTATA
Product: leucyl aminopeptidase
Products: NA
Alternate protein names: Leucine aminopeptidase; LAP; Leucyl aminopeptidase
Number of amino acids: Translated: 494; Mature: 494
Protein sequence:
>494_residues MFQVQKELASHEAVVVALFEEEKTSSFVQELDKAFEGQLQVLLEEKELSTKKKAISKVHSLGKTDVKRYYFVGLGKKESY TTETLRSALGKTFKTLQAAKVQDAAILLDSFVTEKLDAIDVAHIAAEVQGLGTYELQTYKSDKKDRVELEKFTAITAEDA QEIEAALTVGYVHGRATNSARTLVNMPPNVLTATKLAEYAVELAEKYDMDYKVLEKEEMEELGMGALLAVNQGSVEPPKM IALIYKGKEEWTDVIGFVGKGITYDTGGYSLKPREGMVGMKGDMGGAAAVLGAMEIIGELRPEQNVIAVIPSTDNVVSGT AFKPDDVITSMSGKTIEVLNTDAEGRLALADGITYAKKLGANYLIDVATLTGGVIVALGNHTTGAMTNNEELFEQVLEAS METDESIWQLPIFDRDKERVRNSKFADLNNSPGREGHAVMAGTFIGEFAEDTPWVHLDIAGTSESSGAHDLGPAGATGAM VRTLATLVERFGEE
Sequences:
>Translated_494_residues MFQVQKELASHEAVVVALFEEEKTSSFVQELDKAFEGQLQVLLEEKELSTKKKAISKVHSLGKTDVKRYYFVGLGKKESY TTETLRSALGKTFKTLQAAKVQDAAILLDSFVTEKLDAIDVAHIAAEVQGLGTYELQTYKSDKKDRVELEKFTAITAEDA QEIEAALTVGYVHGRATNSARTLVNMPPNVLTATKLAEYAVELAEKYDMDYKVLEKEEMEELGMGALLAVNQGSVEPPKM IALIYKGKEEWTDVIGFVGKGITYDTGGYSLKPREGMVGMKGDMGGAAAVLGAMEIIGELRPEQNVIAVIPSTDNVVSGT AFKPDDVITSMSGKTIEVLNTDAEGRLALADGITYAKKLGANYLIDVATLTGGVIVALGNHTTGAMTNNEELFEQVLEAS METDESIWQLPIFDRDKERVRNSKFADLNNSPGREGHAVMAGTFIGEFAEDTPWVHLDIAGTSESSGAHDLGPAGATGAM VRTLATLVERFGEE >Mature_494_residues MFQVQKELASHEAVVVALFEEEKTSSFVQELDKAFEGQLQVLLEEKELSTKKKAISKVHSLGKTDVKRYYFVGLGKKESY TTETLRSALGKTFKTLQAAKVQDAAILLDSFVTEKLDAIDVAHIAAEVQGLGTYELQTYKSDKKDRVELEKFTAITAEDA QEIEAALTVGYVHGRATNSARTLVNMPPNVLTATKLAEYAVELAEKYDMDYKVLEKEEMEELGMGALLAVNQGSVEPPKM IALIYKGKEEWTDVIGFVGKGITYDTGGYSLKPREGMVGMKGDMGGAAAVLGAMEIIGELRPEQNVIAVIPSTDNVVSGT AFKPDDVITSMSGKTIEVLNTDAEGRLALADGITYAKKLGANYLIDVATLTGGVIVALGNHTTGAMTNNEELFEQVLEAS METDESIWQLPIFDRDKERVRNSKFADLNNSPGREGHAVMAGTFIGEFAEDTPWVHLDIAGTSESSGAHDLGPAGATGAM VRTLATLVERFGEE
Specific function: Presumably involved in the processing and regular turnover of intracellular proteins. Catalyzes the removal of unsubstituted N-terminal amino acids from various peptides
COG id: COG0260
COG function: function code E; Leucyl aminopeptidase
Gene ontology:
Cell location: Cytoplasm
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the peptidase M17 family
Homologues:
Organism=Homo sapiens, GI41393561, Length=494, Percent_Identity=33.1983805668016, Blast_Score=233, Evalue=3e-61, Organism=Homo sapiens, GI47155554, Length=345, Percent_Identity=33.9130434782609, Blast_Score=160, Evalue=3e-39, Organism=Escherichia coli, GI1790710, Length=487, Percent_Identity=37.782340862423, Blast_Score=296, Evalue=2e-81, Organism=Escherichia coli, GI87082123, Length=390, Percent_Identity=34.3589743589744, Blast_Score=186, Evalue=4e-48, Organism=Caenorhabditis elegans, GI17556903, Length=295, Percent_Identity=36.6101694915254, Blast_Score=163, Evalue=2e-40, Organism=Caenorhabditis elegans, GI17565172, Length=147, Percent_Identity=42.8571428571429, Blast_Score=96, Evalue=6e-20, Organism=Drosophila melanogaster, GI24661038, Length=361, Percent_Identity=33.7950138504155, Blast_Score=205, Evalue=5e-53, Organism=Drosophila melanogaster, GI21355725, Length=393, Percent_Identity=33.3333333333333, Blast_Score=203, Evalue=2e-52, Organism=Drosophila melanogaster, GI20129969, Length=469, Percent_Identity=27.5053304904051, Blast_Score=174, Evalue=1e-43, Organism=Drosophila melanogaster, GI21355645, Length=500, Percent_Identity=28.4, Blast_Score=169, Evalue=4e-42, Organism=Drosophila melanogaster, GI24662223, Length=500, Percent_Identity=28.4, Blast_Score=169, Evalue=4e-42, Organism=Drosophila melanogaster, GI161077148, Length=472, Percent_Identity=26.9067796610169, Blast_Score=165, Evalue=6e-41, Organism=Drosophila melanogaster, GI20130057, Length=472, Percent_Identity=26.9067796610169, Blast_Score=165, Evalue=6e-41, Organism=Drosophila melanogaster, GI21357381, Length=294, Percent_Identity=36.3945578231293, Blast_Score=164, Evalue=9e-41, Organism=Drosophila melanogaster, GI24662227, Length=381, Percent_Identity=28.0839895013123, Blast_Score=164, Evalue=1e-40, Organism=Drosophila melanogaster, GI221379063, Length=294, Percent_Identity=36.3945578231293, Blast_Score=164, Evalue=1e-40, Organism=Drosophila melanogaster, GI221379062, Length=294, Percent_Identity=36.3945578231293, Blast_Score=164, Evalue=1e-40, Organism=Drosophila melanogaster, GI19922386, Length=463, Percent_Identity=27.645788336933, Blast_Score=155, Evalue=8e-38, Organism=Drosophila melanogaster, GI20129963, Length=363, Percent_Identity=28.6501377410468, Blast_Score=148, Evalue=1e-35, Organism=Drosophila melanogaster, GI24646701, Length=324, Percent_Identity=30.2469135802469, Blast_Score=109, Evalue=4e-24, Organism=Drosophila melanogaster, GI24646703, Length=324, Percent_Identity=30.2469135802469, Blast_Score=109, Evalue=4e-24, Organism=Drosophila melanogaster, GI21358201, Length=324, Percent_Identity=30.2469135802469, Blast_Score=109, Evalue=4e-24,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): AMPA_BACAA (C3PDF7)
Other databases:
- EMBL: CP001598 - RefSeq: YP_002869167.1 - ProteinModelPortal: C3PDF7 - SMR: C3PDF7 - EnsemblBacteria: EBBACT00000128627 - GeneID: 7849302 - GenomeReviews: CP001598_GR - KEGG: bai:BAA_5191 - GeneTree: EBGT00050000002613 - ProtClustDB: PRK00913 - GO: GO:0005737 - GO: GO:0006508 - HAMAP: MF_00181 - InterPro: IPR011356 - InterPro: IPR000819 - InterPro: IPR023042 - InterPro: IPR008283 - PANTHER: PTHR11963:SF3 - PRINTS: PR00481
Pfam domain/function: PF00883 Peptidase_M17; PF02789 Peptidase_M17_N
EC number: =3.4.11.1; =3.4.11.10
Molecular weight: Translated: 53514; Mature: 53514
Theoretical pI: Translated: 4.50; Mature: 4.50
Prosite motif: PS00631 CYTOSOL_AP
Important sites: ACT_SITE 272-272 ACT_SITE 346-346
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.0 %Cys (Translated Protein) 3.0 %Met (Translated Protein) 3.0 %Cys+Met (Translated Protein) 0.0 %Cys (Mature Protein) 3.0 %Met (Mature Protein) 3.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MFQVQKELASHEAVVVALFEEEKTSSFVQELDKAFEGQLQVLLEEKELSTKKKAISKVHS CCCCHHHHCCCCEEEEEEECCHHHHHHHHHHHHHHCCHHHHEEEHHHHHHHHHHHHHHHH LGKTDVKRYYFVGLGKKESYTTETLRSALGKTFKTLQAAKVQDAAILLDSFVTEKLDAID CCCHHCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH VAHIAAEVQGLGTYELQTYKSDKKDRVELEKFTAITAEDAQEIEAALTVGYVHGRATNSA HHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHEECHHHHHHHHHHHHEEEEECCCCCCC RTLVNMPPNVLTATKLAEYAVELAEKYDMDYKVLEKEEMEELGMGALLAVNQGSVEPPKM CEEEECCCCHHHHHHHHHHHHHHHHHCCCCHHHHCHHHHHHCCCCEEEEECCCCCCCCCE IALIYKGKEEWTDVIGFVGKGITYDTGGYSLKPREGMVGMKGDMGGAAAVLGAMEIIGEL EEEEECCCHHHHHHHHHHCCCCEECCCCCEECCCCCCCCCCCCCCHHHHHHHHHHHHHHC RPEQNVIAVIPSTDNVVSGTAFKPDDVITSMSGKTIEVLNTDAEGRLALADGITYAKKLG CCCCCEEEEECCCCCEECCCCCCCHHHHHCCCCCEEEEEECCCCCCEEEHHCHHHHHHCC ANYLIDVATLTGGVIVALGNHTTGAMTNNEELFEQVLEASMETDESIWQLPIFDRDKERV CCCEEEEEECCCCEEEEECCCCCCCCCCCHHHHHHHHHHHCCCCCCEEECCCCCCCHHHH RNSKFADLNNSPGREGHAVMAGTFIGEFAEDTPWVHLDIAGTSESSGAHDLGPAGATGAM CCCCCCCCCCCCCCCCCEEEEEHHHHHHCCCCCEEEEEEECCCCCCCCCCCCCCCCHHHH VRTLATLVERFGEE HHHHHHHHHHHCCC >Mature Secondary Structure MFQVQKELASHEAVVVALFEEEKTSSFVQELDKAFEGQLQVLLEEKELSTKKKAISKVHS CCCCHHHHCCCCEEEEEEECCHHHHHHHHHHHHHHCCHHHHEEEHHHHHHHHHHHHHHHH LGKTDVKRYYFVGLGKKESYTTETLRSALGKTFKTLQAAKVQDAAILLDSFVTEKLDAID CCCHHCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH VAHIAAEVQGLGTYELQTYKSDKKDRVELEKFTAITAEDAQEIEAALTVGYVHGRATNSA HHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHEECHHHHHHHHHHHHEEEEECCCCCCC RTLVNMPPNVLTATKLAEYAVELAEKYDMDYKVLEKEEMEELGMGALLAVNQGSVEPPKM CEEEECCCCHHHHHHHHHHHHHHHHHCCCCHHHHCHHHHHHCCCCEEEEECCCCCCCCCE IALIYKGKEEWTDVIGFVGKGITYDTGGYSLKPREGMVGMKGDMGGAAAVLGAMEIIGEL EEEEECCCHHHHHHHHHHCCCCEECCCCCEECCCCCCCCCCCCCCHHHHHHHHHHHHHHC RPEQNVIAVIPSTDNVVSGTAFKPDDVITSMSGKTIEVLNTDAEGRLALADGITYAKKLG CCCCCEEEEECCCCCEECCCCCCCHHHHHCCCCCEEEEEECCCCCCEEEHHCHHHHHHCC ANYLIDVATLTGGVIVALGNHTTGAMTNNEELFEQVLEASMETDESIWQLPIFDRDKERV CCCEEEEEECCCCEEEEECCCCCCCCCCCHHHHHHHHHHHCCCCCCEEECCCCCCCHHHH RNSKFADLNNSPGREGHAVMAGTFIGEFAEDTPWVHLDIAGTSESSGAHDLGPAGATGAM CCCCCCCCCCCCCCCCCEEEEEHHHHHHCCCCCEEEEEEECCCCCCCCCCCCCCCCHHHH VRTLATLVERFGEE HHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA