Definition | Escherichia coli HS, complete genome. |
---|---|
Accession | NC_009800 |
Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is pepB [H]
Identifier: 157161998
GI number: 157161998
Start: 2692714
End: 2693997
Strand: Reverse
Name: pepB [H]
Synonym: EcHS_A2674
Alternate gene names: 157161998
Gene position: 2693997-2692714 (Counterclockwise)
Preceding gene: 157161999
Following gene: 157161997
Centisome position: 58.02
GC content: 56.62
Gene sequence:
>1284_bases ATGACAGAAGCGATGAAAATTACCCTCTCTACCCAACCTGCCGACGCGCGCTGGGGAGAAAAAGCAACTTACAGCATTAA TAATGATGGCATTACCCTGCATTTGAACGGGGCAGACGATCTGGGGCTGATCCAGCGTGCGGCCCGCAAGATTGACGGTC TGGGCATCAAGCATGTTCAGTTAAGCGGTGAAGGTTGGGATGCGGATCGCTGCTGGGCATTCTGGCAAGGTTACAAAGCC CCGAAAGGCACGCGTAAAGTGGAGTGGCCGGATCTGGACGATGCCCAGCGCCAGGAACTGGATAACCGCCTGATGATCAT CGACTGGGTGCGTGACACCATCAACGCACCGGCAGAAGAATTGGGACCATCGCAACTGGCACAGCGTGCTGTTGATCTGA TCAGCAACGTCGCGGGCGATCGTGTGACTTATCGGATCACCAAAGGCGAAGATCTGCGTGAGCAAGGTTATATGGGGCTG CACACCGTCGGACGCGGTTCAGAACGTTCTCCGGTATTGCTGGCGCTGGATTACAACCCAACTGGCGATAAAGAAGCGCC AGTGTACGCGTGCCTGGTAGGTAAAGGTATCACTTTTGACTCCGGCGGCTACAGCATCAAACAGACTGCGTTTATGGACT CGATGAAGTCGGACATGGGCGGCGCGGCAACGGTTACCGGGGCGCTGGCATTTGCCATTACGCGCGGACTGAACAAGCGC GTGAAGCTGTTCCTCTGCTGTGCGGATAACCTGATTAGCGGCAATGCGTTCAAGCTGGGCGATATCATCACCTATCGCAA CGGTAAAAAAGTTGAAGTGATGAACACTGATGCCGAAGGGCGTCTGGTGCTTGCCGATGGTCTGATTGATGCCAGTGCGC AGAAACCGGAAATGATCATTGATGCGGCGACCCTCACCGGGGCGGCGAAAACTGCGCTGGGTAATGATTATCACGCGCTG TTCAGTTTTGACGATGCGCTGGCCGGTCGCTTGCTGGCGAGTGCCGCGCAGGAGAACGAACCGTTCTGGCGTCTGCCGCT GGCGGAGTTCCACCGCAGCCAGCTGCCGTCTAACTTTGCCGAACTGAACAATACCGGAAGCGCGGCGTATCCGGCAGGCG CGAGCACGGCGGCGGGCTTCCTGTCGCACTTTGTTGAGAACTATCAGCAAGGCTGGTTGCATATCGACTGCTCGGCGACT TACCGTAAAGCGCCGGTTGAACAGTGGTCTGCGGGCGCTACGGGACTTGGTGTGCGCACGATAGCTAATCTGTTAACGGC GTAA
Upstream 100 bases:
>100_bases CCTCGAAGCGATTTTGTTAGTCTGGCTGGACGAGGCCGAATGAATAAACGGGCTGCCTGGGGTAGCCCGTTTTGCTGTTA AACGAATAAGGATAACTAAA
Downstream 100 bases:
>100_bases ACTAATATGCCGGATGCGGCGTAAACGCCTTATCCGGCCTACAAATTCTGCAAATTCAACAAATTGCGAATCCCTTGTAG GCCTGATAAGCGTAGCGCAT
Product: aminopeptidase B
Products: NA
Alternate protein names: Aminopeptidase B [H]
Number of amino acids: Translated: 427; Mature: 426
Protein sequence:
>427_residues MTEAMKITLSTQPADARWGEKATYSINNDGITLHLNGADDLGLIQRAARKIDGLGIKHVQLSGEGWDADRCWAFWQGYKA PKGTRKVEWPDLDDAQRQELDNRLMIIDWVRDTINAPAEELGPSQLAQRAVDLISNVAGDRVTYRITKGEDLREQGYMGL HTVGRGSERSPVLLALDYNPTGDKEAPVYACLVGKGITFDSGGYSIKQTAFMDSMKSDMGGAATVTGALAFAITRGLNKR VKLFLCCADNLISGNAFKLGDIITYRNGKKVEVMNTDAEGRLVLADGLIDASAQKPEMIIDAATLTGAAKTALGNDYHAL FSFDDALAGRLLASAAQENEPFWRLPLAEFHRSQLPSNFAELNNTGSAAYPAGASTAAGFLSHFVENYQQGWLHIDCSAT YRKAPVEQWSAGATGLGVRTIANLLTA
Sequences:
>Translated_427_residues MTEAMKITLSTQPADARWGEKATYSINNDGITLHLNGADDLGLIQRAARKIDGLGIKHVQLSGEGWDADRCWAFWQGYKA PKGTRKVEWPDLDDAQRQELDNRLMIIDWVRDTINAPAEELGPSQLAQRAVDLISNVAGDRVTYRITKGEDLREQGYMGL HTVGRGSERSPVLLALDYNPTGDKEAPVYACLVGKGITFDSGGYSIKQTAFMDSMKSDMGGAATVTGALAFAITRGLNKR VKLFLCCADNLISGNAFKLGDIITYRNGKKVEVMNTDAEGRLVLADGLIDASAQKPEMIIDAATLTGAAKTALGNDYHAL FSFDDALAGRLLASAAQENEPFWRLPLAEFHRSQLPSNFAELNNTGSAAYPAGASTAAGFLSHFVENYQQGWLHIDCSAT YRKAPVEQWSAGATGLGVRTIANLLTA >Mature_426_residues TEAMKITLSTQPADARWGEKATYSINNDGITLHLNGADDLGLIQRAARKIDGLGIKHVQLSGEGWDADRCWAFWQGYKAP KGTRKVEWPDLDDAQRQELDNRLMIIDWVRDTINAPAEELGPSQLAQRAVDLISNVAGDRVTYRITKGEDLREQGYMGLH TVGRGSERSPVLLALDYNPTGDKEAPVYACLVGKGITFDSGGYSIKQTAFMDSMKSDMGGAATVTGALAFAITRGLNKRV KLFLCCADNLISGNAFKLGDIITYRNGKKVEVMNTDAEGRLVLADGLIDASAQKPEMIIDAATLTGAAKTALGNDYHALF SFDDALAGRLLASAAQENEPFWRLPLAEFHRSQLPSNFAELNNTGSAAYPAGASTAAGFLSHFVENYQQGWLHIDCSATY RKAPVEQWSAGATGLGVRTIANLLTA
Specific function: Probably plays an important role in intracellular peptide degradation [H]
COG id: COG0260
COG function: function code E; Leucyl aminopeptidase
Gene ontology:
Cell location: Cytoplasm [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the peptidase M17 family [H]
Homologues:
Organism=Homo sapiens, GI41393561, Length=317, Percent_Identity=36.5930599369085, Blast_Score=182, Evalue=7e-46, Organism=Homo sapiens, GI47155554, Length=289, Percent_Identity=35.9861591695502, Blast_Score=136, Evalue=3e-32, Organism=Escherichia coli, GI87082123, Length=427, Percent_Identity=99.7658079625293, Blast_Score=877, Evalue=0.0, Organism=Escherichia coli, GI1790710, Length=315, Percent_Identity=38.4126984126984, Blast_Score=188, Evalue=7e-49, Organism=Caenorhabditis elegans, GI17556903, Length=327, Percent_Identity=32.7217125382263, Blast_Score=137, Evalue=7e-33, Organism=Caenorhabditis elegans, GI17565172, Length=207, Percent_Identity=28.5024154589372, Blast_Score=77, Evalue=2e-14, Organism=Drosophila melanogaster, GI221379063, Length=344, Percent_Identity=32.2674418604651, Blast_Score=159, Evalue=3e-39, Organism=Drosophila melanogaster, GI221379062, Length=344, Percent_Identity=32.2674418604651, Blast_Score=159, Evalue=3e-39, Organism=Drosophila melanogaster, GI21357381, Length=344, Percent_Identity=32.2674418604651, Blast_Score=159, Evalue=3e-39, Organism=Drosophila melanogaster, GI24662227, Length=322, Percent_Identity=28.8819875776398, Blast_Score=125, Evalue=4e-29, Organism=Drosophila melanogaster, GI21355725, Length=289, Percent_Identity=29.757785467128, Blast_Score=125, Evalue=5e-29, Organism=Drosophila melanogaster, GI24661038, Length=320, Percent_Identity=28.4375, Blast_Score=124, Evalue=1e-28, Organism=Drosophila melanogaster, GI161077148, Length=344, Percent_Identity=28.4883720930233, Blast_Score=120, Evalue=3e-27, Organism=Drosophila melanogaster, GI20130057, Length=344, Percent_Identity=28.4883720930233, Blast_Score=120, Evalue=3e-27, Organism=Drosophila melanogaster, GI20129969, Length=294, Percent_Identity=28.9115646258503, Blast_Score=117, Evalue=2e-26, Organism=Drosophila melanogaster, GI21355645, Length=292, Percent_Identity=27.0547945205479, Blast_Score=113, Evalue=3e-25, Organism=Drosophila melanogaster, GI24662223, Length=292, Percent_Identity=27.0547945205479, Blast_Score=113, Evalue=3e-25, Organism=Drosophila melanogaster, GI20129963, Length=316, Percent_Identity=27.5316455696203, Blast_Score=105, Evalue=4e-23, Organism=Drosophila melanogaster, GI19922386, Length=311, Percent_Identity=27.3311897106109, Blast_Score=105, Evalue=5e-23, Organism=Drosophila melanogaster, GI24646701, Length=228, Percent_Identity=29.3859649122807, Blast_Score=72, Evalue=7e-13, Organism=Drosophila melanogaster, GI24646703, Length=228, Percent_Identity=29.3859649122807, Blast_Score=72, Evalue=7e-13, Organism=Drosophila melanogaster, GI21358201, Length=228, Percent_Identity=29.3859649122807, Blast_Score=72, Evalue=7e-13,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR008330 - InterPro: IPR011356 - InterPro: IPR000819 [H]
Pfam domain/function: PF00883 Peptidase_M17 [H]
EC number: =3.4.11.23 [H]
Molecular weight: Translated: 46211; Mature: 46079
Theoretical pI: Translated: 5.48; Mature: 5.48
Prosite motif: PS00631 CYTOSOL_AP
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.2 %Cys (Translated Protein) 2.1 %Met (Translated Protein) 3.3 %Cys+Met (Translated Protein) 1.2 %Cys (Mature Protein) 1.9 %Met (Mature Protein) 3.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTEAMKITLSTQPADARWGEKATYSINNDGITLHLNGADDLGLIQRAARKIDGLGIKHVQ CCCEEEEEEECCCCCCCCCCCEEEEECCCCEEEEECCCCCHHHHHHHHHHHCCCCEEEEE LSGEGWDADRCWAFWQGYKAPKGTRKVEWPDLDDAQRQELDNRLMIIDWVRDTINAPAEE ECCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHCCCEEEEEEHHHHHCCCHHH LGPSQLAQRAVDLISNVAGDRVTYRITKGEDLREQGYMGLHTVGRGSERSPVLLALDYNP CCHHHHHHHHHHHHHHCCCCEEEEEEECCCCHHHCCCCEEEECCCCCCCCCEEEEEECCC TGDKEAPVYACLVGKGITFDSGGYSIKQTAFMDSMKSDMGGAATVTGALAFAITRGLNKR CCCCCCCEEEEEECCCEEECCCCCEEHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCE VKLFLCCADNLISGNAFKLGDIITYRNGKKVEVMNTDAEGRLVLADGLIDASAQKPEMII EEEEEEECCCCCCCCCEEECCEEEECCCCEEEEEECCCCCEEEEECCCCCCCCCCCCEEE DAATLTGAAKTALGNDYHALFSFDDALAGRLLASAAQENEPFWRLPLAEFHRSQLPSNFA EEHHHCCHHHHHCCCCEEEEEECCHHHHHHHHHHHHHCCCCEEECCHHHHHHHHCCHHHH ELNNTGSAAYPAGASTAAGFLSHFVENYQQGWLHIDCSATYRKAPVEQWSAGATGLGVRT HHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEECCCHHCCCHHHHCCCCCCCHHHH IANLLTA HHHHHCC >Mature Secondary Structure TEAMKITLSTQPADARWGEKATYSINNDGITLHLNGADDLGLIQRAARKIDGLGIKHVQ CCEEEEEEECCCCCCCCCCCEEEEECCCCEEEEECCCCCHHHHHHHHHHHCCCCEEEEE LSGEGWDADRCWAFWQGYKAPKGTRKVEWPDLDDAQRQELDNRLMIIDWVRDTINAPAEE ECCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHCCCEEEEEEHHHHHCCCHHH LGPSQLAQRAVDLISNVAGDRVTYRITKGEDLREQGYMGLHTVGRGSERSPVLLALDYNP CCHHHHHHHHHHHHHHCCCCEEEEEEECCCCHHHCCCCEEEECCCCCCCCCEEEEEECCC TGDKEAPVYACLVGKGITFDSGGYSIKQTAFMDSMKSDMGGAATVTGALAFAITRGLNKR CCCCCCCEEEEEECCCEEECCCCCEEHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCE VKLFLCCADNLISGNAFKLGDIITYRNGKKVEVMNTDAEGRLVLADGLIDASAQKPEMII EEEEEEECCCCCCCCCEEECCEEEECCCCEEEEEECCCCCEEEEECCCCCCCCCCCCEEE DAATLTGAAKTALGNDYHALFSFDDALAGRLLASAAQENEPFWRLPLAEFHRSQLPSNFA EEHHHCCHHHHHCCCCEEEEEECCHHHHHHHHHHHHHCCCCEEECCHHHHHHHHCCHHHH ELNNTGSAAYPAGASTAAGFLSHFVENYQQGWLHIDCSATYRKAPVEQWSAGATGLGVRT HHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEECCCHHCCCHHHHCCCCCCCHHHH IANLLTA HHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]