The gene/protein map for NC_004061 is currently unavailable.
Definition Buchnera aphidicola str. Sg (Schizaphis graminum), complete genome.
Accession NC_004061
Length 641,454

Click here to switch to the map view.

The map label for this gene is htrA

Identifier: 21672503

GI number: 21672503

Start: 252941

End: 254377

Strand: Direct

Name: htrA

Synonym: BUsg222

Alternate gene names: 21672503

Gene position: 252941-254377 (Clockwise)

Preceding gene: 21672487

Following gene: 21672506

Centisome position: 39.43

GC content: 30.62

Gene sequence:

>1437_bases
ATGAAAAGAATAAATATAGTATTAAGCGGGATAATGCTTTTTTTAACGCTATTACTAAGTTTTGGAATGTCTTGGGGGAA
TAAAAATTTTACTTCTTCTCAAAATGTTTCTTCTGTACAACTAGCTCCTAGTTTAGCTCCTATGTTAGAAAAAGTTATGC
CTTCAGTAATCAGTATCAATATAGAAGGAAGCACTGTAGTTCATACTTCTCGTTTACCTCATCAATTTCAACCCTTTTTT
GGTCATAATTCTCCTTTTTGTCAGGGTAATTCACCATTTCGAAATTCTCCTTTTTGTCGTTCTAATCCAAATTCTAATAG
TATGCATGAAAAATTTCATGCTCTGGGTTCCGGTGTAATTATTAATGCTGATAAGGCATATGCTGTAACAAATAATCATG
TTGTAGAAAATGCAAATAAAATTCAAGTACAATTAAGTGACGGACGTCGTTATGAAGCCTCTATAATTGGTAAAGACTCT
CGTTCCGATATTGCTTTAATACAGCTAAAAAATGCGAAAAATTTAAGTGCAATAAAAATTGCTGATTCTGATACTCTTCG
AGTAGGTGATTATACTGTAGCTATTGGTAATCCATACGGTCTTGGTGAAACGGTAACTTCTGGTATTATTTCGGCGTTGG
GACGAAGTGGATTAAACATTGAGCATTACGAAAATTTTATTCAAACTGATGCAGCTATTAACAGAGGTAATTCTGGTGGT
GCATTAGTTAATTTAAAAGGTGAATTAATAGGTATCAATACTGCAATATTAGCACCAGACGGAGGTAATATTGGAATTGG
ATTTGCTATTCCTGGAAATATGGTTAAAAATCTTACAGAACAAATGGTTAAATTTGGACAAGTAAAACGTGGAGAATTAG
GCATAATAGGCATGGAACTAAATTCAGATTTAGCTCATGTAATGAAAATAAATGCTCAAAAAGGTGCATTTGTAAGTCAA
GTTTTACCTAATTCTTCTGCTTTTCATGCAGGTATTAAAGCAGGTGATATTATTGTTTCTTTAAATAAAAAAACAATTTC
TAGTTTTGCAGCATTACGTGCTGAAGTGGGATCTTTGCCAGTATCTACTAAAATGGAATTAGGAATATTCCGAAATGGAA
TAACTAAAAACGTGATTGTTGAATTAAAACCATCTTTGAAAAATAGTGTTAGTCTAGGAGATATTTATACAGGAATTGAA
GGTGCTGATTTAAGTGATTGTTCATTAAATGGACAAAAAGGTGTAAAAATAGAAAATATAAAATTGAATACTCAAGCTTC
AAAAATTGGTTTTAAGAAAGATGATATTATTGTAGAAGTCAATCAAAAAGTAATAAATAATTTAAATGATTTAAAAAATA
TTTTAGATTCAAAACCAAATATATTAGTTTTTAGTGTGAAGAGAGGGAATAATAGTATTTACTTAGTTAGTGAATAA

Upstream 100 bases:

>100_bases
TCAAATATCGAGTGTATACTAAAATTTATTGTGATTTAATAATAATATTTTTTATTAGTTTTATTTATAAGATCAGATTG
TAATTTACGAGAAAGAAAAG

Downstream 100 bases:

>100_bases
CTACTTTATTGTTCCGCCCGATATAATCACCGGGCGGTTTTCATTTTTCTATTTATCACGCAAAAGTTGATTTATTTCTA
CTTTATTAAGTGTTTTAGAG

Product: serine endoprotease

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 478; Mature: 478

Protein sequence:

>478_residues
MKRINIVLSGIMLFLTLLLSFGMSWGNKNFTSSQNVSSVQLAPSLAPMLEKVMPSVISINIEGSTVVHTSRLPHQFQPFF
GHNSPFCQGNSPFRNSPFCRSNPNSNSMHEKFHALGSGVIINADKAYAVTNNHVVENANKIQVQLSDGRRYEASIIGKDS
RSDIALIQLKNAKNLSAIKIADSDTLRVGDYTVAIGNPYGLGETVTSGIISALGRSGLNIEHYENFIQTDAAINRGNSGG
ALVNLKGELIGINTAILAPDGGNIGIGFAIPGNMVKNLTEQMVKFGQVKRGELGIIGMELNSDLAHVMKINAQKGAFVSQ
VLPNSSAFHAGIKAGDIIVSLNKKTISSFAALRAEVGSLPVSTKMELGIFRNGITKNVIVELKPSLKNSVSLGDIYTGIE
GADLSDCSLNGQKGVKIENIKLNTQASKIGFKKDDIIVEVNQKVINNLNDLKNILDSKPNILVFSVKRGNNSIYLVSE

Sequences:

>Translated_478_residues
MKRINIVLSGIMLFLTLLLSFGMSWGNKNFTSSQNVSSVQLAPSLAPMLEKVMPSVISINIEGSTVVHTSRLPHQFQPFF
GHNSPFCQGNSPFRNSPFCRSNPNSNSMHEKFHALGSGVIINADKAYAVTNNHVVENANKIQVQLSDGRRYEASIIGKDS
RSDIALIQLKNAKNLSAIKIADSDTLRVGDYTVAIGNPYGLGETVTSGIISALGRSGLNIEHYENFIQTDAAINRGNSGG
ALVNLKGELIGINTAILAPDGGNIGIGFAIPGNMVKNLTEQMVKFGQVKRGELGIIGMELNSDLAHVMKINAQKGAFVSQ
VLPNSSAFHAGIKAGDIIVSLNKKTISSFAALRAEVGSLPVSTKMELGIFRNGITKNVIVELKPSLKNSVSLGDIYTGIE
GADLSDCSLNGQKGVKIENIKLNTQASKIGFKKDDIIVEVNQKVINNLNDLKNILDSKPNILVFSVKRGNNSIYLVSE
>Mature_478_residues
MKRINIVLSGIMLFLTLLLSFGMSWGNKNFTSSQNVSSVQLAPSLAPMLEKVMPSVISINIEGSTVVHTSRLPHQFQPFF
GHNSPFCQGNSPFRNSPFCRSNPNSNSMHEKFHALGSGVIINADKAYAVTNNHVVENANKIQVQLSDGRRYEASIIGKDS
RSDIALIQLKNAKNLSAIKIADSDTLRVGDYTVAIGNPYGLGETVTSGIISALGRSGLNIEHYENFIQTDAAINRGNSGG
ALVNLKGELIGINTAILAPDGGNIGIGFAIPGNMVKNLTEQMVKFGQVKRGELGIIGMELNSDLAHVMKINAQKGAFVSQ
VLPNSSAFHAGIKAGDIIVSLNKKTISSFAALRAEVGSLPVSTKMELGIFRNGITKNVIVELKPSLKNSVSLGDIYTGIE
GADLSDCSLNGQKGVKIENIKLNTQASKIGFKKDDIIVEVNQKVINNLNDLKNILDSKPNILVFSVKRGNNSIYLVSE

Specific function: Serine Protease That Is Required At High Temperature. Involved In The Degradation Of Damaged Proteins. It Can Degrade Icia, Ada, Casein And Globin. Shared Specificity With Degq. [C]

COG id: COG0265

COG function: function code O; Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain

Gene ontology:

Cell location: Periplasmic Protein [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 2 PDZ (DHR) domains

Homologues:

Organism=Homo sapiens, GI4506141, Length=270, Percent_Identity=38.8888888888889, Blast_Score=156, Evalue=5e-38,
Organism=Homo sapiens, GI22129776, Length=323, Percent_Identity=33.7461300309598, Blast_Score=144, Evalue=2e-34,
Organism=Homo sapiens, GI24308541, Length=341, Percent_Identity=29.3255131964809, Blast_Score=135, Evalue=1e-31,
Organism=Homo sapiens, GI7019477, Length=246, Percent_Identity=32.1138211382114, Blast_Score=122, Evalue=6e-28,
Organism=Escherichia coli, GI1786356, Length=448, Percent_Identity=65.625, Blast_Score=604, Evalue=1e-174,
Organism=Escherichia coli, GI1789629, Length=484, Percent_Identity=49.1735537190083, Blast_Score=447, Evalue=1e-127,
Organism=Escherichia coli, GI1789630, Length=256, Percent_Identity=43.75, Blast_Score=189, Evalue=4e-49,
Organism=Drosophila melanogaster, GI24646839, Length=252, Percent_Identity=38.0952380952381, Blast_Score=137, Evalue=2e-32,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): DEGP_BUCAP (O85291)

Other databases:

- EMBL:   AF060492
- EMBL:   AE013218
- RefSeq:   NP_660570.1
- ProteinModelPortal:   O85291
- SMR:   O85291
- MEROPS:   S01.273
- EnsemblBacteria:   EBBUCT00000000506
- GeneID:   1005421
- GenomeReviews:   AE013218_GR
- KEGG:   bas:BUsg222
- GeneTree:   EBGT00050000008056
- HOGENOM:   HBG585708
- OMA:   ISINIEG
- ProtClustDB:   PRK10942
- BioCyc:   BAPH198804:BUSG222-MONOMER
- GO:   GO:0006508
- InterPro:   IPR001478
- InterPro:   IPR009003
- InterPro:   IPR011782
- InterPro:   IPR001254
- InterPro:   IPR001940
- PRINTS:   PR00834
- SMART:   SM00228
- TIGRFAMs:   TIGR02037

Pfam domain/function: PF00595 PDZ; PF00089 Trypsin; SSF50156 PDZ; SSF50494 Pept_Ser_Cys

EC number: 3.4.21.-

Molecular weight: Translated: 51304; Mature: 51304

Theoretical pI: Translated: 9.88; Mature: 9.88

Prosite motif: PS50106 PDZ

Important sites: ACT_SITE 133-133 ACT_SITE 163-163 ACT_SITE 238-238

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
2.3 %Met     (Translated Protein)
2.9 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
2.3 %Met     (Mature Protein)
2.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKRINIVLSGIMLFLTLLLSFGMSWGNKNFTSSQNVSSVQLAPSLAPMLEKVMPSVISIN
CCEEHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEECCHHHHHHHHHCCCEEEEE
IEGSTVVHTSRLPHQFQPFFGHNSPFCQGNSPFRNSPFCRSNPNSNSMHEKFHALGSGVI
ECCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCEE
INADKAYAVTNNHVVENANKIQVQLSDGRRYEASIIGKDSRSDIALIQLKNAKNLSAIKI
EECCCEEEEECCEEEECCCEEEEEECCCCEEEEEEECCCCCCCEEEEEEECCCCCEEEEE
ADSDTLRVGDYTVAIGNPYGLGETVTSGIISALGRSGLNIEHYENFIQTDAAINRGNSGG
ECCCEEEECCEEEEECCCCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCC
ALVNLKGELIGINTAILAPDGGNIGIGFAIPGNMVKNLTEQMVKFGQVKRGELGIIGMEL
EEEEECCEEEEEEEEEECCCCCCEEEEEECCHHHHHHHHHHHHHHCCCCCCCEEEEEEEC
NSDLAHVMKINAQKGAFVSQVLPNSSAFHAGIKAGDIIVSLNKKTISSFAALRAEVGSLP
CCCCEEHEEECCCCCCHHHHHCCCCCCEECCCCCCCEEEEECHHHHHHHHHHHHHHCCCC
VSTKMELGIFRNGITKNVIVELKPSLKNSVSLGDIYTGIEGADLSDCSLNGQKGVKIENI
CCCCEEHHHHHCCCCCEEEEEECCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCEEEEE
KLNTQASKIGFKKDDIIVEVNQKVINNLNDLKNILDSKPNILVFSVKRGNNSIYLVSE
EEECCHHHCCCCCCCEEEEECHHHHCCHHHHHHHHCCCCCEEEEEEEECCCEEEEEEC
>Mature Secondary Structure
MKRINIVLSGIMLFLTLLLSFGMSWGNKNFTSSQNVSSVQLAPSLAPMLEKVMPSVISIN
CCEEHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEECCHHHHHHHHHCCCEEEEE
IEGSTVVHTSRLPHQFQPFFGHNSPFCQGNSPFRNSPFCRSNPNSNSMHEKFHALGSGVI
ECCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCEE
INADKAYAVTNNHVVENANKIQVQLSDGRRYEASIIGKDSRSDIALIQLKNAKNLSAIKI
EECCCEEEEECCEEEECCCEEEEEECCCCEEEEEEECCCCCCCEEEEEEECCCCCEEEEE
ADSDTLRVGDYTVAIGNPYGLGETVTSGIISALGRSGLNIEHYENFIQTDAAINRGNSGG
ECCCEEEECCEEEEECCCCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCC
ALVNLKGELIGINTAILAPDGGNIGIGFAIPGNMVKNLTEQMVKFGQVKRGELGIIGMEL
EEEEECCEEEEEEEEEECCCCCCEEEEEECCHHHHHHHHHHHHHHCCCCCCCEEEEEEEC
NSDLAHVMKINAQKGAFVSQVLPNSSAFHAGIKAGDIIVSLNKKTISSFAALRAEVGSLP
CCCCEEHEEECCCCCCHHHHHCCCCCCEECCCCCCCEEEEECHHHHHHHHHHHHHHCCCC
VSTKMELGIFRNGITKNVIVELKPSLKNSVSLGDIYTGIEGADLSDCSLNGQKGVKIENI
CCCCEEHHHHHCCCCCEEEEEECCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCEEEEE
KLNTQASKIGFKKDDIIVEVNQKVINNLNDLKNILDSKPNILVFSVKRGNNSIYLVSE
EEECCHHHCCCCCCCEEEEECHHHHCCHHHHHHHHCCCCCEEEEEEEECCCEEEEEEC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: Hydrolase; Acting on peptide bonds (Peptidases) [C]

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9688822; 12089438