Definition Mycobacterium tuberculosis H37Ra, complete genome.
Accession NC_009525
Length 4,419,977

Click here to switch to the map view.

The map label for this gene is degP [H]

Identifier: 148663534

GI number: 148663534

Start: 4120768

End: 4121961

Strand: Reverse

Name: degP [H]

Synonym: MRA_3706

Alternate gene names: 148663534

Gene position: 4121961-4120768 (Counterclockwise)

Preceding gene: 148663535

Following gene: 148663531

Centisome position: 93.26

GC content: 66.25

Gene sequence:

>1194_bases
ATGACCCCGTCGCAGTGGCTGGATATCGCCGTCTTGGCGGTCGCATTTATTGCAGCCATCTCCGGCTGGCGTGCCGGTGC
GCTGGGCTCAATGCTGTCGTTTGGCGGGGTGCTGCTGGGCGCGACAGCCGGCGTGCTGCTGGCGCCGCATATCGTCAGTC
AAATCAGCGCTCCGCGGGCCAAACTGTTTGCCGCGCTGTTCCTGATCCTGGCACTGGTCGTAGTCGGCGAGGTCGCTGGT
GTGGTGCTGGGCCGCGCCGTCCGCGGGGCGATCCGTAACCGGCCGATCCGGTTGATCGACTCGGTCATTGGGGTAGGGGT
GCAGCTGGTCGTGGTGCTCACCGCGGCGTGGTTGTTGGCGATGCCGCTGACACAGTCGAAAGAGCAGCCCGAGCTGGCTG
CCGCGGTGAAGGGTTCGCGGGTGCTCGCCCGGGTCAACGAGGCGGCACCCACCTGGCTGAAGACGGTGCCCAAGCGGCTG
TCGGCCCTGCTGAACACCTCCGGCCTGCCCGCGGTTTTGGAGCCGTTCAGCCGCACGCCGGTCATTCCAGTGGCCTCACC
CGACCCAGCGCTGGTCAACAATCCGGTGGTGGCGGCCACCGAGCCAAGTGTCGTCAAAATCCGCAGCCTGGCACCCAGAT
GCCAGAAAGTGTTGGAGGGCACCGGCTTCGTGATCTCACCCGATCGGGTGATGACCAACGCGCACGTGGTGGCCGGATCC
AACAACGTCACGGTGTATGCCGGCGACAAGCCCTTCGAGGCCACGGTGGTGTCCTACGACCCGTCGGTCGACGTAGCGAT
CCTGGCCGTTCCGCACTTGCCGCCGCCGCCGCTGGTCTTCGCTGCGGAGCCGGCGAAAACCGGTGCCGACGTTGTGGTGC
TGGGTTATCCCGGCGGCGGCAATTTCACTGCCACACCCGCCAGGATTCGCGAGGCCATCAGACTCAGTGGCCCCGATATT
TACGGGGACCCGGAGCCGGTTACCCGCGACGTGTACACCATCAGAGCCGATGTGGAGCAAGGTGATTCGGGTGGGCCCCT
GATCGACCTCAACGGTCAGGTGCTCGGTGTGGTGTTCGGCGCAGCCATCGACGACGCCGAAACTGGGTTTGTGCTGACGG
CCGGCGAGGTGGCGGGGCAGCTTGCCAAAATCGGTGCTACCCAACCGGTCGGCACCGGGGCCTGCGTCAGCTGA

Upstream 100 bases:

>100_bases
ATGTCGCGGGCTGGGCACAACCCTGGGACACCGGTGACATCCGCGAGTTGGACGCGGCAATGGTGCTGATCGACGACGAG
AGTGACCCGCGATGAATTCG

Downstream 100 bases:

>100_bases
GCTGGTGCACCTGCTCGAGGAAACGCATCAGATGTCGGTTGACTTCCTCCGGCGCCTCTTCGTGACTGAAATGTCCTGCG
CCGGCAATGGATATGTACCG

Product: membrane-associated serine protease

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 397; Mature: 396

Protein sequence:

>397_residues
MTPSQWLDIAVLAVAFIAAISGWRAGALGSMLSFGGVLLGATAGVLLAPHIVSQISAPRAKLFAALFLILALVVVGEVAG
VVLGRAVRGAIRNRPIRLIDSVIGVGVQLVVVLTAAWLLAMPLTQSKEQPELAAAVKGSRVLARVNEAAPTWLKTVPKRL
SALLNTSGLPAVLEPFSRTPVIPVASPDPALVNNPVVAATEPSVVKIRSLAPRCQKVLEGTGFVISPDRVMTNAHVVAGS
NNVTVYAGDKPFEATVVSYDPSVDVAILAVPHLPPPPLVFAAEPAKTGADVVVLGYPGGGNFTATPARIREAIRLSGPDI
YGDPEPVTRDVYTIRADVEQGDSGGPLIDLNGQVLGVVFGAAIDDAETGFVLTAGEVAGQLAKIGATQPVGTGACVS

Sequences:

>Translated_397_residues
MTPSQWLDIAVLAVAFIAAISGWRAGALGSMLSFGGVLLGATAGVLLAPHIVSQISAPRAKLFAALFLILALVVVGEVAG
VVLGRAVRGAIRNRPIRLIDSVIGVGVQLVVVLTAAWLLAMPLTQSKEQPELAAAVKGSRVLARVNEAAPTWLKTVPKRL
SALLNTSGLPAVLEPFSRTPVIPVASPDPALVNNPVVAATEPSVVKIRSLAPRCQKVLEGTGFVISPDRVMTNAHVVAGS
NNVTVYAGDKPFEATVVSYDPSVDVAILAVPHLPPPPLVFAAEPAKTGADVVVLGYPGGGNFTATPARIREAIRLSGPDI
YGDPEPVTRDVYTIRADVEQGDSGGPLIDLNGQVLGVVFGAAIDDAETGFVLTAGEVAGQLAKIGATQPVGTGACVS
>Mature_396_residues
TPSQWLDIAVLAVAFIAAISGWRAGALGSMLSFGGVLLGATAGVLLAPHIVSQISAPRAKLFAALFLILALVVVGEVAGV
VLGRAVRGAIRNRPIRLIDSVIGVGVQLVVVLTAAWLLAMPLTQSKEQPELAAAVKGSRVLARVNEAAPTWLKTVPKRLS
ALLNTSGLPAVLEPFSRTPVIPVASPDPALVNNPVVAATEPSVVKIRSLAPRCQKVLEGTGFVISPDRVMTNAHVVAGSN
NVTVYAGDKPFEATVVSYDPSVDVAILAVPHLPPPPLVFAAEPAKTGADVVVLGYPGGGNFTATPARIREAIRLSGPDIY
GDPEPVTRDVYTIRADVEQGDSGGPLIDLNGQVLGVVFGAAIDDAETGFVLTAGEVAGQLAKIGATQPVGTGACVS

Specific function: Protease With A Shared Specificity With Degp. [C]

COG id: COG0265

COG function: function code O; Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain

Gene ontology:

Cell location: Periplasmic Protein [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 2 PDZ (DHR) domains [H]

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001478
- InterPro:   IPR009003
- InterPro:   IPR011782
- InterPro:   IPR001254
- InterPro:   IPR001940 [H]

Pfam domain/function: PF00595 PDZ; PF00089 Trypsin [H]

EC number: 3.4.21.- [C]

Molecular weight: Translated: 40722; Mature: 40591

Theoretical pI: Translated: 6.80; Mature: 6.80

Prosite motif: PS00135 TRYPSIN_SER

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.5 %Cys     (Translated Protein)
1.0 %Met     (Translated Protein)
1.5 %Cys+Met (Translated Protein)
0.5 %Cys     (Mature Protein)
0.8 %Met     (Mature Protein)
1.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTPSQWLDIAVLAVAFIAAISGWRAGALGSMLSFGGVLLGATAGVLLAPHIVSQISAPRA
CCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHCCCCHH
KLFAALFLILALVVVGEVAGVVLGRAVRGAIRNRPIRLIDSVIGVGVQLVVVLTAAWLLA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHH
MPLTQSKEQPELAAAVKGSRVLARVNEAAPTWLKTVPKRLSALLNTSGLPAVLEPFSRTP
CCCCCCCCCCHHHHHHCCCHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCC
VIPVASPDPALVNNPVVAATEPSVVKIRSLAPRCQKVLEGTGFVISPDRVMTNAHVVAGS
EEEECCCCCCCCCCCEEEECCCCEEEHHHHHHHHHHHHCCCCEEECCCCEEECEEEEECC
NNVTVYAGDKPFEATVVSYDPSVDVAILAVPHLPPPPLVFAAEPAKTGADVVVLGYPGGG
CCEEEEECCCCCEEEEEEECCCCCEEEEEECCCCCCCEEEEECCCCCCCCEEEEEECCCC
NFTATPARIREAIRLSGPDIYGDPEPVTRDVYTIRADVEQGDSGGPLIDLNGQVLGVVFG
CCCCCHHHHHHHHHCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCEEEECCCEEEEEEE
AAIDDAETGFVLTAGEVAGQLAKIGATQPVGTGACVS
CCCCCCCCCEEEECHHHHHHHHHCCCCCCCCCCCCCC
>Mature Secondary Structure 
TPSQWLDIAVLAVAFIAAISGWRAGALGSMLSFGGVLLGATAGVLLAPHIVSQISAPRA
CCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHCCCCHH
KLFAALFLILALVVVGEVAGVVLGRAVRGAIRNRPIRLIDSVIGVGVQLVVVLTAAWLLA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHH
MPLTQSKEQPELAAAVKGSRVLARVNEAAPTWLKTVPKRLSALLNTSGLPAVLEPFSRTP
CCCCCCCCCCHHHHHHCCCHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCC
VIPVASPDPALVNNPVVAATEPSVVKIRSLAPRCQKVLEGTGFVISPDRVMTNAHVVAGS
EEEECCCCCCCCCCCEEEECCCCEEEHHHHHHHHHHHHCCCCEEECCCCEEECEEEEECC
NNVTVYAGDKPFEATVVSYDPSVDVAILAVPHLPPPPLVFAAEPAKTGADVVVLGYPGGG
CCEEEEECCCCCEEEEEEECCCCCEEEEEECCCCCCCEEEEECCCCCCCCEEEEEECCCC
NFTATPARIREAIRLSGPDIYGDPEPVTRDVYTIRADVEQGDSGGPLIDLNGQVLGVVFG
CCCCCHHHHHHHHHCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCEEEECCCEEEEEEE
AAIDDAETGFVLTAGEVAGQLAKIGATQPVGTGACVS
CCCCCCCCCEEEECHHHHHHHHHCCCCCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 10192388; 10684935; 10871362 [H]