| Definition | Geobacillus thermodenitrificans NG80-2 chromosome, complete genome. |
|---|---|
| Accession | NC_009328 |
| Length | 3,550,319 |
Click here to switch to the map view.
The map label for this gene is hhoA [H]
Identifier: 138893962
GI number: 138893962
Start: 323905
End: 325113
Strand: Direct
Name: hhoA [H]
Synonym: GTNG_0286
Alternate gene names: 138893962
Gene position: 323905-325113 (Clockwise)
Preceding gene: 138893956
Following gene: 138893963
Centisome position: 9.12
GC content: 52.61
Gene sequence:
>1209_bases ATGGACATGATGACATTCGAACCAACACCACCCCCACAACCGAAACGCCGCGGTCGCTTCGTGTCATGGATGGCGGCTTC TGTCGCGGGAGCGGTGATCGGCAGCGTGGCGACATGGTATGTGGCGCCCAAATGGCTGAATCAAGGAAACGTCGCCCAAA CCCAATTGACCCAAACGAATGGGAAAAAAAGTGAAACATTGCCGTTGCAGCCGACTGCGAGCACAAACACGAATATGATC GCGGCGATCAACGAAGTGGCTGATGCTGTCGTTGGGGTCATCAACATCCAGAAGCAAGCGGACTTTTTCTCTAACCAAGT GCAAAATACAGAAGCGGGAACAGGCTCGGGCGTGATTTTCAAAAAAGATGGCAATACGGCATACATTGTGACGAACAACC ATGTCATTGAAGGAGCGAACGAAGTCGAAGTGGCACTTGCGAACGGAAAAAAAGTGAAAGCGGACATCGTTGGCGCCGAT GCGTTAACCGATTTGGCCGTCTTGAAAATTCCGGCCAATGGCGTAACGAAAGTAGCGAGTTTCGGCGACTCGTCAAATGT ACCAATCGGCGAACCAGTCGCGGCAATCGGCAATCCGCTTGGCCTTGACTTGTCGCGGACGGTGACGGAAGGGATCGTCA GCGGCAAACGGACGATGCCCGTATCCACCTCAGCGGGTAATTGGGAAATCGATGTCATCCAAACCGACGCGGCGATCAAT CCGGGCAACAGCGGCGGTGCGCTCATTAACAGCGCCGGGCAAGTCATTGGCATCAACAGCATGAAAATTGCCCAAATGGG TGTTGAAGGACTCGGCTTTGCCATCCCGAGCGAAAACGCTCAACCGATCGTCGAGCAGCTCATGAAAGATGGAACAGTCA AGCGCCCGTACCTTGGTGTCCAACTCGTCGATGTCGCTGATTTGTCTGCCGACGTACGCAACGACGAACTGAAACTTCCA TCAAGCGTGACACAAGGCGCTGCTGTCACCGCGGTCGAACCGTTCTCGCCTGCGGCCGAGGCCGGGTTGAAGTCGAAAGA TGTCATCGTGGCAATCAATGGGGATAAAATCGACAGCGTCAGCGCCTTGTGCAAATATTTGTATACAAAAACATCGGTAG ATGAACGCATCAAACTGACCATTTATCGCGATGGATTCGAGACAACCGTTTCGGTGACGCTCAAAACCAAACAAAACAAT CAATCATAA
Upstream 100 bases:
>100_bases TGGTGTTTCATCCGCCTTTTCGCTGTTTCGCCAGAATGCGAGCCAATCTGTCTAATGCTTGGTTATCGATGACGATTCAT CATCAAGGAGGAGAAGAGCT
Downstream 100 bases:
>100_bases GAAAATAACGCCTCTTTTCGGCTGTATCAGCTGGGACGGGGCGGTTTTTTATTTGAAAAGCCCCCCCTGCGAATACAATC GACTTGACGGTGGTGATCAC
Product: protease HhoA
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 402; Mature: 402
Protein sequence:
>402_residues MDMMTFEPTPPPQPKRRGRFVSWMAASVAGAVIGSVATWYVAPKWLNQGNVAQTQLTQTNGKKSETLPLQPTASTNTNMI AAINEVADAVVGVINIQKQADFFSNQVQNTEAGTGSGVIFKKDGNTAYIVTNNHVIEGANEVEVALANGKKVKADIVGAD ALTDLAVLKIPANGVTKVASFGDSSNVPIGEPVAAIGNPLGLDLSRTVTEGIVSGKRTMPVSTSAGNWEIDVIQTDAAIN PGNSGGALINSAGQVIGINSMKIAQMGVEGLGFAIPSENAQPIVEQLMKDGTVKRPYLGVQLVDVADLSADVRNDELKLP SSVTQGAAVTAVEPFSPAAEAGLKSKDVIVAINGDKIDSVSALCKYLYTKTSVDERIKLTIYRDGFETTVSVTLKTKQNN QS
Sequences:
>Translated_402_residues MDMMTFEPTPPPQPKRRGRFVSWMAASVAGAVIGSVATWYVAPKWLNQGNVAQTQLTQTNGKKSETLPLQPTASTNTNMI AAINEVADAVVGVINIQKQADFFSNQVQNTEAGTGSGVIFKKDGNTAYIVTNNHVIEGANEVEVALANGKKVKADIVGAD ALTDLAVLKIPANGVTKVASFGDSSNVPIGEPVAAIGNPLGLDLSRTVTEGIVSGKRTMPVSTSAGNWEIDVIQTDAAIN PGNSGGALINSAGQVIGINSMKIAQMGVEGLGFAIPSENAQPIVEQLMKDGTVKRPYLGVQLVDVADLSADVRNDELKLP SSVTQGAAVTAVEPFSPAAEAGLKSKDVIVAINGDKIDSVSALCKYLYTKTSVDERIKLTIYRDGFETTVSVTLKTKQNN QS >Mature_402_residues MDMMTFEPTPPPQPKRRGRFVSWMAASVAGAVIGSVATWYVAPKWLNQGNVAQTQLTQTNGKKSETLPLQPTASTNTNMI AAINEVADAVVGVINIQKQADFFSNQVQNTEAGTGSGVIFKKDGNTAYIVTNNHVIEGANEVEVALANGKKVKADIVGAD ALTDLAVLKIPANGVTKVASFGDSSNVPIGEPVAAIGNPLGLDLSRTVTEGIVSGKRTMPVSTSAGNWEIDVIQTDAAIN PGNSGGALINSAGQVIGINSMKIAQMGVEGLGFAIPSENAQPIVEQLMKDGTVKRPYLGVQLVDVADLSADVRNDELKLP SSVTQGAAVTAVEPFSPAAEAGLKSKDVIVAINGDKIDSVSALCKYLYTKTSVDERIKLTIYRDGFETTVSVTLKTKQNN QS
Specific function: Serine Protease That Is Required At High Temperature. Involved In The Degradation Of Damaged Proteins. It Can Degrade Icia, Ada, Casein And Globin. Shared Specificity With Degq. [C]
COG id: COG0265
COG function: function code O; Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain
Gene ontology:
Cell location: Cell membrane; Single-pass membrane protein (Potential) [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 PDZ (DHR) domain [H]
Homologues:
Organism=Homo sapiens, GI4506141, Length=286, Percent_Identity=36.7132867132867, Blast_Score=144, Evalue=2e-34, Organism=Homo sapiens, GI24308541, Length=370, Percent_Identity=32.7027027027027, Blast_Score=139, Evalue=4e-33, Organism=Homo sapiens, GI22129776, Length=304, Percent_Identity=31.5789473684211, Blast_Score=122, Evalue=8e-28, Organism=Homo sapiens, GI7019477, Length=317, Percent_Identity=31.5457413249211, Blast_Score=119, Evalue=4e-27, Organism=Escherichia coli, GI1786356, Length=292, Percent_Identity=40.0684931506849, Blast_Score=166, Evalue=2e-42, Organism=Escherichia coli, GI1789629, Length=302, Percent_Identity=37.7483443708609, Blast_Score=159, Evalue=3e-40, Organism=Escherichia coli, GI1789630, Length=327, Percent_Identity=38.5321100917431, Blast_Score=155, Evalue=3e-39, Organism=Drosophila melanogaster, GI24646839, Length=315, Percent_Identity=34.2857142857143, Blast_Score=137, Evalue=1e-32,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001478 - InterPro: IPR009003 - InterPro: IPR001254 - InterPro: IPR001940 [H]
Pfam domain/function: PF00089 Trypsin [H]
EC number: 3.4.21.- [C]
Molecular weight: Translated: 42126; Mature: 42126
Theoretical pI: Translated: 4.95; Mature: 4.95
Prosite motif: PS50106 PDZ
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.2 %Cys (Translated Protein) 2.2 %Met (Translated Protein) 2.5 %Cys+Met (Translated Protein) 0.2 %Cys (Mature Protein) 2.2 %Met (Mature Protein) 2.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MDMMTFEPTPPPQPKRRGRFVSWMAASVAGAVIGSVATWYVAPKWLNQGNVAQTQLTQTN CCCCCCCCCCCCCHHHCCCCHHHHHHHHHHHHHHHHHHEEECHHHCCCCCCEEEEEECCC GKKSETLPLQPTASTNTNMIAAINEVADAVVGVINIQKQADFFSNQVQNTEAGTGSGVIF CCCCCCCCCCCCCCCCCCEEEHHHHHHHHHHHHEEEHHHHHHHHHHHCCCCCCCCCEEEE KKDGNTAYIVTNNHVIEGANEVEVALANGKKVKADIVGADALTDLAVLKIPANGVTKVAS EECCCEEEEEECCEEECCCCEEEEEEECCCEEEEEEECCHHHCCEEEEEECCCCHHHHHH FGDSSNVPIGEPVAAIGNPLGLDLSRTVTEGIVSGKRTMPVSTSAGNWEIDVIQTDAAIN CCCCCCCCCCCCHHHHCCCCCCCHHHHHHHHHHCCCCCCCEECCCCCEEEEEEEECCCCC PGNSGGALINSAGQVIGINSMKIAQMGVEGLGFAIPSENAQPIVEQLMKDGTVKRPYLGV CCCCCCEEEECCCCEEECCCEEEEHHCCCCCEEECCCCCCHHHHHHHHHCCCCCCCCCCE QLVDVADLSADVRNDELKLPSSVTQGAAVTAVEPFSPAAEAGLKSKDVIVAINGDKIDSV EEEEEECCCCCCCCCCEECCCCCCCCCEEEEECCCCCHHHCCCCCCCEEEEECCCCCHHH SALCKYLYTKTSVDERIKLTIYRDGFETTVSVTLKTKQNNQS HHHHHHHHHCCCCCCEEEEEEEECCCCEEEEEEEEECCCCCC >Mature Secondary Structure MDMMTFEPTPPPQPKRRGRFVSWMAASVAGAVIGSVATWYVAPKWLNQGNVAQTQLTQTN CCCCCCCCCCCCCHHHCCCCHHHHHHHHHHHHHHHHHHEEECHHHCCCCCCEEEEEECCC GKKSETLPLQPTASTNTNMIAAINEVADAVVGVINIQKQADFFSNQVQNTEAGTGSGVIF CCCCCCCCCCCCCCCCCCEEEHHHHHHHHHHHHEEEHHHHHHHHHHHCCCCCCCCCEEEE KKDGNTAYIVTNNHVIEGANEVEVALANGKKVKADIVGADALTDLAVLKIPANGVTKVAS EECCCEEEEEECCEEECCCCEEEEEEECCCEEEEEEECCHHHCCEEEEEECCCCHHHHHH FGDSSNVPIGEPVAAIGNPLGLDLSRTVTEGIVSGKRTMPVSTSAGNWEIDVIQTDAAIN CCCCCCCCCCCCHHHHCCCCCCCHHHHHHHHHHCCCCCCCEECCCCCEEEEEEEECCCCC PGNSGGALINSAGQVIGINSMKIAQMGVEGLGFAIPSENAQPIVEQLMKDGTVKRPYLGV CCCCCCEEEECCCCEEECCCEEEEHHCCCCCEEECCCCCCHHHHHHHHHCCCCCCCCCCE QLVDVADLSADVRNDELKLPSSVTQGAAVTAVEPFSPAAEAGLKSKDVIVAINGDKIDSV EEEEEECCCCCCCCCCEECCCCCCCCCEEEEECCCCCHHHCCCCCCCEEEEECCCCCHHH SALCKYLYTKTSVDERIKLTIYRDGFETTVSVTLKTKQNNQS HHHHHHHHHCCCCCCEEEEEEEECCCCEEEEEEEEECCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: Serine endopeptidases [C]
General reaction: Hydrolase; Acting on peptide bonds (Peptidases) [C]
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 9384377; 8113162 [H]