Definition | Streptococcus pyogenes M1 GAS chromosome, complete genome. |
---|---|
Accession | NC_002737 |
Length | 1,852,441 |
Click here to switch to the map view.
The map label for this gene is hlyX [C]
Identifier: 15674525
GI number: 15674525
Start: 322545
End: 323879
Strand: Direct
Name: hlyX [C]
Synonym: SPy_0378
Alternate gene names: 15674525
Gene position: 322545-323879 (Clockwise)
Preceding gene: 15674524
Following gene: 15674526
Centisome position: 17.41
GC content: 38.65
Gene sequence:
>1335_bases ATGGAAGACCCTGTGAGTCAGTCCTTAGTGATTCAATTTTTATTGTTAGTTGTTTTAACCTTGTTAAATGCTTTTTTTTC AGCCAGTGAAATGGCCTTAGTTTCTCTCAATCGTTCTCGGGTGGAACAAAAAGCAGCAGACGGTGATAAAAAATACGCTC GTTTGTTGCGGGTTTTAGAGGAACCTAATCATTTTTTATCAACGATTCAAGTTGGGATTACCTTTATTAGTTTACTATCA GGAGCAAGTTTATCAGCTTCTTTGGGTAAGGTGATCTCAGGTTGGCTAGGTAATTCAGCGACCGCAAGGACAGCTGGTAC TATCATCTCCTTGGTTTTCTTGACTTATGTCTCTATTGTTTTAGGAGAATTGTATCCAAAACGGATTGCCATGAACCTCA AAGACAAGTTGGCGATTGTTTCAGCCCCTATTATCATTGGGTTAGGGAGACTGGTTAGTCCCTTTGTATGGCTCTTATCA GCTTCTACTAATTTACTGAGCCGACTTACCCCTATGACCTTTGATGATGCAGATGAGCAAATGACACGTGATGAAATCGA GTATATGTTATCAAAAAGTGAGGCGACCCTTGATGCTGAAGAAATTGAGATGTTGCAAGGAGTTTTCTCACTTGATGAAA TGATGGCGCGTGAAGTCATGGTCCCAAGGACCGATGCTTTCATGATTGACATTAACGATGATCCGCTTGAAAATATTCAG GAAATCTTAAAACAAAGTTTTTCACGCATTCCTGTTTATGATGTGGATAAAGATAAAATTATCGGTCTCATCCACACTAA GCGTCTCTTGGAGTCAGGTTTCCGCCAGGGATTTGATCAGATTAACATGCGAAAAATGTTACAAGAACCTCTTTTTGTTC CCGAAACCATTTTTGTAGATGATCTCTTACGCCAGCTGCGCAATACCCAAAATCAGATGGCTATTTTGCTAGATGAATAT GGTGGTGTGGCAGGACTTGTGACTTTGGAAGACTTGCTTGAAGAAATCGTCGGTGAAATCGATGATGAAACCGATAAAGC AGAACAATTTGTTCATGAGATTGGAGACAATACCTATATTGTTGTTGGTACTATGACTTTAAATGAGTTTAATGACTATT TTGATACCGAACTAGAATCAGATGATGTAGATACCATTGCTGGTTTTTATTTGACAGGTATCGGAACCATTCCAAGCCAG GAGCAAAAAGAAGCCTACGAAATAGATAACAAAGACAAACATTTAGTTCTAATCAACGATAAAGTCAAAGATGGCCGTAT TACGAAATTAAAATTAATCCTGTCTAATATAGAACAGATTATTGAGGAAGACTAG
Upstream 100 bases:
>100_bases CTGCGATTATTTCGGTGGAAAAAGAAGAAATCCCATAGGCTTTGGCTATAAATGTGATATAATGAAGTCACTATATTATT TTTTTAGGAGTTACATTAAC
Downstream 100 bases:
>100_bases GCTCCATGCCTAGTCTTTTTGATGAAAAGTGTTATAATGAAAATGTAAAAACGTTTACAAGTAATGAAGGAGATGCAATG ACTGAGAAAGATTATGGACA
Product: putative hemolysin
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 444; Mature: 444
Protein sequence:
>444_residues MEDPVSQSLVIQFLLLVVLTLLNAFFSASEMALVSLNRSRVEQKAADGDKKYARLLRVLEEPNHFLSTIQVGITFISLLS GASLSASLGKVISGWLGNSATARTAGTIISLVFLTYVSIVLGELYPKRIAMNLKDKLAIVSAPIIIGLGRLVSPFVWLLS ASTNLLSRLTPMTFDDADEQMTRDEIEYMLSKSEATLDAEEIEMLQGVFSLDEMMAREVMVPRTDAFMIDINDDPLENIQ EILKQSFSRIPVYDVDKDKIIGLIHTKRLLESGFRQGFDQINMRKMLQEPLFVPETIFVDDLLRQLRNTQNQMAILLDEY GGVAGLVTLEDLLEEIVGEIDDETDKAEQFVHEIGDNTYIVVGTMTLNEFNDYFDTELESDDVDTIAGFYLTGIGTIPSQ EQKEAYEIDNKDKHLVLINDKVKDGRITKLKLILSNIEQIIEED
Sequences:
>Translated_444_residues MEDPVSQSLVIQFLLLVVLTLLNAFFSASEMALVSLNRSRVEQKAADGDKKYARLLRVLEEPNHFLSTIQVGITFISLLS GASLSASLGKVISGWLGNSATARTAGTIISLVFLTYVSIVLGELYPKRIAMNLKDKLAIVSAPIIIGLGRLVSPFVWLLS ASTNLLSRLTPMTFDDADEQMTRDEIEYMLSKSEATLDAEEIEMLQGVFSLDEMMAREVMVPRTDAFMIDINDDPLENIQ EILKQSFSRIPVYDVDKDKIIGLIHTKRLLESGFRQGFDQINMRKMLQEPLFVPETIFVDDLLRQLRNTQNQMAILLDEY GGVAGLVTLEDLLEEIVGEIDDETDKAEQFVHEIGDNTYIVVGTMTLNEFNDYFDTELESDDVDTIAGFYLTGIGTIPSQ EQKEAYEIDNKDKHLVLINDKVKDGRITKLKLILSNIEQIIEED >Mature_444_residues MEDPVSQSLVIQFLLLVVLTLLNAFFSASEMALVSLNRSRVEQKAADGDKKYARLLRVLEEPNHFLSTIQVGITFISLLS GASLSASLGKVISGWLGNSATARTAGTIISLVFLTYVSIVLGELYPKRIAMNLKDKLAIVSAPIIIGLGRLVSPFVWLLS ASTNLLSRLTPMTFDDADEQMTRDEIEYMLSKSEATLDAEEIEMLQGVFSLDEMMAREVMVPRTDAFMIDINDDPLENIQ EILKQSFSRIPVYDVDKDKIIGLIHTKRLLESGFRQGFDQINMRKMLQEPLFVPETIFVDDLLRQLRNTQNQMAILLDEY GGVAGLVTLEDLLEEIVGEIDDETDKAEQFVHEIGDNTYIVVGTMTLNEFNDYFDTELESDDVDTIAGFYLTGIGTIPSQ EQKEAYEIDNKDKHLVLINDKVKDGRITKLKLILSNIEQIIEED
Specific function: Unknown
COG id: COG1253
COG function: function code R; Hemolysins and related proteins containing CBS domains
Gene ontology:
Cell location: Cell membrane; Multi-pass membrane protein (Potential) [H]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 2 CBS domains [H]
Homologues:
Organism=Homo sapiens, GI310128564, Length=382, Percent_Identity=32.9842931937173, Blast_Score=196, Evalue=3e-50, Organism=Homo sapiens, GI40068055, Length=362, Percent_Identity=25.1381215469613, Blast_Score=71, Evalue=2e-12, Organism=Homo sapiens, GI40068053, Length=362, Percent_Identity=25.1381215469613, Blast_Score=71, Evalue=2e-12, Organism=Homo sapiens, GI94681046, Length=341, Percent_Identity=26.099706744868, Blast_Score=67, Evalue=3e-11, Organism=Escherichia coli, GI145693175, Length=382, Percent_Identity=29.5811518324607, Blast_Score=169, Evalue=5e-43, Organism=Escherichia coli, GI1790664, Length=399, Percent_Identity=24.5614035087719, Blast_Score=128, Evalue=7e-31, Organism=Escherichia coli, GI1786879, Length=225, Percent_Identity=32.4444444444444, Blast_Score=120, Evalue=2e-28, Organism=Escherichia coli, GI87082033, Length=246, Percent_Identity=28.4552845528455, Blast_Score=98, Evalue=1e-21, Organism=Escherichia coli, GI1788119, Length=225, Percent_Identity=31.1111111111111, Blast_Score=85, Evalue=1e-17, Organism=Saccharomyces cerevisiae, GI6324512, Length=358, Percent_Identity=22.0670391061453, Blast_Score=64, Evalue=6e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR016169 - InterPro: IPR000644 - InterPro: IPR002550 - InterPro: IPR005170 [H]
Pfam domain/function: PF00571 CBS; PF03471 CorC_HlyC; PF01595 DUF21 [H]
EC number: NA
Molecular weight: Translated: 49834; Mature: 49834
Theoretical pI: Translated: 4.16; Mature: 4.16
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.0 %Cys (Translated Protein) 3.4 %Met (Translated Protein) 3.4 %Cys+Met (Translated Protein) 0.0 %Cys (Mature Protein) 3.4 %Met (Mature Protein) 3.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MEDPVSQSLVIQFLLLVVLTLLNAFFSASEMALVSLNRSRVEQKAADGDKKYARLLRVLE CCCCCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHCCHHHHHHHHCCCHHHHHHHHHHHH EPNHFLSTIQVGITFISLLSGASLSASLGKVISGWLGNSATARTAGTIISLVFLTYVSIV CCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHH LGELYPKRIAMNLKDKLAIVSAPIIIGLGRLVSPFVWLLSASTNLLSRLTPMTFDDADEQ HHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHH MTRDEIEYMLSKSEATLDAEEIEMLQGVFSLDEMMAREVMVPRTDAFMIDINDDPLENIQ HHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCHHHHHH EILKQSFSRIPVYDVDKDKIIGLIHTKRLLESGFRQGFDQINMRKMLQEPLFVPETIFVD HHHHHHHHCCCCEECCHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHCCCCCCHHHHHH DLLRQLRNTQNQMAILLDEYGGVAGLVTLEDLLEEIVGEIDDETDKAEQFVHEIGDNTYI HHHHHHHCCHHHHEEHHHHCCCEEHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCEE VVGTMTLNEFNDYFDTELESDDVDTIAGFYLTGIGTIPSQEQKEAYEIDNKDKHLVLIND EEEEEEHHHHHHHHCCCCCCCCHHHHHHHHHHCCCCCCCHHHHHHHCCCCCCCEEEEEEC KVKDGRITKLKLILSNIEQIIEED CCCCCCHHHHHHHHHHHHHHHCCC >Mature Secondary Structure MEDPVSQSLVIQFLLLVVLTLLNAFFSASEMALVSLNRSRVEQKAADGDKKYARLLRVLE CCCCCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHCCHHHHHHHHCCCHHHHHHHHHHHH EPNHFLSTIQVGITFISLLSGASLSASLGKVISGWLGNSATARTAGTIISLVFLTYVSIV CCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHH LGELYPKRIAMNLKDKLAIVSAPIIIGLGRLVSPFVWLLSASTNLLSRLTPMTFDDADEQ HHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHH MTRDEIEYMLSKSEATLDAEEIEMLQGVFSLDEMMAREVMVPRTDAFMIDINDDPLENIQ HHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCHHHHHH EILKQSFSRIPVYDVDKDKIIGLIHTKRLLESGFRQGFDQINMRKMLQEPLFVPETIFVD HHHHHHHHCCCCEECCHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHCCCCCCHHHHHH DLLRQLRNTQNQMAILLDEYGGVAGLVTLEDLLEEIVGEIDDETDKAEQFVHEIGDNTYI HHHHHHHCCHHHHEEHHHHCCCEEHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCEE VVGTMTLNEFNDYFDTELESDDVDTIAGFYLTGIGTIPSQEQKEAYEIDNKDKHLVLIND EEEEEEHHHHHHHHCCCCCCCCHHHHHHHHHHCCCCCCCHHHHHHHCCCCCCCEEEEEEC KVKDGRITKLKLILSNIEQIIEED CCCCCCHHHHHHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 8905231 [H]