Definition | Streptococcus pyogenes M1 GAS chromosome, complete genome. |
---|---|
Accession | NC_002737 |
Length | 1,852,441 |
Click here to switch to the map view.
The map label for this gene is nupC [C]
Identifier: 15675687
GI number: 15675687
Start: 1549807
End: 1551009
Strand: Reverse
Name: nupC [C]
Synonym: SPy_1868
Alternate gene names: 15675687
Gene position: 1551009-1549807 (Counterclockwise)
Preceding gene: 15675688
Following gene: 15675686
Centisome position: 83.73
GC content: 35.99
Gene sequence:
>1203_bases ATGCAATTTATTTATAGTATTATTGGTATTTTATTGGTATTAGGAATTGTGTATGCAATTTCTTTCAATCGTAAGAGTGT TTCTCTAAGTTTAATTGGAAAAGCTCTTATCGTTCAATTCATTATTGCGCTAATCTTAGTACGTATCCCACTAGGCCAAC AAATTGTTAGTGTTGTTTCAACTGGAGTTACTAGCGTAATCAACTGTGGTCAAGCTGGTTTAAATTTTGTGTTTGGGTCA TTAGCAGATAGTGGCGCAAAAACTGGTTTTATTTTCGCTATTCAAACGCTTGGTAATATTGTTTTCTTATCTGCCCTAGT TAGTCTACTTTATTATGTAGGAATCCTTGGATTTGTAGTAAAATGGATAGGTAAGGGCGTTGGTAAAATTATGAAATCCT CAGAGGTTGAGAGTTTTGTTGCTGTAGCTAATATGTTTCTTGGTCAAACAGACAGTCCAATCTTGGTTAGCAAATACCTA GGTCGTATGACTGATAGTGAGATAATGGTTGTGTTGGTATCAGGTATGGGAAGTATGTCAGTTTCTATTCTTGGTGGCTA TATTGCATTAGGCATTCCAATGGAATATCTCTTGATTGCTTCAACAATGGTTCCTATTGGCAGTATTCTCATTGCTAAAA TCTTATTGCCTCAAACAGAACCTGTTCAAAAAATTGATGACATTAAGATGGATAATAAAGGTAATAACGCCAATGTGATT GATGCAATCGCTGAGGGTGCAAGCACAGGTGCACAAATGGCTTTCTCAATTGGTGCTAGTTTGATTGCCTTTGTTGGTTT AGTTTCTTTGATTAATATGATGTTAAGTGGATTGGGAATCCGCTTAGAACAAATCTTTTCATATGTTTTTGCTCCATTTG GTTTTCTTATGGGATTTGACCACAAAAACATTCTTCTAGAAGGAAACCTTCTTGGAAGTAAGTTGATTTTAAATGAGTTT GTTTCGTTCCAACAATTGGGTCACCTAATCAAATCTTTAGATTATCGTACAGCATTGGTAGCAACTATTTCACTCTGTGG TTTTGCTAATTTATCAAGTTTAGGTATTTGTGTTTCAGGTATTGCTGTTCTTTGCCCGGAGAAACGTAGCACCCTAGCTC GACTTGTTTTCCGTGCAATGATTGGTGGTATTGCTGTAAGTATGCTTAGCGCCTTTATCGTCGGTATTGTAACTCTATTC TAA
Upstream 100 bases:
>100_bases TGGCTCATGATACAGAAGCAGCTATTCAAGTTGCTGTTGAAGCACTCCGCACACTTATTGAAAATGATAAATCACAATAA GCGAGGTTTTGGAAGTCATC
Downstream 100 bases:
>100_bases ACGTTGACAAAAGAAAGAAGGATAGTAACGTGGAAGTAAAAGATATTTTAAAAACGGTAGACCATACTTTGCTAGCAACA ACAGCAACGTGGCCAGAAAT
Product: putative nucleoside transporter
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 400; Mature: 400
Protein sequence:
>400_residues MQFIYSIIGILLVLGIVYAISFNRKSVSLSLIGKALIVQFIIALILVRIPLGQQIVSVVSTGVTSVINCGQAGLNFVFGS LADSGAKTGFIFAIQTLGNIVFLSALVSLLYYVGILGFVVKWIGKGVGKIMKSSEVESFVAVANMFLGQTDSPILVSKYL GRMTDSEIMVVLVSGMGSMSVSILGGYIALGIPMEYLLIASTMVPIGSILIAKILLPQTEPVQKIDDIKMDNKGNNANVI DAIAEGASTGAQMAFSIGASLIAFVGLVSLINMMLSGLGIRLEQIFSYVFAPFGFLMGFDHKNILLEGNLLGSKLILNEF VSFQQLGHLIKSLDYRTALVATISLCGFANLSSLGICVSGIAVLCPEKRSTLARLVFRAMIGGIAVSMLSAFIVGIVTLF
Sequences:
>Translated_400_residues MQFIYSIIGILLVLGIVYAISFNRKSVSLSLIGKALIVQFIIALILVRIPLGQQIVSVVSTGVTSVINCGQAGLNFVFGS LADSGAKTGFIFAIQTLGNIVFLSALVSLLYYVGILGFVVKWIGKGVGKIMKSSEVESFVAVANMFLGQTDSPILVSKYL GRMTDSEIMVVLVSGMGSMSVSILGGYIALGIPMEYLLIASTMVPIGSILIAKILLPQTEPVQKIDDIKMDNKGNNANVI DAIAEGASTGAQMAFSIGASLIAFVGLVSLINMMLSGLGIRLEQIFSYVFAPFGFLMGFDHKNILLEGNLLGSKLILNEF VSFQQLGHLIKSLDYRTALVATISLCGFANLSSLGICVSGIAVLCPEKRSTLARLVFRAMIGGIAVSMLSAFIVGIVTLF >Mature_400_residues MQFIYSIIGILLVLGIVYAISFNRKSVSLSLIGKALIVQFIIALILVRIPLGQQIVSVVSTGVTSVINCGQAGLNFVFGS LADSGAKTGFIFAIQTLGNIVFLSALVSLLYYVGILGFVVKWIGKGVGKIMKSSEVESFVAVANMFLGQTDSPILVSKYL GRMTDSEIMVVLVSGMGSMSVSILGGYIALGIPMEYLLIASTMVPIGSILIAKILLPQTEPVQKIDDIKMDNKGNNANVI DAIAEGASTGAQMAFSIGASLIAFVGLVSLINMMLSGLGIRLEQIFSYVFAPFGFLMGFDHKNILLEGNLLGSKLILNEF VSFQQLGHLIKSLDYRTALVATISLCGFANLSSLGICVSGIAVLCPEKRSTLARLVFRAMIGGIAVSMLSAFIVGIVTLF
Specific function: Unknown
COG id: COG1972
COG function: function code F; Nucleoside permease
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein (Potential) [H]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the concentrative nucleoside transporter (CNT) (TC 2.A.41) family [H]
Homologues:
Organism=Homo sapiens, GI11545853, Length=364, Percent_Identity=35.4395604395604, Blast_Score=211, Evalue=7e-55, Organism=Homo sapiens, GI227116277, Length=413, Percent_Identity=31.7191283292978, Blast_Score=208, Evalue=7e-54, Organism=Homo sapiens, GI42542381, Length=416, Percent_Identity=31.25, Blast_Score=204, Evalue=2e-52, Organism=Escherichia coli, GI1788485, Length=411, Percent_Identity=35.5231143552311, Blast_Score=248, Evalue=4e-67, Organism=Escherichia coli, GI1788488, Length=412, Percent_Identity=36.1650485436893, Blast_Score=246, Evalue=2e-66, Organism=Escherichia coli, GI1788737, Length=384, Percent_Identity=26.3020833333333, Blast_Score=145, Evalue=5e-36, Organism=Caenorhabditis elegans, GI17560276, Length=393, Percent_Identity=30.2798982188295, Blast_Score=192, Evalue=3e-49, Organism=Caenorhabditis elegans, GI71991794, Length=397, Percent_Identity=28.9672544080605, Blast_Score=180, Evalue=1e-45, Organism=Caenorhabditis elegans, GI25146537, Length=187, Percent_Identity=30.4812834224599, Blast_Score=98, Evalue=6e-21, Organism=Drosophila melanogaster, GI45552517, Length=389, Percent_Identity=31.6195372750643, Blast_Score=210, Evalue=2e-54, Organism=Drosophila melanogaster, GI19921868, Length=389, Percent_Identity=31.6195372750643, Blast_Score=210, Evalue=2e-54, Organism=Drosophila melanogaster, GI45552519, Length=389, Percent_Identity=31.6195372750643, Blast_Score=209, Evalue=2e-54, Organism=Drosophila melanogaster, GI281360430, Length=407, Percent_Identity=30.4668304668305, Blast_Score=178, Evalue=6e-45,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR008276 - InterPro: IPR018270 - InterPro: IPR011642 - InterPro: IPR011657 - InterPro: IPR002668 [H]
Pfam domain/function: PF07670 Gate; PF07662 Nucleos_tra2_C; PF01773 Nucleos_tra2_N [H]
EC number: NA
Molecular weight: Translated: 42394; Mature: 42394
Theoretical pI: Translated: 9.08; Mature: 9.08
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.0 %Cys (Translated Protein) 4.0 %Met (Translated Protein) 5.0 %Cys+Met (Translated Protein) 1.0 %Cys (Mature Protein) 4.0 %Met (Mature Protein) 5.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MQFIYSIIGILLVLGIVYAISFNRKSVSLSLIGKALIVQFIIALILVRIPLGQQIVSVVS CHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHH TGVTSVINCGQAGLNFVFGSLADSGAKTGFIFAIQTLGNIVFLSALVSLLYYVGILGFVV HHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH KWIGKGVGKIMKSSEVESFVAVANMFLGQTDSPILVSKYLGRMTDSEIMVVLVSGMGSMS HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCHHEEHEECCCCCHH VSILGGYIALGIPMEYLLIASTMVPIGSILIAKILLPQTEPVQKIDDIKMDNKGNNANVI HHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHCCCCCCCCCCCHHH DAIAEGASTGAQMAFSIGASLIAFVGLVSLINMMLSGLGIRLEQIFSYVFAPFGFLMGFD HHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCC HKNILLEGNLLGSKLILNEFVSFQQLGHLIKSLDYRTALVATISLCGFANLSSLGICVSG CCCEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH IAVLCPEKRSTLARLVFRAMIGGIAVSMLSAFIVGIVTLF HHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC >Mature Secondary Structure MQFIYSIIGILLVLGIVYAISFNRKSVSLSLIGKALIVQFIIALILVRIPLGQQIVSVVS CHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHH TGVTSVINCGQAGLNFVFGSLADSGAKTGFIFAIQTLGNIVFLSALVSLLYYVGILGFVV HHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH KWIGKGVGKIMKSSEVESFVAVANMFLGQTDSPILVSKYLGRMTDSEIMVVLVSGMGSMS HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCHHEEHEECCCCCHH VSILGGYIALGIPMEYLLIASTMVPIGSILIAKILLPQTEPVQKIDDIKMDNKGNNANVI HHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHCCCCCCCCCCCHHH DAIAEGASTGAQMAFSIGASLIAFVGLVSLINMMLSGLGIRLEQIFSYVFAPFGFLMGFD HHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCC HKNILLEGNLLGSKLILNEFVSFQQLGHLIKSLDYRTALVATISLCGFANLSSLGICVSG CCCEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH IAVLCPEKRSTLARLVFRAMIGGIAVSMLSAFIVGIVTLF HHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 7542800 [H]