Definition Streptococcus pyogenes M1 GAS chromosome, complete genome.
Accession NC_002737
Length 1,852,441

Click here to switch to the map view.

The map label for this gene is ybgI [C]

Identifier: 15674951

GI number: 15674951

Start: 773810

End: 774598

Strand: Direct

Name: ybgI [C]

Synonym: SPy_0931

Alternate gene names: 15674951

Gene position: 773810-774598 (Clockwise)

Preceding gene: 15674950

Following gene: 15674952

Centisome position: 41.77

GC content: 40.56

Gene sequence:

>789_bases
ATGAAAGCTAAAACCCTAATTGATGCCTATGAAGCCTTTTGCCCACTTGATTTATCTATGGAAGGAGACGTTAAAGGTCT
TCAAATGGGTTCATTGGATAAAGACATTCGTAAGGTTATGATTACTTTAGATATCCGAGAGTCAACAGTAGCAGAAGCCA
TCAAAAATGAAGTTGATCTCATTATAACTAAGCATGCTCCCATCTTTAAACCGCTTAAAGATCTGGTTTCATCCCCTCAG
CGTGATATCCTTCTAGATTTGGTCAAGCATGACATTTCTGTCTATGTGAGCCATACCAATATTGACATTGTACCAGGTGG
GCTAAATGATTGGTTTTGTGACCTGCTAGAAATCAAGGAAGCAACTTACTTGTCTGAAACCAAAGAAGGCTTTGGCATTG
GTCGTATTGGAACAGTTAAAGAACAAGCGTTAGAAGAGTTAGCAAGTAAGGTCAAAAGGGTGTTTGATTTAGATACCGTG
CGACTTATCCGTTACGATAAGGAAAACCCTTTGATTTCAAAGATTGCTATCTGTGGCGGAAGTGGTGGTGAATTTTATCA
GGATGCTGTGCAAAAAGGCGCAGACGTTTATATTACAGGGGATATCTATTACCATACAGCACAGGAGATGCTGACAGAAG
GACTATTTGCGGTTGACCCGGGGCATCATATTGAGGTGCTTTTTACGGAGAAATTAAAGGAGAAACTCCAAGGGTGGAAA
GAAGAGAACGGTTGGGATGTCAGCATTATTTCAAGCAAAGCCTCGACCAATCCATTTAGCCATTTATAG

Upstream 100 bases:

>100_bases
GATAAGTTAGCTTATGCCTTGTCCTGCATCCCAGAAGAAAAAACTCAAGAGCGTCAGCTTCTCTTGACAAAAATACAACA
AATAAAAGAGGTGATTGTTC

Downstream 100 bases:

>100_bases
GCTGTTAAGGTCTTAAAATAAGATAAAAGGATACCAGGAATGAAAATAGCCATTATTGGGGCGGGGATTGTAGGATCAAC
CGCCGCTTATTATTTACAAC

Product: hypothetical protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 262; Mature: 262

Protein sequence:

>262_residues
MKAKTLIDAYEAFCPLDLSMEGDVKGLQMGSLDKDIRKVMITLDIRESTVAEAIKNEVDLIITKHAPIFKPLKDLVSSPQ
RDILLDLVKHDISVYVSHTNIDIVPGGLNDWFCDLLEIKEATYLSETKEGFGIGRIGTVKEQALEELASKVKRVFDLDTV
RLIRYDKENPLISKIAICGGSGGEFYQDAVQKGADVYITGDIYYHTAQEMLTEGLFAVDPGHHIEVLFTEKLKEKLQGWK
EENGWDVSIISSKASTNPFSHL

Sequences:

>Translated_262_residues
MKAKTLIDAYEAFCPLDLSMEGDVKGLQMGSLDKDIRKVMITLDIRESTVAEAIKNEVDLIITKHAPIFKPLKDLVSSPQ
RDILLDLVKHDISVYVSHTNIDIVPGGLNDWFCDLLEIKEATYLSETKEGFGIGRIGTVKEQALEELASKVKRVFDLDTV
RLIRYDKENPLISKIAICGGSGGEFYQDAVQKGADVYITGDIYYHTAQEMLTEGLFAVDPGHHIEVLFTEKLKEKLQGWK
EENGWDVSIISSKASTNPFSHL
>Mature_262_residues
MKAKTLIDAYEAFCPLDLSMEGDVKGLQMGSLDKDIRKVMITLDIRESTVAEAIKNEVDLIITKHAPIFKPLKDLVSSPQ
RDILLDLVKHDISVYVSHTNIDIVPGGLNDWFCDLLEIKEATYLSETKEGFGIGRIGTVKEQALEELASKVKRVFDLDTV
RLIRYDKENPLISKIAICGGSGGEFYQDAVQKGADVYITGDIYYHTAQEMLTEGLFAVDPGHHIEVLFTEKLKEKLQGWK
EENGWDVSIISSKASTNPFSHL

Specific function: Unknown

COG id: COG0327

COG function: function code S; Uncharacterized conserved protein

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the UPF0135 (NIF3) family

Homologues:

Organism=Saccharomyces cerevisiae, GI6321217, Length=256, Percent_Identity=28.515625, Blast_Score=88, Evalue=2e-18,
Organism=Drosophila melanogaster, GI19921430, Length=214, Percent_Identity=30.8411214953271, Blast_Score=94, Evalue=9e-20,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): Y644_STRP3 (P67275)

Other databases:

- EMBL:   AE014074
- EMBL:   BA000034
- RefSeq:   NP_664448.1
- RefSeq:   NP_802470.1
- ProteinModelPortal:   P67275
- SMR:   P67275
- EnsemblBacteria:   EBSTRT00000034052
- EnsemblBacteria:   EBSTRT00000036497
- GeneID:   1008958
- GeneID:   1065348
- GenomeReviews:   AE014074_GR
- GenomeReviews:   BA000034_GR
- KEGG:   spg:SpyM3_0644
- KEGG:   sps:SPs1208
- GeneTree:   EBGT00050000028053
- HOGENOM:   HBG554751
- OMA:   LAVEELW
- ProtClustDB:   CLSK877132
- BioCyc:   SPYO193567:SPS1208-MONOMER
- BioCyc:   SPYO198466:SPYM3_0644-MONOMER
- InterPro:   IPR002678
- PANTHER:   PTHR13799
- TIGRFAMs:   TIGR00486

Pfam domain/function: PF01784 NIF3; SSF102705 interacting_NIF3

EC number: NA

Molecular weight: Translated: 29290; Mature: 29290

Theoretical pI: Translated: 4.79; Mature: 4.79

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.1 %Cys     (Translated Protein)
1.9 %Met     (Translated Protein)
3.1 %Cys+Met (Translated Protein)
1.1 %Cys     (Mature Protein)
1.9 %Met     (Mature Protein)
3.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKAKTLIDAYEAFCPLDLSMEGDVKGLQMGSLDKDIRKVMITLDIRESTVAEAIKNEVDL
CCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHEEEEEEECCHHHHHHHHHCCEEE
IITKHAPIFKPLKDLVSSPQRDILLDLVKHDISVYVSHTNIDIVPGGLNDWFCDLLEIKE
EEECCCCHHHHHHHHHCCCHHHHHHHHHHCCEEEEEEECCEEEEECCCCHHHHHHHHHHH
ATYLSETKEGFGIGRIGTVKEQALEELASKVKRVFDLDTVRLIRYDKENPLISKIAICGG
HHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCEEEEEEECC
SGGEFYQDAVQKGADVYITGDIYYHTAQEMLTEGLFAVDPGHHIEVLFTEKLKEKLQGWK
CCCHHHHHHHHCCCCEEEECCEEEHHHHHHHHCCCEEECCCCEEEEEEHHHHHHHHCCCC
EENGWDVSIISSKASTNPFSHL
CCCCCEEEEEECCCCCCCCCCC
>Mature Secondary Structure
MKAKTLIDAYEAFCPLDLSMEGDVKGLQMGSLDKDIRKVMITLDIRESTVAEAIKNEVDL
CCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHEEEEEEECCHHHHHHHHHCCEEE
IITKHAPIFKPLKDLVSSPQRDILLDLVKHDISVYVSHTNIDIVPGGLNDWFCDLLEIKE
EEECCCCHHHHHHHHHCCCHHHHHHHHHHCCEEEEEEECCEEEEECCCCHHHHHHHHHHH
ATYLSETKEGFGIGRIGTVKEQALEELASKVKRVFDLDTVRLIRYDKENPLISKIAICGG
HHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCEEEEEEECC
SGGEFYQDAVQKGADVYITGDIYYHTAQEMLTEGLFAVDPGHHIEVLFTEKLKEKLQGWK
CCCHHHHHHHHCCCCEEEECCEEEHHHHHHHHCCCEEECCCCEEEEEEHHHHHHHHCCCC
EENGWDVSIISSKASTNPFSHL
CCCCCEEEEEECCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 12122206; 12799345