Definition Mesoplasma florum L1, complete genome.
Accession NC_006055
Length 793,224

Click here to switch to the map view.

The map label for this gene is sau3AIR [H]

Identifier: 50365123

GI number: 50365123

Start: 353411

End: 354769

Strand: Reverse

Name: sau3AIR [H]

Synonym: Mfl307

Alternate gene names: 50365123

Gene position: 354769-353411 (Counterclockwise)

Preceding gene: 50365135

Following gene: 50365063

Centisome position: 44.72

GC content: 22.96

Gene sequence:

>1359_bases
ATGAATAAGACAATAGATGATATATTTCGTAGAGCAAAAGAAGCGGAAAATAAACAATTAGGAGAAATAATTAATTTTGA
TGAAATTGATTATAAACTTAAAGATAAAGGAAGAATAGGAAATTTAATTCAAAAGTATTATTTCAATATTGACATAAATA
ACAGATCTGAAAGCGATTTTAAAGAGTTAGGTTTAGATTTAGAATTAAAAGTTACTGGTTTAAAACAAAACAAGAAAAAA
CAGTGAATTGCAAAAGAAAGAGTTTCTCTTTCAATGATTGATTTTAATAATATAGAAAAAGATTTTACTGATTCAAAGGT
TCTTAATAAAAATAAAAATATACTTTTAATAGGTCACGAATACAACAAGGACAAAAATCGTAAAAATTTAAAAATCATAA
AATCTATACTATGAAAGTATGAAAATATTTCTGAAAATGAAAAAAAAATTATTGAAAATGATTACTCAATAATTTATCAA
AAAATACAAAATGGAATTGCTCATGAGCTTTCTTGTTCAGACACTAAAATTTTAGAAGCGACAACAAAGGGGCAAGGTAA
AAATAAAGATTTAAGAAATCAACCAAATAGTGATTTTAAAGCAAAAAGTAGAGCTTTTGCTTTAAAAAAGTGATATGTAA
ATTACATTTTAACAGGAAGTGATTTTGAAAGTATAAATGCTTATTTTAATACTTCTAGTTTAAAAGCTTATAACTTTTTA
AAGAATTTTATAAACGCAGATTTATCTGATTATGCTAAAAAAAATAATATCAATACTTCTGCAAAACAAGCAAACGGAAT
AATTCTAAAAAAGATGTTAGAAGAATATGATGATTCAATTTTAGAACTAATTAGTGAAGATCAAATAAAAGTAAGAACCA
TGAGACTAAGTAAATCTGGTGAAATAACACAGGAAAGTATGCCAATTTCGTCTCATCACATTAAAATTGATGAAATATTA
AAAGAAAACAAATTTGAGGATTCTTTATTTTATCAAGAGCTTACAAAACCATATATTATAGTTACAATAGATGAAAATAT
TTTTAGTAAAAAAATCATTAAAGATGTCGTTCTGCTTGATGGTATTATTTCAAAAAAAGATATTAATGAAAGAATTTATA
AAAATGCAGAGAACATTTGAAAGCAAACAAAACTTTCTTTTGAAACAATAAATAAAGGTAAAAAAGAAGATTTTATTCTA
ACTAAACAAAAAGAAGACAAAGATTTTCATTTAAGACCTAAAGCAAAAAATTCAAATGCCAAAAATAAATATGAAAATTC
TAATTTAACATTTCAAGCATTTTGATTAAATCGCAAAACAATTTCATTTTTAATCAAAAAACATAGAGGCCTTATTTAA

Upstream 100 bases:

>100_bases
ACGACTTTGATTTTATCAGACATTATTTCACACTCTTTCATAAAATATTTTATAATAACATTAATAAATTATACACTAAA
ATAGGAAGAGAAAATATAAT

Downstream 100 bases:

>100_bases
AAAGACATCTATGTTTTTTTACTCTGTTAACAATGTTAATATTTTAGTTGCTACTAAATCAACTGCAACTTCATTCCCCT
CTTTATAAGGAATTATTAAA

Product: type II restriction enzyme Sau3AI, GATC site

Products: NA

Alternate protein names: R.Sau3AI; Endonuclease Sau3AI; Type II restriction enzyme Sau3AI [H]

Number of amino acids: Translated: 452; Mature: 452

Protein sequence:

>452_residues
MNKTIDDIFRRAKEAENKQLGEIINFDEIDYKLKDKGRIGNLIQKYYFNIDINNRSESDFKELGLDLELKVTGLKQNKKK
QWIAKERVSLSMIDFNNIEKDFTDSKVLNKNKNILLIGHEYNKDKNRKNLKIIKSILWKYENISENEKKIIENDYSIIYQ
KIQNGIAHELSCSDTKILEATTKGQGKNKDLRNQPNSDFKAKSRAFALKKWYVNYILTGSDFESINAYFNTSSLKAYNFL
KNFINADLSDYAKKNNINTSAKQANGIILKKMLEEYDDSILELISEDQIKVRTMRLSKSGEITQESMPISSHHIKIDEIL
KENKFEDSLFYQELTKPYIIVTIDENIFSKKIIKDVVLLDGIISKKDINERIYKNAENIWKQTKLSFETINKGKKEDFIL
TKQKEDKDFHLRPKAKNSNAKNKYENSNLTFQAFWLNRKTISFLIKKHRGLI

Sequences:

>Translated_452_residues
MNKTIDDIFRRAKEAENKQLGEIINFDEIDYKLKDKGRIGNLIQKYYFNIDINNRSESDFKELGLDLELKVTGLKQNKKK
Q*IAKERVSLSMIDFNNIEKDFTDSKVLNKNKNILLIGHEYNKDKNRKNLKIIKSIL*KYENISENEKKIIENDYSIIYQ
KIQNGIAHELSCSDTKILEATTKGQGKNKDLRNQPNSDFKAKSRAFALKK*YVNYILTGSDFESINAYFNTSSLKAYNFL
KNFINADLSDYAKKNNINTSAKQANGIILKKMLEEYDDSILELISEDQIKVRTMRLSKSGEITQESMPISSHHIKIDEIL
KENKFEDSLFYQELTKPYIIVTIDENIFSKKIIKDVVLLDGIISKKDINERIYKNAENI*KQTKLSFETINKGKKEDFIL
TKQKEDKDFHLRPKAKNSNAKNKYENSNLTFQAF*LNRKTISFLIKKHRGLI
>Mature_452_residues
MNKTIDDIFRRAKEAENKQLGEIINFDEIDYKLKDKGRIGNLIQKYYFNIDINNRSESDFKELGLDLELKVTGLKQNKKK
Q*IAKERVSLSMIDFNNIEKDFTDSKVLNKNKNILLIGHEYNKDKNRKNLKIIKSIL*KYENISENEKKIIENDYSIIYQ
KIQNGIAHELSCSDTKILEATTKGQGKNKDLRNQPNSDFKAKSRAFALKK*YVNYILTGSDFESINAYFNTSSLKAYNFL
KNFINADLSDYAKKNNINTSAKQANGIILKKMLEEYDDSILELISEDQIKVRTMRLSKSGEITQESMPISSHHIKIDEIL
KENKFEDSLFYQELTKPYIIVTIDENIFSKKIIKDVVLLDGIISKKDINERIYKNAENI*KQTKLSFETINKGKKEDFIL
TKQKEDKDFHLRPKAKNSNAKNKYENSNLTFQAF*LNRKTISFLIKKHRGLI

Specific function: Recognizes the double-stranded sequence GATC and cleaves before G-1 [H]

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR011337
- InterPro:   IPR011335 [H]

Pfam domain/function: PF02976 MutH [H]

EC number: =3.1.21.4 [H]

Molecular weight: Translated: 51980; Mature: 51980

Theoretical pI: Translated: 9.97; Mature: 9.97

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.2 %Cys     (Translated Protein)
1.1 %Met     (Translated Protein)
1.3 %Cys+Met (Translated Protein)
0.2 %Cys     (Mature Protein)
1.1 %Met     (Mature Protein)
1.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNKTIDDIFRRAKEAENKQLGEIINFDEIDYKLKDKGRIGNLIQKYYFNIDINNRSESDF
CCCCHHHHHHHHHHHHHHHHHHHCCCCCCCEEECCCCHHHHHHHHHEEEEEECCCCCCHH
KELGLDLELKVTGLKQNKKKQIAKERVSLSMIDFNNIEKDFTDSKVLNKNKNILLIGHEY
HHCCCCEEEEEECCCCHHHHHHHHHHHEEEEECCCCCCCCCCCHHHHCCCCCEEEEECCC
NKDKNRKNLKIIKSILKYENISENEKKIIENDYSIIYQKIQNGIAHELSCSDTKILEATT
CCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHCHHCCCCCCCCEEEEEEC
KGQGKNKDLRNQPNSDFKAKSRAFALKKYVNYILTGSDFESINAYFNTSSLKAYNFLKNF
CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHEECCCCHHHHHHEECCCHHHHHHHHHHH
INADLSDYAKKNNINTSAKQANGIILKKMLEEYDDSILELISEDQIKVRTMRLSKSGEIT
HCCCHHHHHHHCCCCCHHHHHCCHHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCCCCCC
QESMPISSHHIKIDEILKENKFEDSLFYQELTKPYIIVTIDENIFSKKIIKDVVLLDGII
HHCCCCCCCCEEHHHHHHHCCCHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHHHHHH
SKKDINERIYKNAENIKQTKLSFETINKGKKEDFILTKQKEDKDFHLRPKAKNSNAKNKY
HHHHHHHHHHHCHHHHHHHHHHHHHHCCCCCCCEEEECCCCCCCCEECCCCCCCCCCCCC
ENSNLTFQAFLNRKTISFLIKKHRGLI
CCCCEEEEEHHCHHHHHHHHHHHCCCC
>Mature Secondary Structure
MNKTIDDIFRRAKEAENKQLGEIINFDEIDYKLKDKGRIGNLIQKYYFNIDINNRSESDF
CCCCHHHHHHHHHHHHHHHHHHHCCCCCCCEEECCCCHHHHHHHHHEEEEEECCCCCCHH
KELGLDLELKVTGLKQNKKKQIAKERVSLSMIDFNNIEKDFTDSKVLNKNKNILLIGHEY
HHCCCCEEEEEECCCCHHHHHHHHHHHEEEEECCCCCCCCCCCHHHHCCCCCEEEEECCC
NKDKNRKNLKIIKSILKYENISENEKKIIENDYSIIYQKIQNGIAHELSCSDTKILEATT
CCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHCHHCCCCCCCCEEEEEEC
KGQGKNKDLRNQPNSDFKAKSRAFALKKYVNYILTGSDFESINAYFNTSSLKAYNFLKNF
CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHEECCCCHHHHHHEECCCHHHHHHHHHHH
INADLSDYAKKNNINTSAKQANGIILKKMLEEYDDSILELISEDQIKVRTMRLSKSGEIT
HCCCHHHHHHHCCCCCHHHHHCCHHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCCCCCC
QESMPISSHHIKIDEILKENKFEDSLFYQELTKPYIIVTIDENIFSKKIIKDVVLLDGII
HHCCCCCCCCEEHHHHHHHCCCHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHHHHHH
SKKDINERIYKNAENIKQTKLSFETINKGKKEDFILTKQKEDKDFHLRPKAKNSNAKNKY
HHHHHHHHHHHCHHHHHHHHHHHHHHCCCCCCCEEEECCCCCCCCEECCCCCCCCCCCCC
ENSNLTFQAFLNRKTISFLIKKHRGLI
CCCCEEEEEHHCHHHHHHHHHHHCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 2227451 [H]