LOCUS NC_012972 9816 bp DNA circular BCT 16-JUL-2009 DEFINITION Methylovorus sp. SIP3-4 plasmid pMsip02, complete sequence. ACCESSION NC_012972 VERSION NC_012972.1 GI:254028270 DBLINK Project:33241 KEYWORDS . SOURCE Methylovorus sp. SIP3-4 ORGANISM Methylovorus sp. SIP3-4 Bacteria; Proteobacteria; Betaproteobacteria; Methylophilales; Methylophilaceae; Methylovorus. REFERENCE 1 (bases 1 to 9816) AUTHORS Lucas,S., Copeland,A., Lapidus,A., Glavina del Rio,T., Tice,H., Bruce,D., Goodwin,L., Pitluck,S., Clum,A., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Mikhailova,N., Kayluzhnaya,M. and Chistoserdova,L. CONSRTM US DOE Joint Genome Institute TITLE Complete sequence of plasmid 2 of Methylovorus sp. SIP3-4 JOURNAL Unpublished REFERENCE 2 (bases 1 to 9816) CONSRTM NCBI Genome Project TITLE Direct Submission JOURNAL Submitted (15-JUL-2009) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA REFERENCE 3 (bases 1 to 9816) AUTHORS Lucas,S., Copeland,A., Lapidus,A., Glavina del Rio,T., Tice,H., Bruce,D., Goodwin,L., Pitluck,S., Clum,A., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Mikailova,N., Kayluzhnaya,M. and Chistoserdova,L. CONSRTM US DOE Joint Genome Institute TITLE Direct Submission JOURNAL Submitted (13-JUL-2009) US DOE Joint Genome Institute, 2800 Mitchell Drive B310, Walnut Creek, CA 94598-1698, USA COMMENT PROVISIONAL REFSEQ: This record has not yet been subject to final NCBI review. The reference sequence was derived from CP001676. URL -- http://www.jgi.doe.gov JGI Project ID: 4086190 Source DNA and bacteria available from Ludmila Chistoserdova (milachis@u.washington.edu) Contacts: Ludmila Chistoserdova (milachis@u.washington.edu) David Bruce (microbe@cuba.jgi-psf.org) Annotation done by JGI-ORNL and JGI-PGF Finishing done by JGI-PGF Finished microbial genomes have been curated to close all gaps with greater than 98% coverage of at least two independent clones. Each base pair has a minimum q (quality) value of 30 and the total error rate is less than one per 50000. The JGI and collaborators endorse the principles for the distribution and use of large scale sequencing data adopted by the larger genome sequencing community and urge users of this data to follow them. it is our intention to publish the work of this project in a timely fashion and we welcome collaborative interaction on the project and analysis. (http://www.genome.gov/page.cfm?pageID=10506376). COMPLETENESS: full length. FEATURES Location/Qualifiers source 1..9816 /organism="Methylovorus sp. SIP3-4" /mol_type="genomic DNA" /strain="SIP3-4" /db_xref="taxon:582744" /plasmid="pMsip02" gene 103..351 /locus_tag="Msip34_2831" /db_xref="GeneID:8174800" CDS 103..351 /locus_tag="Msip34_2831" /inference="ab initio prediction:Prodigal:1.4" /note="KEGG: asa:ASA_1269 transcriptional protein" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_003056831.1" /db_xref="GI:254028271" /db_xref="InterPro:IPR001387" /db_xref="GeneID:8174800" /translation="MSTGYGKRIRLVREHLGLGRAEFCNETGIPKQSLINYETERTKA NAEVLGAIANKWPEYAAYLLTDKTFVIQKNPEHDQINN" gene complement(376..1059) /locus_tag="Msip34_2832" /db_xref="GeneID:8174788" CDS complement(376..1059) /locus_tag="Msip34_2832" /inference="ab initio prediction:Prodigal:1.4" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_003056832.1" /db_xref="GI:254028272" /db_xref="GeneID:8174788" /translation="MILFIGVIRMNLYVVVEGVVETIVYPDWISIVNEDLKKVSYLDE VVDNNYILFSAGGYPNVFNVIDSAVEDISESALFDRLVVAMDAESFSYNERYDEIYNH LLQKKVKVDFKIIIQNPCFEAWALGNSGIAPRNPSDEKLKKYKRIFDILRKDPETIPD LPEEQLNKAQFSYEYLKLAIRDKGQHLHYSKNSPKLVCHPKYFSSLKERMEKKNHIRS FNDFITAFI" gene complement(1034..2119) /locus_tag="Msip34_2833" /db_xref="GeneID:8174789" CDS complement(1034..2119) /locus_tag="Msip34_2833" /inference="ab initio prediction:Prodigal:1.4" /note="KEGG: sde:Sde_2755 hypothetical protein" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_003056833.1" /db_xref="GI:254028273" /db_xref="GeneID:8174789" /translation="MILNNYSYKDHDGHGWNFSPVKLRRNNLLVGISGSGKSRWLNSF SNICSYVFNNDQFRAGVWNLEFSAEEKEYAWEFSSNLEQGGSILSEVLKTKSGDVWEV IFSRDENHIYYGGQQLPRLAKNVTGINLLRDEDAIAPVYRAFGKVMRRNFNASDLIEA GAIIQFPSEPESFDFPANISAINPRLYFLKKRNKKMFDYVIDEFKSIFPFIINVDFTS GAELSDNNSFSPLLIFQERNVKESIYLHDLSSGLIKVLLIMTDISTLPEESIYIIDEY ENSLGMNAINFLPDFIRSYSSKIQFITTSHHPYLINNIPIDDWVVFGRNGSNVAVRLG ENLVSRYGASKQDAFVKLINDPFYRGY" gene complement(2153..3346) /locus_tag="Msip34_2834" /db_xref="GeneID:8174790" CDS complement(2153..3346) /locus_tag="Msip34_2834" /inference="protein motif:PFAM:PF00589" /note="PFAM: integrase family protein; KEGG: nmu:Nmul_A2184 phage integrase" /codon_start=1 /transl_table=11 /product="integrase family protein" /protein_id="YP_003056834.1" /db_xref="GI:254028274" /db_xref="InterPro:IPR002104" /db_xref="GeneID:8174790" /translation="MALTDLAVKNADPKPKPYKLTDGNGLYLLIKPSGKYWRFDYRFN YKRKTLALGVYPDTTLAHAREKLDKARKLLANDPPVDPGENRKAERASRTAKNESCFE VIAREWVASYMLNKSDLHRQRVFRRLEVFMFPWIGAVSIDQLTAPEILKCIRRIQAQN KIETAHRTLQATGQVFRYAVQTGRAIRDVTHDLRGALPSHNVKHMASFTEPEQVAQLL KAIEGFTGSFTVQTALRLAPLVFVRPSELRKARWADIDLEKKEWRFRVSKTNIDHLVP LSTQAVELLEGIKPVSGHGEWVFMGGHDPKKPMSAAAINAALKRMGYNTQTEITGHGF RAMARTILHERLNFDPYVIEHQLAHKVPDALGAAYNRTKFIEQRKDMMQKWADFLTYL VIRAV" gene complement(3711..4676) /locus_tag="Msip34_2835" /db_xref="GeneID:8174791" CDS complement(3711..4676) /locus_tag="Msip34_2835" /inference="similar to AA sequence:KEGG:Neut_1485" /note="KEGG: net:Neut_1485 SppA protein" /codon_start=1 /transl_table=11 /product="SppA protein" /protein_id="YP_003056835.1" /db_xref="GI:254028275" /db_xref="GeneID:8174791" /translation="MAVKEIVNSLSEALDVDIHLYSGQISREGYEQLSVILRDEAAHP RAILMLSTVGGDPHAGYRIARALCHHYPEGFSVMIPGLCKSAGTLICLGAKALIMGDR SELGPLDVQVRKKDELFELGSGLDTIQALSYMQSQAMSAFRTYLIELKADAGLSTRMA AEISTKLATGLFSPVYSQVDPVRLGELQRSIEVAYSYGNRLSKKSSNLKSDALNRLVS RYPAHGFVIDRSEARELFENVRSPAGDELDLLNFWYDNPNNDPSGQPMVINLTKLFGD EDDEQQQANVANDGPHEGDGGVDAVEQAIGEDRGNGDPVPIQDAD" gene complement(4861..6021) /locus_tag="Msip34_2836" /db_xref="GeneID:8174792" CDS complement(4861..6021) /locus_tag="Msip34_2836" /inference="protein motif:PFAM:PF05707" /note="PFAM: Zonular occludens toxin; KEGG: rpi:Rpic_1399 zonular occludens toxin" /codon_start=1 /transl_table=11 /product="Zonular occludens toxin" /protein_id="YP_003056836.1" /db_xref="GI:254028276" /db_xref="InterPro:IPR008900" /db_xref="GeneID:8174792" /translation="MIHLITGLPGSGKSLYTLSTVKSRADKENRPVFYHGIPELTLDW QQLESADKWVDCPKGAIIVIDECQSTFRPRATGAAVPRHVSQLETHRHDGHDLYLITQ HPMLVDSNLRRLVNYHYHVERFFGFAKSKIHEFHKVRENVDKSTKNSIESHFVYPKEV YTWYKSADMHTVKKRIPMRLMLMVLLPVLFFAIVWYGYRALTGISKSPDLDQLPTVEE SGSAQPNQPAPPQFQKPIYSWGEAQRPRVPDLPFTAPKYDDITKPVMAPRIAAAVLIR GKCTAYTQQGTKIAMDEAVCLQFVENGMFQDFDDGSARFNKDQRDLNQQTMTPNALDD AQRPKGEPVAPAAVAVIPYPTDEESERSPANRVRESQSQNRYPNQPQPYAGS" gene complement(6018..6317) /locus_tag="Msip34_2837" /db_xref="GeneID:8174793" CDS complement(6018..6317) /locus_tag="Msip34_2837" /inference="similar to AA sequence:KEGG:Dtpsy_0843" /note="KEGG: dia:Dtpsy_0843 hypothetical protein" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_003056837.1" /db_xref="GI:254028277" /db_xref="GeneID:8174793" /translation="MPLFISMFWGFAALSLKTLVGRVLVALAITYVTYQGLDVLLSGA RQAALGLLTNVPADVIGAVGLLRLGESINIVFSAIAARYIVQGLTGGVLTRQVIK" gene complement(6318..7634) /locus_tag="Msip34_2838" /db_xref="GeneID:8174794" CDS complement(6318..7634) /locus_tag="Msip34_2838" /inference="ab initio prediction:Prodigal:1.4" /note="KEGG: rpi:Rpic_2546 hypothetical protein" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_003056838.1" /db_xref="GI:254028278" /db_xref="GeneID:8174794" /translation="MAHFLRLILLLSMVLSSYAHAANEEIPSIIDYWSKDGMTVTKKP TPEESCQAYTAYSGVSYGSVTKVDNWKYFCNSKNGNIGNITAHEVCPGGVTPTILHTC LVPQCAAGQVRNPTTGTCQESCKSGFVSGVTGSKYFGSDGSTDCLSNCTYSIAVDMCI KLTNGQGACVGQYGNGTGAACSAATNNTALTPEANCMSQGKSFISVGGVTSCVSAGSQ GSQPVTTKNNSSSSSNSSTKDANGNPTGSSSSNSTTNSSATFNGDGTVVVVTTTEKTN EDGTKETKTETKTVSQTSYCAENPTAAQCKAATDSDISGACDAVACKGDAIQCAMAKE QARRNCEWFKENAEWKSKGEALANGTDTTANPAEENNRTIVNLPTALDASSPISATGI QDRTFAVFGRSYTLRLSELNPYLAVVGYVFMALAYVAAGRILAGAV" sig_peptide complement(6318..6383) /locus_tag="Msip34_2838" /note="Signal predicted by SignalP 3.0 HMM (Signal peptide probability 1.000) with cleavage site probability 0.999 at residue 22" gene complement(7748..7954) /locus_tag="Msip34_2839" /db_xref="GeneID:8174795" CDS complement(7748..7954) /locus_tag="Msip34_2839" /inference="ab initio prediction:Prodigal:1.4" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_003056839.1" /db_xref="GI:254028279" /db_xref="InterPro:IPR017441" /db_xref="GeneID:8174795" /translation="MKKVNLYSRKVAAYLAVGMTLAAAQAQAAIDVSAVTEKLGEGET AVAAIGGAILVVWAIKKVYSMIRG" sig_peptide complement(7748..7834) /locus_tag="Msip34_2839" /note="Signal predicted by SignalP 3.0 HMM (Signal peptide probability 1.000) with cleavage site probability 0.946 at residue 29" gene complement(7941..8315) /locus_tag="Msip34_2840" /db_xref="GeneID:8174796" CDS complement(7941..8315) /locus_tag="Msip34_2840" /inference="ab initio prediction:Prodigal:1.4" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_003056840.1" /db_xref="GI:254028280" /db_xref="GeneID:8174796" /translation="MKAFNRIASIHLSRSQVNAMALIMGLLLGGFLTFIWGMTDAYAA ECLTDIGAGQYQIASPQPTEISECVYLIAQPKELTAGAWSLTVEQGQQLAAAIALLWA IGAVFRVLISLLKQKESQHEES" sig_peptide complement(7941..8072) /locus_tag="Msip34_2840" /note="Signal predicted by SignalP 3.0 HMM (Signal peptide probability 0.989) with cleavage site probability 0.831 at residue 44" gene complement(8302..9180) /locus_tag="Msip34_2841" /db_xref="GeneID:8174797" CDS complement(8302..9180) /locus_tag="Msip34_2841" /inference="similar to AA sequence:KEGG:Mfla_1456" /note="KEGG: mfa:Mfla_1456 hypothetical protein" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_003056841.1" /db_xref="GI:254028281" /db_xref="GeneID:8174797" /translation="MGTSLALDQDHVYTELPDYHLTIREFASGMVEMIQRRVNFMDEL RKSRSHGIRGLPRQKTEEEIEASQEENLRRAERRARQKVRFLIQSIGADHLLTLVYRE NMTDSDKLNEDFTRFVRLVREKYPDWLYVGVKEYQERGALHMHVACVGKQDVHHLRKC WYIAIGGNADDAGENTKGQINVRYRKKRFSGQSPIFTCLQLAAYLSKYISKTFAHSRE LGERRYKASRGIPAPKVMRQYLGAFAAIHKEQTFPVATQHTLGIADLMGVFDYQIWAT KDLDILILRGSISESL" gene complement(9250..9516) /locus_tag="Msip34_2842" /db_xref="GeneID:8174798" CDS complement(9250..9516) /locus_tag="Msip34_2842" /inference="ab initio prediction:Prodigal:1.4" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_003056842.1" /db_xref="GI:254028282" /db_xref="GeneID:8174798" /translation="MELKIQVLAISSRTGKSERGEYVRHDLQCFVDQVVGVIPSYDPE FKPEAGFYMASIGWSNYQGRLQPRIEKLTPISQTREHAREQVKA" gene complement(9604..9810) /locus_tag="Msip34_2843" /db_xref="GeneID:8174799" CDS complement(9604..9810) /locus_tag="Msip34_2843" /inference="ab initio prediction:Prodigal:1.4" /note="KEGG: bpy:Bphyt_0267 hypothetical protein" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_003056843.1" /db_xref="GI:254028283" /db_xref="GeneID:8174799" /translation="MDTHFDASQLTVVPLMSRTKFAQHVGVTEDTVTGWINKGYLPVV EIGKYRLVNLALVTKNALEQSFYE" ORIGIN 1 ttaggctcct ttagagttca ttacaaatca taaatataac cagatcagac taaatcttaa 61 tttggataca ttagggctga ctgtataacg aactggatac acgtgtcaac aggctatgga 121 aaaagaattc gtttggtgag agaacaccta ggactaggca gggcagagtt ttgcaatgaa 181 acaggtatcc caaagcaaag cttaattaac tacgaaacag agaggactaa agcgaacgct 241 gaagtattag gggccattgc caataaatgg cctgaatatg ctgcatactt actgacagat 301 aagactttcg tgatacaaaa gaatcctgaa catgaccaaa taaacaacta gcctcaggat 361 taaaagggcc tgtatttaaa taaatgctgt aatgaaatca ttaaaacttc ttatgtgatt 421 tttcttttcc attctctctt taagagaaga gaaatatttt ggatgacata ctaatttagg 481 agagttcttt gagtaatgca gatgctgacc tttatcacgt atagccagct ttaagtattc 541 atatgagaac tgagccttat ttaattgctc ctcaggcaaa tctggaatcg tctcaggatc 601 tttccttaaa atatcgaata ttcttttata ctttttcaat ttctcatctg acggatttct 661 tggtgctatt ccagaattgc ctaaagccca agcttcaaaa catggatttt gtataattat 721 tttaaaatcc acttttactt ttttctgaag taaatggtta tatatctcat cgtacctctc 781 gttataacta aaactttctg catccatcgc aactacgagg cgatcaaata aagcggattc 841 agatatatcc tctactgcac tatcaataac attgaagaca ttggggtaac ctccagcaga 901 aaatagaatg taattattat caaccacctc atccaaataa cttactttct tgagatcctc 961 attaacgatt gatatccaat caggatacac aatagtttca accactcctt caactacaac 1021 atataaattc attctaataa ccccgataaa aaggatcatt aatcaacttc acaaatgcgt 1081 cttgttttga tgcaccatat ctagatacta gattctcacc caaccgaaca gctacatttg 1141 atccatttcg accaaacaca acccaatcat caattggaat gttattaatc aagtatggat 1201 gatgagaagt tgttataaat tgaattttac tagaatacga cctgatgaaa tcaggaagaa 1261 aattaattgc attcattcct aaagaatttt cgtactcatc aataatataa atactttctt 1321 ctggcaaagt agaaatatcc gtcattatta ataaaacttt tattaaacca gaagataaat 1381 catgaagata aattgactct ttaacatttc tttcttgaaa tattaggaga ggagaaaaac 1441 tattgttatc acttagctcc gctccgcttg taaaatcaac attaataatg aatggaaata 1501 tagacttaaa ctcatctata acataatcaa acatcttctt gttgcgtttc tttagaaagt 1561 agagccttgg attaatagca gagatgttgg caggaaagtc gaaagattca ggctcactcg 1621 gaaactgaat aatggctcct gcctctataa gatctgaggc attaaaattt cgcctcataa 1681 ccttaccaaa tgctctataa actggagcaa tcgcatcctc atctctgaga agattaatcc 1741 ctgtcacatt ctttgccaac cggggaagct gctgtcctcc atagtaaata tgattttcat 1801 cgcgagaaaa aatcacctcc caaacatcac ccgattttgt ttttagaact tctgagagaa 1861 tgctcccacc ttgttcaaga tttgagctaa attcccaggc atactctttt tcttcggctg 1921 aaaattcaag attccagaca cctgctctaa attgatcgtt attaaaaaca taactgcaaa 1981 tatttgaaaa tgaatttagc catctggatt tccctgatcc acttataccc actagaagat 2041 tatttcgtct tagtttgaca ggggaaaagt tccaaccatg tccatcgtga tctttataag 2101 aatagttatt taatatcata tcttgctccc gaattgggtt cccatgcgaa tatcatacag 2161 cccgtataac taaatatgta aggaaatcag cccatttttg catcatatct tttctctgct 2221 cgataaattt agtgcgattg taggctgcac cgagtgcatc ggggactttg tgggctaact 2281 gatgctcgat aacgtaggga tcgaagttta agcgctcgtg caatattgtt cgggccatgg 2341 cgcggaaacc gtggcctgtt atctctgttt gggtgttgta gcccattctc tttaaggcag 2401 cattgattgc cgctgcactc atgggctttt ttggatcgtg tccacccata aatacccatt 2461 ccccatgccc tgatacaggc ttgatccctt ccagcagttc aaccgcctga gttgagagcg 2521 gcaccaggtg atcgatgttt gttttgctca ccctaaaccg ccactccttc ttttcaagat 2581 caatatctgc ccatcttgcc ttgcgtagct cgctagggcg cacgaatacg agtggggcta 2641 acctcagggc tgtctgtacc gtgaaagagc ctgtgaagcc ttctattgcc ttcagcagct 2701 gggcaacttg ctcgggttcg gtgaatgagg ccatatgttt gacgttatgc gatggcaacg 2761 cccctctcaa atcatgcgta acgtcacgta ttgcccgccc tgtttgaact gcatatctga 2821 aaacctgacc tgttgcctgc aatgtgcggt gcgcggtctc tattttgttt tgcgcctgaa 2881 ttcgccggat gcatttcaga atctcagggg cagttaactg atcaatagaa actgctccga 2941 tccaaggaaa catgaagact tccaaacggc gaaacacacg ctggcggtgt aagtcgcttt 3001 tgttaagcat ataactggcg acccactccc gcgctatgac ctcaaagcag ctttcatttt 3061 tggcggttcg tgaggcgcgc tccgctttcc ggttctcacc tggatcaacg ggcggatcat 3121 tggcgagtag cttacgggct ttatcaagtt tttctcgggc atgggctagt gttgtatctg 3181 gatacacgcc aagagcgagg gttttgcgtt tgtagttaaa ccgataatcg aagcgccagt 3241 atttgccgga tggcttaata agcagatata gaccattgcc atccgtcagc ttgtagggct 3301 tgggtttggg atcagcgttc tttactgcga ggtctgtaag ggccatgacg gtatctggta 3361 atgacggtac ttgagatacc gcgaaagata ccgtcagtga ttatggactg tcaatgaacc 3421 acaatagaaa tctataaacg aggaaaattt ataactacct gtatttattg atatttaaaa 3481 atcccatcag acgtcattgg atgactatgg attgcattgt gcggggaatg gtgccggaga 3541 tagctacaaa acaaatgggc ctaacccagt attgacaagg gtaaggccca tttatgaaaa 3601 aagataccga cggatatacc gtcaaaatta aagattcaga ttaacatctc gacccaagca 3661 cacttttcaa ggtggcatta ttggtcttgt ggtttacatc aaaattcgac tcagtctgca 3721 tcttgtattg ggactggatc accatttccg cgatcttcgc cgattgcctg ctcgactgca 3781 tcaacgcccc catcaccttc atgtggtccg tcatttgcga cgttagcttg ctgctgctca 3841 tcatcctcat ccccaaaaag tttagttaaa ttaataacca taggttgccc agaagggtca 3901 ttattaggat tatcatacca aaaattcaat aaatctagct catcaccagc aggactacga 3961 acattttcaa aaagctctct agcctcacta cgatcaataa caaacccatg cgctgggtaa 4021 cggcttacca gcctgtttaa ggcatcactc tttaagttag aggatttttt tgacaatcgg 4081 tttccatatg aataagcaac ttcgattgac cgttgcaatt cgcctaacct aacgggatcg 4141 acttgactgt atacaggaga aaacaaccca gtagctagct tagttgaaat ctcggcagcc 4201 attctagtac tcaagcctgc atctgctttc aattcaatta aataggttct gaaagcactc 4261 atggcctgcg attgcatgta acttagcgcc tggatagtgt ccagtccaga gcctaactca 4321 aagagttcgt cttttttacg cacttgaaca tctaaggggc ctagctcact tctatccccc 4381 ataatgagtg cctttgcgcc taaacatatc aaagtaccgg cacttttgca caatcccgga 4441 atcattactg agaacccttc cgggtaatga tggcataacg ccctagcaat tctatatcct 4501 gcatgaggat cgccgccaac agtggataac atcaaaatcg cacgcgggtg agctgcctcg 4561 tctcttaaga taacagacag ctgttcataa ccttcccgag aaatttggcc cgaataaaga 4621 tgtatatcga catccagtgc ttctgaaagt gaattaacaa tttctttcac agccattttc 4681 atctccctaa gactttattt ttattgttaa taatcttaca tccaaacatc ttggttgtat 4741 tgtgtccgta atataaattt aggcggcgct tcgcgccggg ctacgctccc ggcaccaagc 4801 accgccttaa cccccataaa aaaagcgcct catgggcgct tttttcttct cctgactcta 4861 tcaacttccc gcgtaaggtt gcggctggtt tgggtagcgg ttttgagact gcgactcacg 4921 cactcgattc gccgggctac gctctgattc ctcatccgtg ggataaggaa tcacggctac 4981 cgccgcggga gctaccggct cacccttcgg gcgctgcgcg tcgtccagtg cgttcggagt 5041 catggtctgt tggttgagat cgcgttggtc tttattaaac cgagcgctgc catcgtcgaa 5101 atcctgaaac atcccgtttt cgacaaactg gaggcacact gcctcatcca tcgcgatctt 5161 ggtgccctgt tgggtgtatg ccgtgcattt accccgaata agcacagcag cggcgattct 5221 gggggccatg acgggcttgg tgatgtcgtc atacttggga gccgtaaacg gcaggtctgg 5281 gacgcgtggg cgttgcgctt caccccatga atagatcggc ttttggaatt gcggcggggc 5341 tggctgattg ggttgagcag atccactttc ttccaccgtt ggcagttggt cgagatccgg 5401 cgattttgag atcccggtta gagcacgata gccataccag acgatggcaa agaacagcac 5461 gggcaacagg accataagca tcaagcgcat cgggatacgc tttttaaccg tatgcatgtc 5521 cgccgactta taccaggtgt agacctcttt cggatataca aagtggctct caatgctgtt 5581 cttggtggat ttatcgacgt tctcccgcac cttgtggaat tcgtggattt tggacttggc 5641 aaagccaaaa aaccgctcga cgtggtaatg gtagttcacg agccggcgca gattggaatc 5701 aaccagcatg gggtgctgcg taatcagata gagatcatgg ccatcatgcc ggtgcgtttc 5761 caattggcta acatggcgag gcaccgcagc accggtggca cgaggtctaa aggtggactg 5821 acattcatca atcacgataa tcgccccctt ggggcaatct acccatttat cagcagattc 5881 gagctgttgc caatctaggg tgagttccgg gatgccgtga tagaacaccg ggcgattttc 5941 tttatccgcc ctactcttca ccgtggacaa ggtatagagg cttttacccg agcctgggag 6001 gcctgttatc aggtgaatca tttgatcacc tgccttgtga ggacgccacc cgtcagacct 6061 tgcacgatgt atcgcgccgc tatggcacta aatacgatat tgatcgattc gcccaggcgc 6121 aggaggccca ctgccccaat gacatccgcc gggacattgg ttaacagacc aagggcggct 6181 tgtcttgcgc cagacagcag cacatccagc ccttgatagg tcacataggt aattgccaag 6241 gccaccagta cacgcccgac caaggtttta agcgacaggg cagcaaagcc ccagaacatg 6301 gaaatgaata atggcattta taccgctccc gctaagattc ggcctgcagc gacataagcc 6361 aaggccatga agacataccc gaccactgcc agatatggat tgagttcaga tagacgcagg 6421 gtgtaactgc gaccaaatac agcgaaggtc cggtcttgaa tcccggtggc ggagatcggg 6481 ctggaggcat ccaaggcagt gggcaagtta acaatggtcc ggttgttctc ttctgccgga 6541 ttggcggtcg tatcagtgcc attggctaag gcttcgccct tgcttttcca ttcggcgttt 6601 tctttaaacc actcgcagtt acgtcgggcc tgttcttttg ccatggcgca ctggatcgca 6661 tcgcccttgc acgccaccgc atcgcatgcc ccggaaatat cggaatccgt cgccgccttg 6721 cactgcgcag ccgtgggatt ctcagcgcaa tagctggtct ggctgaccgt tttcgtttcg 6781 gtcttcgttt ccttggtgcc gtcttcattg gtcttttcgg tggtagtgac gacaacaacc 6841 gtaccatcgc cgttgaaggt agcgctcgaa ttggtggtgc tattgctgga gctgctccct 6901 gtcgggttgc cgttggcgtc tttggtcgag ctgttagagc tggaagacga attgtttttc 6961 gtggtgacag gttgcgagcc ttgagagcca gcggaaacgc aggaggtaac accacccaca 7021 ctgataaagc ttttgccttg gctcatgcaa ttagcctcag gggttaaggc cgtgttattg 7081 gtggcagccg agcaagcagc cccggtgcca ttgccatatt gaccaacaca agcaccctgc 7141 ccgttggtga gtttgatgca catatcaacc gcgatggagt aagtgcaatt gctcaggcaa 7201 tcggtggagc catcggagcc gaagtattta gagccagtga cgccagagac aaaaccggat 7261 ttgcaagatt cctggcatgt acctgttgtg ggatttcgca cctgaccagc ggcacattgc 7321 gggactaagc aggtgtgaag aatggtcggc gtgacaccac cggggcaaac ttcgtgggcg 7381 gtaatattcc cgatatttcc atttttgctg ttacagaaat atttccaatt atcgacctta 7441 gtgacactgc cataagaaac gcctgaataa gccgtatagg cttggcaaga ttcctcgggt 7501 gtaggctttt ttgtgacggt catgccatct tttgaccagt aatcaataat ggaggggatt 7561 tcttcattgg ccgcatgcgc atagctactc agcaccattg aaagtaggag aatcaggcgg 7621 agaaaatgag ccatagcgcc cccagtaatc cgatcaacac taaccatcct tccattttcc 7681 gtaccccctt aagaaaaaag gcccggaggg cttagaagct ccgggccgat tggtcttgat 7741 cgtggaatta accccggatc atgctgtata ccttcttgat ggcccagacc accaagatgg 7801 cgccaccgat cgcagctaca gccgtttcac cttcaccgag tttttcggta acagcggaaa 7861 cgtcgatagc ggcttgggct tgtgcagcag ccaaggtcat accaacagcc aggtaagcgg 7921 caactttgcg ggaatacagg ttaactttct tcatgttgag actccttttg tttaagtaat 7981 gaaatgagga cgcggaaaac cgctccgatt gcccacagta gcgctatggc agccgctagc 8041 tgttgaccct gttcgactgt cagggaccat gccccggcgg tcagttcttt tggctgtgca 8101 atgaggtata cacactctga aatctcggta ggttgcgggc tggcgatctg gtattgcccg 8161 gctccgatat cggttaagca ttcggcggcg taggcgtctg tcattcccca gatgaacgtg 8221 aggaatccgc cgagcaaaag gcccataatc agcgccatgg cattgacctg gctgcgtgaa 8281 agatgaatgg aggcgatccg attaaaggct ttcactgatc gaacctcgca atatgagtat 8341 gtccaggtct ttcgtggccc atatctggta atcaaagacg cccatcaaat cggcgatgcc 8401 tagcgtgtgt tgggtggcca ccgggaacgt ctgttcttta tggatggcag cgaatgcgcc 8461 gaggtattgg cgcatgacct tgggcgcggg tatgccccgg cttgctttat agcggcgctc 8521 gcccaattcg cggctatggg cgaatgtctt gctgatgtac ttgcttaaat aagcggcgag 8581 ctgcaagcat gtgaaaatcg gcgactgacc agagaagcgc tttttacggt agcgcacatt 8641 gatctgacct ttggtgttct cgcctgcatc gtctgcattg ccaccgatgg cgatatacca 8701 gcacttgcgg aggtggtgga catcctgttt accgacgcag gcgacgtgca tatggagtgc 8761 gccgcgttcc tggtattcct tgacgccgac atagagccaa tcggggtatt tttcgcgaac 8821 gaggcgcacg aagcgggtga agtcttcgtt cagtttgtcg ctatcggtca tgttctcgcg 8881 atagaccaac gtcaggaggt gatcggcccc gatggactgg atcagaaagc ggactttttg 8941 ccgggcgcgg cgttcggcac ggcgaaggtt ttcttcctgg gaggcttcga tttcctcttc 9001 ggttttttgg cgtggcaagc ccctgatccc atgcgagcgg gatttccgta gctcgtccat 9061 aaagttgaca cgcctttgta tcatttccac cattcctgag gcgaactccc tgattgttaa 9121 gtgataatcg gggagctccg tatatacatg atcctgatct agtgctaagc ttgtacccat 9181 gttttagcca tccgcttatg gttaaaaata gaaagccctg attggttgcc gcctttcagg 9241 gcttttgttt tatgccttta cctgttcccg tgcatgctct ctggtctgac tgattggggt 9301 gagtttttcg atacgaggtt gcagccgtcc ctgatagtta ctccagccga tagacgccat 9361 ataaaagcca gcctcaggct tgaattccgg gtcataactt gggatgactc ctactacctg 9421 atctacgaag cattgcagat cgtggcggac gtattccccg cgttcacttt tgccagtgcg 9481 tgagctgatt gccagtactt gaatcttgag ttccatggtg tgtttccttt agttgattga 9541 gtgcttatgt acccgctgat tgggtgttaa gcccctacgg tgcgggcgtt gccgagggga 9601 gtcttattca taaaaggatt gctctaacgc gtttttggtg acgagcgcga ggttgaccag 9661 gcggtatttg ccgatctcga caacggggag gtagcccttg ttaatccagc ccgtaacggt 9721 gtcttctgtc acacctacgt gttgggcgaa tttagtgcgg gacatgaggg gcacgacggt 9781 caattgactt gcgtcgaagt gagtgtccat aatgcg //